{"doi":"10.1093/nar/gkae1007","title":"GutMetaNet: an integrated database for exploring horizontal gene transfer and functional redundancy in the human gut microbiome","abstract":"Metagenomic studies have revealed the critical roles of complex microbial interactions, including horizontal gene transfer (HGT) and functional redundancy (FR), in shaping the gut microbiome's functional capacity and resilience. However, the lack of comprehensive data integration and systematic analysis approaches has limited the in-depth exploration of HGT and FR dynamics across large-scale gut microbiome datasets. To address this gap, we present GutMetaNet (https://gutmetanet.deepomics.org/), a first-of-its-kind database integrating extensive human gut microbiome data with comprehensive HGT and FR analyses. GutMetaNet contains 21 567 human gut metagenome samples with whole-genome shotgun sequencing data related to various health conditions. Through systematic analysis, we have characterized the taxonomic profiles and FR profiles, and identified 14 636 HGT events using a shared reference genome database across the collected samples. These HGT events have been curated into 8049 clusters, which are annotated with categorized mobile genetic elements, including transposons, prophages, integrative mobilizable elements, genomic islands, integrative conjugative elements and group II introns. Additionally, GutMetaNet incorporates automated analyses and visualizations for the HGT events and FR, serving as an efficient platform for in-depth exploration of the interactions among gut microbiome taxa and their implications for human health.","journal":"Nucleic Acids Research","year":2024,"id":9047,"datarank":0.42668645768126945,"base_score":2.1972245773362196,"endowment":2.1972245773362196,"self_citation_contribution":0.32958368660043297,"citation_network_contribution":0.09710277108083651,"self_endowment_contribution":0.32958368660043297,"citer_contribution":0.09710277108083651,"corpus_percentile":50.44751830756713,"corpus_rank":610,"citation_count":11,"citer_count":10,"citers_with_citation_signal":6,"citers_with_endowment":6,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.8465,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2024-11-11","fair_score":52.9167,"fair_percentile":79.11169744942832,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":77844,"name":"Yanfei Wang","orcid":"0000-0003-0745-3670","position":1,"is_corresponding":false},{"id":77845,"name":"Lijia Che","orcid":null,"position":2,"is_corresponding":false},{"id":72536,"name":"Shuo Yang","orcid":"0009-0008-0296-0183","position":3,"is_corresponding":false},{"id":77846,"name":"Xianglilan Zhang","orcid":"0000-0002-4946-4880","position":4,"is_corresponding":false},{"id":52518,"name":"Yu Lin","orcid":"0000-0003-2620-0345","position":5,"is_corresponding":false},{"id":77847,"name":"Yucheng Shi","orcid":null,"position":6,"is_corresponding":false},{"id":77848,"name":"Nanhe Zou","orcid":null,"position":7,"is_corresponding":false},{"id":77849,"name":"Shuai Wang","orcid":"0000-0002-1922-4878","position":8,"is_corresponding":false},{"id":77850,"name":"Yuanzheng Zhang","orcid":null,"position":9,"is_corresponding":false},{"id":77851,"name":"Zicheng Zhao","orcid":"0009-0009-9974-6403","position":10,"is_corresponding":false},{"id":77852,"name":"Shuai Cheng Li","orcid":"0000-0001-6246-6349","position":11,"is_corresponding":false},{"id":77853,"name":"Yuan Shi","orcid":"0000-0002-4571-4424","position":12,"is_corresponding":false},{"id":77854,"name":"Ning Zou","orcid":"0000-0003-2167-0612","position":13,"is_corresponding":false},{"id":77843,"name":"Yiqi Jiang","orcid":"0000-0003-4950-937X","position":0,"is_corresponding":true}],"reference_count":153,"raw_metadata":null,"created_at":"2026-03-01T18:20:47.508186Z","pmid":"39526401","pmcid":"PMC11701528","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":65.0,"fair_a":67.5,"fair_i":37.5,"fair_r":41.6667,"fair_zscore":0.697,"fair_rationale":{"fair_score":52.92,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":65.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper describes metadata fields (e.g., MeSH, country codes) but does not provide machine-readable metadata or use standard metadata schemas."}]},"A":{"name":"Accessible","score":67.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"A clear URL is given for data access, but no detailed protocol (e.g., API, authentication) or download instructions are provided."}]},"I":{"name":"Interoperable","score":37.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"Standard identifiers (MeSH, KEGG, UHGG) and formats (MPA, FASTQ) are used, but the paper does not confirm that all data are in standard community formats."}]},"R":{"name":"Reusable","score":41.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.667,"signal":null,"rationale":"A data-availability statement and CC BY-NC license are present, and methods are detailed, but analysis code is not provided, limiting full reproducibility."}]}},"suggestions":["Provide metadata in machine-readable formats (e.g., JSON-LD) with standard schemas like MIxS.","Include an API or detailed download instructions for programmatic access to the data.","Deposit analysis scripts (e.g., for HGT detection and FR calculation) in a public repository with a DOI.","Assign a persistent identifier (e.g., DOI) to the database and cite it in the paper.","Ensure all downloadable data files use standard community formats (e.g., BIOM for taxonomic profiles)."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:50:06.377398Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}