{"doi":"10.1093/nar/gky1008","title":"gcMeta: a Global Catalogue of Metagenomics platform to support the archiving, standardization and analysis of microbiome data","abstract":"Meta-omics approaches have been increasingly used to study the structure and function of the microbial communities. A variety of large-scale collaborative projects are being conducted to encompass samples from diverse environments and habitats. This change has resulted in enormous demands for long-term data maintenance and capacity for data analysis. The Global Catalogue of Metagenomics (gcMeta) is a part of the 'Chinese Academy of Sciences Initiative of Microbiome (CAS-CMI)', which focuses on studying the human and environmental microbiome, establishing depositories of samples, strains and data, as well as promoting international collaboration. To accommodate and rationally organize massive datasets derived from several thousands of human and environmental microbiome samples, gcMeta features a database management system for archiving and publishing data in a standardized way. Another main feature is the integration of more than ninety web-based data analysis tools and workflows through a Docker platform which enables data analysis by using various operating systems. This platform has been rapidly expanding, and now hosts data from the CAS-CMI and a number of other ongoing research projects. In conclusion, this platform presents a powerful and user-friendly service to support worldwide collaborative efforts in the field of meta-omics research. This platform is freely accessible at https://gcmeta.wdcm.org/.","journal":"Nucleic Acids Research","year":2018,"id":5087,"datarank":3.5630677617268067,"base_score":4.700480365792417,"endowment":4.700480365792417,"self_citation_contribution":0.7050720548688626,"citation_network_contribution":2.857995706857944,"self_endowment_contribution":0.7050720548688626,"citer_contribution":2.857995706857944,"corpus_percentile":68.91781936533768,"corpus_rank":383,"citation_count":111,"citer_count":83,"citers_with_citation_signal":66,"citers_with_endowment":66,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.9319,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2018-10-26","fair_score":52.9167,"fair_percentile":79.11169744942832,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":51677,"name":"Heyuan Qi","orcid":null,"position":1,"is_corresponding":false},{"id":51678,"name":"Qinglan Sun","orcid":"0000-0002-8451-760X","position":2,"is_corresponding":false},{"id":51679,"name":"Guomei Fan","orcid":null,"position":3,"is_corresponding":false},{"id":4805,"name":"Shuang-jiang Liu","orcid":null,"position":4,"is_corresponding":false},{"id":6308,"name":"Jun Wang","orcid":"0000-0003-2509-9599","position":5,"is_corresponding":false},{"id":326,"name":"Baoli Zhu","orcid":"0000-0001-5326-9503","position":6,"is_corresponding":false},{"id":4806,"name":"Hongwei Liu","orcid":"0000-0001-6471-131X","position":7,"is_corresponding":false},{"id":19565,"name":"Fangqing Zhao","orcid":"0000-0002-6216-1235","position":8,"is_corresponding":false},{"id":12525,"name":"Xiaochen Wang","orcid":"0000-0003-1507-6465","position":9,"is_corresponding":false},{"id":51681,"name":"Xiaoxuan Hu","orcid":"0000-0002-0907-065X","position":10,"is_corresponding":false},{"id":51113,"name":"Wei Li","orcid":"0000-0002-0693-3536","position":11,"is_corresponding":false},{"id":13744,"name":"Jia Liu","orcid":"0000-0002-2070-7754","position":12,"is_corresponding":false},{"id":51682,"name":"Ye Tian","orcid":"0000-0002-7675-9270","position":13,"is_corresponding":false},{"id":51683,"name":"Linhuan Wu","orcid":"0000-0002-5255-1846","position":14,"is_corresponding":false},{"id":51684,"name":"Juncai Ma","orcid":"0000-0001-6382-8014","position":15,"is_corresponding":false},{"id":4808,"name":"Shuang‐Jiang Liu","orcid":"0000-0002-7585-310X","position":16,"is_corresponding":false},{"id":51676,"name":"Wenyu Shi","orcid":"0000-0001-7036-8917","position":0,"is_corresponding":true}],"reference_count":109,"raw_metadata":null,"created_at":"2026-03-01T18:20:47.508186Z","pmid":"30365027","pmcid":"PMC6324004","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":65.0,"fair_a":67.5,"fair_i":37.5,"fair_r":41.6667,"fair_zscore":0.697,"fair_rationale":{"fair_score":52.92,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":65.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper mentions adopting MIMS/MIMARKS/MIxS standards and ENVO ontologies (95 controlled terms), but does not describe machine-readable metadata (e.g., JSON-LD, schema.org) nor provide a formal metadata schema accessible via API or downloadable file."}]},"A":{"name":"Accessible","score":67.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"The platform URL (https://gcmeta.wdcm.org/) is provided and states free access, but no detailed machine-access protocol (e.g., public API, SPARQL endpoint, direct download without login for public data) is described, and some data require login/submission."}]},"I":{"name":"Interoperable","score":37.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"Uses community standards (MIxS, ENVO) and a persistent identifier (PID) system, but text does not specify use of standard file formats (e.g., MAGE-TAB, ISA-Tab) or linked-data principles (e.g., RDF) for interoperability."}]},"R":{"name":"Reusable","score":41.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.667,"signal":null,"rationale":"License (CC BY 4.0) is stated, data are downloadable after publication, and PIDs support citation, but no explicit data-availability statement for underlying paper data and no reproducibility details (e.g., versioned software, Docker images for workflows) are provided."}]}},"suggestions":["Publish a formal, machine-readable metadata schema (e.g., JSON-LD) with controlled vocabularies accessible via an API.","Provide a public REST API or SPARQL endpoint for programmatic access to public data without requiring login.","Deposit the paper's underlying datasets in a recognized repository (e.g., INSDC) and include a data-availability statement with DOIs.","Make analysis workflows fully reproducible by providing Docker images with version numbers and input test data.","Add explicit adoption of standard file formats (e.g., ISA-Tab for metadata, FASTQ for sequences) to improve interoperability."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:39:40.370303Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}