{"doi":"10.1093/nar/gkab1062","title":"HMDB 5.0: the Human Metabolome Database for 2022","abstract":"The Human Metabolome Database or HMDB (https://hmdb.ca) has been providing comprehensive reference information about human metabolites and their associated biological, physiological and chemical properties since 2007. Over the past 15 years, the HMDB has grown and evolved significantly to meet the needs of the metabolomics community and respond to continuing changes in internet and computing technology. This year's update, HMDB 5.0, brings a number of important improvements and upgrades to the database. These should make the HMDB more useful and more appealing to a larger cross-section of users. In particular, these improvements include: (i) a significant increase in the number of metabolite entries (from 114 100 to 217 920 compounds); (ii) enhancements to the quality and depth of metabolite descriptions; (iii) the addition of new structure, spectral and pathway visualization tools; (iv) the inclusion of many new and much more accurately predicted spectral data sets, including predicted NMR spectra, more accurately predicted MS spectra, predicted retention indices and predicted collision cross section data and (v) enhancements to the HMDB's search functions to facilitate better compound identification. Many other minor improvements and updates to the content, the interface, and general performance of the HMDB website have also been made. Overall, we believe these upgrades and updates should greatly enhance the HMDB's ease of use and its potential applications not only in human metabolomics but also in exposomics, lipidomics, nutritional science, biochemistry and clinical chemistry.","journal":"Nucleic Acids Research","year":2021,"id":5104,"datarank":8.837164152652724,"base_score":7.680637427560936,"endowment":7.680637427560936,"self_citation_contribution":1.1520956141341405,"citation_network_contribution":7.685068538518583,"self_endowment_contribution":1.1520956141341405,"citer_contribution":7.685068538518583,"corpus_percentile":76.56631407648494,"corpus_rank":289,"citation_count":2377,"citer_count":189,"citers_with_citation_signal":189,"citers_with_endowment":189,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.9553,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2021-11-19","fair_score":69.5833,"fair_percentile":99.0325417766051,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":51860,"name":"AnChi Guo","orcid":null,"position":1,"is_corresponding":false},{"id":51861,"name":"Eponine Oler","orcid":null,"position":2,"is_corresponding":false},{"id":51129,"name":"Fei Wang","orcid":"0000-0001-6712-3468","position":3,"is_corresponding":false},{"id":51862,"name":"Afia Anjum","orcid":"0000-0002-8349-7811","position":4,"is_corresponding":false},{"id":51863,"name":"Harrison Peters","orcid":null,"position":5,"is_corresponding":false},{"id":51864,"name":"Raynard Dizon","orcid":null,"position":6,"is_corresponding":false},{"id":51865,"name":"Zinat Sayeeda","orcid":null,"position":7,"is_corresponding":false},{"id":51866,"name":"Siyang Tian","orcid":"0000-0002-7298-2520","position":8,"is_corresponding":false},{"id":51867,"name":"Brian L. Lee","orcid":null,"position":9,"is_corresponding":false},{"id":51868,"name":"Mark Berjanskii","orcid":null,"position":10,"is_corresponding":false},{"id":51869,"name":"Robert Mah","orcid":null,"position":11,"is_corresponding":false},{"id":51870,"name":"Mai Yamamoto","orcid":"0000-0003-0344-2747","position":12,"is_corresponding":false},{"id":51871,"name":"Juan Jovel","orcid":null,"position":13,"is_corresponding":false},{"id":51872,"name":"Claudia Torres-Calzada","orcid":"0000-0001-9372-7230","position":14,"is_corresponding":false},{"id":51873,"name":"Mickel Hiebert-Giesbrecht","orcid":"0000-0003-2947-187X","position":15,"is_corresponding":false},{"id":51874,"name":"Vicki W Lui","orcid":null,"position":16,"is_corresponding":false},{"id":51875,"name":"Dorna Varshavi","orcid":null,"position":17,"is_corresponding":false},{"id":51876,"name":"Dorsa Varshavi","orcid":"0000-0002-1425-8171","position":18,"is_corresponding":false},{"id":51877,"name":"Dana Allen","orcid":null,"position":19,"is_corresponding":false},{"id":51878,"name":"David Arndt","orcid":"0000-0003-0703-8469","position":20,"is_corresponding":false},{"id":51879,"name":"Nitya Khetarpal","orcid":"0000-0002-0881-4020","position":21,"is_corresponding":false},{"id":51880,"name":"Aadhavya Sivakumaran","orcid":"0000-0002-3975-1077","position":22,"is_corresponding":false},{"id":51881,"name":"Karxena Harford","orcid":null,"position":23,"is_corresponding":false},{"id":51882,"name":"Selena Sanford","orcid":null,"position":24,"is_corresponding":false},{"id":51883,"name":"Kristen Yee","orcid":null,"position":25,"is_corresponding":false},{"id":51884,"name":"Xuan Cao","orcid":"0000-0001-9713-4192","position":26,"is_corresponding":false},{"id":51885,"name":"Zachary Budinski","orcid":null,"position":27,"is_corresponding":false},{"id":51886,"name":"Jaanus Liigand","orcid":"0000-0002-8814-9111","position":28,"is_corresponding":false},{"id":51887,"name":"Lun Zhang","orcid":"0000-0002-0928-7568","position":29,"is_corresponding":false},{"id":51888,"name":"Jiamin Zheng","orcid":"0000-0002-6120-7035","position":30,"is_corresponding":false},{"id":51889,"name":"Rupasri Mandal","orcid":null,"position":31,"is_corresponding":false},{"id":51890,"name":"Naama Karu","orcid":"0000-0001-8005-0726","position":32,"is_corresponding":false},{"id":51891,"name":"Maija Dambrova","orcid":"0000-0002-1739-0928","position":33,"is_corresponding":false},{"id":51892,"name":"Helgi B. Schiöth","orcid":"0000-0001-7112-0921","position":34,"is_corresponding":false},{"id":51893,"name":"Russell Greiner","orcid":"0000-0001-8327-934X","position":35,"is_corresponding":false},{"id":51894,"name":"Vasuk Gautam","orcid":"0000-0002-9204-1963","position":36,"is_corresponding":false},{"id":51895,"name":"Dana G. Allen","orcid":null,"position":37,"is_corresponding":false},{"id":51896,"name":"Kristen S. Yee","orcid":null,"position":38,"is_corresponding":false},{"id":51859,"name":"David S. Wishart","orcid":"0000-0002-3207-2434","position":0,"is_corresponding":true}],"reference_count":45,"raw_metadata":{"citation_network_status":"fetched"},"created_at":"2026-03-01T18:20:47.508186Z","pmid":"34986597","pmcid":"PMC8728138","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":90.0,"fair_a":80.0,"fair_i":50.0,"fair_r":58.3333,"fair_zscore":2.2046,"fair_rationale":{"fair_score":69.58,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":90.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":1.0,"signal":null,"rationale":"The paper explicitly states that all entries have unique, persistent HMDB identifiers and that metadata is provided in multiple machine-readable formats (e.g., SMILES, InChI, JSON, XML, OWL/OBO for ChemFOnt), ensuring rich machine-readable metadata."}]},"A":{"name":"Accessible","score":80.0,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":1.0,"signal":null,"rationale":"The paper states the HMDB website is open and free, data download is compatible with all modern web browsers, and spectral data files are available in universally readable formats (nmrML, mzML, CSV, PNG, etc.), providing a clear access protocol."}]},"I":{"name":"Interoperable","score":50.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":1.0,"signal":null,"rationale":"The paper explicitly states use of standard formats (e.g., SMILES, InChI, FASTA, mzML, nmrML, JCAMP-DX, SBML, BioPax, OWL/OBO) and standard vocabularies/ontologies (e.g., ChemFOnt, SPLASH keys), ensuring high interoperability."}]},"R":{"name":"Reusable","score":58.33,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":1.0,"signal":null,"rationale":"The paper provides a data availability statement (all data freely available), a clear license (Creative Commons Attribution-NonCommercial 4.0 International), and extensive provenance information with references, supporting full reusability and reproducibility."}]}},"suggestions":["Explicitly state the use of a formal machine-readable metadata schema (e.g., DCAT or schema.org) for the dataset description to improve findability.","Provide a direct link to the data download page in the abstract or introduction to make the access protocol more immediately discoverable.","Include a versioning policy for the database (e.g., DOI for each version) to ensure clear version tracking and reusability.","Add a statement about the use of persistent identifiers (e.g., ORCID) for all authors to improve findability of contributors.","Specify the exact software dependencies and versions used for generating predicted data to enhance reproducibility."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:29:59.039053Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}