{"doi":"10.1038/s41467-020-19045-9","title":"A high-stringency blueprint of the human proteome","abstract":"<jats:title>Abstract</jats:title><jats:p>The Human Proteome Organization (HUPO) launched the Human Proteome Project (HPP) in 2010, creating an international framework for global collaboration, data sharing, quality assurance and enhancing accurate annotation of the genome-encoded proteome. During the subsequent decade, the HPP established collaborations, developed guidelines and metrics, and undertook reanalysis of previously deposited community data, continuously increasing the coverage of the human proteome. On the occasion of the HPP’s tenth anniversary, we here report a 90.4% complete high-stringency human proteome blueprint. This knowledge is essential for discerning molecular processes in health and disease, as we demonstrate by highlighting potential roles the human proteome plays in our understanding, diagnosis and treatment of cancers, cardiovascular and infectious diseases.</jats:p>","journal":"Nature Communications","year":2020,"id":24986,"datarank":4.349729820895478,"base_score":5.384495062789089,"endowment":5.384495062789089,"self_citation_contribution":0.8076742594183635,"citation_network_contribution":3.542055561477114,"self_endowment_contribution":0.8076742594183635,"citer_contribution":3.542055561477114,"corpus_percentile":71.7,"corpus_rank":384,"citation_count":217,"citer_count":165,"citers_with_citation_signal":135,"citers_with_endowment":135,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"fair_score":54.1667,"fair_percentile":80.05716798592788,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":148288,"name":"Edouard C. Nice","orcid":null,"position":1,"is_corresponding":false},{"id":6362,"name":"Eric W. Deutsch","orcid":"0000-0001-8732-0928","position":2,"is_corresponding":false},{"id":121997,"name":"Lydie Lane","orcid":"0000-0002-9818-3030","position":3,"is_corresponding":false},{"id":32989,"name":"Gilbert S. Omenn","orcid":"0000-0002-8976-6074","position":4,"is_corresponding":false},{"id":148289,"name":"Stephen R. Pennington","orcid":null,"position":5,"is_corresponding":false},{"id":148290,"name":"Young-Ki Paik","orcid":null,"position":6,"is_corresponding":false},{"id":145274,"name":"Christopher M. Overall","orcid":"0000-0001-5844-2731","position":7,"is_corresponding":false},{"id":148291,"name":"Fernando J. Corrales","orcid":null,"position":8,"is_corresponding":false},{"id":148292,"name":"Ileana M. Cristea","orcid":null,"position":9,"is_corresponding":false},{"id":112291,"name":"Jennifer E. Van Eyk","orcid":"0000-0001-9050-148X","position":10,"is_corresponding":false},{"id":96344,"name":"Mathias Uhlén","orcid":"0000-0002-4858-8056","position":11,"is_corresponding":false},{"id":148293,"name":"Cecilia Lindskog","orcid":"0000-0001-5611-1015","position":12,"is_corresponding":false},{"id":51795,"name":"Daniel W. Chan","orcid":"0000-0002-6398-6597","position":13,"is_corresponding":false},{"id":11681,"name":"Amos Bairoch","orcid":"0000-0003-2826-6444","position":14,"is_corresponding":false},{"id":148294,"name":"James C. Waddington","orcid":null,"position":15,"is_corresponding":false},{"id":148295,"name":"Joshua L. Justice","orcid":null,"position":16,"is_corresponding":false},{"id":148296,"name":"Joshua LaBaer","orcid":null,"position":17,"is_corresponding":false},{"id":787,"name":"Henry Rodriguez","orcid":"0000-0002-4593-4232","position":18,"is_corresponding":false},{"id":148297,"name":"Fuchu He","orcid":null,"position":19,"is_corresponding":false},{"id":148298,"name":"Markus Kostrzewa","orcid":null,"position":20,"is_corresponding":false},{"id":64665,"name":"Peipei Ping","orcid":"0000-0003-3583-3881","position":21,"is_corresponding":false},{"id":148300,"name":"Rebekah L. Gundry","orcid":"0000-0002-9263-833X","position":22,"is_corresponding":false},{"id":148301,"name":"Peter Stewart","orcid":null,"position":23,"is_corresponding":false},{"id":148302,"name":"Sanjeeva Srivastava","orcid":null,"position":24,"is_corresponding":false},{"id":77400,"name":"Sudhir Srivastava","orcid":"0000-0002-7798-9772","position":25,"is_corresponding":false},{"id":148303,"name":"Fabio C. S. Nogueira","orcid":null,"position":26,"is_corresponding":false},{"id":148304,"name":"Gilberto B. Domont","orcid":null,"position":27,"is_corresponding":false},{"id":148305,"name":"Yves Vandenbrouck","orcid":"0000-0002-1292-373X","position":28,"is_corresponding":false},{"id":148306,"name":"Maggie P. Y. Lam","orcid":null,"position":29,"is_corresponding":false},{"id":148307,"name":"Sara Wennersten","orcid":null,"position":30,"is_corresponding":false},{"id":17887,"name":"Juan Antonio Vizcaino","orcid":"0000-0002-3905-4335","position":31,"is_corresponding":false},{"id":148308,"name":"Marc Wilkins","orcid":"0000-0002-5700-5684","position":32,"is_corresponding":false},{"id":148309,"name":"Jochen M. Schwenk","orcid":"0000-0001-8141-8449","position":33,"is_corresponding":false},{"id":44848,"name":"Emma Lundberg","orcid":"0000-0001-7034-0850","position":34,"is_corresponding":false},{"id":84381,"name":"Nuno Bandeira","orcid":"0000-0001-8385-3655","position":35,"is_corresponding":false},{"id":148310,"name":"Gyorgy Marko-Varga","orcid":null,"position":36,"is_corresponding":false},{"id":123432,"name":"Susan T. Weintraub","orcid":null,"position":37,"is_corresponding":false},{"id":148311,"name":"Charles Pineau","orcid":null,"position":38,"is_corresponding":false},{"id":148312,"name":"Ulrike Kusebauch","orcid":"0000-0001-6162-7577","position":39,"is_corresponding":false},{"id":78845,"name":"Robert L. Moritz","orcid":"0000-0002-3216-9447","position":40,"is_corresponding":false},{"id":148313,"name":"Seong Beom Ahn","orcid":null,"position":41,"is_corresponding":false},{"id":148314,"name":"Magnus Palmblad","orcid":null,"position":42,"is_corresponding":false},{"id":5648,"name":"Michael P. Snyder","orcid":"0000-0003-0784-7987","position":43,"is_corresponding":false},{"id":74631,"name":"Ruedi Aebersold","orcid":"0000-0002-9576-3267","position":44,"is_corresponding":false},{"id":148315,"name":"Mark S. Baker","orcid":"0000-0001-5858-4035","position":45,"is_corresponding":false},{"id":148287,"name":"Subash Adhikari","orcid":null,"position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":5.384495062789089,"endowment":5.384495062789089,"datacite_reuse_total":0,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"33067450","pmcid":"PMC7568584","openalex_id":"https://openalex.org/W3092794487","authors":[],"funders":[{"funder_name":"NIGMS NIH HHS","grant_id":"R01 GM114141","title":null},{"funder_name":"NIEHS NIH HHS","grant_id":"P30 ES017885","title":null},{"funder_name":"NIA NIH HHS","grant_id":"U19 AG023122","title":null},{"funder_name":"NCI NIH HHS","grant_id":"U24 CA210967","title":null},{"funder_name":"Wellcome Trust","grant_id":"WT101477MA","title":null},{"funder_name":"NIGMS NIH HHS","grant_id":"R01 GM087221","title":null},{"funder_name":"Wellcome Trust","grant_id":"208391/Z/17/Z","title":null},{"funder_name":"NIGMS NIH HHS","grant_id":"R24 GM127667","title":null},{"funder_name":"NCI NIH HHS","grant_id":"U24 CA115102","title":null},{"funder_name":"NCI NIH HHS","grant_id":"U24 CA210985","title":null},{"funder_name":"NHLBI NIH HHS","grant_id":"R01 HL111362","title":null},{"funder_name":"Canadian Institutes of Health Research","grant_id":"unidentified","title":"unidentified"},{"funder_name":"National Institutes of Health","grant_id":"1R24GM127667-01","title":"Advancing data and metadata standards for proteomics mass spectra"},{"funder_name":"Wellcome Trust","grant_id":"101477","title":"PRIDE Atlas."},{"funder_name":"National Institutes of Health","grant_id":"5R01GM087221-10","title":"Development of Trans Proteomic Pipeline, an Analysis Suite for Mass Spectrometry"},{"funder_name":"National Institutes of Health","grant_id":"5U24CA210985-03","title":"The Comprehensive Proteome Characterization Center at Johns Hopkins: High Precision Discovery and Confirmation of Genoproteomic Targets"},{"funder_name":"National Health and Medical Research Council (NHMRC)","grant_id":"1010303","title":"Colorectal Cancer Membrane Protein Interactomics [A Major Discriminator of Clinical Outcome]"},{"funder_name":"National Institutes of Health","grant_id":"2U24CA115102-06","title":"Clinical and Analytical Validation of Cancer Biomarkers"},{"funder_name":"National Science Foundation","grant_id":"1933311","title":"CIBR:  PTMexchange:  Globally harmonized re-analysis and sharing of data on post-translational modifications"},{"funder_name":"National Institutes of Health","grant_id":"5U24CA210967-04","title":"University of Michigan Proteogenomics Data Analysis Center"},{"funder_name":"National Institutes of Health","grant_id":"5U19AG023122-04","title":"Consortium to Study the Genetics of Longevity"},{"funder_name":"Wellcome Trust","grant_id":"208391","title":"The PRIDE database: A proteomics data hub in the life sciences"},{"funder_name":"National Institutes of Health","grant_id":"2P30ES017885-10A1","title":"Michigan Center on Lifestage Environmental Exposures and Disease (M-LEEaD)"},{"funder_name":"National Institutes of Health","grant_id":"1ZIABC011347-01","title":"Genetic Correlation with Efficacy and Toxicity of Brain Tumor Therapies"},{"funder_name":"Wellcome Trust","grant_id":"","title":null},{"funder_name":"Wellcome Trust","grant_id":"","title":null}],"total_grants":26,"fwci":16.8136,"citation_percentile":0.99657973,"influential_citations":4,"citation_trend":[{"year":2020,"count":5},{"year":2021,"count":41},{"year":2022,"count":61},{"year":2023,"count":42},{"year":2024,"count":40},{"year":2025,"count":23},{"year":2026,"count":5}],"oa_status":"gold","license":"cc-by","oa_locations":[{"url":"https://www.nature.com/articles/s41467-020-19045-9.pdf","host_type":"journal"},{"url":"https://www.nature.com/articles/s41467-020-19045-9.pdf","host_type":"GOLD"},{"url":"https://www.nature.com/articles/s41467-020-19045-9.pdf","host_type":"publisher"},{"url":"https://www.nature.com/articles/s41467-020-19045-9","host_type":"publisher"},{"url":"https://doi.org/10.1038/s41467-020-19045-9","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/33067450","host_type":"repository"},{"url":"https://archive-ouverte.unige.ch/unige:144197","host_type":"repository"},{"url":"http://hdl.handle.net/20.500.11850/463671","host_type":"repository"},{"url":"http://hdl.handle.net/1887/3182293","host_type":"repository"},{"url":"http://urn.kb.se/resolve?urn=urn:nbn:se:uu:diva-425495","host_type":"repository"},{"url":"https://hal.science/hal-03004830","host_type":"repository"},{"url":"http://hdl.handle.net/10261/228661","host_type":"repository"},{"url":"https://doaj.org/article/bb037a5bb053476e9a3946b7578e5277","host_type":"repository"},{"url":"https://lup.lub.lu.se/record/386225d4-c788-4d7a-a52c-6c832e7face9","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/7568584","host_type":"repository"},{"url":"https://hdl.handle.net/1887/3182293","host_type":"repository"},{"url":"https://doi.org/10.3929/ethz-b-000463671","host_type":"repository"},{"url":"https://europepmc.org/articles/PMC7568584","host_type":"Europe_PMC"},{"url":"https://europepmc.org/articles/PMC7568584?pdf=render","host_type":"Europe_PMC"},{"url":"http://dx.doi.org/10.1038/s41467-020-19045-9","host_type":""},{"url":"https://dx.doi.org/10.1038/s41467-020-19045-9","host_type":""},{"url":"https://sonar.ch/global/documents/76861","host_type":""},{"url":"https://hal.science/hal-03004830v1/document","host_type":""},{"url":"https://hal.science/hal-03004830v1","host_type":""},{"url":"https://publications.scilifelab.se/publication/3b8d1f21730c444f9488bdde02b7966a","host_type":""},{"url":"https://doi.org/https://doi.org/10.1038/s41467-020-19045-9","host_type":""}],"fields_of_study":["Advanced Proteomics Techniques and Applications","vaccines and immunoinformatics approaches","Genetics, Bioinformatics, and Biomedical Research","Biology","Medicine","Materials Science","ddc:616","0301 basic medicine","0303 health sciences","03 medical and health sciences","Disease","Human Genome Project","Humans","Proteome","Proteomics"],"mesh_terms":["Disease","Humans","Human Genome Project","Proteome","Proteomics"],"keywords":["Human proteome project","Proteome","Blueprint","Computational biology","Human genome","Biology","Data science","Genome","Bioinformatics","Computer science","Proteomics","Genetics","Gene","Engineering","Molekylärbiologi","Molecular medicine","[SDV]Life Sciences [q-bio]","Science","Q","Biochemistry and Molecular Biology","Proteomic analysis","Biochemistry","576","[SDV] Life Sciences [q-bio]","616","Perspective","Human Genome Project","Humans","Disease","Biokemi","Molecular Biology","Biokemi och molekylärbiologi"],"sdg_mappings":[{"sdg_number":3,"sdg_label":"3. Good health"},{"sdg_number":0,"sdg_label":"Partnerships for the goals"}],"linked_datasets":[],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-07T23:41:17.451093Z","pmid":"33067450","pmcid":"PMC7568584","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":65.0,"fair_a":72.5,"fair_i":37.5,"fair_r":41.6667,"fair_zscore":0.8101,"fair_rationale":{"fair_score":54.17,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":65.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper mentions FAIR principles and neXtProt as a knowledgebase, but does not describe machine-readable metadata or structured metadata formats for the data itself."}]},"A":{"name":"Accessible","score":72.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":0.5,"signal":"files/OA location present but not flagged OA","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"26 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"The paper states that data are deposited in ProteomeXchange with PXD identifiers and that neXtProt is publicly accessible, but does not provide a direct link or explicit protocol for accessing the underlying data."}]},"I":{"name":"Interoperable","score":37.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"The paper uses standard identifiers (PXD, UniProt, neXtProt) and mentions community standards (MIAPE, HPP guidelines), but does not specify use of standard vocabularies or formats for all data types."}]},"R":{"name":"Reusable","score":41.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.667,"signal":null,"rationale":"The paper includes a Creative Commons Attribution 4.0 license and mentions FAIR principles, but lacks a clear data-availability statement specifying how to access the exact dataset and does not provide code or detailed reproducibility steps."}]}},"suggestions":["Include a dedicated data-availability statement with direct URLs to the deposited datasets and code.","Provide metadata in a machine-readable format (e.g., JSON-LD or RDF) for the key results.","Specify the exact version of the neXtProt release and include a persistent identifier (e.g., DOI) for the dataset.","Add a reproducibility section describing software versions, parameters, and analysis workflows.","Use standard ontologies (e.g., OBI, PSI-MOD) for describing experimental methods and results."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:36:48.508519Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}