{"doi":"10.1101/2022.03.10.483747","title":"An integrated cell atlas of the human lung in health and disease","abstract":"<h4>ABSTRACT</h4> Organ- and body-scale cell atlases have the potential to transform our understanding of human biology. To capture the variability present in the population, these atlases must include diverse demographics such as age and ethnicity from both healthy and diseased individuals. The growth in both size and number of single-cell datasets, combined with recent advances in computational techniques, for the first time makes it possible to generate such comprehensive large-scale atlases through integration of multiple datasets. Here, we present the integrated Human Lung Cell Atlas (HLCA) combining 46 datasets of the human respiratory system into a single atlas spanning over 2.2 million cells from 444 individuals across health and disease. The HLCA contains a consensus re-annotation of published and newly generated datasets, resolving under- or misannotation of 59% of cells in the original datasets. The HLCA enables recovery of rare cell types, provides consensus marker genes for each cell type, and uncovers gene modules associated with demographic covariates and anatomical location within the respiratory system. To facilitate the use of the HLCA as a reference for single-cell lung research and allow rapid analysis of new data, we provide an interactive web portal to project datasets onto the HLCA. Finally, we demonstrate the value of the HLCA reference for interpreting disease-associated changes. Thus, the HLCA outlines a roadmap for the development and use of organ-scale cell atlases within the Human Cell Atlas.","journal":null,"year":2022,"id":3465,"datarank":2.724036571064559,"base_score":4.564348191467836,"endowment":4.564348191467836,"self_citation_contribution":0.6846522287201755,"citation_network_contribution":2.0393843423443836,"self_endowment_contribution":0.6846522287201755,"citer_contribution":2.0393843423443836,"corpus_percentile":66.88364524003255,"corpus_rank":408,"citation_count":97,"citer_count":76,"citers_with_citation_signal":68,"citers_with_endowment":68,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.89,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2022-03-11","fair_score":27.9167,"fair_percentile":8.839050131926122,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":11446,"name":"Daniel Strobl","orcid":"0000-0002-5516-7057","position":1,"is_corresponding":false},{"id":11447,"name":"Luke Zappia","orcid":"0000-0001-7744-8565","position":2,"is_corresponding":false},{"id":12113,"name":"Nikolay S. Markov","orcid":"0000-0002-3659-4387","position":4,"is_corresponding":false},{"id":12114,"name":"Laure-Emmanuelle Zaragosi","orcid":"0000-0001-6747-7928","position":5,"is_corresponding":false},{"id":12116,"name":"Marie-Jeanne Arguel","orcid":"0000-0002-8725-2555","position":7,"is_corresponding":false},{"id":12119,"name":"Christophe Bécavin","orcid":"0000-0003-1555-3153","position":9,"is_corresponding":false},{"id":12120,"name":"Marijn Berg","orcid":"0000-0002-9870-1571","position":10,"is_corresponding":false},{"id":36181,"name":"A Collin","orcid":null,"position":13,"is_corresponding":false},{"id":36182,"name":"A.C.A. Gay","orcid":null,"position":14,"is_corresponding":false},{"id":12125,"name":"Baharak Hooshiar Kashani","orcid":"0000-0001-8665-8935","position":15,"is_corresponding":false},{"id":12126,"name":"Manu Jain","orcid":"0000-0003-1534-5629","position":16,"is_corresponding":false},{"id":36183,"name":"T Kapellos","orcid":null,"position":17,"is_corresponding":false},{"id":12179,"name":"Tessa Kole","orcid":"0000-0002-5176-6300","position":18,"is_corresponding":false},{"id":12131,"name":"Michael von Papen","orcid":"0000-0001-5030-1643","position":20,"is_corresponding":false},{"id":36185,"name":"L Peter","orcid":null,"position":21,"is_corresponding":false},{"id":69428,"name":"Zoe Piran","orcid":"0000-0003-0241-8948","position":22,"is_corresponding":false},{"id":2921,"name":"Janine Gote-Schniering","orcid":"0000-0001-7869-4936","position":23,"is_corresponding":false},{"id":10424,"name":"C. Taylor","orcid":"0000-0001-6816-8051","position":24,"is_corresponding":false},{"id":12135,"name":"Chuan Xu","orcid":"0000-0002-6265-999X","position":26,"is_corresponding":false},{"id":36188,"name":"LT Bui","orcid":null,"position":27,"is_corresponding":false},{"id":12137,"name":"Carlo De Donno","orcid":"0000-0002-9553-0121","position":28,"is_corresponding":false},{"id":12138,"name":"Leander Dony","orcid":"0000-0001-5697-6991","position":29,"is_corresponding":false},{"id":12139,"name":"Minzhe Guo","orcid":"0000-0002-5502-9172","position":30,"is_corresponding":false},{"id":36189,"name":"AJ Gutierrez","orcid":null,"position":31,"is_corresponding":false},{"id":3563,"name":"Lukas Heumos","orcid":"0000-0002-8937-3457","position":32,"is_corresponding":false},{"id":12141,"name":"Ni Huang","orcid":"0000-0001-8849-038X","position":33,"is_corresponding":false},{"id":12142,"name":"Ignacio Ibarra Del Río","orcid":"0000-0002-0582-002X","position":34,"is_corresponding":false},{"id":12144,"name":"Preetish Kadur Lakshminarasimha Murthy","orcid":"0000-0002-9762-8376","position":36,"is_corresponding":false},{"id":1694,"name":"Mohammad Lotfollahi","orcid":"0000-0001-6858-7985","position":37,"is_corresponding":false},{"id":1700,"name":"Carlos Talavera‐López","orcid":"0000-0001-8590-2393","position":39,"is_corresponding":false},{"id":12147,"name":"Kyle J. Travaglini","orcid":"0000-0003-3164-6448","position":40,"is_corresponding":false},{"id":12149,"name":"Kaylee B. Worlock","orcid":"0000-0002-5656-7634","position":42,"is_corresponding":false},{"id":12150,"name":"Masahiro Yoshida","orcid":"0000-0002-3521-5322","position":43,"is_corresponding":false},{"id":11069,"name":"Yuexin Chen","orcid":"0000-0001-6280-4918","position":44,"is_corresponding":false},{"id":12161,"name":"Tushar J. Desai","orcid":"0000-0002-8794-5319","position":45,"is_corresponding":false},{"id":572,"name":"Oliver Eickelberg","orcid":"0000-0001-7170-0360","position":46,"is_corresponding":false},{"id":12175,"name":"Christine S. Falk","orcid":"0000-0003-1376-7318","position":47,"is_corresponding":false},{"id":12162,"name":"Naftali Kaminski","orcid":"0000-0001-5917-4601","position":48,"is_corresponding":false},{"id":7274,"name":"Robert Lafyatis","orcid":"0000-0002-9398-5034","position":50,"is_corresponding":false},{"id":5347,"name":"Nancy L. Pedersen","orcid":"0000-0001-8057-3543","position":51,"is_corresponding":false},{"id":36193,"name":"J Powell","orcid":"0000-0001-9031-6356","position":52,"is_corresponding":false},{"id":3190,"name":"Jayaraj Rajagopal","orcid":"0000-0002-4122-177X","position":53,"is_corresponding":false},{"id":36194,"name":"O Rozenblatt-Rosen","orcid":null,"position":54,"is_corresponding":false},{"id":12167,"name":"Max A. Seibold","orcid":"0000-0002-8685-4263","position":55,"is_corresponding":false},{"id":12169,"name":"Douglas P. Shepherd","orcid":"0000-0001-9087-0832","position":57,"is_corresponding":false},{"id":3586,"name":"Sarah A. Teichmann","orcid":"0000-0002-6294-6366","position":58,"is_corresponding":false},{"id":12171,"name":"Alexander M. Tsankov","orcid":"0000-0002-7955-4414","position":59,"is_corresponding":false},{"id":36197,"name":"J Whitsett","orcid":null,"position":60,"is_corresponding":false},{"id":27137,"name":"Y. Xu","orcid":"0000-0001-9563-4804","position":61,"is_corresponding":false},{"id":12172,"name":"Nicholas E. Banovich","orcid":"0000-0003-2604-3247","position":62,"is_corresponding":false},{"id":29918,"name":"Omer Ali Bayraktar","orcid":"0000-0001-6055-277X","position":63,"is_corresponding":false},{"id":12174,"name":"Thu Elizabeth Duong","orcid":"0000-0001-7122-4448","position":64,"is_corresponding":false},{"id":12177,"name":"Jonathan A. Kropski","orcid":"0000-0002-8923-1344","position":66,"is_corresponding":false},{"id":20865,"name":"Paul A. Reyfman","orcid":"0000-0002-6435-6001","position":67,"is_corresponding":false},{"id":573,"name":"Herbert B. Schiller","orcid":"0000-0001-9498-7034","position":68,"is_corresponding":false},{"id":12178,"name":"Purushothama Rao Tata","orcid":"0000-0003-4837-0337","position":69,"is_corresponding":false},{"id":4590,"name":"Joachim L. Schultze","orcid":"0000-0003-2812-9853","position":70,"is_corresponding":false},{"id":1699,"name":"Alexander V. Misharin","orcid":"0000-0003-2879-3789","position":71,"is_corresponding":false},{"id":2958,"name":"Martijn C. Nawijn","orcid":"0000-0003-3372-6521","position":72,"is_corresponding":false},{"id":3578,"name":"Malte D. Luecken","orcid":"0000-0001-7464-7921","position":73,"is_corresponding":false},{"id":42,"name":"Fabian Joachim Theis","orcid":"0000-0002-2419-1943","position":74,"is_corresponding":false},{"id":12112,"name":"Elo Madissoon","orcid":"0000-0003-2405-4911","position":75,"is_corresponding":false},{"id":537,"name":"Meshal Ansari","orcid":"0000-0002-8819-7965","position":76,"is_corresponding":false},{"id":12117,"name":"Leonie Apperloo","orcid":null,"position":77,"is_corresponding":false},{"id":12121,"name":"Evgeny Chichelnitskiy","orcid":"0000-0002-6341-4177","position":78,"is_corresponding":false},{"id":12122,"name":"Mei-i Chung","orcid":null,"position":79,"is_corresponding":false},{"id":36198,"name":"Anne Collin","orcid":"0000-0002-3410-6108","position":80,"is_corresponding":false},{"id":36199,"name":"Theodoros Kapellos","orcid":null,"position":81,"is_corresponding":false},{"id":578,"name":"Christoph H. Mayr","orcid":"0000-0001-5353-4768","position":82,"is_corresponding":false},{"id":36200,"name":"Peter N. Le Souëf","orcid":"0000-0003-0930-1654","position":83,"is_corresponding":false},{"id":7247,"name":"Ciro Ramírez-Suástegui","orcid":"0000-0001-8126-710X","position":84,"is_corresponding":false},{"id":12133,"name":"Chase J. Taylor","orcid":"0000-0002-7942-0483","position":85,"is_corresponding":false},{"id":12134,"name":"Thomas Walzthoeni","orcid":null,"position":86,"is_corresponding":false},{"id":12143,"name":"Nathan D. Jackson","orcid":"0009-0008-2675-021X","position":87,"is_corresponding":false},{"id":12145,"name":"Tracy Tabib","orcid":"0000-0003-4053-988X","position":88,"is_corresponding":false},{"id":12148,"name":"Anna Wilbrey-Clark","orcid":"0000-0002-5399-7997","position":89,"is_corresponding":false},{"id":12165,"name":"Joseph E. Powell","orcid":"0000-0002-5070-4124","position":90,"is_corresponding":false},{"id":2697,"name":"Orit Rozenblatt–Rosen","orcid":"0000-0001-6313-3570","position":91,"is_corresponding":false},{"id":12168,"name":"Dean Sheppard","orcid":"0000-0002-6277-2036","position":92,"is_corresponding":false},{"id":36201,"name":"J. A. Whitsett","orcid":null,"position":93,"is_corresponding":false},{"id":3062,"name":"Yan Xu","orcid":"0000-0002-2832-2664","position":94,"is_corresponding":false},{"id":52013,"name":"Avi Srivastava","orcid":"0000-0001-9798-2079","position":0,"is_corresponding":true}],"reference_count":100,"raw_metadata":null,"created_at":"2026-03-01T18:20:47.508186Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"green","license":"cc-by-nd","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":37.0,"fair_a":53.0,"fair_i":5.0,"fair_r":16.6667,"fair_zscore":-1.5644,"fair_rationale":{"fair_score":27.92,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":37.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"datacite=0, pmcid=False, pmid=False","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper mentions a web portal for projection but does not describe any machine-readable metadata or structured metadata standards."}]},"A":{"name":"Accessible","score":53.0,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper states an interactive web portal is provided but does not specify a clear protocol for accessing the underlying data or code."}]},"I":{"name":"Interoperable","score":5.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper does not mention use of standard formats, controlled vocabularies, or persistent identifiers for the integrated data."}]},"R":{"name":"Reusable","score":16.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.167,"signal":null,"rationale":"The paper lacks a data-availability statement, license information, and details on reproducibility beyond the web portal."}]}},"suggestions":["Provide a data-availability statement with a persistent identifier (e.g., DOI) for the integrated atlas and code.","Include a clear license (e.g., CC-BY) for the data and code to enable reuse.","Describe the use of standard file formats (e.g., HDF5, AnnData) and controlled vocabularies (e.g., Cell Ontology) for cell types.","Add machine-readable metadata (e.g., structured JSON-LD) to the web portal for automated discovery.","Specify a protocol for accessing raw data and code, such as a GitHub repository with versioning and documentation."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"abstract_only"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"abstract_only","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:40:01.219381Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}