{"doi":"10.1183/13993003.02057-2021","title":"The discovAIR project: a roadmap towards the Human Lung Cell Atlas","abstract":"The Human Cell Atlas (HCA) consortium aims to establish an atlas of all organs in the healthy human body at single-cell resolution to increase our understanding of basic biological processes that govern development, physiology and anatomy, and to accelerate diagnosis and treatment of disease. The Lung Biological Network of the HCA aims to generate the Human Lung Cell Atlas as a reference for the cellular repertoire, molecular cell states and phenotypes, and cell-cell interactions that characterise normal lung homeostasis in healthy lung tissue. Such a reference atlas of the healthy human lung will facilitate mapping the changes in the cellular landscape in disease. The discovAIR project is one of six pilot actions for the HCA funded by the European Commission in the context of the H2020 framework programme. discovAIR aims to establish the first draft of an integrated Human Lung Cell Atlas, combining single-cell transcriptional and epigenetic profiling with spatially resolving techniques on matched tissue samples, as well as including a number of chronic and infectious diseases of the lung. The integrated Human Lung Cell Atlas will be available as a resource for the wider respiratory community, including basic and translational scientists, clinical medicine, and the private sector, as well as for patients with lung disease and the interested lay public. We anticipate that the Human Lung Cell Atlas will be the founding stone for a more detailed understanding of the pathogenesis of lung diseases, guiding the design of novel diagnostics and preventive or curative interventions.","journal":"European Respiratory Journal","year":2022,"id":2126,"datarank":0.7862636270088788,"base_score":3.295836866004329,"endowment":3.295836866004329,"self_citation_contribution":0.4943755299006494,"citation_network_contribution":0.29188809710822927,"self_endowment_contribution":0.4943755299006494,"citer_contribution":0.29188809710822927,"corpus_percentile":55.899104963384865,"corpus_rank":543,"citation_count":26,"citer_count":16,"citers_with_citation_signal":15,"citers_with_endowment":15,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.95,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2022-01-27","fair_score":38.3333,"fair_percentile":19.217238346525946,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":12114,"name":"Laure-Emmanuelle Zaragosi","orcid":"0000-0001-6747-7928","position":1,"is_corresponding":false},{"id":12112,"name":"Elo Madissoon","orcid":"0000-0003-2405-4911","position":2,"is_corresponding":false},{"id":52013,"name":"Avi Srivastava","orcid":"0000-0001-9798-2079","position":3,"is_corresponding":false},{"id":24362,"name":"Alexandra B. Firsova","orcid":"0000-0002-7345-7429","position":4,"is_corresponding":false},{"id":24363,"name":"Elena De Domenico","orcid":"0000-0003-0336-8284","position":5,"is_corresponding":false},{"id":24364,"name":"Louis Kümmerle","orcid":null,"position":6,"is_corresponding":false},{"id":24365,"name":"Adem Saglam","orcid":null,"position":7,"is_corresponding":false},{"id":12120,"name":"Marijn Berg","orcid":"0000-0002-9870-1571","position":8,"is_corresponding":false},{"id":2921,"name":"Janine Gote-Schniering","orcid":"0000-0001-7869-4936","position":10,"is_corresponding":false},{"id":578,"name":"Christoph H. Mayr","orcid":"0000-0001-5353-4768","position":11,"is_corresponding":false},{"id":24367,"name":"Xesús M. Abalo","orcid":"0000-0002-1643-0705","position":12,"is_corresponding":false},{"id":24368,"name":"Ludvig Larsson","orcid":"0000-0003-4209-2911","position":13,"is_corresponding":false},{"id":24369,"name":"Alexandros Sountoulidis","orcid":"0000-0002-8837-4642","position":14,"is_corresponding":false},{"id":3586,"name":"Sarah A. Teichmann","orcid":"0000-0002-6294-6366","position":15,"is_corresponding":false},{"id":24370,"name":"Karen van Eunen","orcid":"0000-0001-5603-1883","position":16,"is_corresponding":false},{"id":20869,"name":"Gerard H. Koppelman","orcid":"0000-0001-8567-3252","position":17,"is_corresponding":false},{"id":12129,"name":"Sylvie Leroy","orcid":"0000-0002-3465-8180","position":19,"is_corresponding":false},{"id":24372,"name":"Pippa Powell","orcid":"0000-0003-1828-8332","position":20,"is_corresponding":false},{"id":2956,"name":"Wim Timens","orcid":"0000-0002-4146-6363","position":22,"is_corresponding":false},{"id":481,"name":"Joakim Lundeberg","orcid":"0000-0003-4313-1601","position":23,"is_corresponding":false},{"id":2957,"name":"Maarten van den Berge","orcid":"0000-0002-9336-7340","position":24,"is_corresponding":false},{"id":24374,"name":"Mats Nilsson","orcid":"0000-0001-9985-0387","position":25,"is_corresponding":false},{"id":12153,"name":"Peter Horvath","orcid":null,"position":26,"is_corresponding":false},{"id":24376,"name":"Jessica Denning","orcid":null,"position":27,"is_corresponding":false},{"id":13898,"name":"Irene Papatheodorou","orcid":"0000-0001-7270-5470","position":28,"is_corresponding":false},{"id":4590,"name":"Joachim L. Schultze","orcid":"0000-0003-2812-9853","position":29,"is_corresponding":false},{"id":573,"name":"Herbert B. Schiller","orcid":"0000-0001-9498-7034","position":30,"is_corresponding":false},{"id":29918,"name":"Omer Ali Bayraktar","orcid":"0000-0001-6055-277X","position":31,"is_corresponding":false},{"id":24377,"name":"Ilya Petoukhov","orcid":null,"position":32,"is_corresponding":false},{"id":1699,"name":"Alexander V. Misharin","orcid":"0000-0003-2879-3789","position":33,"is_corresponding":false},{"id":20867,"name":"Ian M. Adcock","orcid":"0000-0003-2101-8843","position":34,"is_corresponding":false},{"id":12131,"name":"Michael von Papen","orcid":"0000-0001-5030-1643","position":35,"is_corresponding":false},{"id":42,"name":"Fabian Joachim Theis","orcid":"0000-0002-2419-1943","position":36,"is_corresponding":false},{"id":24378,"name":"Christos Samakovlis","orcid":"0000-0002-9153-6040","position":37,"is_corresponding":false},{"id":12175,"name":"Christine S. Falk","orcid":"0000-0003-1376-7318","position":38,"is_corresponding":false},{"id":2958,"name":"Martijn C. Nawijn","orcid":"0000-0003-3372-6521","position":39,"is_corresponding":false},{"id":12124,"name":"Aurore C. A. Gay","orcid":"0000-0002-1593-2674","position":40,"is_corresponding":false},{"id":24379,"name":"Kourosh Saeb‐Parsy","orcid":"0000-0002-0633-3696","position":41,"is_corresponding":false},{"id":19027,"name":"Uğis Sarkans","orcid":"0000-0001-9227-8488","position":42,"is_corresponding":false},{"id":12180,"name":"Péter Horváth","orcid":"0000-0002-4492-1798","position":43,"is_corresponding":false},{"id":3578,"name":"Malte D. Luecken","orcid":"0000-0001-7464-7921","position":0,"is_corresponding":true}],"reference_count":42,"raw_metadata":null,"created_at":"2026-03-01T18:20:47.508186Z","pmid":"35086829","pmcid":"PMC9386332","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"hybrid","license":"cc-by-nc","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":52.5,"fair_a":42.5,"fair_i":25.0,"fair_r":33.3333,"fair_zscore":-0.6222,"fair_rationale":{"fair_score":38.33,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":52.5,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper describes the project but does not provide or link to any machine-readable metadata; only URLs to portals and GitHub are mentioned."}]},"A":{"name":"Accessible","score":42.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper states data will be 'freely available' and mentions repositories like ArrayExpress, but no specific access protocol, persistent identifiers, or direct download links for the final data are provided."}]},"I":{"name":"Interoperable","score":25.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper references ontologies (e.g., lung ontologies, cell-type labels) and standard formats (e.g., FASTQ) implicitly but does not confirm that all outputs use community-standard vocabularies or formats with explicit references."}]},"R":{"name":"Reusable","score":33.33,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.5,"signal":null,"rationale":"The manuscript includes an open-access license (CC BY-NC 4.0) and mentions sharing methods via protocols.io and GitHub, but lacks a formal data-availability statement for the final atlas, no software versioning, and no explicit reproducibility instructions for analyses."}]}},"suggestions":["Deposit final atlas data in a FAIR-aligned repository (e.g., ArrayExpress, GEO) with persistent identifiers and structured metadata.","Provide a formal data-availability statement specifying exact repository names, accession numbers, and terms of reuse.","Define and use community-standard ontologies for all cell-type and anatomical annotations in machine-readable form.","Publish the exact computational workflow (e.g., as a container or Snakemake workflow) with version-controlled code to ensure reproducibility."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:46:10.122107Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}