{"doi":"10.5281/zenodo.7157285","title":"The Translational Data Catalog - discoverable biomedical datasets","abstract":"The discoverability of datasets resulting from the diverse range of translational and biomedical projects remains sporadic. It is especially difficult for datasets emerging from pre-competitive projects, often due to the legal constraints of data-sharing agreements, and the different priorities of the private and public sectors. The Translational Data Catalog is a single discovery point for the projects and datasets produced by a number of major research programmes funded by the European Commission. Funded by and rooted in a number of these European private-public partnership projects, the Data Catalog is built on FAIR-enabling community standards, and its mission is to ensure that datasets are findable and accessible by machines. Here we present its creation, content, value and adoption, as well as the next steps for sustainability within the ELIXIR ecosystem.","journal":"Zenodo (CERN European Organization for Nuclear Research)","year":2022,"id":6007,"datarank":0.10397207708399181,"base_score":0.6931471805599453,"endowment":0.6931471805599453,"self_citation_contribution":0.10397207708399181,"citation_network_contribution":0.0,"self_endowment_contribution":0.10397207708399181,"citer_contribution":0.0,"corpus_percentile":37.91700569568755,"corpus_rank":716,"citation_count":1,"citer_count":0,"citers_with_citation_signal":0,"citers_with_endowment":0,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.9013,"is_oa":true,"file_count":1,"downloads":201,"has_version_chain":false,"published_date":"2022-10-07","fair_score":31.25,"fair_percentile":12.708883025505717,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":289,"name":"Rocca-Serra, Philippe","orcid":"0000-0001-9853-5668","position":1,"is_corresponding":false},{"id":2540,"name":"Daniel J. B. Clarke","orcid":"0000-0003-3471-7416","position":2,"is_corresponding":false},{"id":56840,"name":"Nirmeen Sallam","orcid":null,"position":3,"is_corresponding":false},{"id":56841,"name":"François Ancien","orcid":"0000-0002-0895-1746","position":4,"is_corresponding":false},{"id":56842,"name":"Abetare Shabani","orcid":null,"position":5,"is_corresponding":false},{"id":56843,"name":"Saeideh Asariardakani","orcid":null,"position":6,"is_corresponding":false},{"id":31883,"name":"Pinar Alper","orcid":"0000-0002-2224-0780","position":7,"is_corresponding":false},{"id":56844,"name":"Soumyabrata Ghosh","orcid":"0000-0003-0659-6733","position":8,"is_corresponding":false},{"id":2487,"name":"Tony Burdett","orcid":"0000-0002-2513-5396","position":9,"is_corresponding":false},{"id":290,"name":"Susanna‐Assunta Sansone","orcid":"0000-0001-5306-5690","position":10,"is_corresponding":false},{"id":2482,"name":"Wei Gu","orcid":"0000-0003-3951-6680","position":11,"is_corresponding":false},{"id":35903,"name":"Venkata Satagopam","orcid":"0000-0002-6532-5880","position":12,"is_corresponding":false},{"id":2498,"name":"Danielle Welter","orcid":"0000-0003-1058-2668","position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"citation_network_status":"fetched"},"created_at":"2026-03-01T18:20:47.508186Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"green","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":37.0,"fair_a":53.0,"fair_i":5.0,"fair_r":30.0,"fair_zscore":-1.2629,"fair_rationale":{"fair_score":31.25,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":37.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"datacite=0, pmcid=False, pmid=False","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper mentions FAIR-enabling community standards and findability/accessibility by machines, but provides no details on specific metadata elements or machine-readable formats."}]},"A":{"name":"Accessible","score":53.0,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper describes the catalog as a discovery point but does not specify any access protocol (e.g., API, SPARQL endpoint) or terms for data access."}]},"I":{"name":"Interoperable","score":5.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper vaguely refers to FAIR-enabling community standards without naming specific vocabularies, formats, or persistent identifiers used."}]},"R":{"name":"Reusable","score":30.0,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"downloads=201","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.167,"signal":null,"rationale":"No data-availability statement, license, or reproducibility details are provided; the paper only discusses discovery, not reuse."}]}},"suggestions":["Provide explicit descriptions of the metadata schema, including properties like title, description, creator, and license.","Specify the machine-accessible protocol (e.g., REST API), authentication method, and access conditions.","Name the standard vocabularies (e.g., EDAM, Ontology for Biomedical Investigations) and identifier systems (e.g., DOI, ORCID) used.","Include a clear data-availability statement with an open license (e.g., CC0) and link to the catalog's terms of use.","Add reproducibility instructions, such as a versioned code repository, to enable reuse of the catalog's implementation."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"abstract_only"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"abstract_only","fair_has_llm":true,"fair_computed_at":"2026-06-18T04:57:13.047416Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}