{"doi":"10.1038/s41597-022-01899-x","title":"MIMIC-IV, a freely accessible electronic health record dataset","abstract":"Digital data collection during routine clinical practice is now ubiquitous within hospitals. The data contains valuable information on the care of patients and their response to treatments, offering exciting opportunities for research. Typically, data are stored within archival systems that are not intended to support research. These systems are often inaccessible to researchers and structured for optimal storage, rather than interpretability and analysis. Here we present MIMIC-IV, a publicly available database sourced from the electronic health record of the Beth Israel Deaconess Medical Center. Information available includes patient measurements, orders, diagnoses, procedures, treatments, and deidentified free-text clinical notes. MIMIC-IV is intended to support a wide array of research studies and educational material, helping to reduce barriers to conducting clinical research.","journal":"Scientific Data","year":2023,"id":4854,"datarank":13.03519131955881,"base_score":7.720905251936779,"endowment":7.720905251936779,"self_citation_contribution":1.158135787790517,"citation_network_contribution":11.877055531768294,"self_endowment_contribution":1.158135787790517,"citer_contribution":11.877055531768294,"corpus_percentile":84.62164361269325,"corpus_rank":190,"citation_count":2640,"citer_count":190,"citers_with_citation_signal":190,"citers_with_endowment":190,"datacite_reuse_total":25,"is_dataset":true,"is_dataset_confidence":0.927,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2023-01-03","fair_score":59.1667,"fair_percentile":92.10642040457344,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":50502,"name":"Lucas Bulgarelli","orcid":"0000-0001-5456-2170","position":1,"is_corresponding":false},{"id":50503,"name":"Lu Shen","orcid":null,"position":2,"is_corresponding":false},{"id":50504,"name":"Alvin Gayles","orcid":null,"position":3,"is_corresponding":false},{"id":50505,"name":"Ayad Shammout","orcid":null,"position":4,"is_corresponding":false},{"id":50506,"name":"Steven Horng","orcid":"0000-0002-0958-1820","position":5,"is_corresponding":false},{"id":14627,"name":"Tom J. Pollard","orcid":"0000-0002-5676-7898","position":6,"is_corresponding":false},{"id":50507,"name":"Sicheng Hao","orcid":"0000-0002-8905-005X","position":7,"is_corresponding":false},{"id":50508,"name":"Benjamin Moody","orcid":null,"position":8,"is_corresponding":false},{"id":50509,"name":"Brian Gow","orcid":"0000-0002-7682-1943","position":9,"is_corresponding":false},{"id":50510,"name":"Li-wei H. Lehman","orcid":"0000-0002-3782-9977","position":10,"is_corresponding":false},{"id":4662,"name":"Leo Anthony Celi","orcid":"0000-0001-6712-6626","position":11,"is_corresponding":false},{"id":50511,"name":"Roger G. Mark","orcid":"0000-0002-6318-2978","position":12,"is_corresponding":false},{"id":50501,"name":"Alistair E. W. Johnson","orcid":"0000-0002-8735-3014","position":0,"is_corresponding":true}],"reference_count":28,"raw_metadata":{"citation_network_status":"fetched"},"created_at":"2026-03-01T18:20:47.508186Z","pmid":"36596836","pmcid":"PMC9810617","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":52.5,"fair_a":67.5,"fair_i":75.0,"fair_r":41.6667,"fair_zscore":1.2623,"fair_rationale":{"fair_score":59.17,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":52.5,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=25, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper does not describe any machine-readable metadata (e.g., schema.org, DCAT) for the dataset."}]},"A":{"name":"Accessible","score":67.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"The access protocol is clearly described (PhysioNet, training, DUA), but it requires registration and is not fully open."}]},"I":{"name":"Interoperable","score":75.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"linked_datasets=0, datacite=25","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":1.0,"signal":null,"rationale":"The dataset uses standard formats (CSV, SQL), standard vocabularies (ICD, DRG, HCPCS), and consistent identifiers (subject_id, hadm_id)."}]},"R":{"name":"Reusable","score":41.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.667,"signal":null,"rationale":"The data is available under a CC BY 4.0 license, with DOIs and a code repository, but the full build process is not publicly reproducible."}]}},"suggestions":["Add machine-readable metadata (e.g., JSON-LD with schema.org/Dataset) to the PhysioNet landing page.","Publish a synthetic version of the data or detailed build scripts to improve reproducibility.","Provide a machine-readable data dictionary (e.g., CSV or JSON) for all tables and columns.","Ensure all code repositories have persistent identifiers and versioning (e.g., Zenodo DOIs for each release)."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:30:05.411982Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}