{"doi":"10.1186/gb-2003-4-5-p3","title":"DAVID: Database for Annotation, Visualization, and Integrated Discovery","abstract":"<h4>Background</h4>Functional annotation of differentially expressed genes is a necessary and critical step in the analysis of microarray data. The distributed nature of biological knowledge frequently requires researchers to navigate through numerous web-accessible databases gathering information one gene at a time. A more judicious approach is to provide query-based access to an integrated database that disseminates biologically rich information across large datasets and displays graphic summaries of functional information.<h4>Results</h4>Database for Annotation, Visualization, and Integrated Discovery (DAVID; http://www.david.niaid.nih.gov) addresses this need via four web-based analysis modules: 1) Annotation Tool - rapidly appends descriptive data from several public databases to lists of genes; 2) GoCharts - assigns genes to Gene Ontology functional categories based on user selected classifications and term specificity level; 3) KeggCharts - assigns genes to KEGG metabolic processes and enables users to view genes in the context of biochemical pathway maps; and 4) DomainCharts - groups genes according to PFAM conserved protein domains.<h4>Conclusions</h4>Analysis results and graphical displays remain dynamically linked to primary data and external data repositories, thereby furnishing in-depth as well as broad-based data coverage. The functionality provided by DAVID accelerates the analysis of genome-scale datasets by facilitating the transition from data collection to biological meaning.","journal":"Genome Biology","year":2003,"id":6570,"datarank":23.823749014802118,"base_score":9.143986871426161,"endowment":9.143986871426161,"self_citation_contribution":1.3715980307139244,"citation_network_contribution":22.452150984088195,"self_endowment_contribution":1.3715980307139244,"citer_contribution":22.452150984088195,"corpus_percentile":97.39625711960944,"corpus_rank":33,"citation_count":9481,"citer_count":191,"citers_with_citation_signal":191,"citers_with_endowment":191,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.7037,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2003-04-03","fair_score":29.1667,"fair_percentile":10.29023746701847,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":60621,"name":"Brad T. Sherman","orcid":"0000-0001-5815-7359","position":1,"is_corresponding":false},{"id":60622,"name":"Douglas A Hosack","orcid":null,"position":2,"is_corresponding":false},{"id":50203,"name":"JUN YANG","orcid":"0000-0002-7920-2777","position":3,"is_corresponding":false},{"id":60623,"name":"Wei Gao","orcid":"0000-0002-1391-8334","position":4,"is_corresponding":false},{"id":60624,"name":"H. Clifford Lane","orcid":"0000-0001-9509-1045","position":5,"is_corresponding":false},{"id":60625,"name":"Richard A. Lempicki","orcid":"0000-0002-7059-409X","position":6,"is_corresponding":false},{"id":60620,"name":"Glynn Dennis","orcid":null,"position":0,"is_corresponding":true}],"reference_count":13,"raw_metadata":{"citation_network_status":"fetched"},"created_at":"2026-03-01T18:20:47.508186Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":32.5,"fair_a":55.0,"fair_i":12.5,"fair_r":16.6667,"fair_zscore":-1.4513,"fair_rationale":{"fair_score":29.17,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":32.5,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"datacite=0, pmcid=False, pmid=False","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper provides a basic description of the DAVID database, but lacks explicit machine-readable metadata or structured metadata such as schema.org annotations."}]},"A":{"name":"Accessible","score":55.0,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper states that the database is free of charge and provides a URL (http://genomebiology.com/2003/4/5/P3), but does not specify a clear access protocol or authentication requirements for the data/code."}]},"I":{"name":"Interoperable","score":12.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper mentions standard bioinformatics terms but does not explicitly state use of standard formats, controlled vocabularies, or persistent identifiers beyond the article URL."}]},"R":{"name":"Reusable","score":16.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.167,"signal":null,"rationale":"No data-availability statement, license, or explicit reproducibility information is provided; the article is a preprint without peer-reviewed data sharing details."}]}},"suggestions":["Include a formal data availability statement specifying where and how to access the database and any associated source code or data.","Add machine-readable metadata in the HTML (e.g., schema.org Dataset) to facilitate automated discovery.","Clearly indicate the license under which the database and its contents are released.","Provide standard identifiers (e.g., DOIs, accession numbers) for all datasets and tools mentioned.","Describe the use of standard file formats and controlled vocabularies (e.g., Gene Ontology) explicitly in the methods."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"unpaywall_pdf"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"unpaywall_pdf","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:25:47.749872Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}