{"doi":"10.1186/1471-2105-9-548","title":"GeneChaser: Identifying all biological and clinical conditions in which genes of interest are differentially expressed","abstract":"<h4>Background</h4>The amount of gene expression data in the public repositories, such as NCBI Gene Expression Omnibus (GEO) has grown exponentially, and provides a gold mine for bioinformaticians, but has not been easily accessible by biologists and clinicians.<h4>Results</h4>We developed an automated approach to annotate and analyze all GEO data sets, including 1,515 GEO data sets from 231 microarray types across 42 species, and performed 12,658 group versus group comparisons of 24 GEO-specified types. We then built GeneChaser, a web server that enables biologists and clinicians without bioinformatics skills to easily identify biological and clinical conditions in which a gene or set of genes was differentially expressed. GeneChaser displays these conditions in graphs, gives statistical comparisons, allows sort/filter functions and provides access to the original studies.We performed a single gene search for Nanog and a multiple gene search for Nanog, Oct4, Sox2 and LIN28, confirmed their roles in embryonic stem cell development, identified several drugs that regulate their expression, and suggested their potential roles in sex determination, abnormal sperm morphology, malaria infection, and cancer.<h4>Conclusion</h4>We demonstrated that GeneChaser is a powerful tool to elucidate information on function, transcriptional regulation, drug-response and clinical implications for genes of interest.","journal":"BMC Bioinformatics","year":2008,"id":11948,"datarank":1.9486660128280378,"base_score":3.5553480614894135,"endowment":3.5553480614894135,"self_citation_contribution":0.5333022092234121,"citation_network_contribution":1.4153638036046257,"self_endowment_contribution":0.5333022092234121,"citer_contribution":1.4153638036046257,"corpus_percentile":64.27990235964198,"corpus_rank":440,"citation_count":34,"citer_count":29,"citers_with_citation_signal":26,"citers_with_endowment":26,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":0.9152,"is_oa":true,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":"2008-12-01","fair_score":39.375,"fair_percentile":20.030782761653473,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":72955,"name":"Rohan Mallelwar","orcid":null,"position":1,"is_corresponding":false},{"id":95684,"name":"Ajit Thosar","orcid":null,"position":2,"is_corresponding":false},{"id":14908,"name":"Shivkumar Venkatasubrahmanyam","orcid":null,"position":3,"is_corresponding":false},{"id":51,"name":"Atul Janardhan Butte","orcid":"0000-0002-7433-2740","position":4,"is_corresponding":false},{"id":50,"name":"Rong Chen","orcid":"0000-0001-6322-0340","position":0,"is_corresponding":true}],"reference_count":23,"raw_metadata":{"citation_network_status":"fetched"},"created_at":"2026-03-01T18:20:47.508186Z","pmid":"19094235","pmcid":"PMC2629779","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":52.5,"fair_a":55.0,"fair_i":25.0,"fair_r":25.0,"fair_zscore":-0.5279,"fair_rationale":{"fair_score":39.38,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":52.5,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.25,"signal":null,"rationale":"The paper describes the tool and its outputs but does not provide machine-readable metadata (e.g., structured JSON-LD, schema.org annotations) for the data or code."}]},"A":{"name":"Accessible","score":55.0,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":1.0,"signal":"Open Access","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"0 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper states the web server is accessible at http://genechaser.stanford.edu and that results can be downloaded as tabbed text files, but does not specify a persistent identifier or a formal access protocol for the underlying data or code."}]},"I":{"name":"Interoperable","score":25.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper uses standard identifiers (Entrez Gene, Homologene) and formats (SOFT files, tabbed text), but does not adopt community-standard vocabularies or linked-data principles for the output."}]},"R":{"name":"Reusable","score":25.0,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.333,"signal":null,"rationale":"The paper provides a data-availability statement (Creative Commons Attribution License) and mentions the tool is updated quarterly, but does not provide a persistent identifier, versioned data/code repository, or detailed reproducibility instructions."}]}},"suggestions":["Provide a machine-readable metadata file (e.g., JSON-LD) describing the dataset and tool outputs.","Assign a persistent identifier (e.g., DOI) to the underlying data and code, and deposit them in a public repository.","Document the exact software versions, parameters, and execution environment to enable full reproducibility.","Use standard ontologies (e.g., OBI, EDAM) for experimental conditions and data types in the output.","Include a formal license for the code (e.g., MIT) and a clear citation recommendation for reuse."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:44:19.227777Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}