{"doi":"10.1186/1471-2164-11-645","title":"Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","abstract":"<jats:title>Abstract</jats:title><jats:sec><jats:title>Background</jats:title><jats:p>A goal of the Bovine Genome Database (BGD;<jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" xlink:href=\"http://BovineGenome.org\" ext-link-type=\"uri\">http://BovineGenome.org</jats:ext-link>) has been to support the Bovine Genome Sequencing and Analysis Consortium (BGSAC) in the annotation and analysis of the bovine genome. We were faced with several challenges, including the need to maintain consistent quality despite diversity in annotation expertise in the research community, the need to maintain consistent data formats, and the need to minimize the potential duplication of annotation effort. With new sequencing technologies allowing many more eukaryotic genomes to be sequenced, the demand for collaborative annotation is likely to increase. Here we present our approach, challenges and solutions facilitating a large distributed annotation project.</jats:p></jats:sec><jats:sec><jats:title>Results and Discussion</jats:title><jats:p>BGD has provided annotation tools that supported 147 members of the BGSAC in contributing 3,871 gene models over a fifteen-week period, and these annotations have been integrated into the bovine Official Gene Set. Our approach has been to provide an annotation system, which includes a BLAST site, multiple genome browsers, an annotation portal, and the Apollo Annotation Editor configured to connect directly to our Chado database. In addition to implementing and integrating components of the annotation system, we have performed computational analyses to create gene evidence tracks and a consensus gene set, which can be viewed on individual gene pages at BGD.</jats:p></jats:sec><jats:sec><jats:title>Conclusions</jats:title><jats:p>We have provided annotation tools that alleviate challenges associated with distributed annotation. Our system provides a consistent set of data to all annotators and eliminates the need for annotators to format data. Involving the bovine research community in genome annotation has allowed us to leverage expertise in various areas of bovine biology to provide biological insight into the genome sequence.</jats:p></jats:sec>","journal":"BMC Genomics","year":2010,"id":30985,"datarank":1.3847071266185877,"base_score":3.295836866004329,"endowment":3.295836866004329,"self_citation_contribution":0.4943755299006494,"citation_network_contribution":0.8903315967179384,"self_endowment_contribution":0.4943755299006494,"citer_contribution":0.8903315967179384,"corpus_percentile":61.5,"corpus_rank":537,"citation_count":26,"citer_count":22,"citers_with_citation_signal":20,"citers_with_endowment":20,"datacite_reuse_total":9,"is_dataset":true,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"fair_score":63.5417,"fair_percentile":95.99824098504837,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":167071,"name":"Christopher P Childers","orcid":null,"position":1,"is_corresponding":false},{"id":167072,"name":"Jaideep P Sundaram","orcid":null,"position":2,"is_corresponding":false},{"id":167073,"name":"C Michael Dickens","orcid":null,"position":3,"is_corresponding":false},{"id":167074,"name":"Kevin L Childs","orcid":null,"position":4,"is_corresponding":false},{"id":167075,"name":"Donald C Vile","orcid":null,"position":5,"is_corresponding":false},{"id":167076,"name":"Christine G Elsik","orcid":null,"position":6,"is_corresponding":false},{"id":37898,"name":"Justin T Reese","orcid":"0000-0002-2170-2250","position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":3.295836866004329,"endowment":3.295836866004329,"datacite_reuse_total":9,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"21092105","pmcid":"PMC3012608","openalex_id":"https://openalex.org/W2162782320","authors":[],"funders":[],"total_grants":0,"fwci":1.5237,"citation_percentile":0.84647753,"influential_citations":0,"citation_trend":[{"year":2012,"count":2},{"year":2013,"count":2},{"year":2014,"count":5},{"year":2015,"count":4},{"year":2016,"count":1},{"year":2017,"count":4},{"year":2020,"count":1},{"year":2021,"count":2},{"year":2022,"count":1}],"oa_status":"gold","license":"cc-by","oa_locations":[{"url":"https://bmcgenomics.biomedcentral.com/counter/pdf/10.1186/1471-2164-11-645","host_type":"journal"},{"url":"https://bmcgenomics.biomedcentral.com/counter/pdf/10.1186/1471-2164-11-645","host_type":"GOLD"},{"url":"https://bmcgenomics.biomedcentral.com/counter/pdf/10.1186/1471-2164-11-645","host_type":"publisher"},{"url":"https://link.springer.com/content/pdf/10.1186/1471-2164-11-645.pdf","host_type":"publisher"},{"url":"https://link.springer.com/article/10.1186/1471-2164-11-645/fulltext.html","host_type":"publisher"},{"url":"https://doi.org/10.1186/1471-2164-11-645","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/21092105","host_type":"repository"},{"url":"https://doaj.org/article/41edcbe0f9ad4b73852b62ce04232e25","host_type":"repository"},{"url":"https://hdl.handle.net/1969.1/183754","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/3012608","host_type":"repository"},{"url":"https://bmcgenomics.biomedcentral.com/track/pdf/10.1186/1471-2164-11-645","host_type":"Unpaywall"},{"url":"http://www.biomedcentral.com/1471-2164/11/645/abstract","host_type":"BioMedCentral"},{"url":"http://www.biomedcentral.com/content/pdf/1471-2164-11-645.pdf","host_type":"BioMedCentral"},{"url":"http://www.biomedcentral.com/1471-2164/11/645","host_type":"BioMedCentral"},{"url":"https://europepmc.org/articles/PMC3012608","host_type":"Europe_PMC"},{"url":"https://europepmc.org/articles/PMC3012608?pdf=render","host_type":"Europe_PMC"}],"fields_of_study":["Genetic and phenotypic traits in livestock","Genetic Mapping and Diversity in Plants and Animals","Genomics and Phylogenetic Studies","Biology","Medicine","Computer Science","Environmental Science","Animals","Cattle","Databases, Genetic","Genome","Internet","Molecular Sequence Annotation","Statistics as Topic"],"mesh_terms":["Animals","Cattle","Statistics as Topic","Genome","Internet","Databases, Genetic","Molecular Sequence Annotation"],"keywords":["Annotation","Genome project","Genome","Gene Annotation","Leverage (statistics)","Computational biology","Biology","Computer science","Bovine genome","Database","Gene","Genetics","Artificial intelligence"],"sdg_mappings":[{"sdg_number":0,"sdg_label":"Partnerships for the goals"}],"linked_datasets":[{"doi":"10.6084/m9.figshare.14434619.v1","title":"Additional file of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.14434619.v2","title":"Additional file of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.14434619","title":"Additional file of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.14434616.v1","title":"Additional file 2 of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.14434616","title":"Additional file 2 of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.14434613.v1","title":"Additional file 1 of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.14434613","title":"Additional file 1 of Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.c.4872633.v1","title":"Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Collection"},{"doi":"10.6084/m9.figshare.c.4872633","title":"Bovine Genome Database: supporting community annotation and analysis of the Bos taurus genome","publisher":"figshare","resource_type":"Collection"}],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-09T06:09:53.200112Z","pmid":"21092105","pmcid":"PMC3012608","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":65.0,"fair_a":72.5,"fair_i":75.0,"fair_r":41.6667,"fair_zscore":1.6581,"fair_rationale":{"fair_score":63.54,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":65.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=9, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper describes use of controlled vocabularies (SO, GO) and schemas (Chado), but lacks explicit machine-readable metadata (e.g., JSON-LD, RDF) for datasets."}]},"A":{"name":"Accessible","score":72.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":0.5,"signal":"files/OA location present but not flagged OA","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"16 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"BGD is publicly accessible at given URL, code at repositories, but some tools require registration and downloading full data may need Apollo software."}]},"I":{"name":"Interoperable","score":75.0,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"linked_datasets=0, datacite=9","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":1.0,"signal":null,"rationale":"Extensive use of standard formats (GFF3, FASTA, Chado-XML, SO, GO) and identifiers (RefSeq, Ensembl) is explicitly described."}]},"R":{"name":"Reusable","score":41.67,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.667,"signal":null,"rationale":"Data is publicly accessible, article is CC BY, and code repositories are provided, but no explicit data license or statement on long-term preservation."}]}},"suggestions":["Include an explicit data availability statement with a license for the data (e.g., CC0 or ODbL).","Provide machine-readable metadata (e.g., JSON-LD or schema.org) for datasets and annotations.","Make data downloadable in bulk without registration to improve accessibility.","Document the exact software versions and database dumps used for reproducibility.","Use persistent identifiers (e.g., DOIs) for specific dataset releases (e.g., OGSv2)."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:46:03.272980Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}