{"doi":"10.1038/s41598-024-56705-y","title":"Enriched atlas of lncRNA and protein-coding genes for the GRCg7b chicken assembly and its functional annotation across 47 tissues","abstract":"<jats:title>Abstract</jats:title>\n                  <jats:p>Gene atlases for livestock are steadily improving thanks to new genome assemblies and new expression data improving the gene annotation. However, gene content varies across databases due to differences in RNA sequencing data and bioinformatics pipelines, especially for long non-coding RNAs (lncRNAs) which have higher tissue and developmental specificity and are harder to consistently identify compared to protein coding genes (PCGs). As done previously in 2020 for chicken assemblies galgal5 and GRCg6a, we provide a new gene atlas, lncRNA-enriched, for the latest GRCg7b chicken assembly, integrating \"NCBI RefSeq\", \"EMBL-EBI Ensembl/GENCODE\" reference annotations and other resources such as FAANG and NONCODE. As a result, the number of PCGs increases from 18,022 (RefSeq) and 17,007 (Ensembl) to 24,102, and that of lncRNAs from 5789 (RefSeq) and 11,944 (Ensembl) to 44,428. Using 1400 public RNA-seq transcriptome representing 47 tissues, we provided expression evidence for 35,257 (79%) lncRNAs and 22,468 (93%) PCGs, supporting the relevance of this atlas. Further characterization including tissue-specificity, sex-differential expression and gene configurations are provided. We also identified conserved miRNA-hosting genes with human counterparts, suggesting common function. The annotated atlas is available at gega.sigenae.org</jats:p>","journal":"Scientific Reports","year":2024,"id":30988,"datarank":0.6256035286409051,"base_score":3.1780538303479458,"endowment":3.1780538303479458,"self_citation_contribution":0.47670807455219194,"citation_network_contribution":0.1488954540887131,"self_endowment_contribution":0.47670807455219194,"citer_contribution":0.1488954540887131,"corpus_percentile":52.7,"corpus_rank":660,"citation_count":23,"citer_count":14,"citers_with_citation_signal":8,"citers_with_endowment":8,"datacite_reuse_total":0,"is_dataset":true,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"fair_score":56.25,"fair_percentile":91.64467897977133,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":123000,"name":"Mathieu Charles","orcid":"0000-0001-6491-1928","position":1,"is_corresponding":false},{"id":11752,"name":"Sylvain Foissac","orcid":"0000-0002-2631-5356","position":2,"is_corresponding":false},{"id":167082,"name":"Haijuan Zhou","orcid":null,"position":3,"is_corresponding":false},{"id":123001,"name":"Dailu Guan","orcid":"0000-0001-8800-3158","position":4,"is_corresponding":false},{"id":31097,"name":"Lingzhao Fang","orcid":"0000-0003-1103-3679","position":5,"is_corresponding":false},{"id":122144,"name":"Christophe Klopp","orcid":"0000-0001-7126-5477","position":6,"is_corresponding":false},{"id":123002,"name":"Coralie Allain","orcid":"0009-0009-7673-0644","position":7,"is_corresponding":false},{"id":123003,"name":"Laetitia Lagoutte","orcid":null,"position":8,"is_corresponding":false},{"id":123004,"name":"Frédéric Lecerf","orcid":"0000-0002-6471-1771","position":9,"is_corresponding":false},{"id":123005,"name":"Hervé Acloque","orcid":"0000-0003-4761-1055","position":10,"is_corresponding":false},{"id":123006,"name":"Elisabetta Giuffra","orcid":"0000-0001-9568-2056","position":11,"is_corresponding":false},{"id":123007,"name":"Frédérique Pitel","orcid":"0000-0002-1477-7633","position":12,"is_corresponding":false},{"id":122143,"name":"Sandrine Lagarrigue","orcid":"0000-0002-4887-7245","position":13,"is_corresponding":false},{"id":122999,"name":"Fabien Degalez","orcid":"0000-0001-8252-6425","position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":3.1354942159291497,"endowment":3.1354942159291497,"datacite_reuse_total":0,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"38504112","pmcid":"PMC10951430","openalex_id":"https://openalex.org/W4392949890","authors":[],"funders":[{"funder_name":"European Commission","grant_id":"101000236","title":"GEroNIMO: Genome and Epigenome eNabled breedIng in MOnogastrics"}],"total_grants":1,"fwci":5.6032,"citation_percentile":0.96698606,"influential_citations":0,"citation_trend":[{"year":2024,"count":6},{"year":2025,"count":10},{"year":2026,"count":6}],"oa_status":"gold","license":"cc-by","oa_locations":[{"url":"https://www.nature.com/articles/s41598-024-56705-y.pdf","host_type":"journal"},{"url":"https://www.nature.com/articles/s41598-024-56705-y.pdf","host_type":"GOLD"},{"url":"https://www.nature.com/articles/s41598-024-56705-y.pdf","host_type":"publisher"},{"url":"https://www.nature.com/articles/s41598-024-56705-y","host_type":"publisher"},{"url":"https://doi.org/10.1038/s41598-024-56705-y","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/38504112","host_type":"repository"},{"url":"https://pure.au.dk/portal/en/publications/5197dc6f-47a4-4591-aaf5-d4ecf78aa121","host_type":""},{"url":"https://hal.inrae.fr/hal-04575157","host_type":"repository"},{"url":"https://escholarship.org/uc/item/1rx769hm","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/10951430","host_type":"repository"},{"url":"https://doaj.org/article/289590f9430a4752b5aaa9be20f46cfd","host_type":"repository"},{"url":"https://hal.inrae.fr/hal-04575157/document","host_type":"repository"},{"url":"https://escholarship.org/content/qt1rx769hm/qt1rx769hm.pdf","host_type":"repository"},{"url":"https://pmc.ncbi.nlm.nih.gov/articles/PMC10951430/pdf/41598_2024_Article_56705.pdf","host_type":"repository"},{"url":"https://europepmc.org/articles/PMC10951430","host_type":"Europe_PMC"},{"url":"https://europepmc.org/articles/PMC10951430?pdf=render","host_type":"Europe_PMC"},{"url":"https://doi.org/10.1101/2023.08.18.553750","host_type":""},{"url":"http://dx.doi.org/10.1038/s41598-024-56705-y","host_type":""},{"url":"https://hal.inrae.fr/hal-04575157v1","host_type":""},{"url":"https://hal.inrae.fr/hal-04575157v1/document","host_type":""},{"url":"https://zenodo.org/records/13121780","host_type":""},{"url":"https://www.scopus.com/pages/publications/85188156983","host_type":""},{"url":"https://doi.org/https://doi.org/10.1038/s41598-024-56705-y","host_type":""}],"fields_of_study":["Cancer-related molecular mechanisms research","RNA modifications and cancer","RNA Research and Splicing","Biology","Medicine","0301 basic medicine","03 medical and health sciences","0303 health sciences","Animals","Humans","RNA, Long Noncoding","Chickens","Transcriptome","Molecular Sequence Annotation","Sequence Analysis, RNA"],"mesh_terms":["Animals","Chickens","Humans","Sequence Analysis, RNA","Molecular Sequence Annotation","Transcriptome","RNA, Long Noncoding"],"keywords":["Annotation","Atlas (anatomy)","Computational biology","Gene","Biology","Coding (social sciences)","Human Protein Atlas","Genetics","Bioinformatics","Computer science","Anatomy","Protein expression","Tissue specificity","Genome annotation","Chicken","miRNA","Co-expression","Long Non Coding Rnas","Gene Atlas","[SDV.BIO]Life Sciences [q-bio]/Biotechnology","Science","[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry","Article","576","Animals","Humans","Molecular Biology/Genomics [q-bio.GN]","[SDV.BA.MVSA]Life Sciences [q-bio]/Animal biology/Veterinary medicine and animal Health","Sequence Analysis, RNA","Q","[SDV.BA.MVSA] Life Sciences [q-bio]/Animal biology/Veterinary medicine and animal Health","R","500","Molecular Sequence Annotation","[SDV.BIO] Life Sciences [q-bio]/Biotechnology","[SDV.AEN] Life Sciences [q-bio]/Food and Nutrition","RNA","[SDV.BBM.GTP] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN]","Medicine","Long Noncoding","RNA, Long Noncoding","Transcriptome","[SDV.AEN]Life Sciences [q-bio]/Food and Nutrition","Sequence Analysis","Chickens"],"sdg_mappings":[{"sdg_number":2,"sdg_label":"2. Zero hunger"}],"linked_datasets":[],"clinical_trials":[],"software_tools":[],"database_accessions":[{"name":"ensembl"}],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-09T06:10:08.973098Z","pmid":"38504112","pmcid":"PMC10951430","fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":"gold","license":"cc-by","views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":65.0,"fair_a":72.5,"fair_i":37.5,"fair_r":50.0,"fair_zscore":0.9985,"fair_rationale":{"fair_score":56.25,"has_llm":true,"dimensions":{"F":{"name":"Findable","score":65.0,"criteria":[{"key":"f_has_doi","label":"Has a persistent DOI","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"DOI present","rationale":null},{"key":"f_repository_presence","label":"Indexed in repositories / literature DBs","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"datacite=0, pmcid=True, pmid=True","rationale":null},{"key":"f_persistent_ids","label":"Resolvable scholarly identifiers (OpenAlex)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no OpenAlex id","rationale":null},{"key":"f_metadata_richness","label":"Rich, machine-readable metadata","kind":"llm","weight":1.0,"fraction":0.5,"signal":null,"rationale":"The paper provides a DOI and mentions a dedicated website (gega.sigenae.org) but does not state that metadata is provided in a machine-readable format like structured data or JSON-LD."}]},"A":{"name":"Accessible","score":72.5,"criteria":[{"key":"a_open_access","label":"Open Access / files deposited","kind":"deterministic","weight":1.5,"fraction":0.5,"signal":"files/OA location present but not flagged OA","rationale":null},{"key":"a_retrievable","label":"Free full text retrievable","kind":"deterministic","weight":1.0,"fraction":1.0,"signal":"23 OA location(s)","rationale":null},{"key":"a_access_protocol","label":"Clear data/code access protocol","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"The paper states that data and annotation files are publicly available via URLs (fragencode.org, gega.sigenae.org) and supplementary files, but does not specify a formal access protocol or persistent identifier for the code."}]},"I":{"name":"Interoperable","score":37.5,"criteria":[{"key":"i_linked_data","label":"Linked datasets / DataCite relations","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"linked_datasets=0, datacite=0","rationale":null},{"key":"i_standard_ids","label":"References data via standard accessions","kind":"deterministic","weight":1.0,"fraction":0.0,"signal":"accessions=0, trials=0","rationale":null},{"key":"i_standards","label":"Standard formats, vocabularies & identifiers","kind":"llm","weight":1.0,"fraction":0.75,"signal":null,"rationale":"The paper uses standard formats (GTF, FASTA, bigBed) and references standard identifiers (RefSeq, Ensembl), but does not explicitly state use of standard vocabularies or ontologies for functional annotation."}]},"R":{"name":"Reusable","score":50.0,"criteria":[{"key":"r_license","label":"Clear, open reuse license","kind":"deterministic","weight":1.5,"fraction":0.0,"signal":"no license","rationale":null},{"key":"r_downloads","label":"Demonstrated reuse (downloads)","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"downloads=0","rationale":null},{"key":"r_version","label":"Versioned / maintained","kind":"deterministic","weight":0.5,"fraction":0.0,"signal":"no version chain","rationale":null},{"key":"r_dataset","label":"Classified as a data resource","kind":"deterministic","weight":0.5,"fraction":1.0,"signal":"is_dataset","rationale":null},{"key":"r_reusability","label":"Data-availability statement, license & reproducibility","kind":"llm","weight":2.0,"fraction":0.833,"signal":null,"rationale":"The paper includes a data availability statement with a Creative Commons license (CC BY 4.0), provides supplementary tables and a website, and describes reproducibility steps, but does not explicitly state that all code is available or that the analysis pipeline is fully documented for reuse."}]}},"suggestions":["Provide machine-readable metadata (e.g., schema.org markup) for the dataset and its description.","Include a persistent identifier (e.g., DOI) for the code repository and specify a formal access protocol.","Use standard ontologies (e.g., Gene Ontology) for functional annotations and state their use explicitly.","Make the full analysis pipeline (e.g., scripts, workflows) available in a public repository with a clear license.","Provide a formal data citation with a DOI or other persistent identifier for the supplementary data files."],"model":"deepseek/deepseek-v4-flash","agent_version":"fair_agent_v2","fulltext_source":"epmc_xml"},"fair_model":"deepseek/deepseek-v4-flash","fair_agent_version":"fair_agent_v2","fair_fulltext_source":"epmc_xml","fair_has_llm":true,"fair_computed_at":"2026-06-18T00:46:56.889525Z","clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}