{"doi":"10.1101/2023.11.09.566363","title":"Are most human specific proteins encoded by long non-coding RNA ?","abstract":"<jats:title>Abstract</jats:title>\n                <jats:p>By looking for a lack of homologues in a reference database of 27 well-annotated proteomes of primates and 52 well-annotated proteomes of other mammals, 170 putative human-specific proteins were identified. Among them, only 2 are known at the protein level and 23 at the transcript level, according to Uniprot. Though 21 of these 25 proteins are found encoded by an open reading frame of a long non-coding RNA, 60% of them are predicted to be at least 90% globular, with a single structural domain. However, there is a near complete lack of structural knowledge about these proteins, with no tridimensional structure presently available in the Protein Databank and a fair prediction for a single of them in the AlphaFold Protein Structure Database. Moreover, the knowledge about the function of these possibly key proteins remains scarce.</jats:p>","journal":null,"year":null,"id":23779,"datarank":0.0,"base_score":0.0,"endowment":0.0,"self_citation_contribution":0.0,"citation_network_contribution":0.0,"self_endowment_contribution":0.0,"citer_contribution":0.0,"corpus_percentile":null,"corpus_rank":null,"citation_count":0,"citer_count":0,"citers_with_citation_signal":0,"citers_with_endowment":0,"datacite_reuse_total":0,"is_dataset":false,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"fair_score":null,"fair_percentile":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":144772,"name":"Yves-Henri Sanejouand","orcid":"0000-0001-8589-2666","position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":0.0,"endowment":0.0,"datacite_reuse_total":0,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"18998881","pmcid":null,"openalex_id":"https://openalex.org/W4388621152","authors":[],"funders":[],"total_grants":0,"fwci":null,"citation_percentile":null,"influential_citations":0,"citation_trend":[],"oa_status":"green","license":"cc-by","oa_locations":[{"url":"https://www.biorxiv.org/content/biorxiv/early/2023/11/13/2023.11.09.566363.full.pdf","host_type":"repository"},{"url":"https://www.biorxiv.org/content/biorxiv/early/2023/11/13/2023.11.09.566363.full.pdf","host_type":"repository"},{"url":"https://syndication.highwire.org/content/doi/10.1101/2023.11.09.566363","host_type":"publisher"},{"url":"http://dx.doi.org/10.1101/2023.11.09.566363","host_type":"repository"},{"url":"https://hal.science/hal-04289887","host_type":"repository"},{"url":"https://hal.science/hal-04289887v1/document","host_type":"repository"}],"fields_of_study":["RNA and protein synthesis mechanisms","Cancer-related molecular mechanisms research","Genomics and Phylogenetic Studies"],"mesh_terms":[],"keywords":["UniProt","Human proteome project","Computational biology","Proteome","Human proteins","RNA","Open reading frame","Biology","Genetics","Proteomics","Peptide sequence","Gene"],"sdg_mappings":[],"linked_datasets":[],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-07T20:30:50.126330Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":null,"fair_a":null,"fair_i":null,"fair_r":null,"fair_zscore":null,"fair_rationale":null,"fair_model":null,"fair_agent_version":null,"fair_fulltext_source":null,"fair_has_llm":null,"fair_computed_at":null,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}