{"doi":"10.1042/bj2750529","title":"Amino acid distributions around <i>O</i>-linked glycosylation sites","abstract":"<jats:p>To study the sequence requirements for addition of O-linked N-acetylgalactosamine to proteins, amino acid distributions around 174 O-glycosylation sites were compared with distributions around non-glycosylated sites. In comparison with non-glycosylated serine and threonine residues, the most prominent feature in the vicinity of O-glycosylated sites is a significantly increased frequency of proline residues, especially at positions -1 and +3 relative to the glycosylated residues. Alanine, serine and threonine are also significantly increased. The high serine and threonine content of O-glycosylated regions is due to the presence of clusters of several closely spaced glycosylated hydroxy amino acids in many O-glycosylated proteins. Such clusters can be predicted from the primary sequence in some cases, but there is no apparent possibility of predicting isolated O-glycosylation sites from primary sequence data.</jats:p>","journal":"Biochemical Journal","year":1991,"id":17404,"datarank":17.28923019944907,"base_score":5.634789603169249,"endowment":5.634789603169249,"self_citation_contribution":0.8452184404753875,"citation_network_contribution":16.44401175897368,"self_endowment_contribution":0.8452184404753875,"citer_contribution":16.44401175897368,"corpus_percentile":null,"corpus_rank":null,"citation_count":279,"citer_count":200,"citers_with_citation_signal":200,"citers_with_endowment":200,"datacite_reuse_total":0,"is_dataset":false,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":124652,"name":"Y Gavel","orcid":null,"position":1,"is_corresponding":false},{"id":124653,"name":"G von Heijne","orcid":null,"position":2,"is_corresponding":false},{"id":124651,"name":"I B Wilson","orcid":null,"position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":5.634789603169249,"endowment":5.634789603169249,"datacite_reuse_total":0,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"2025231","pmcid":null,"openalex_id":"https://openalex.org/W1870828328","authors":[],"funders":[],"total_grants":0,"fwci":7.2284,"citation_percentile":0.97898289,"influential_citations":5,"citation_trend":[{"year":2012,"count":7},{"year":2013,"count":5},{"year":2014,"count":7},{"year":2015,"count":6},{"year":2016,"count":4},{"year":2017,"count":6},{"year":2018,"count":1},{"year":2020,"count":3},{"year":2021,"count":1},{"year":2022,"count":2},{"year":2024,"count":2},{"year":2025,"count":2}],"oa_status":"bronze","license":null,"oa_locations":[{"url":"https://portlandpress.com/biochemj/article-pdf/275/2/529/602474/bj2750529.pdf","host_type":"journal"},{"url":"https://europepmc.org/articles/pmc1150083?pdf=render","host_type":"GREEN"},{"url":"https://portlandpress.com/biochemj/article-pdf/275/2/529/602474/bj2750529.pdf","host_type":"publisher"},{"url":"https://doi.org/10.1042/bj2750529","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/2025231","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/1150083","host_type":"repository"}],"fields_of_study":["Glycosylation and Glycoproteins Research","Carbohydrate Chemistry and Synthesis","Machine Learning in Bioinformatics","Medicine","Biology","Chemistry","Acetylgalactosamine","Amino Acid Sequence","Animals","Databases, Factual","Glycoproteins","Glycosylation","Humans","Sequence Homology, Nucleic Acid"],"mesh_terms":["Acetylgalactosamine","Amino Acid Sequence","Animals","Glycoproteins","Glycosylation","Humans","Sequence Homology, Nucleic Acid","Databases, Factual"],"keywords":["Threonine","Serine","Glycosylation","Alanine","Amino acid","Biochemistry","Peptide sequence","Chemistry","Sequence (biology)","Proline","Amino acid residue","Protein primary structure","Biology","Phosphorylation"],"sdg_mappings":[{"sdg_number":0,"sdg_label":"Clean water and sanitation"}],"linked_datasets":[],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-02T18:44:36.649595Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}