{"doi":"10.1002/prot.20541","title":"Structural analysis of a set of proteins resulting from a bacterial genomics project","abstract":"<jats:title>Abstract</jats:title><jats:p>The targets of the Structural GenomiX (SGX) bacterial genomics project were proteins conserved in multiple prokaryotic organisms with no obvious sequence homolog in the Protein Data Bank of known structures. The outcome of this work was 80 structures, covering 60 unique sequences and 49 different genes. Experimental phase determination from proteins incorporating Se‐Met was carried out for 45 structures with most of the remainder solved by molecular replacement using members of the experimentally phased set as search models. An automated tool was developed to deposit these structures in the Protein Data Bank, along with the associated X‐ray diffraction data (including refined experimental phases) and experimentally confirmed sequences. BLAST comparisons of the SGX structures with structures that had appeared in the Protein Data Bank over the intervening 3.5 years since the SGX target list had been compiled identified homologs for 49 of the 60 unique sequences represented by the SGX structures. This result indicates that, for bacterial structures that are relatively easy to express, purify, and crystallize, the structural coverage of gene space is proceeding rapidly. More distant sequence‐structure relationships between the SGX and PDB structures were investigated using PDB‐BLAST and Combinatorial Extension (CE). Only one structure, SufD, has a truly unique topology compared to all folds in the PDB. Proteins 2005. © 2005 Wiley‐Liss, Inc.</jats:p>","journal":"Proteins: Structure, Function, and Bioinformatics","year":2005,"id":15110,"datarank":13.787699405690157,"base_score":5.5134287461649825,"endowment":5.5134287461649825,"self_citation_contribution":0.8270143119247475,"citation_network_contribution":12.96068509376541,"self_endowment_contribution":0.8270143119247475,"citer_contribution":12.96068509376541,"corpus_percentile":85.8,"corpus_rank":183,"citation_count":247,"citer_count":200,"citers_with_citation_signal":200,"citers_with_endowment":200,"datacite_reuse_total":4,"is_dataset":true,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":116596,"name":"J.M. Sauder","orcid":null,"position":1,"is_corresponding":false},{"id":116597,"name":"J.M. Adams","orcid":null,"position":2,"is_corresponding":false},{"id":116598,"name":"S. Antonysamy","orcid":null,"position":3,"is_corresponding":false},{"id":116599,"name":"K. Bain","orcid":null,"position":4,"is_corresponding":false},{"id":116600,"name":"M.G. Bergseid","orcid":null,"position":5,"is_corresponding":false},{"id":116601,"name":"S.G. Buchanan","orcid":null,"position":6,"is_corresponding":false},{"id":116602,"name":"M.D. Buchanan","orcid":null,"position":7,"is_corresponding":false},{"id":116603,"name":"Y. Batiyenko","orcid":null,"position":8,"is_corresponding":false},{"id":116604,"name":"J.A. Christopher","orcid":null,"position":9,"is_corresponding":false},{"id":116605,"name":"S. Emtage","orcid":null,"position":10,"is_corresponding":false},{"id":116606,"name":"A. Eroshkina","orcid":null,"position":11,"is_corresponding":false},{"id":116607,"name":"I. Feil","orcid":null,"position":12,"is_corresponding":false},{"id":116608,"name":"E.B. Furlong","orcid":null,"position":13,"is_corresponding":false},{"id":116609,"name":"K.S. Gajiwala","orcid":null,"position":14,"is_corresponding":false},{"id":116610,"name":"X. Gao","orcid":null,"position":15,"is_corresponding":false},{"id":116611,"name":"D. He","orcid":null,"position":16,"is_corresponding":false},{"id":116612,"name":"J. Hendle","orcid":null,"position":17,"is_corresponding":false},{"id":116613,"name":"A. Huber","orcid":null,"position":18,"is_corresponding":false},{"id":116614,"name":"K. Hoda","orcid":null,"position":19,"is_corresponding":false},{"id":116615,"name":"P. Kearins","orcid":null,"position":20,"is_corresponding":false},{"id":116616,"name":"C. Kissinger","orcid":null,"position":21,"is_corresponding":false},{"id":116617,"name":"B. Laubert","orcid":null,"position":22,"is_corresponding":false},{"id":116618,"name":"H.A. Lewis","orcid":null,"position":23,"is_corresponding":false},{"id":58902,"name":"J. Lin","orcid":"0000-0002-3353-1559","position":24,"is_corresponding":false},{"id":116619,"name":"K. Loomis","orcid":null,"position":25,"is_corresponding":false},{"id":116620,"name":"D. Lorimer","orcid":null,"position":26,"is_corresponding":false},{"id":116621,"name":"G. Louie","orcid":null,"position":27,"is_corresponding":false},{"id":116622,"name":"M. Maletic","orcid":null,"position":28,"is_corresponding":false},{"id":116623,"name":"C.D. Marsh","orcid":null,"position":29,"is_corresponding":false},{"id":116624,"name":"I. Miller","orcid":null,"position":30,"is_corresponding":false},{"id":116625,"name":"J. Molinari","orcid":null,"position":31,"is_corresponding":false},{"id":116626,"name":"H.J. Muller‐Dieckmann","orcid":null,"position":32,"is_corresponding":false},{"id":116627,"name":"J.M. Newman","orcid":null,"position":33,"is_corresponding":false},{"id":116628,"name":"B.W. Noland","orcid":null,"position":34,"is_corresponding":false},{"id":116629,"name":"B. Pagarigan","orcid":null,"position":35,"is_corresponding":false},{"id":116630,"name":"F. Park","orcid":null,"position":36,"is_corresponding":false},{"id":116631,"name":"T.S. Peat","orcid":null,"position":37,"is_corresponding":false},{"id":116632,"name":"K.W. Post","orcid":null,"position":38,"is_corresponding":false},{"id":116633,"name":"S. Radojicic","orcid":null,"position":39,"is_corresponding":false},{"id":116634,"name":"A. Ramos","orcid":null,"position":40,"is_corresponding":false},{"id":116635,"name":"R. Romero","orcid":null,"position":41,"is_corresponding":false},{"id":116636,"name":"M.E. Rutter","orcid":null,"position":42,"is_corresponding":false},{"id":116637,"name":"W.E. Sanderson","orcid":null,"position":43,"is_corresponding":false},{"id":116638,"name":"K.D. Schwinn","orcid":null,"position":44,"is_corresponding":false},{"id":116639,"name":"J. Tresser","orcid":null,"position":45,"is_corresponding":false},{"id":116640,"name":"J. Winhoven","orcid":null,"position":46,"is_corresponding":false},{"id":116641,"name":"T.A. Wright","orcid":null,"position":47,"is_corresponding":false},{"id":116643,"name":"L. Wu","orcid":null,"position":48,"is_corresponding":false},{"id":18961,"name":"J. Xu","orcid":null,"position":49,"is_corresponding":false},{"id":116646,"name":"T.J.R. Harris","orcid":null,"position":50,"is_corresponding":false},{"id":116595,"name":"J. Badger","orcid":null,"position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":5.5134287461649825,"endowment":5.5134287461649825,"datacite_reuse_total":4,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"16021622","pmcid":null,"openalex_id":"https://openalex.org/W2116774744","authors":[],"funders":[],"total_grants":0,"fwci":10.753,"citation_percentile":0.99116281,"influential_citations":7,"citation_trend":[{"year":2012,"count":25},{"year":2013,"count":16},{"year":2014,"count":16},{"year":2015,"count":11},{"year":2016,"count":6},{"year":2017,"count":12},{"year":2018,"count":8},{"year":2019,"count":8},{"year":2020,"count":13},{"year":2021,"count":5},{"year":2022,"count":3},{"year":2023,"count":4},{"year":2024,"count":16},{"year":2025,"count":10},{"year":2026,"count":2}],"oa_status":"closed","license":"http://onlinelibrary.wiley.com/termsAndConditions#vor","oa_locations":[{"url":"https://api.wiley.com/onlinelibrary/tdm/v1/articles/10.1002%2Fprot.20541","host_type":"publisher"},{"url":"https://onlinelibrary.wiley.com/doi/pdf/10.1002/prot.20541","host_type":"publisher"},{"url":"https://doi.org/10.1002/prot.20541","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/16021622","host_type":"repository"}],"fields_of_study":["Enzyme Structure and Function","Protein Structure and Dynamics","RNA and protein synthesis mechanisms","Medicine","Biology","Computer Science","Databases, Protein","Enzymes","Escherichia coli","Escherichia coli Proteins","Genome, Bacterial","Genomics","Models, Molecular","Protein Conformation","Regression Analysis","X-Ray Diffraction"],"mesh_terms":["Enzymes","Escherichia coli","Models, Molecular","Protein Conformation","Regression Analysis","X-Ray Diffraction","Genome, Bacterial","Genomics","Escherichia coli Proteins","Databases, Protein"],"keywords":["Protein Data Bank","Structural genomics","Protein Data Bank (RCSB PDB)","Protein structure database","Computational biology","Molecular replacement","Protein structure","Genomics","Sequence (biology)","Sequence alignment","Data mining","Computer science","Crystallography","Gene","Biology","Genetics","Peptide sequence","Chemistry","Genome","Biochemistry","Sequence database"],"sdg_mappings":[],"linked_datasets":[{"doi":"10.6084/m9.figshare.26709893.v1","title":"Additional file 1 of Cloning, heterologous expression and purification of the novel thermo-alkalistable cellulase from Geobacillus sp. TP-3 and its molecular characterisation","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.26709893","title":"Additional file 1 of Cloning, heterologous expression and purification of the novel thermo-alkalistable cellulase from Geobacillus sp. TP-3 and its molecular characterisation","publisher":"figshare","resource_type":"Presentation"},{"doi":"10.6084/m9.figshare.26709896.v1","title":"Additional file 2 of Cloning, heterologous expression and purification of the novel thermo-alkalistable cellulase from Geobacillus sp. TP-3 and its molecular characterisation","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.26709896","title":"Additional file 2 of Cloning, heterologous expression and purification of the novel thermo-alkalistable cellulase from Geobacillus sp. TP-3 and its molecular characterisation","publisher":"figshare","resource_type":"JournalArticle"}],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-01T16:25:28.201478Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}