{"doi":"10.1158/1055-9965.epi-04-0832","title":"Examining Population Stratification via Individual Ancestry Estimates versus Self-Reported Race","abstract":"<jats:title>Abstract</jats:title>\n               <jats:p>Population stratification has the potential to affect the results of genetic marker studies. Estimating individual ancestry provides a continuous measure to assess population structure in case-control studies of complex disease, instead of using self-reported racial groups. We estimate individual ancestry using the Federal Bureau of Investigation CODIS Core short tandem repeat set of 13 loci using two different analysis methods in a case-control study of early-onset lung cancer. Individual ancestry proportions were estimated for “European” and “West African” groups using published allele frequencies. The majority of Caucasian, non-Hispanics had &amp;gt;50% European ancestry, whereas the majority of African Americans had &amp;lt;20% European ancestry, regardless of ancestry estimation method, although significant overlap by self-reported race and ancestry also existed. When we further investigated the effect of ancestry and self-reported race on the frequency of a lung cancer risk genotype, we found that the frequency of the GSTM1 null genotype varies by individual European ancestry and case-control status within self-reported race (particularly for African Americans). Genetic risk models showed that adjusting for individual European ancestry provided a better fit to the data compared with the model with no group adjustment or adjustment for self-reported race. This study suggests that significant population substructure differences exist that self-reported race alone does not capture and that individual ancestry may be confounded with disease status and/or a candidate gene risk genotype.</jats:p>","journal":"Cancer Epidemiology, Biomarkers &amp; Prevention","year":2005,"id":19166,"datarank":5.18989015640365,"base_score":4.430816798843313,"endowment":4.430816798843313,"self_citation_contribution":0.6646225198264971,"citation_network_contribution":4.525267636577153,"self_endowment_contribution":0.6646225198264971,"citer_contribution":4.525267636577153,"corpus_percentile":null,"corpus_rank":null,"citation_count":83,"citer_count":74,"citers_with_citation_signal":66,"citers_with_endowment":66,"datacite_reuse_total":0,"is_dataset":false,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"fair_score":null,"fair_percentile":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":130677,"name":"Ranajit Chakraborty","orcid":null,"position":1,"is_corresponding":false},{"id":6444,"name":"Thomas A. Sellers","orcid":"0000-0002-7832-0405","position":2,"is_corresponding":false},{"id":71792,"name":"Ann G. Schwartz","orcid":"0000-0002-9525-1157","position":3,"is_corresponding":false},{"id":130676,"name":"Jill S. Barnholtz-Sloan","orcid":null,"position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":4.430816798843313,"endowment":4.430816798843313,"datacite_reuse_total":0,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"15941970","pmcid":null,"openalex_id":"https://openalex.org/W2143211805","authors":[],"funders":[{"funder_name":"NCI NIH HHS","grant_id":"K07 CA91849","title":null},{"funder_name":"NCI NIH HHS","grant_id":"N01 PC35145","title":null},{"funder_name":"NCI NIH HHS","grant_id":"CA60691","title":null}],"total_grants":3,"fwci":5.7897,"citation_percentile":0.96577017,"influential_citations":1,"citation_trend":[{"year":2012,"count":3},{"year":2013,"count":4},{"year":2014,"count":6},{"year":2015,"count":3},{"year":2016,"count":2},{"year":2017,"count":2},{"year":2018,"count":3},{"year":2019,"count":2},{"year":2020,"count":1},{"year":2021,"count":2},{"year":2022,"count":2},{"year":2023,"count":3},{"year":2024,"count":1},{"year":2025,"count":1}],"oa_status":"bronze","license":null,"oa_locations":[{"url":"https://aacrjournals.org/cebp/article-pdf/14/6/1545/1744932/1545-1551.pdf","host_type":"journal"},{"url":"https://aacrjournals.org/cebp/article-pdf/14/6/1545/1744932/1545-1551.pdf","host_type":"BRONZE"},{"url":"https://aacrjournals.org/cebp/article-pdf/14/6/1545/1744932/1545-1551.pdf","host_type":"publisher"},{"url":"https://doi.org/10.1158/1055-9965.epi-04-0832","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/15941970","host_type":"repository"}],"fields_of_study":["Genetic Associations and Epidemiology","BRCA gene mutations in cancer","Race, Genetics, and Society","Biology","Medicine","Black People","Case-Control Studies","Genotype","Glutathione Transferase","Humans","Lung Neoplasms","Models, Theoretical","Pedigree","Reproducibility of Results","Risk Assessment","Tandem Repeat Sequences","White People","Black or African American"],"mesh_terms":["Black or African American","Genotype","Glutathione Transferase","Humans","Lung Neoplasms","Models, Theoretical","Pedigree","Reproducibility of Results","Case-Control Studies","Risk Assessment","Tandem Repeat Sequences","Black People","White People"],"keywords":["Ancestry-informative marker","Population stratification","Genetic genealogy","Race (biology)","Population","Demography","Genotype","Allele frequency","Allele","Biology","Genetics","Single-nucleotide polymorphism","Gene"],"sdg_mappings":[{"sdg_number":0,"sdg_label":"Good health and well-being"}],"linked_datasets":[],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-04T02:18:51.480491Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"fair_f":null,"fair_a":null,"fair_i":null,"fair_r":null,"fair_zscore":null,"fair_rationale":null,"fair_model":null,"fair_agent_version":null,"fair_fulltext_source":null,"fair_has_llm":null,"fair_computed_at":null,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}