{"doi":"10.1101/gr.120477.111","title":"De novo discovery of mutated driver pathways in cancer","abstract":"<jats:p>Next-generation DNA sequencing technologies are enabling genome-wide measurements of somatic mutations in large numbers of cancer patients. A major challenge in the interpretation of these data is to distinguish functional “driver mutations” important for cancer development from random “passenger mutations.” A common approach for identifying driver mutations is to find genes that are mutated at significant frequency in a large cohort of cancer genomes. This approach is confounded by the observation that driver mutations target multiple cellular signaling and regulatory pathways. Thus, each cancer patient may exhibit a different combination of mutations that are sufficient to perturb these pathways. This mutational heterogeneity presents a problem for predicting driver mutations solely from their frequency of occurrence. We introduce two combinatorial properties, coverage and exclusivity, that distinguish driver pathways, or groups of genes containing driver mutations, from groups of genes with passenger mutations. We derive two algorithms, called Dendrix, to find driver pathways de novo from somatic mutation data. We apply Dendrix to analyze somatic mutation data from 623 genes in 188 lung adenocarcinoma patients, 601 genes in 84 glioblastoma patients, and 238 known mutations in 1000 patients with various cancers. In all data sets, we find groups of genes that are mutated in large subsets of patients and whose mutations are approximately exclusive. Our Dendrix algorithms scale to whole-genome analysis of thousands of patients and thus will prove useful for larger data sets to come from The Cancer Genome Atlas (TCGA) and other large-scale cancer genome sequencing projects.</jats:p>","journal":"Genome Research","year":2012,"id":32741,"datarank":11.995767252371962,"base_score":6.131226489483141,"endowment":6.131226489483141,"self_citation_contribution":0.9196839734224713,"citation_network_contribution":11.076083278949492,"self_endowment_contribution":0.9196839734224713,"citer_contribution":11.076083278949492,"corpus_percentile":null,"corpus_rank":null,"citation_count":459,"citer_count":200,"citers_with_citation_signal":200,"citers_with_endowment":200,"datacite_reuse_total":18,"is_dataset":false,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":78053,"name":"Eli Upfal","orcid":"0000-0002-9321-9460","position":1,"is_corresponding":false},{"id":106817,"name":"Benjamin J. Raphael","orcid":"0000-0003-1274-048X","position":2,"is_corresponding":false},{"id":1794,"name":"Fabio Vandin","orcid":"0000-0003-2244-2320","position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":6.131226489483141,"endowment":6.131226489483141,"datacite_reuse_total":18,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"21653252","pmcid":"PMC3266044","openalex_id":"https://openalex.org/W2121443461","authors":[],"funders":[],"total_grants":0,"fwci":null,"citation_percentile":null,"influential_citations":0,"citation_trend":[{"year":2012,"count":15},{"year":2013,"count":27},{"year":2014,"count":41},{"year":2015,"count":53},{"year":2016,"count":57},{"year":2017,"count":43},{"year":2018,"count":24},{"year":2019,"count":42},{"year":2020,"count":34},{"year":2021,"count":36},{"year":2022,"count":30},{"year":2023,"count":18},{"year":2024,"count":21},{"year":2025,"count":11},{"year":2026,"count":6}],"oa_status":"bronze","license":"other-oa","oa_locations":[{"url":"https://genome.cshlp.org/content/22/2/375.full.pdf","host_type":"journal"},{"url":"https://genome.cshlp.org/content/22/2/375.full.pdf","host_type":"publisher"},{"url":"https://syndication.highwire.org/content/doi/10.1101/gr.120477.111","host_type":"publisher"},{"url":"https://doi.org/10.1101/gr.120477.111","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/21653252","host_type":"repository"},{"url":"http://hdl.handle.net/11577/3179973","host_type":"repository"},{"url":"http://genome.cshlp.org/cgi/content/short/22/2/375","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/3266044","host_type":"repository"},{"url":"http://genome.cshlp.org/content/22/2/375.full.pdf","host_type":"repository"}],"fields_of_study":["Cancer Genomics and Diagnostics","Bioinformatics and Genomic Networks","Genomics and Rare Diseases","Algorithms","Computational Biology","Computer Simulation","Humans","Internet","Models, Genetic","Mutation","Neoplasms","Signal Transduction","Software"],"mesh_terms":["Algorithms","Computer Simulation","Humans","Models, Genetic","Mutation","Neoplasms","Software","Signal Transduction","Computational Biology","Internet"],"keywords":["Biology","Genetics","Gene","Genome","Mutation","Cancer genome sequencing","Cancer","Germline mutation","Computational biology","Somatic cell","Genomics"],"sdg_mappings":[],"linked_datasets":[{"doi":"10.6084/m9.figshare.19619538.v1","title":"Additional file 1 of De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.19619538","title":"Additional file 1 of De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.19619541.v1","title":"Additional file 2 of De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.19619541","title":"Additional file 2 of De novo identification of maximally deregulated subnetworks based on multi-omics data with DeRegNet","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.25357839.v1","title":"Additional file 1 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.25357839","title":"Additional file 1 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.26689595.v1","title":"Additional file 5 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689592","title":"Additional file 4 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689598.v1","title":"Additional file 6 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689592.v1","title":"Additional file 4 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689598","title":"Additional file 6 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689601","title":"Additional file 7 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.25357955.v1","title":"Additional file 2 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689589","title":"Additional file 3 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689595","title":"Additional file 5 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689601.v1","title":"Additional file 7 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.25357955","title":"Additional file 2 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"},{"doi":"10.6084/m9.figshare.26689589.v1","title":"Additional file 3 of Gsw-fi: a GLM model incorporating shrinkage and double-weighted strategies for identifying cancer driver genes with functional impact","publisher":"figshare","resource_type":"Dataset"}],"clinical_trials":[],"software_tools":[],"database_accessions":[],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-09T14:23:39.638245Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}