{"doi":"10.1101/gr.169243.113","title":"Dissection of thousands of cell type-specific enhancers identifies dinucleotide repeat motifs as general enhancer features","abstract":"<jats:p>Gene expression is determined by genomic elements called enhancers, which contain short motifs bound by different transcription factors (TFs). However, how enhancer sequences and TF motifs relate to enhancer activity is unknown, and general sequence requirements for enhancers or comprehensive sets of important enhancer sequence elements have remained elusive. Here, we computationally dissect thousands of functional enhancer sequences from three different<jats:italic>Drosophila</jats:italic>cell lines. We find that the enhancers display distinct<jats:italic>cis</jats:italic>-regulatory sequence signatures, which are predictive of the enhancers’ cell type-specific or broad activities. These signatures contain transcription factor motifs and a novel class of enhancer sequence elements, dinucleotide repeat motifs (DRMs). DRMs are highly enriched in enhancers, particularly in enhancers that are broadly active across different cell types. We experimentally validate the importance of the identified TF motifs and DRMs for enhancer function and show that they can be sufficient to create an active enhancer de novo from a nonfunctional sequence. The function of DRMs as a novel class of general enhancer features that are also enriched in human regulatory regions might explain their implication in several diseases and provides important insights into gene regulation.</jats:p>","journal":"Genome Research","year":2014,"id":14195,"datarank":6.277835811123183,"base_score":5.049856007249537,"endowment":5.049856007249537,"self_citation_contribution":0.7574784010874307,"citation_network_contribution":5.520357410035753,"self_endowment_contribution":0.7574784010874307,"citer_contribution":5.520357410035753,"corpus_percentile":null,"corpus_rank":null,"citation_count":155,"citer_count":133,"citers_with_citation_signal":121,"citers_with_endowment":121,"datacite_reuse_total":10,"is_dataset":false,"is_dataset_confidence":null,"is_oa":false,"file_count":0,"downloads":0,"has_version_chain":false,"published_date":null,"algorithm_id":"datarank_citation_only_1hop_v6","ranking_scope":"data_only","authors":[{"id":113278,"name":"Cosmas D. Arnold","orcid":null,"position":1,"is_corresponding":false},{"id":113279,"name":"Gerald Stampfel","orcid":null,"position":2,"is_corresponding":false},{"id":113280,"name":"Łukasz M. Boryń","orcid":null,"position":3,"is_corresponding":false},{"id":113281,"name":"Daniel Gerlach","orcid":null,"position":4,"is_corresponding":false},{"id":113282,"name":"Martina Rath","orcid":null,"position":5,"is_corresponding":false},{"id":14954,"name":"Alexander Stark","orcid":"0000-0003-2611-0841","position":6,"is_corresponding":false},{"id":113277,"name":"J. Omar Yáñez-Cuna","orcid":null,"position":0,"is_corresponding":false}],"reference_count":0,"raw_metadata":{"has_enrichment":true,"base_score":5.049856007249537,"endowment":5.049856007249537,"datacite_reuse_total":10,"file_count":0,"downloads":0,"views":0,"has_version_chain":false,"is_dataset":false,"is_oa":false,"pmid":"24714811","pmcid":"PMC4079970","openalex_id":"https://openalex.org/W2118646732","authors":[],"funders":[{"funder_name":"European Research Council","grant_id":"242922","title":"Regulatory Genomics in Drosophila"}],"total_grants":1,"fwci":null,"citation_percentile":null,"influential_citations":10,"citation_trend":[{"year":2014,"count":7},{"year":2015,"count":15},{"year":2016,"count":14},{"year":2017,"count":19},{"year":2018,"count":16},{"year":2019,"count":9},{"year":2020,"count":12},{"year":2021,"count":22},{"year":2022,"count":11},{"year":2023,"count":10},{"year":2024,"count":8},{"year":2025,"count":11},{"year":2026,"count":1}],"oa_status":"bronze","license":"CC BY NC","oa_locations":[{"url":"http://genome.cshlp.org/content/24/7/1147.full.pdf","host_type":"journal"},{"url":"http://genome.cshlp.org/content/24/7/1147.full.pdf","host_type":"HYBRID"},{"url":"http://genome.cshlp.org/content/24/7/1147.full.pdf","host_type":"publisher"},{"url":"https://syndication.highwire.org/content/doi/10.1101/gr.169243.113","host_type":"publisher"},{"url":"https://doi.org/10.1101/gr.169243.113","host_type":"journal"},{"url":"https://pubmed.ncbi.nlm.nih.gov/24714811","host_type":"repository"},{"url":"https://www.ncbi.nlm.nih.gov/pmc/articles/4079970","host_type":"repository"},{"url":"http://genome.cshlp.org/cgi/content/short/24/7/1147","host_type":"repository"},{"url":"https://europepmc.org/articles/PMC4079970","host_type":"Europe_PMC"},{"url":"https://europepmc.org/articles/PMC4079970?pdf=render","host_type":"Europe_PMC"},{"url":"http://dx.doi.org/10.1101/gr.169243.113","host_type":""},{"url":"https://dx.doi.org/10.1101/gr.169243.113","host_type":""}],"fields_of_study":["Genomics and Chromatin Dynamics","RNA Research and Splicing","RNA and protein synthesis mechanisms","Medicine","Biology","0301 basic medicine","0303 health sciences","03 medical and health sciences","Animals","Base Sequence","Cell Line","Dinucleotide Repeats","Drosophila melanogaster","Enhancer Elements, Genetic","Gene Expression Regulation","Models, Biological","Nucleotide Motifs","Organ Specificity","Transcription Factors"],"mesh_terms":["Animals","Base Sequence","Cell Line","Drosophila melanogaster","Enhancer Elements, Genetic","Gene Expression Regulation","Models, Biological","Organ Specificity","Transcription Factors","Dinucleotide Repeats","Nucleotide Motifs"],"keywords":["Enhancer","Biology","Enhancer RNAs","Transcription factor","Computational biology","Genetics","Gene","Regulatory sequence","Sequence motif","Regulation of gene expression","Base Sequence","Research","Models, Biological","Cell Line","Drosophila melanogaster","Enhancer Elements, Genetic","Gene Expression Regulation","Organ Specificity","Animals","Nucleotide Motifs","Dinucleotide Repeats","Transcription Factors"],"sdg_mappings":[],"linked_datasets":[{"doi":"10.6084/m9.figshare.14168590.v1","title":"Additional file 1 of Identification of super enhancer-associated key genes for prognosis of germinal center B-cell type diffuse large B-cell lymphoma by integrated analysis","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.14168590","title":"Additional file 1 of Identification of super enhancer-associated key genes for prognosis of germinal center B-cell type diffuse large B-cell lymphoma by integrated analysis","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.16960406","title":"Additional file 1 of An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.16960406.v1","title":"Additional file 1 of An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.16960409.v1","title":"Additional file 2 of An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.16960409","title":"Additional file 2 of An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.20336728.v1","title":"Additional file 1 of High-throughput techniques enable advances in the roles of DNA and RNA secondary structures in transcriptional and post-transcriptional gene regulation","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.20336728","title":"Additional file 1 of High-throughput techniques enable advances in the roles of DNA and RNA secondary structures in transcriptional and post-transcriptional gene regulation","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.25876085.v1","title":"Additional file 1 of Su(Hw) interacts with Combgap to establish long-range chromatin contacts","publisher":"figshare","resource_type":"JournalArticle"},{"doi":"10.6084/m9.figshare.25876085","title":"Additional file 1 of Su(Hw) interacts with Combgap to establish long-range chromatin contacts","publisher":"figshare","resource_type":"JournalArticle"}],"clinical_trials":[],"software_tools":[],"database_accessions":[{"name":"geo"},{"name":"igsr"}],"source":"live","citation_network_status":"fetched"},"created_at":"2026-06-01T05:54:42.590622Z","pmid":null,"pmcid":null,"fwci":null,"citation_percentile":null,"influential_citations":0,"oa_status":null,"license":null,"views":0,"total_file_size_bytes":0,"version_count":0,"clinical_trials":[],"software_tools":[],"db_accessions":[],"linked_datasets":[],"topics":[]}