{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,30]],"date-time":"2025-10-30T22:16:32Z","timestamp":1761862592895},"reference-count":29,"publisher":"Oxford University Press (OUP)","issue":"23","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2006,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Predicting cis-regulatory modules (CRMs) in higher eukaryotes is a challenging computational task. Commonly used methods to predict CRMs based on the signal of transcription factor binding sites (TFBS) are limited by prior information about transcription factor specificity. More general methods that bypass the reliance on TFBS models are needed for comprehensive CRM prediction.<\/jats:p>\n               <jats:p>Results: We have developed a method to predict CRMs called CisPlusFinder that identifies high density regions of perfect local ungapped sequences (PLUSs) based on multiple species conservation. By assuming that PLUSs contain core TFBS motifs that are locally overrepresented, the method attempts to capture the expected features of CRM structure and evolution. Applied to a benchmark dataset of CRMs involved in early Drosophila development, CisPlusFinder predicts more annotated CRMs than all other methods tested. Using the REDfly database, we find that some \u2018false positive\u2019 predictions in the benchmark dataset correspond to recently annotated CRMs. Our work demonstrates that CRM prediction methods that combine comparative genomic data with statistical properties of DNA may achieve reasonable performance when applied genome-wide in the absence of an a priori set of known TFBS motifs.<\/jats:p>\n               <jats:p>Availability: The program CisPlusFinder can be downloaded at . All software is licensed under the Lesser GNU Public License (LGPL).<\/jats:p>\n               <jats:p>Contact: \u00a0nora.pierstorff@uni-koeln.de.<\/jats:p>\n               <jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btl499","type":"journal-article","created":{"date-parts":[[2006,10,11]],"date-time":"2006-10-11T02:51:11Z","timestamp":1160535071000},"page":"2858-2864","source":"Crossref","is-referenced-by-count":30,"title":["Identifying <i>cis<\/i>-regulatory modules by combining comparative and compositional analysis of DNA"],"prefix":"10.1093","volume":"22","author":[{"given":"Nora","family":"Pierstorff","sequence":"first","affiliation":[{"name":"Institute for Genetics, University of Cologne 1 \u00a0 1 \u00a0 \u00a0 Zuelpicher Strasse 47, 50674 Cologne, Germany"},{"name":"Faculty of Life Sciences, University of Manchester 2 \u00a0 2 \u00a0 \u00a0 Michael Smith Building, Oxford Road, M13 9PT Manchester, UK"}]},{"given":"Casey M.","family":"Bergman","sequence":"additional","affiliation":[{"name":"Faculty of Life Sciences, University of Manchester 2 \u00a0 2 \u00a0 \u00a0 Michael Smith Building, Oxford Road, M13 9PT Manchester, UK"}]},{"given":"Thomas","family":"Wiehe","sequence":"additional","affiliation":[{"name":"Institute for Genetics, University of Cologne 1 \u00a0 1 \u00a0 \u00a0 Zuelpicher Strasse 47, 50674 Cologne, Germany"}]}],"member":"286","published-online":{"date-parts":[[2006,10,10]]},"reference":[{"key":"2023012409221766800_b1","doi-asserted-by":"crossref","first-page":"3596","DOI":"10.1093\/bioinformatics\/bti609","article-title":"JIGSAW: integration of multiple sources of evidence for gene prediction","volume":"21","author":"Allen","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012409221766800_b2","doi-asserted-by":"crossref","first-page":"403","DOI":"10.1016\/S0022-2836(05)80360-2","article-title":"Basic local alignment search tool","volume":"215","author":"Altschul","year":"1990","journal-title":"J. Mol. Biol."},{"key":"2023012409221766800_b3","doi-asserted-by":"crossref","first-page":"II16","DOI":"10.1093\/bioinformatics\/btg1054","article-title":"Searching for statistically significant regulatory modules","volume":"19","author":"Bailey","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012409221766800_b4","doi-asserted-by":"crossref","first-page":"1747","DOI":"10.1093\/bioinformatics\/bti173","article-title":"Drosophila DNase I footprint database: a systematic genome annotation of transcription factor binding sites in the fruitfly, D.melanogaster","volume":"21","author":"Bergman","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012409221766800_b5","first-page":"757","article-title":"Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genome","volume":"2","author":"Berman","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012409221766800_b6","doi-asserted-by":"crossref","first-page":"1391","DOI":"10.1126\/science.1081331","article-title":"Phylogenetic shadowing of primate sequences to find functional regions of the human genome","volume":"299","author":"Bofelli","year":"2003","journal-title":"Science"},{"key":"2023012409221766800_b7","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1186\/1471-2105-6-262","article-title":"Using hexamers to predict cis-regulatory motifs in Drosophila","volume":"6","author":"Chan","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023012409221766800_b8","doi-asserted-by":"crossref","first-page":"13850","DOI":"10.1074\/jbc.270.23.13850","article-title":"Evidence for functional binding and stable sliding of the TATA binding protein on nonspecific DNA","volume":"270","author":"Coleman","year":"1995","journal-title":"J. Biol. Chem."},{"key":"2023012409221766800_b9","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1186\/1471-2105-4-57","article-title":"Conservation of regulatory elements between two species of Drosophila","volume":"4","author":"Emberly","year":"2003","journal-title":"BMC Bioinformatics"},{"key":"2023012409221766800_b10","doi-asserted-by":"crossref","first-page":"3666","DOI":"10.1093\/nar\/gkg540","article-title":"Cluster-Buster: finding dense clusters of motifs in DNA sequences","volume":"31","author":"Frith","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012409221766800_b11","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1093\/bioinformatics\/bti794","article-title":"REDfly: a regulatory element database for Drosophila","volume":"22","author":"Gallo","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012409221766800_b12","doi-asserted-by":"crossref","first-page":"2738","DOI":"10.1093\/bioinformatics\/bth320","article-title":"Prediction of similarly acting cis-regulatory modules by subsequence profiling and comparative genomics in Drosophila melanogaster and D. pseudoobscura","volume":"20","author":"Grad","year":"2004","journal-title":"Bioinformatics"},{"key":"2023012409221766800_b13","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511574931","volume-title":"Algorithms on Strings, Trees and Sequences","author":"Gusfield","year":"1997"},{"key":"2023012409221766800_b14","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1007\/BF02101694","article-title":"Dating of the human-ape splitting by a molecular clock of mitochondrial DNA","volume":"22","author":"Hasegawa","year":"1985","journal-title":"J. Mol. Evol."},{"key":"2023012409221766800_b15","first-page":"169","article-title":"Identification of functional clusters of transcription factor binding motifs in genomic sequences: the MSCAN algorithm","volume":"19","author":"Johansson","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012409221766800_b16","doi-asserted-by":"crossref","first-page":"996","DOI":"10.1101\/gr.229102","article-title":"The human genome browser at UCSC","volume":"12","author":"Kent","year":"2002","journal-title":"Genome Res."},{"key":"2023012409221766800_b17","doi-asserted-by":"crossref","first-page":"R356","DOI":"10.1016\/j.cub.2006.03.082","article-title":"Positive selction on gene expression in the human brain","volume":"16","author":"Khaitovich","year":"2006","journal-title":"Curr. Biol."},{"key":"2023012409221766800_b18","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1016\/0022-2836(87)90517-1","article-title":"Kinetic studies on Cro repressor-operator DNA interaction","volume":"196","author":"Kim","year":"1987","journal-title":"J. Mol. Biol."},{"key":"2023012409221766800_b19","doi-asserted-by":"crossref","first-page":"1051","DOI":"10.1101\/gr.3642605","article-title":"Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences","volume":"15","author":"King","year":"2005","journal-title":"Genome Res."},{"key":"2023012409221766800_b20","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1016\/0167-4781(90)90120-Q","article-title":"Lac repressor-operator interaction: DNA length dependence","volume":"1087","author":"Khory","year":"1990","journal-title":"Biochim. Biophys. Acta"},{"key":"2023012409221766800_b21","doi-asserted-by":"crossref","first-page":"564","DOI":"10.1038\/35000615","article-title":"Evidence for stabilizing selection in a eukaryotic enhancer element","volume":"403","author":"Ludwig","year":"2000","journal-title":"Nature"},{"key":"2023012409221766800_b22","doi-asserted-by":"crossref","first-page":"4966","DOI":"10.1073\/pnas.0409414102","article-title":"Quantitative analysis of binding motifs meditating diverse spatial readouts of the Dorsal gradient in the Drosophila embryo","volume":"102","author":"Papatsenko","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012409221766800_b23","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1101\/gr.212502","article-title":"Extraction of functional binding sites from unique regulatory regions: the Drosophila early developmental enhancers","volume":"12","author":"Papatsenko","year":"2002","journal-title":"Genome Res."},{"key":"2023012409221766800_b24","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1186\/1471-2105-7-376","article-title":"Detecting the limits of regulatory element conservation and divergence estimation using pairwise and multiple alignments","volume":"7","author":"Pollard","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023012409221766800_b25","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1186\/1471-2105-3-30","article-title":"Computational detection of genomic cis-regulatory modules, applied to body patterning in the early Drosophila embryo","volume":"3","author":"Rajewsky","year":"2002","journal-title":"BMC Bioinformatics"},{"key":"2023012409221766800_b26","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1101\/gr.3715005","article-title":"Evolutionary conserved elements in vertebrate, insect, worm and yeast genomes","volume":"15","author":"Siepel","year":"2005","journal-title":"Genome Res."},{"key":"2023012409221766800_b27","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1186\/1471-2105-5-129","article-title":"Cross-species comparison significantly improves genome-wide prediction of cis-regulatory modules in Drosophila","volume":"5","author":"Sinha","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012409221766800_b28","doi-asserted-by":"crossref","first-page":"575","DOI":"10.1016\/S0959-437X(00)00130-1","article-title":"Evolution of transcriptional regulation","volume":"10","author":"Tautz","year":"2000","journal-title":"Curr. Opin. Genet. Dev."},{"key":"2023012409221766800_b29","first-page":"117","article-title":"Evolutionary importance of gene regulation","volume":"7","author":"Wilson","year":"1975","journal-title":"Stadler Genet. Symp."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/23\/2858\/48842751\/bioinformatics_22_23_2858.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/22\/23\/2858\/48842751\/bioinformatics_22_23_2858.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,24]],"date-time":"2023-01-24T10:03:26Z","timestamp":1674554606000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/22\/23\/2858\/278959"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,10,10]]},"references-count":29,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2006,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btl499","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2006,12,1]]},"published":{"date-parts":[[2006,10,10]]}}}