{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T19:26:08Z","timestamp":1761765968983},"reference-count":44,"publisher":"Oxford University Press (OUP)","issue":"23","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":421,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected.<\/jats:p>\n               <jats:p>Results: We present a novel algorithm that supports both alignment-free and alignment-based motif discovery in the promoter sequences of related species. Putative motifs are exhaustively enumerated as words over the IUPAC alphabet and screened for conservation using the branch length score. Additionally, a confidence score is established in a genome-wide fashion. In order to take advantage of a cloud computing infrastructure, the MapReduce programming model is adopted. The method is applied to four monocotyledon plant species and it is shown that high-scoring motifs are significantly enriched for open chromatin regions in Oryza sativa and for transcription factor binding sites inferred through protein-binding microarrays in O.sativa and Zea mays. Furthermore, the method is shown to recover experimentally profiled ga2ox1-like KN1 binding sites in Z.mays.<\/jats:p>\n               <jats:p>Availability and implementation: BLSSpeller was written in Java. Source code and manual are available at http:\/\/bioinformatics.intec.ugent.be\/blsspeller<\/jats:p>\n               <jats:p>Contact: \u00a0Klaas.Vandepoele@psb.vib-ugent.be or jan.fostier@intec.ugent.be<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv466","type":"journal-article","created":{"date-parts":[[2015,8,8]],"date-time":"2015-08-08T23:59:31Z","timestamp":1439078371000},"page":"3758-3766","source":"Crossref","is-referenced-by-count":15,"title":["BLSSpeller: exhaustive comparative discovery of conserved <i>cis<\/i>-regulatory elements"],"prefix":"10.1093","volume":"31","author":[{"given":"Dieter","family":"De Witte","sequence":"first","affiliation":[{"name":"1 Department of Information Technology (INTEC), Ghent University-iMinds, Ghent, Belgium,"}]},{"given":"Jan","family":"Van de Velde","sequence":"additional","affiliation":[{"name":"2 Department of Plant Systems Biology, VIB and"},{"name":"3 Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium"}]},{"given":"Dries","family":"Decap","sequence":"additional","affiliation":[{"name":"1 Department of Information Technology (INTEC), Ghent University-iMinds, Ghent, Belgium,"}]},{"given":"Michiel","family":"Van Bel","sequence":"additional","affiliation":[{"name":"2 Department of Plant Systems Biology, VIB and"},{"name":"3 Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium"}]},{"given":"Pieter","family":"Audenaert","sequence":"additional","affiliation":[{"name":"1 Department of Information Technology (INTEC), Ghent University-iMinds, Ghent, Belgium,"}]},{"given":"Piet","family":"Demeester","sequence":"additional","affiliation":[{"name":"1 Department of Information Technology (INTEC), Ghent University-iMinds, Ghent, Belgium,"}]},{"given":"Bart","family":"Dhoedt","sequence":"additional","affiliation":[{"name":"1 Department of Information Technology (INTEC), Ghent University-iMinds, Ghent, Belgium,"}]},{"given":"Klaas","family":"Vandepoele","sequence":"additional","affiliation":[{"name":"2 Department of Plant Systems Biology, VIB and"},{"name":"3 Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium"}]},{"given":"Jan","family":"Fostier","sequence":"additional","affiliation":[{"name":"1 Department of Information Technology (INTEC), Ghent University-iMinds, Ghent, Belgium,"}]}],"member":"286","published-online":{"date-parts":[[2015,8,8]]},"reference":[{"key":"2023020202411674600_btv466-B1","doi-asserted-by":"crossref","first-page":"W202","DOI":"10.1093\/nar\/gkp335","article-title":"MEME Suite: tools for motif discovery and searching","volume":"37","author":"Bailey","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023020202411674600_btv466-B2","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1016\/0168-9525(93)90001-X","article-title":"Grasses as a single genetic system: genome composition, collinearity and compatibility","volume":"9","author":"Benntzin","year":"1993","journal-title":"Trends Genet."},{"key":"2023020202411674600_btv466-B3","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1101\/gr.1642804","article-title":"CONREAL: conserved regulatory elements anchored alignment algorithm for identification of transcription factor binding sites by phylogenetic footprinting","volume":"14","author":"Berezikov","year":"2004","journal-title":"Genome Res."},{"key":"2023020202411674600_btv466-B4","doi-asserted-by":"crossref","first-page":"739","DOI":"10.1101\/gr.6902","article-title":"Discovery of regulatory elements by a computational method for phylogenetic footprinting","volume":"12","author":"Blanchette","year":"2002","journal-title":"Genome Res."},{"key":"2023020202411674600_btv466-B5","doi-asserted-by":"crossref","first-page":"1647","DOI":"10.1105\/tpc.109.068221","article-title":"The maize transcription factor knotted1 directly regulates the gibberellin catabolism gene ga2ox1","volume":"21","author":"Bolduc","year":"2009","journal-title":"Plant Cell"},{"key":"2023020202411674600_btv466-B6","doi-asserted-by":"crossref","first-page":"1685","DOI":"10.1101\/gad.193433.112","article-title":"Unraveling the KNOTTED1 regulatory network in maize meristems","volume":"26","author":"Bolduc","year":"2012","journal-title":"Genes Dev."},{"key":"2023020202411674600_btv466-B7","doi-asserted-by":"crossref","first-page":"e1000343+","DOI":"10.1371\/journal.pbio.1000343","article-title":"Binding site turnover produces pervasive quantitative changes in transcription factor binding between closely related Drosophila species","volume":"8","author":"Bradley","year":"2010","journal-title":"PLoS Biol."},{"key":"2023020202411674600_btv466-B8","doi-asserted-by":"crossref","first-page":"1+","DOI":"10.1186\/1748-7188-2-1","article-title":"PhyloScan: identification of transcription factor binding sites using cross-species evidence","volume":"2","author":"Carmack","year":"2007","journal-title":"Algorithms Mol. Biol."},{"key":"2023020202411674600_btv466-B9","doi-asserted-by":"crossref","first-page":"3021","DOI":"10.1093\/nar\/13.9.3021","article-title":"Nomenclature for incompletely specified bases in nucleic acid sequences: recommendations 1984","volume":"13","author":"Cornish-Bowden","year":"1985","journal-title":"Nucleic Acids Res."},{"key":"2023020202411674600_btv466-B10","doi-asserted-by":"crossref","first-page":"S21+","DOI":"10.1186\/1471-2105-8-S7-S21","article-title":"A survey of DNA motif finding algorithms","volume":"8","author":"Das","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020202411674600_btv466-B11","doi-asserted-by":"crossref","first-page":"268","DOI":"10.1007\/978-3-642-55195-6_25","article-title":"A parallel, distributed-memory framework for comparative motif discovery","volume":"8385","author":"De Witte","year":"2013","journal-title":"Parallel Process. Appl. Math."},{"key":"2023020202411674600_btv466-B12","first-page":"137","article-title":"MapReduce: simplified data processing on large clusters","volume":"53","author":"Dean","year":"2004","journal-title":"Operat. Syst. Des. Implement."},{"key":"2023020202411674600_btv466-B13","doi-asserted-by":"crossref","first-page":"R18+","DOI":"10.1186\/gb-2005-6-2-r18","article-title":"Fast and systematic genome-wide discovery of conserved regulatory elements using a non-alignment based approach","volume":"6","author":"Elemento","year":"2005","journal-title":"Genome Biol."},{"key":"2023020202411674600_btv466-B14","first-page":"354","article-title":"Finding composite regulatory patterns in DNA sequences","volume":"18","author":"Eskin","year":"2002","journal-title":"Bioinformatics (Oxford, England)"},{"key":"2023020202411674600_btv466-B15","doi-asserted-by":"crossref","first-page":"R104","DOI":"10.1186\/gb-2005-6-12-r104","article-title":"The discovery, positioning and verification of a set of transcription-associated motifs in vertebrates","volume":"6","author":"Ettwiller","year":"2005","journal-title":"Genome Biol."},{"key":"2023020202411674600_btv466-B16","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1007\/3-540-48318-7_5","article-title":"Efficient implementation of lazy suffix trees","volume-title":"International Workshop on Algorithm Engineering","author":"Giegerich","year":"1999"},{"key":"2023020202411674600_btv466-B17","doi-asserted-by":"crossref","first-page":"e90","DOI":"10.1093\/nar\/gkp1166","article-title":"Finding regulatory DNA motifs using alignment-free evolutionary conservation information","volume":"38","author":"Gord\u00e2n","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023020202411674600_btv466-B18","doi-asserted-by":"crossref","first-page":"1205","DOI":"10.1006\/jmbi.2000.3519","article-title":"Computational identification of Cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae","volume":"296","author":"Hughes","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023020202411674600_btv466-B19","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1038\/nature01644","article-title":"Sequencing and comparison of yeast species to identify genes and regulatory elements","volume":"423","author":"Kellis","year":"2003","journal-title":"Nature"},{"key":"2023020202411674600_btv466-B20","doi-asserted-by":"crossref","first-page":"208+","DOI":"10.1186\/1471-2164-11-208","article-title":"Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes","volume":"11","author":"Kumar","year":"2010","journal-title":"BMC Genomics"},{"key":"2023020202411674600_btv466-B21","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1142\/S0219720004000466","article-title":"cWINNOWER algorithm for finding fuzzy dna motifs","volume":"2","author":"Liang","year":"2004","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023020202411674600_btv466-B22","first-page":"127","article-title":"BioProspector: discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes","volume":"6","author":"Liu","year":"2001","journal-title":"Pac. Symp. Biocomput."},{"key":"2023020202411674600_btv466-B23","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1089\/106652700750050826","article-title":"Algorithms for extracting structured motifs using a suffix tree with an application to promoter and regulatory site consensus identification","volume":"7","author":"Marsan","year":"2000","journal-title":"J. Comput. Biol."},{"key":"2023020202411674600_btv466-B24","first-page":"356","article-title":"Efficient exact motif discovery","volume":"25","author":"Marschall","year":"2009","journal-title":"Bioinformatics (Oxford, England)"},{"key":"2023020202411674600_btv466-B25","first-page":"S207","article-title":"An algorithm for finding signals of unknown length in DNA sequences","volume":"17","author":"Pavesi","year":"2001","journal-title":"Bioinformatics (Oxford, England)"},{"key":"2023020202411674600_btv466-B26","doi-asserted-by":"crossref","first-page":"6+","DOI":"10.1186\/1471-2105-5-6","article-title":"Benchmarking tools for the alignment of functional noncoding DNA","volume":"5","author":"Pollard","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023020202411674600_btv466-B27","doi-asserted-by":"crossref","first-page":"3718","DOI":"10.1105\/tpc.109.071506","article-title":"PLAZA: a comparative genomics resource to study gene and genome evolution in plants","volume":"21","author":"Proost","year":"2009","journal-title":"Plant Cell Online"},{"key":"2023020202411674600_btv466-B28","doi-asserted-by":"crossref","first-page":"6029","DOI":"10.1093\/nar\/gkr179","article-title":"Evolutionary divergence and limits of conserved non-coding sequence detection in plant genomes","volume":"39","author":"Reineke","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023020202411674600_btv466-B29","first-page":"662","article-title":"Pruner: algorithms for finding monad patterns in DNA sequences","volume-title":"CSB","author":"Satya","year":"2004"},{"key":"2023020202411674600_btv466-B30","doi-asserted-by":"crossref","first-page":"3053","DOI":"10.1073\/pnas.0813264106","article-title":"Comparative genomics allows the discovery of cis-regulatory elements in mosquitoes","volume":"106","author":"Sieglaff","year":"2009","journal-title":"Proc. Natl. Acad. Sci."},{"key":"2023020202411674600_btv466-B31","doi-asserted-by":"crossref","first-page":"214","DOI":"10.1016\/j.gde.2005.02.004","article-title":"Computational methods for transcriptional regulation","volume":"15","author":"Siggia","year":"2005","journal-title":"Curr. Opin. Genet. Dev."},{"key":"2023020202411674600_btv466-B32","doi-asserted-by":"crossref","first-page":"219","DOI":"10.1038\/nature06340","article-title":"Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures","volume":"450","author":"Stark","year":"2007","journal-title":"Nature"},{"key":"2023020202411674600_btv466-B33","doi-asserted-by":"crossref","first-page":"6+","DOI":"10.1186\/1748-7188-3-6","article-title":"DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment","volume":"3","author":"Subramanian","year":"2008","journal-title":"Algorithms Mol. Biol. AMB"},{"key":"2023020202411674600_btv466-B34","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1089\/10665270252935566","article-title":"A Gibbs sampling method to detect overrepresented motifs in the upstream regions of coexpressed genes","volume":"9","author":"Thijs","year":"2002","journal-title":"J. Comput. Biol."},{"key":"2023020202411674600_btv466-B35","doi-asserted-by":"crossref","first-page":"W119","DOI":"10.1093\/nar\/gkn304","article-title":"RSAT: regulatory sequence analysis tools","volume":"36","author":"Thomas-Chollier","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023020202411674600_btv466-B36","doi-asserted-by":"crossref","first-page":"590","DOI":"10.1104\/pp.111.189514","article-title":"Dissecting plant genomes with the PLAZA comparative genomics platform","volume":"158","author":"Van Bel","year":"2012","journal-title":"Plant Physiol."},{"key":"2023020202411674600_btv466-B37","doi-asserted-by":"crossref","first-page":"1808","DOI":"10.1093\/nar\/28.8.1808","article-title":"Discovering regulatory elements in non-coding sequences by analysis of spaced dyads","volume":"28","author":"van Helden","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023020202411674600_btv466-B38","first-page":"851","article-title":"Is transcription factor binding site turnover a sufficient explanation for cis-regulatory sequence divergence? Genome Biol","volume":"2","author":"Venkataram","year":"2010","journal-title":"Evol."},{"key":"2023020202411674600_btv466-B39","doi-asserted-by":"crossref","first-page":"17400","DOI":"10.1073\/pnas.0505147102","article-title":"Identifying the conserved network of cis-regulatory sites of a eukaryotic genome","volume":"102","author":"Wang","year":"2005","journal-title":"Proc. Natl. Acad. Sci."},{"key":"2023020202411674600_btv466-B40","doi-asserted-by":"crossref","first-page":"131","DOI":"10.1016\/S1672-0229(07)60023-0","article-title":"Comparative analysis of regulatory motif discovery tools for transcription factor binding sites","volume":"5","author":"Wei","year":"2007","journal-title":"Genomics Proteomics Bioinf."},{"key":"2023020202411674600_btv466-B41","doi-asserted-by":"crossref","first-page":"1431","DOI":"10.1016\/j.cell.2014.08.009","article-title":"Determination and inference of eukaryotic transcription factor sequence specificity","volume":"158","author":"Weirauch","year":"2014","journal-title":"Cell"},{"key":"2023020202411674600_btv466-B42","doi-asserted-by":"crossref","first-page":"1843","DOI":"10.1093\/bioinformatics\/btn348","article-title":"Discovering regulatory motifs in the Plasmodium genome using comparative genomics","volume":"24","author":"Wu","year":"2008","journal-title":"Bioinformatics (Oxford, England)"},{"key":"2023020202411674600_btv466-B43","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1038\/nature03441","article-title":"Systematic discovery of regulatory motifs in human promoters and 3[prime] UTRs by comparison of several mammals","volume":"434","author":"Xie","year":"2005","journal-title":"Nature"},{"key":"2023020202411674600_btv466-B44","doi-asserted-by":"crossref","first-page":"151","DOI":"10.1101\/gr.131342.111","article-title":"High-resolution mapping of open chromatin in the rice genome","volume":"22","author":"Zhang","year":"2012","journal-title":"Genome Res."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/23\/3758\/49036095\/bioinformatics_31_23_3758.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/23\/3758\/49036095\/bioinformatics_31_23_3758.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T03:56:36Z","timestamp":1675310196000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/23\/3758\/209257"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,8,8]]},"references-count":44,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2015,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv466","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,12,1]]},"published":{"date-parts":[[2015,8,8]]}}}