{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,19]],"date-time":"2025-09-19T07:28:44Z","timestamp":1758266924408},"reference-count":34,"publisher":"Oxford University Press (OUP)","issue":"20","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,10,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Locating transcription factor binding sites (motifs) is a key step in understanding gene regulation. Based on Tompa's benchmark study, the performance of current de novo motif finders is far from satisfactory (with sensitivity \u22640.222 and precision \u22640.307). The same study also shows that no motif finder performs consistently well over all datasets. Hence, it is not clear which finder one should use for a given dataset. To address this issue, a class of algorithms called ensemble methods have been proposed. Though the existing ensemble methods overall perform better than stand-alone motif finders, the improvement gained is not substantial. Our study reveals that these methods do not fully exploit the information obtained from the results of individual finders, resulting in minor improvement in sensitivity and poor precision.<\/jats:p>\n               <jats:p>Results: In this article, we identify several key observations on how to utilize the results from individual finders and design a novel ensemble method, MotifVoter, to predict the motifs and binding sites. Evaluations on 186 datasets show that MotifVoter can locate more than 95% of the binding sites found by its component motif finders. In terms of sensitivity and precision, MotifVoter outperforms stand-alone motif finders and ensemble methods significantly on Tompa's benchmark, Escherichia coli, and ChIP-Chip datasets. MotifVoter is available online via a web server with several biologist-friendly features.<\/jats:p>\n               <jats:p>Availability: \u00a0http:\/\/www.comp.nus.edu.sg\/~bioinfo\/MotifVoter<\/jats:p>\n               <jats:p>Contact: \u00a0ksung@comp.nus.edu.sg<\/jats:p>\n               <jats:p>supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn420","type":"journal-article","created":{"date-parts":[[2008,8,13]],"date-time":"2008-08-13T02:35:24Z","timestamp":1218594924000},"page":"2288-2295","source":"Crossref","is-referenced-by-count":32,"title":["MotifVoter: a novel ensemble method for fine-grained integration of generic motif finders"],"prefix":"10.1093","volume":"24","author":[{"given":"Edward","family":"Wijaya","sequence":"first","affiliation":[{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"},{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Siu-Ming","family":"Yiu","sequence":"additional","affiliation":[{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ngo Thanh","family":"Son","sequence":"additional","affiliation":[{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Rajaraman","family":"Kanagasabai","sequence":"additional","affiliation":[{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wing-Kin","family":"Sung","sequence":"additional","affiliation":[{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"},{"name":"1 School of Computing, National University of Singapore, Singapore 119260, 2Institute for Infocomm Research, 21 Heng Mui Keng Terrace, Singapore 119613, 3Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong and 4Genome Institute of Singapore, 60 Biopolis Street, #02-01 Genome, Singapore 138672"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2008,8,7]]},"reference":[{"key":"2023020211260016500_B1","doi-asserted-by":"crossref","first-page":"1743","DOI":"10.1126\/science.1102216","article-title":"Environmentally induced foregut remodeling by PHA-4\/FoxA and DAF-12\/NHR","volume":"305","author":"Ao","year":"2004","journal-title":"Science"},{"key":"2023020211260016500_B2","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/BF00993379","article-title":"Unsupervised learning of multiple motifs in biopolymers using expectation maximization","volume":"21","author":"Bailey","year":"1995","journal-title":"Mach. Learn."},{"key":"2023020211260016500_B3","doi-asserted-by":"crossref","first-page":"947","DOI":"10.1016\/j.cell.2005.08.020","article-title":"Core transcriptional regulatory circuitry in human embryonic stem cells","volume":"122","author":"Boyer","year":"2005","journal-title":"Cell"},{"key":"2023020211260016500_B4","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1038\/sj.emboj.7600958","article-title":"Global and gene-specific analyses show distinct roles for myod and myog at a common set of promoters","volume":"25","author":"Cao","year":"2006","journal-title":"EMBO J."},{"key":"2023020211260016500_B5","first-page":"29","article-title":"A parameter-free algorithm for improved de novo identification of transcription factor binding sites","volume":"8","author":"Chakravarty","year":"2007","journal-title":"BMC Bioinformatics"},{"issue":"Suppl.7","key":"2023020211260016500_B6","article-title":"A survey of DNA motif finding algorithms","volume":"8","author":"Das","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020211260016500_B7","doi-asserted-by":"crossref","first-page":"2909","DOI":"10.1093\/bioinformatics\/bti425","article-title":"BEST: binding-site estimation suite tools","volume":"21","author":"Dongsheng","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B8","doi-asserted-by":"crossref","first-page":"1792","DOI":"10.1093\/nar\/gkh340","article-title":"MUSCLE: multiple sequence alignment with high accuracy and high throughput","volume":"32","author":"Edgar","year":"2004","journal-title":"Nucleic Acids Res."},{"issue":"suppl. 1","key":"2023020211260016500_B9","doi-asserted-by":"crossref","first-page":"S354","DOI":"10.1093\/bioinformatics\/18.suppl_1.S354","article-title":"Finding composite regulatory patterns in DNA sequences","volume":"18","author":"Eskin","year":"2002","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B10","doi-asserted-by":"crossref","first-page":"3164","DOI":"10.1093\/bioinformatics\/bti481","article-title":"TAMO: a flexible, object oriented framework for analyzing transcriptional regulation using DNA-sequences motifs","volume":"21","author":"Gordon","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B11","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nature02800","article-title":"Transcription regulatory code of a eukaryotic genome","volume":"431","author":"Harbison","year":"2004","journal-title":"Nature"},{"key":"2023020211260016500_B12","doi-asserted-by":"crossref","first-page":"342","DOI":"10.1186\/1471-2105-7-342","article-title":"EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences","volume":"7","author":"Hu","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020211260016500_B13","doi-asserted-by":"crossref","first-page":"4899","DOI":"10.1093\/nar\/gki791","article-title":"Limitations and potentials of current motif discovery algorithms","volume":"33","author":"Hu","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023020211260016500_B14","doi-asserted-by":"crossref","first-page":"1948","DOI":"10.1093\/nar\/gkh345","article-title":"Identifying transcriptional regulatory sites in the human genome using an integrated system","volume":"32","author":"Huang","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023020211260016500_B15","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1186\/1471-2105-7-229","article-title":"Meta-analysis discovery of tissue specific DNA sequence motifs from mammalian gene expression data","volume":"7","author":"Huber","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023020211260016500_B16","doi-asserted-by":"crossref","first-page":"1205","DOI":"10.1006\/jmbi.2000.3519","article-title":"Computational identification of cis-regulatory elements associated with functionally coherent groups of genes in saccharomyces cerevisiae","volume":"296","author":"Hughes","year":"2000","journal-title":"J. Mol. Biol."},{"key":"2023020211260016500_B17","doi-asserted-by":"crossref","first-page":"1557","DOI":"10.1093\/bioinformatics\/bth127","article-title":"BioOptimizer: a Bayesian scoring function approach to motif discovery","volume":"20","author":"Jensen","year":"2006","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B18","first-page":"127","article-title":"BioProspector: discovering DNA motifs in upstream regulatory regions of co-expressed genes","volume-title":"Proceedings of the Seventh Pacific Symposium of Biocomputing (PSB)","author":"Liu","year":"2001"},{"key":"2023020211260016500_B19","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1038\/nbt717","article-title":"An algorithm for finding protein-DNA binding sites with application to chromatin-immunoprecipitation microarray experiments","volume":"20","author":"Liu","year":"2002","journal-title":"Nat. Biotechnol."},{"key":"2023020211260016500_B20","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1371\/journal.pcbi.0020036","article-title":"Practical strategies for discovering regulatory DNA sequence motifs","volume":"2","author":"MacIsaac","year":"2006","journal-title":"PloS Comput. Biol."},{"key":"2023020211260016500_B21","doi-asserted-by":"crossref","first-page":"e61+1","DOI":"10.1371\/journal.pcbi.0030061","article-title":"DNA familial binding profiles made easy: comparison of various motif alignment and clustering strategies","volume":"3","author":"Mahony","year":"2007","journal-title":"PLoS Comput. Biol."},{"issue":"Suppl. 6","key":"2023020211260016500_B22","doi-asserted-by":"crossref","first-page":"S4","DOI":"10.1186\/1471-2105-8-S6-S4","article-title":"Finding regulatory elements and regulatory motifs a general probabilistic framework","volume":"8","author":"Nimwegen","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023020211260016500_B23","doi-asserted-by":"crossref","first-page":"1378","DOI":"10.1126\/science.1089769","article-title":"Control of pancreas and liver gene expression by HNF transcription factors","volume":"303","author":"Odom","year":"2004","journal-title":"Science"},{"key":"2023020211260016500_B24","doi-asserted-by":"crossref","first-page":"18261","DOI":"10.1073\/pnas.0606108103","article-title":"NOTCH1 directly regulates c-MYC and activates a feed-forward-loop transcriptional network promoting leukemic cell growth","volume":"103","author":"Palomero","year":"2006","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020211260016500_B25","doi-asserted-by":"crossref","first-page":"S207","DOI":"10.1093\/bioinformatics\/17.suppl_1.S207","article-title":"An algorithm for finding signals of unknown length in DNA sequencess","volume":"17","author":"Pavesi","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B26","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1101\/gad.949802","article-title":"CE2F integrates cell cycle progression with DNA repair, replication and G2\/M checkpoints","volume":"16","author":"Ren","year":"2002","journal-title":"Genes Dev."},{"key":"2023020211260016500_B27","doi-asserted-by":"crossref","first-page":"W217","DOI":"10.1093\/nar\/gkm376","article-title":"WebMOTIFS: automated discovery, filtering, and scoring of DNA sequence motifs using multple programs and bayesian approaches","volume":"35","author":"Romer","year":"2007","journal-title":"Nucleic Acids Res"},{"key":"2023020211260016500_B28","doi-asserted-by":"crossref","first-page":"D303","DOI":"10.1093\/nar\/gkh140","article-title":"Regulon DB (version 4.0): transcriptional regulation, operon organization and growth conditions in escherichia coli k-12","volume":"32","author":"Salgado","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023020211260016500_B29","doi-asserted-by":"crossref","first-page":"5899","DOI":"10.1073\/pnas.0510996103","article-title":"Coordinated binding of NFKB family members in the response of human cells to lipopolysaccharide","volume":"103","author":"Schrieber","year":"2006","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023020211260016500_B30","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1093\/bioinformatics\/17.12.1113","article-title":"A higher-order background model improves the detection of promoter regulatory elements by gibbs sampling","volume":"17","author":"Thijs","year":"2001","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B31","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1038\/nbt1053","article-title":"Assessing computational tools for the discovery of transcription factor binding sites","volume":"23","author":"Tompa","year":"2005","journal-title":"Nat. Biotechnol."},{"key":"2023020211260016500_B32","doi-asserted-by":"crossref","first-page":"1476","DOI":"10.1093\/bioinformatics\/btm118","article-title":"Detection of generic spaced motifs using submotif pattern mining","volume":"23","author":"Wijaya","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020211260016500_B33","first-page":"467","article-title":"ANN-Spec: a method for discovering transcription factor binding sites with improved specificity","volume-title":"Proceedings of Pacific Symposium of Biocomputing (PSB)","author":"Workman","year":"2000"},{"key":"2023020211260016500_B34","doi-asserted-by":"crossref","first-page":"4459","DOI":"10.1073\/pnas.0501076102","article-title":"Genome-wide analysis of camp-response element binding protein occupancy, phosphorylation, and target gene activation in human tissues","volume":"102","author":"Zhang","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/20\/2288\/49052228\/bioinformatics_24_20_2288.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/20\/2288\/49052228\/bioinformatics_24_20_2288.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,2]],"date-time":"2023-02-02T14:12:54Z","timestamp":1675347174000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/20\/2288\/258515"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,8,7]]},"references-count":34,"journal-issue":{"issue":"20","published-print":{"date-parts":[[2008,10,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn420","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,10,15]]},"published":{"date-parts":[[2008,8,7]]}}}