{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,7]],"date-time":"2025-03-07T07:40:30Z","timestamp":1741333230036,"version":"3.38.0"},"reference-count":37,"publisher":"Oxford University Press (OUP)","issue":"19","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,10,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: ChIP-seq data are enriched in binding sites for the protein immunoprecipitated. Some sequences may also contain binding sites for a coregulator. Biologists are interested in knowing which coregulatory factor motifs may be present in the sequences bound by the protein ChIP'ed.<\/jats:p><jats:p>Results: We present a finite mixture framework with an expectation\u2013maximization algorithm that considers two motifs jointly and simultaneously determines which sequences contain both motifs, either one or neither of them. Tested on 10 simulated ChIP-seq datasets, our method performed better than repeated application of MEME in predicting sequences containing both motifs. When applied to a mouse liver Foxa2 ChIP-seq dataset involving ~ 12 000 400-bp sequences, coMOTIF identified co-occurrence of Foxa2 with Hnf4a, Cebpa, E-box, Ap1\/Maf or Sp1 motifs in ~6\u201333% of these sequences. These motifs are either known as liver-specific transcription factors or have an important role in liver function.<\/jats:p><jats:p>Availability: Freely available at http:\/\/www.niehs.nih.gov\/research\/resources\/software\/comotif\/.<\/jats:p><jats:p>Contact: \u00a0li3@niehs.nih.gov<\/jats:p><jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr397","type":"journal-article","created":{"date-parts":[[2011,7,21]],"date-time":"2011-07-21T00:15:34Z","timestamp":1311207334000},"page":"2625-2632","source":"Crossref","is-referenced-by-count":6,"title":["coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq Data"],"prefix":"10.1093","volume":"27","author":[{"given":"Mengyuan","family":"Xu","sequence":"first","affiliation":[{"name":"Biostatistics Branch, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC 27709, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Clarice R.","family":"Weinberg","sequence":"additional","affiliation":[{"name":"Biostatistics Branch, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC 27709, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David M.","family":"Umbach","sequence":"additional","affiliation":[{"name":"Biostatistics Branch, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC 27709, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leping","family":"Li","sequence":"additional","affiliation":[{"name":"Biostatistics Branch, National Institute of Environmental Health Sciences, NIH, Research Triangle Park, NC 27709, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2011,7,19]]},"reference":[{"key":"2023012512003162100_B1","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1111\/j.2517-6161.1985.tb01331.x","article-title":"Estimation and hypothesis testing in finite mixture models","volume":"47","author":"Aitkin","year":"1985","journal-title":"J. R. Statist. Soc. B"},{"key":"2023012512003162100_B2","first-page":"28","article-title":"Fitting a mixture model by expectation maximization to discover motifs in biopolymers","volume":"2","author":"Bailey","year":"1994","journal-title":"Proc. Int. Conf. Intell. Syst. Mol. Biol."},{"key":"2023012512003162100_B3","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1007\/BF00993379","article-title":"Unsupervised learning of multiple motifs in biopolymers using expectation maximization","volume":"21","author":"Bailey","year":"1995","journal-title":"Mach. Learn. J."},{"key":"2023012512003162100_B4","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1093\/bioinformatics\/btg037","article-title":"Greedy mixture learning for multiple motif discovery in biological sequences","volume":"19","author":"Blekas","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B5","doi-asserted-by":"crossref","first-page":"2715","DOI":"10.1093\/bioinformatics\/btp490","article-title":"info-gibbs: a motif discovery algorithm that directly optimizes information content during sampling","volume":"25","author":"Defrance","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B6","doi-asserted-by":"crossref","first-page":"1445","DOI":"10.1093\/nar\/gki282","article-title":"NestedMICA, sensitive inference of over-represented","volume":"33","author":"Down","year":"2005","journal-title":"Nucleic Acids Res."},{"key":"2023012512003162100_B7","volume-title":"Pattern Classification.","author":"Duda","year":"2000","edition":"2"},{"key":"2023012512003162100_B8","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1093\/nar\/gkh169","article-title":"Finding functional sequence elements by multiple local alignment","volume":"32","author":"Frith","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012512003162100_B9","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1093\/bioinformatics\/btr064","article-title":"FIMO: scanning for occurrences of a given motif","volume":"27","author":"Grant","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B10","doi-asserted-by":"crossref","first-page":"7079","DOI":"10.1073\/pnas.0408743102","article-title":"De novo cis-regulatory module elicitation for eukaryotic genomes","volume":"102","author":"Gupta","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012512003162100_B11","doi-asserted-by":"crossref","first-page":"1037","DOI":"10.1101\/gr.104356.109","article-title":"Locus co-occupancy, nucleosome positioning, and H3K4me1 regulate the functionality of FOXA2-, HNF4A-, and PDX1-bound loci in islets and liver","volume":"20","author":"Hoffman","year":"2010","journal-title":"Genome Res."},{"key":"2023012512003162100_B12","doi-asserted-by":"crossref","first-page":"2154","DOI":"10.1093\/nar\/gkp1180","article-title":"On the detection and refinement of transcription factor binding sites using ChIP-Seq data","volume":"38","author":"Hu","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012512003162100_B13","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1186\/1471-2105-9-262","article-title":"Finding sequence motifs with Bayesian models incorporating positional information: an application to transcription factor binding sites","volume":"9","author":"Kim","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012512003162100_B14","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1002\/prot.340070105","article-title":"An expectation maximization EM algorithm for the identification and characterization of common sites in unaligned biopolymer sequences","volume":"7","author":"Lawrence","year":"1990","journal-title":"Proteins"},{"key":"2023012512003162100_B15","doi-asserted-by":"crossref","first-page":"208","DOI":"10.1126\/science.8211139","article-title":"Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment","volume":"262","author":"Lawrence","year":"1993","journal-title":"Science"},{"key":"2023012512003162100_B16","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1089\/cmb.2008.16TT","article-title":"GADEM: a genetic algorithm guided formation of spaced dyads coupled with an EM algorithm for motif discovery","volume":"16","author":"Li","year":"2009","journal-title":"J. Comput. Biol."},{"key":"2023012512003162100_B17","doi-asserted-by":"crossref","first-page":"629","DOI":"10.1093\/bioinformatics\/btn009","article-title":"fdrMotif: identifying cis-elements by an EM algorithm coupled with false discovery rate control","volume":"24","author":"Li","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B18","doi-asserted-by":"crossref","first-page":"1156","DOI":"10.1080\/01621459.1995.10476622","article-title":"Bayesian models for multiple local sequence alignment and Gibbs sampling strategies","volume":"90","author":"Liu","year":"1995","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012512003162100_B19","first-page":"127","article-title":"BioProspector, discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes","volume":"6","author":"Liu","year":"2001","journal-title":"Pac. Symp. Biocomput."},{"key":"2023012512003162100_B20","doi-asserted-by":"crossref","first-page":"2826","DOI":"10.1093\/bioinformatics\/btq546","article-title":"Identification of context-dependent motifs by contrasting ChIP binding data","volume":"26","author":"Mason","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B21","doi-asserted-by":"crossref","first-page":"D77","DOI":"10.1093\/nar\/gkn660","article-title":"UniPROBE: an online database of protein binding microarray data on protein\u2013DNA interactions","volume":"37","author":"Newburger","year":"2008","journal-title":"Nucleic Acid Res."},{"key":"2023012512003162100_B22","doi-asserted-by":"crossref","first-page":"2256","DOI":"10.1093\/bioinformatics\/btn408","article-title":"GIMSAN: a Gibbs motif finder with significance analysis","volume":"24","author":"Ng","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B23","doi-asserted-by":"crossref","first-page":"939","DOI":"10.1038\/nbt1098-939","article-title":"Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole genome mRNA quantitation","volume":"16","author":"Roth","year":"1998","journal-title":"Nat. Biotechnol."},{"key":"2023012512003162100_B24","doi-asserted-by":"crossref","first-page":"1036","DOI":"10.1126\/science.1186176","article-title":"Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding","volume":"328","author":"Schmidt","year":"2010","journal-title":"Science"},{"key":"2023012512003162100_B25","doi-asserted-by":"crossref","first-page":"i273","DOI":"10.1093\/bioinformatics\/btg1038","article-title":"Genome-wide discovery of transcriptional modules from DNA sequence and gene expression","volume":"19","author":"Segal","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B26","first-page":"89","article-title":"Methods for calculating the probabilities of finding patterns in sequences","volume":"5","author":"Staden","year":"1989","journal-title":"Comput. Appl. Biosci."},{"key":"2023012512003162100_B27","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1007\/978-1-60761-854-6_6","article-title":"Motif discovery using expectation maximization and Gibbs' sampling","volume":"674","author":"Stormo","year":"2010","journal-title":"Methods Mol. Biol."},{"key":"2023012512003162100_B28","doi-asserted-by":"crossref","first-page":"751","DOI":"10.1038\/nrg2845","article-title":"Determining the specificity of protein-DNA interactions","volume":"11","author":"Stormo","year":"2010","journal-title":"Nat. Rev. Genet."},{"key":"2023012512003162100_B29","doi-asserted-by":"crossref","first-page":"1347","DOI":"10.1089\/cmb.2007.0176","article-title":"BayesMD: flexible biological modeling for motif discovery","volume":"15","author":"Tang","year":"2008","journal-title":"J. Comput. Biol."},{"key":"2023012512003162100_B30","doi-asserted-by":"crossref","first-page":"1113","DOI":"10.1093\/bioinformatics\/17.12.1113","article-title":"A higher-order background model improves the detection of promoter regulatory elements by Gibbs sampling","volume":"17","author":"Thijs","year":"2001","journal-title":"Bioinformatics"},{"key":"2023012512003162100_B31","doi-asserted-by":"crossref","first-page":"3580","DOI":"10.1093\/nar\/gkg608","article-title":"Gibbs Recursive Sampler: finding transcription factor binding sites","volume":"31","author":"Thompson","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012512003162100_B32","doi-asserted-by":"crossref","first-page":"W232","DOI":"10.1093\/nar\/gkm265","article-title":"The Gibbs Centroid Sampler","volume":"35","author":"Thompson","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012512003162100_B33","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1038\/nbt1053","article-title":"Assessing computational tools for the discovery of transcription factor binding sites","volume":"23","author":"Tompa","year":"2005","journal-title":"Nat. Biotechnol."},{"key":"2023012512003162100_B34","doi-asserted-by":"crossref","first-page":"4549","DOI":"10.1093\/nar\/gkn382","article-title":"Global analysis of in vivo Foxa2-binding sites in mouse adult liver using massively parallel sequencing","volume":"36","author":"Wederell","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012512003162100_B35","doi-asserted-by":"crossref","first-page":"326","DOI":"10.1093\/bib\/bbn016","article-title":"The transfac project as an example of framework technology that supports the analysis of genomic regulation","volume":"9","author":"Wingender","year":"2008","journal-title":"Brief. Bioinformatics"},{"key":"2023012512003162100_B36","doi-asserted-by":"crossref","first-page":"e1000590","DOI":"10.1371\/journal.pcbi.1000590","article-title":"Inferring binding energies from selected binding sites","volume":"5","author":"Zhao","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023012512003162100_B37","doi-asserted-by":"crossref","first-page":"12114","DOI":"10.1073\/pnas.0402858101","article-title":"CisModule: De novo discovery of cis-regulatory modules by hierarchical mixture modeling","volume":"101","author":"Zhou","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/19\/2625\/48866591\/bioinformatics_27_19_2625.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/19\/2625\/48866591\/bioinformatics_27_19_2625.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,7]],"date-time":"2025-03-07T07:03:53Z","timestamp":1741331033000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/19\/2625\/230627"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,7,19]]},"references-count":37,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2011,10,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr397","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2011,10,1]]},"published":{"date-parts":[[2011,7,19]]}}}