{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T07:48:09Z","timestamp":1774424889700,"version":"3.50.1"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Identifying the target genes regulated by transcription factors (TFs) is the most basic step in understanding gene regulation. Recent advances in high-throughput sequencing technology, together with chromatin immunoprecipitation (ChIP), enable mapping TF binding sites genome wide, but it is not possible to infer function from binding alone. This is especially true in mammalian systems, where regulation often occurs through long-range enhancers in gene-rich neighborhoods, rather than proximal promoters, preventing straightforward assignment of a binding site to a target gene.<\/jats:p>\n               <jats:p>Results: We present EMBER (Expectation Maximization of Binding and Expression pRofiles), a method that integrates high-throughput binding data (e.g. ChIP-chip or ChIP-seq) with gene expression data (e.g. DNA microarray) via an unsupervised machine learning algorithm for inferring the gene targets of sets of TF binding sites. Genes selected are those that match overrepresented expression patterns, which can be used to provide information about multiple TF regulatory modes. We apply the method to genome-wide human breast cancer data and demonstrate that EMBER confirms a role for the TFs estrogen receptor alpha, retinoic acid receptors alpha and gamma in breast cancer development, whereas the conventional approach of assigning regulatory targets based on proximity does not. Additionally, we compare several predicted target genes from EMBER to interactions inferred previously, examine combinatorial effects of TFs on gene regulation and illustrate the ability of EMBER to discover multiple modes of regulation.<\/jats:p>\n               <jats:p>Availability: All code used for this work is available at http:\/\/dinner-group.uchicago.edu\/downloads.html<\/jats:p>\n               <jats:p>Contact: \u00a0dinner@uchicago.edu<\/jats:p>\n               <jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btr628","type":"journal-article","created":{"date-parts":[[2011,11,15]],"date-time":"2011-11-15T15:18:21Z","timestamp":1321370301000},"page":"206-213","source":"Crossref","is-referenced-by-count":32,"title":["Discovering transcription factor regulatory targets using gene expression and binding data"],"prefix":"10.1093","volume":"28","author":[{"given":"Mark","family":"Maienschein-Cline","sequence":"first","affiliation":[{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"}]},{"given":"Jie","family":"Zhou","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"}]},{"given":"Kevin P.","family":"White","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"},{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"}]},{"given":"Roger","family":"Sciammas","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"}]},{"given":"Aaron R.","family":"Dinner","sequence":"additional","affiliation":[{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"},{"name":"1 Department of Chemistry, 2Department of Human Genetics, 3Institute for Genomics and Systems Biology and 4Department of Surgery, The University of Chicago, Chicago, IL 60637, USA"}]}],"member":"286","published-online":{"date-parts":[[2011,11,13]]},"reference":[{"key":"2023012511501363700_B1","doi-asserted-by":"crossref","first-page":"890","DOI":"10.1002\/jcb.20352","article-title":"Transcriptional enhancers: intelligent enhanceosomes or flexible billboards?","volume":"94","author":"Arnosti","year":"2005","journal-title":"J. Cell Biochem."},{"key":"2023012511501363700_B2","first-page":"28","article-title":"Fitting a mixture model by expectation maximization to discover motifs in biopolymers","volume-title":"Proceedings of the Second International Conference on Intelligent Systems for Molecular Biology.","author":"Bailey","year":"1994"},{"key":"2023012511501363700_B3","doi-asserted-by":"crossref","first-page":"299","DOI":"10.1016\/0092-8674(81)90413-X","article-title":"Expression of a \u03b2-globin gene is enhanced by remote SV40 DNA sequences","volume":"27","author":"Banerji","year":"1981","journal-title":"Cell"},{"key":"2023012511501363700_B4","doi-asserted-by":"crossref","first-page":"1337","DOI":"10.1038\/nbt890","article-title":"Computational discovery of gene modules and regulatory networks","volume":"21","author":"Bar-Joseph","year":"2003","journal-title":"Nat. Biotechnol."},{"key":"2023012511501363700_B5","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1016\/j.cell.2007.05.009","article-title":"High-resolution profiling of histone methylations in the human genome","volume":"129","author":"Barski","year":"2007","journal-title":"Cell"},{"key":"2023012511501363700_B6","first-page":"245","article-title":"Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA binding proteins","volume":"338","author":"Berger","year":"2006","journal-title":"Methods Mol. Biol."},{"key":"2023012511501363700_B7","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1016\/j.cell.2005.01.001","article-title":"Genomic maps and comparative analysis of histone modifications in human and mouse","volume":"120","author":"Bernstein","year":"2005","journal-title":"Cell"},{"key":"2023012511501363700_B8","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1038\/nature05874","article-title":"Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project","volume":"447","author":"Birney","year":"2007","journal-title":"Nature"},{"key":"2023012511501363700_B9","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1186\/1742-4682-2-23","article-title":"Predicting transcription factor activities from combined analysis of microarray and ChIP data: a partial least squares approach","volume":"2","author":"Boulesteix","year":"2005","journal-title":"Theor. Biol. Med. Model"},{"key":"2023012511501363700_B10","doi-asserted-by":"crossref","first-page":"1314","DOI":"10.1073\/pnas.91.4.1314","article-title":"Dual promoter activation by the human beta-globin locus control region","volume":"91","author":"Bresnick","year":"1994","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511501363700_B11","doi-asserted-by":"crossref","first-page":"1300","DOI":"10.1038\/ng.235","article-title":"Structure and function of a transcriptional network activated by the MAPK Hog1","volume":"40","author":"Capaldi","year":"2008","journal-title":"Nat. Genet."},{"key":"2023012511501363700_B12","doi-asserted-by":"crossref","first-page":"499","DOI":"10.1016\/S0092-8674(04)00127-8","article-title":"Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs","volume":"116","author":"Cawley","year":"2004","journal-title":"Cell"},{"key":"2023012511501363700_B13","doi-asserted-by":"crossref","first-page":"1306","DOI":"10.1126\/science.1067799","article-title":"Capturing chromosome conformation","volume":"295","author":"Dekker","year":"2002","journal-title":"Science"},{"key":"2023012511501363700_B14","doi-asserted-by":"crossref","first-page":"988","DOI":"10.1038\/nprot.2007.116","article-title":"Mapping networks of physical interactions between genomic elements using 5C technology","volume":"2","author":"Dostie","year":"2007","journal-title":"Nat. Protoc."},{"key":"2023012511501363700_B15","doi-asserted-by":"crossref","first-page":"14863","DOI":"10.1073\/pnas.95.25.14863","article-title":"Cluster analysis and display of genome-wide expression patterns","volume":"95","author":"Eisen","year":"1998","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511501363700_B16","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1038\/nature09906","article-title":"Mapping and analysis of chromatin state dynamics in nine human cell types","volume":"473","author":"Ernst","year":"2011","journal-title":"Nature"},{"key":"2023012511501363700_B17","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1016\/j.molcel.2009.11.001","article-title":"Discovering hematopoietic mechanisms through genome-wide analysis of GATA factor chromatin occupancy","volume":"36","author":"Fujiwara","year":"2009","journal-title":"Mol. Cell"},{"key":"2023012511501363700_B18","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1186\/1471-2105-5-31","article-title":"Defining transcriptional networks through integrative modeling of mRNA expression and transcription factor binding data","volume":"5","author":"Gao","year":"2004","journal-title":"BMC Bioinformatics"},{"key":"2023012511501363700_B19","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1038\/nature04768","article-title":"Systems biology approaches identify ATF3 as a negative regulator of Toll-like receptor 4","volume":"441","author":"Gilchrist","year":"2006","journal-title":"Nature"},{"key":"2023012511501363700_B20","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1016\/j.cell.2007.05.042","article-title":"A chromatin landmark and transcription initiation at most promoters in human cells","volume":"130","author":"Guenther","year":"2007","journal-title":"Cell"},{"key":"2023012511501363700_B21","doi-asserted-by":"crossref","first-page":"622","DOI":"10.1016\/j.cell.2011.03.042","article-title":"A rapid, extensive, and transient transcriptional response to estrogen signaling in breast cancer cells","volume":"145","author":"Hah","year":"2011","journal-title":"Cell"},{"key":"2023012511501363700_B22","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1038\/ng1966","article-title":"Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome","volume":"39","author":"Heintzman","year":"2007","journal-title":"Nat. Genet."},{"key":"2023012511501363700_B23","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1038\/nature07829","article-title":"Histone modifications at human enhancers reflect global cell-type-specific gene expression","volume":"459","author":"Heintzman","year":"2009","journal-title":"Nature"},{"key":"2023012511501363700_B24","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1038\/msb.2008.25","article-title":"Genomic analysis of estrogen cascade reveals histone variant H2A.Z associated with breast cancer progression","volume":"4","author":"Hua","year":"2008","journal-title":"Mol. Syst. Biol."},{"key":"2023012511501363700_B25","doi-asserted-by":"crossref","first-page":"1259","DOI":"10.1016\/j.cell.2009.04.043","article-title":"Genomic antagonism between retinoic acid and estrogen signaling in breast cancer","volume":"137","author":"Hua","year":"2009","journal-title":"Cell"},{"key":"2023012511501363700_B26","doi-asserted-by":"crossref","first-page":"1497","DOI":"10.1126\/science.1141319","article-title":"Genome-wide mapping of in vivo protein-DNA interactions","volume":"316","author":"Johnson","year":"2007","journal-title":"Science"},{"key":"2023012511501363700_B27","doi-asserted-by":"crossref","first-page":"426","DOI":"10.1126\/science.1171347","article-title":"Transcriptional regulatory circuits: predicting numbers from alphabets","volume":"325","author":"Kim","year":"2009","journal-title":"Science"},{"key":"2023012511501363700_B28","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1038\/ni1148","article-title":"Hypersensitive site 7 of the TH2 locus control region is essential for expressing TH2 cytokine genes and for long-range intrachromosomal interactions","volume":"6","author":"Lee","year":"2005","journal-title":"Nat. Immunol."},{"key":"2023012511501363700_B29","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1007\/0-387-21679-0_5","article-title":"DNA-Chip analyzer (dChip)","volume-title":"The Analysis of Gene Expression Data.","author":"Li","year":"2003"},{"key":"2023012511501363700_B30","doi-asserted-by":"crossref","first-page":"16985","DOI":"10.1074\/jbc.273.27.16985","article-title":"Interaction of BAG-1 with retinoic acid receptor and its inhibition of retinoic acid-induced apoptosis in cancer cells","volume":"273","author":"Liu","year":"1998","journal-title":"J. Biol. Chem."},{"key":"2023012511501363700_B31","first-page":"281","article-title":"Some methods for classification and analysis of multivariate observations","volume-title":"Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability","author":"MacQueen","year":"1967"},{"key":"2023012511501363700_B32","doi-asserted-by":"crossref","first-page":"1212","DOI":"10.1038\/ni.2136","article-title":"Epigenetic repression of the Igk locus by STAT5-mediated recruitment of the histone methyltransferase Ezh2","volume":"12","author":"Mandal","year":"2011","journal-title":"Nat. Immunol."},{"key":"2023012511501363700_B33","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nbt.1630","article-title":"GREAT improves functional interpretation of cis-regulatory regions","volume":"28","author":"McLean","year":"2010","journal-title":"Nat. Biotechnol."},{"key":"2023012511501363700_B34","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1038\/nature06008","article-title":"Genome-wide maps of chromatin state in pluripotent and lineage-committed cells","volume":"448","author":"Mikkelsen","year":"2007","journal-title":"Nature"},{"key":"2023012511501363700_B35","doi-asserted-by":"crossref","first-page":"S22","DOI":"10.1038\/nmeth.1371","article-title":"Computation for ChIP-seq and RNA-seq studies","volume":"6","author":"Pepke","year":"2009","journal-title":"Nat. Methods"},{"key":"2023012511501363700_B36","doi-asserted-by":"crossref","first-page":"13090","DOI":"10.1073\/pnas.93.23.13090","article-title":"Long-range disruption of gene expression by a selectable marker cassette","volume":"93","author":"Pham","year":"1996","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511501363700_B37","volume-title":"A Genetic Switch: Phage \u03bb and Higher Organisms.","author":"Ptashne","year":"1992"},{"key":"2023012511501363700_B38","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1038\/nmeth1068","article-title":"Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing","volume":"4","author":"Robertson","year":"2007","journal-title":"Nat. Methods"},{"key":"2023012511501363700_B39","first-page":"1337","article-title":"Probabilistic inference of transcription factor concentrations and gene-specific regulatory activities","volume":"21","author":"Sanguinetti","year":"2003","journal-title":"Nat. Biotechnol."},{"key":"2023012511501363700_B40","doi-asserted-by":"crossref","first-page":"1753","DOI":"10.1093\/bioinformatics\/btl154","article-title":"A probabilistic dynamical model for quantitative inference of the regulatory mechanism of transcription","volume":"22","author":"Sanguinetti","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012511501363700_B41","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012511501363700_B42","first-page":"1252","article-title":"Evolutionary optimization in quantitative structure-activity relationship: an application of genetic neural networks","volume":"39","author":"So","year":"1996","journal-title":"J. Med. Chem."},{"key":"2023012511501363700_B43","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-12-research0071","article-title":"Bayesian analysis of gene expression levels: statistical quantification of relative mRNA level across multiple strains or treatments","volume":"3","author":"Townsend","year":"2002","journal-title":"Genome Biol."},{"key":"2023012511501363700_B44","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1038\/nmeth.1246","article-title":"Genome-wide analysis of transcription factor binding sites based on ChIP-seq data","volume":"5","author":"Valouev","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012511501363700_B45","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1016\/j.devcel.2010.10.006","article-title":"Differentiation-specific histone modifications reveal dynamic chromatin interactions and partners for the intestinal transcription factor CDX2","volume":"19","author":"Verzi","year":"2010","journal-title":"Dev. Cell"},{"key":"2023012511501363700_B46","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1007\/0-387-21679-0_14","article-title":"MAANOVA: a software package for the analysis of spotted cDNA microarray experiments","volume-title":"The Analysis of Gene Expression Data","author":"Wu","year":"2003"},{"key":"2023012511501363700_B47","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1056\/NEJMra050776","article-title":"Estrogen carcinogenesis in breast cancer","volume":"354","author":"Yager","year":"2006","journal-title":"N. Engl. J. Med."},{"key":"2023012511501363700_B48","doi-asserted-by":"crossref","first-page":"682","DOI":"10.1016\/j.molcel.2009.11.002","article-title":"Insights into GATA-1-mediated gene activation versus repression via genome-wide chromatin occupancy analysis","volume":"36","author":"Yu","year":"2009","journal-title":"Mol. Cell"},{"key":"2023012511501363700_B49","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1038\/sj.onc.1201142","article-title":"Inhibition of cyclin D expression in human breast cancer carcinoma cells by retinoids in vitro","volume":"15","author":"Zhou","year":"1997","journal-title":"Oncogene"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/2\/206\/48869473\/bioinformatics_28_2_206.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/2\/206\/48869473\/bioinformatics_28_2_206.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T11:51:53Z","timestamp":1674647513000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/2\/206\/197146"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,11,13]]},"references-count":49,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2012,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btr628","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,1,15]]},"published":{"date-parts":[[2011,11,13]]}}}