{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,26]],"date-time":"2026-01-26T09:52:52Z","timestamp":1769421172728,"version":"3.49.0"},"reference-count":41,"publisher":"Oxford University Press (OUP)","issue":"21","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,11,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Modelling the regulation of gene expression can provide insight into the regulatory roles of individual transcription factors (TFs) and histone modifications. Recently, Ouyang et al. in 2009 modelled gene expression levels in mouse embryonic stem (mES) cells using in vivo ChIP-seq measurements of TF binding. ChIP-seq TF binding data, however, are tissue-specific and relatively difficult to obtain. This limits the applicability of gene expression models that rely on ChIP-seq TF binding data.<\/jats:p>\n               <jats:p>Results: In this study, we build regression-based models that relate gene expression to the binding of 12 different TFs, 7 histone modifications and chromatin accessibility (DNase I hypersensitivity) in two different tissues. We find that expression models based on computationally predicted TF binding can achieve similar accuracy to those using in vivo TF binding data and that including binding at weak sites is critical for accurate prediction of gene expression. We also find that incorporating histone modification and chromatin accessibility data results in additional accuracy. Surprisingly, we find that models that use no TF binding data at all, but only histone modification and chromatin accessibility data, can be as (or more) accurate than those based on in vivo TF binding data.<\/jats:p>\n               <jats:p>Availability and implementation: All scripts, motifs and data presented in this article are available online at http:\/\/research.imb.uq.edu.au\/t.bailey\/supplementary_data\/McLeay2011a.<\/jats:p>\n               <jats:p>Contact: \u00a0t.bailey@imb.uq.edu.au<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts529","type":"journal-article","created":{"date-parts":[[2012,9,7]],"date-time":"2012-09-07T01:09:09Z","timestamp":1346980149000},"page":"2789-2796","source":"Crossref","is-referenced-by-count":52,"title":["Genome-wide <i>in silico<\/i> prediction of gene expression"],"prefix":"10.1093","volume":"28","author":[{"given":"Robert C.","family":"McLeay","sequence":"first","affiliation":[{"name":"1 Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia and 2D\u00e9partement d'Informatique, Universit\u00e9 Bordeaux 1 Sciences et Technologies, 33405 Talence, France"}]},{"given":"Tom","family":"Lesluyes","sequence":"additional","affiliation":[{"name":"1 Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia and 2D\u00e9partement d'Informatique, Universit\u00e9 Bordeaux 1 Sciences et Technologies, 33405 Talence, France"},{"name":"1 Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia and 2D\u00e9partement d'Informatique, Universit\u00e9 Bordeaux 1 Sciences et Technologies, 33405 Talence, France"}]},{"given":"Gabriel","family":"Cuellar Partida","sequence":"additional","affiliation":[{"name":"1 Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia and 2D\u00e9partement d'Informatique, Universit\u00e9 Bordeaux 1 Sciences et Technologies, 33405 Talence, France"}]},{"given":"Timothy L.","family":"Bailey","sequence":"additional","affiliation":[{"name":"1 Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia and 2D\u00e9partement d'Informatique, Universit\u00e9 Bordeaux 1 Sciences et Technologies, 33405 Talence, France"}]}],"member":"286","published-online":{"date-parts":[[2012,9,6]]},"reference":[{"key":"2023012513151043200_bts529-B1","doi-asserted-by":"crossref","first-page":"440","DOI":"10.3109\/10409238.2010.504700","article-title":"Decoding the histone h4 lysine 20 methylation mark","volume":"45","author":"Balakrishnan","year":"2010","journal-title":"Crit. Rev. Biochem. Mol. Biol."},{"key":"2023012513151043200_bts529-B2","doi-asserted-by":"crossref","first-page":"1742","DOI":"10.1101\/gr.090951.109","article-title":"Chromatin poises mirna- and protein-coding genes for expression","volume":"19","author":"Barski","year":"2009","journal-title":"Genome Res."},{"key":"2023012513151043200_bts529-B3","doi-asserted-by":"crossref","first-page":"393","DOI":"10.1038\/nprot.2008.195","article-title":"Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors","volume":"4","author":"Berger","year":"2009","journal-title":"Nat. Protoc."},{"key":"2023012513151043200_bts529-B4","doi-asserted-by":"crossref","first-page":"1045","DOI":"10.1038\/nbt1010-1045","article-title":"The nih roadmap epigenomics mapping consortium","volume":"28","author":"Bernstein","year":"2010","journal-title":"Nat. Biotechnol."},{"key":"2023012513151043200_bts529-B5","doi-asserted-by":"crossref","first-page":"595","DOI":"10.1101\/gr.4887606","article-title":"Unbiased location analysis of E2F1-binding sites suggests a widespread role for E2F1 in the human genome","volume":"16","author":"Bieda","year":"2006","journal-title":"Genome Res."},{"key":"2023012513151043200_bts529-B6","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/bioinformatics\/19.2.185","article-title":"A comparison of normalization methods for high density oligonucleotide array data based on variance and bias","volume":"19","author":"Bolstad","year":"2003","journal-title":"Bioinformatics"},{"key":"2023012513151043200_bts529-B7","doi-asserted-by":"crossref","first-page":"1039","DOI":"10.1126\/science.1076997","article-title":"Role of histone h3 lysine 27 methylation in polycomb-group silencing","volume":"298","author":"Cao","year":"2002","journal-title":"Science"},{"key":"2023012513151043200_bts529-B8","doi-asserted-by":"crossref","first-page":"1106","DOI":"10.1016\/j.cell.2008.04.043","article-title":"Integration of external signaling pathways with the core transcriptional network in embryonic stem cells","volume":"133","author":"Chen","year":"2008","journal-title":"Cell"},{"key":"2023012513151043200_bts529-B9","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1038\/nmeth.1223","article-title":"Stem cell transcriptome profiling via massive-scale mrna sequencing","volume":"5","author":"Cloonan","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012513151043200_bts529-B10","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1093\/bioinformatics\/btr614","article-title":"Epigenetic priors for identifying active transcription factor binding sites","volume":"28","author":"Cuellar Partida","year":"2012","journal-title":"Bioinformatics"},{"key":"2023012513151043200_bts529-B11","doi-asserted-by":"crossref","first-page":"16234","DOI":"10.1073\/pnas.0407365101","article-title":"Interacting models of cooperative gene regulation","volume":"101","author":"Das","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012513151043200_bts529-B12","doi-asserted-by":"crossref","first-page":"2006.0029","DOI":"10.1038\/msb4100067","article-title":"Adaptively inferring human transcriptional subnetworks","volume":"2","author":"Das","year":"2006","journal-title":"Mol. Syst. Biol."},{"key":"2023012513151043200_bts529-B13","doi-asserted-by":"crossref","first-page":"e1001046","DOI":"10.1371\/journal.pbio.1001046","article-title":"A user\u2019s guide to the encyclopedia of DNA elements (ENCODE)","volume":"9","author":"ENCODE Project Consortium (2011).","year":"2011","journal-title":"PLoS Biol."},{"key":"2023012513151043200_bts529-B14","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1038\/nature09906","article-title":"Mapping and analysis of chromatin state dynamics in nine human cell types","volume":"473","author":"Ernst","year":"2011","journal-title":"Nature"},{"key":"2023012513151043200_bts529-B15","doi-asserted-by":"crossref","first-page":"1729","DOI":"10.1093\/bioinformatics\/btn305","article-title":"Findpeaks 3.1: a tool for identifying areas of enrichment from massively parallel short-read sequencing technology","volume":"24","author":"Fejes","year":"2008","journal-title":"Bioinformatics"},{"key":"2023012513151043200_bts529-B16","doi-asserted-by":"crossref","first-page":"1775","DOI":"10.1126\/science.1196914","article-title":"Integrative analysis of the caenorhabditis elegans genome by the modencode project","volume":"330","author":"Gerstein","year":"2010","journal-title":"Science"},{"key":"2023012513151043200_bts529-B17","doi-asserted-by":"crossref","first-page":"1017","DOI":"10.1093\/bioinformatics\/btr064","article-title":"Fimo: scanning for occurrences of a given motif","volume":"27","author":"Grant","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012513151043200_bts529-B18","doi-asserted-by":"crossref","first-page":"2926","DOI":"10.1073\/pnas.0909344107","article-title":"Histone modification levels are predictive for gene expression","volume":"107","author":"Karli\u0107","year":"2010","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012513151043200_bts529-B19","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1016\/j.cell.2007.02.005","article-title":"Chromatin modifications and their function","volume":"128","author":"Kouzarides","year":"2007","journal-title":"Cell"},{"key":"2023012513151043200_bts529-B20","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2009-10-3-r25","article-title":"Ultrafast and memory-efficient alignment of short dna sequences to the human genome","volume":"10","author":"Langmead","year":"2009","journal-title":"Genome Biol."},{"key":"2023012513151043200_bts529-B21","doi-asserted-by":"crossref","first-page":"374","DOI":"10.1093\/nar\/gkg108","article-title":"Transfac: transcriptional regulation, from patterns to profiles","volume":"31","author":"Matys","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012513151043200_bts529-B22","doi-asserted-by":"crossref","first-page":"766","DOI":"10.1038\/nature07107","article-title":"Genome-scale dna methylation maps of pluripotent and differentiated cells","volume":"454","author":"Meissner","year":"2008","journal-title":"Nature"},{"key":"2023012513151043200_bts529-B23","doi-asserted-by":"crossref","first-page":"553","DOI":"10.1038\/nature06008","article-title":"Genome-wide maps of chromatin state in pluripotent and lineage-committed cells","volume":"448","author":"Mikkelsen","year":"2007","journal-title":"Nature"},{"key":"2023012513151043200_bts529-B24","doi-asserted-by":"crossref","first-page":"621","DOI":"10.1038\/nmeth.1226","article-title":"Mapping and quantifying mammalian transcriptomes by RNA-Seq","volume":"5","author":"Mortazavi","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012513151043200_bts529-B25","doi-asserted-by":"crossref","first-page":"21521","DOI":"10.1073\/pnas.0904863106","article-title":"Chip-seq of transcription factors predicts absolute and differential gene expression in embryonic stem cells","volume":"106","author":"Ouyang","year":"2009","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012513151043200_bts529-B26","doi-asserted-by":"crossref","first-page":"S50","DOI":"10.1186\/1471-2105-12-S1-S50","article-title":"A regression analysis of gene expression in es cells reveals two gene classes that are significantly different in epigenetic patterns","volume":"12","author":"Park","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023012513151043200_bts529-B27","doi-asserted-by":"crossref","first-page":"4958","DOI":"10.1093\/nar\/gkq244","article-title":"Characterization of an antagonistic switch between histone h3 lysine 27 methylation and acetylation in the transcriptional regulation of polycomb group target genes","volume":"38","author":"Pasini","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012513151043200_bts529-B28","doi-asserted-by":"crossref","first-page":"D105","DOI":"10.1093\/nar\/gkp950","article-title":"Jaspar 2010: the greatly expanded open-access database of transcription factor binding profiles","volume":"38","author":"Portales-Casamar","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012513151043200_bts529-B29","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Development Core Team. (2008)","year":"2008"},{"key":"2023012513151043200_bts529-B30","doi-asserted-by":"crossref","first-page":"D620","DOI":"10.1093\/nar\/gkp961","article-title":"ENCODE whole-genome data in the UCSC Genome Browser","volume":"38","author":"Rosenbloom","year":"2010","journal-title":"Nucleic Acids Res."},{"key":"2023012513151043200_bts529-B31","doi-asserted-by":"crossref","first-page":"e1001023","DOI":"10.1371\/journal.pgen.1001023","article-title":"Chd7 targets active gene enhancer elements to modulate es cell-specific gene expression","volume":"6","author":"Schnetz","year":"2010","journal-title":"PLoS Genet."},{"key":"2023012513151043200_bts529-B32","doi-asserted-by":"crossref","first-page":"1251","DOI":"10.1101\/gad.300704","article-title":"A silencing pathway to induce h3-k9 and h4-k20 trimethylation at constitutive heterochromatin","volume":"18","author":"Schotta","year":"2004","journal-title":"Genes Dev."},{"key":"2023012513151043200_bts529-B33","doi-asserted-by":"crossref","first-page":"1105","DOI":"10.1093\/bioinformatics\/btp120","article-title":"Tophat: discovering splice junctions with RNA-seq","volume":"25","author":"Trapnell","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012513151043200_bts529-B34","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1038\/nmeth.1246","article-title":"Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data","volume":"5","author":"Valouev","year":"2008","journal-title":"Nat. Methods"},{"key":"2023012513151043200_bts529-B35","doi-asserted-by":"crossref","first-page":"1245","DOI":"10.1016\/j.str.2008.04.015","article-title":"Structural insight into the recognition of the H3K4me3 mark by the TFIID subunit TAF3","volume":"16","author":"van Ingen","year":"2008","journal-title":"Structure"},{"key":"2023012513151043200_bts529-B36","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1038\/nrg2538","article-title":"A census of human transcription factors: function, expression and evolution","volume":"10","author":"Vaquerizas","year":"2009","journal-title":"Nat. Rev. Genet."},{"key":"2023012513151043200_bts529-B37","doi-asserted-by":"crossref","first-page":"897","DOI":"10.1038\/ng.154","article-title":"Combinatorial patterns of histone acetylations and methylations in the human genome","volume":"40","author":"Wang","year":"2008","journal-title":"Nat. Genet."},{"key":"2023012513151043200_bts529-B38","doi-asserted-by":"crossref","first-page":"276","DOI":"10.1038\/nrg1315","article-title":"Applied bioinformatics for the identification of regulatory elements","volume":"5","author":"Wasserman","year":"2004","journal-title":"Nat. Rev. Genet."},{"key":"2023012513151043200_bts529-B39","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1093\/nar\/gkn866","article-title":"High-throughput chromatin information enables accurate tissue-specific prediction of transcription factor binding sites","volume":"37","author":"Whitington","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012513151043200_bts529-B40","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1093\/bioinformatics\/btn605","article-title":"MotifMap: a human genome-wide map of candidate regulatory motif sites","volume":"25","author":"Xie","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012513151043200_bts529-B41","doi-asserted-by":"crossref","first-page":"R137","DOI":"10.1186\/gb-2008-9-9-r137","article-title":"Model-based analysis of ChIP-Seq (MACS)","volume":"9","author":"Zhang","year":"2008","journal-title":"Genome Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/21\/2789\/48873383\/bioinformatics_28_21_2789.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/21\/2789\/48873383\/bioinformatics_28_21_2789.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T19:18:55Z","timestamp":1674674335000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/21\/2789\/236714"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,6]]},"references-count":41,"journal-issue":{"issue":"21","published-print":{"date-parts":[[2012,11,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts529","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,11,1]]},"published":{"date-parts":[[2012,9,6]]}}}