{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,29]],"date-time":"2025-10-29T19:18:57Z","timestamp":1761765537574},"reference-count":57,"publisher":"Oxford University Press (OUP)","issue":"8","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: A system-wide approach to revealing the underlying molecular state of a cell is a long-standing biological challenge. Developed over the last decade, gene expression profiles possess the characteristics of such an assay. They have the capacity to reveal both underlying molecular events as well as broader phenotypes such as clinical outcomes. To interpret these profiles, many gene sets have been developed that characterize biological processes. However, the full potential of these gene sets has not yet been achieved. Since the advent of gene expression databases, many have posited that they can reveal properties of activities that are not evident from individual datasets, analogous to how the expression of a single gene generally cannot reveal the activation of a biological process.<\/jats:p><jats:p>Results: To address this issue, we have developed a high-throughput method to mine gene expression databases for the regulation of gene sets. Given a set of genes, we scored it against each gene expression dataset by looking for enrichment of co-regulated genes relative to an empirical null distribution. After validating the method, we applied it to address two biological problems. First, we deciphered the E2F transcriptional network. We confirmed that true transcriptional targets exhibit a distinct regulatory profile across a database. Second, we leveraged the patterns of regulation across a database of gene sets to produce an automatically generated catalog of biological processes. These demonstrations revealed the power of a global analysis of the data contained within gene expression databases, and the potential for using them to address biological questions.<\/jats:p><jats:p>Contact: \u00a0jeffrey.t.chang@uth.tmc.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts112","type":"journal-article","created":{"date-parts":[[2012,3,10]],"date-time":"2012-03-10T01:39:55Z","timestamp":1331343595000},"page":"1122-1129","source":"Crossref","is-referenced-by-count":3,"title":["Deriving transcriptional programs and functional processes from gene expression databases"],"prefix":"10.1093","volume":"28","author":[{"given":"Jeffrey T.","family":"Chang","sequence":"first","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2012,3,8]]},"reference":[{"key":"2023012711533589200_B1","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet."},{"key":"2023012711533589200_B2","doi-asserted-by":"crossref","first-page":"7024","DOI":"10.1093\/nar\/gkg894","article-title":"Identifying cooperativity among transcription factors controlling the cell cycle in yeast","volume":"31","author":"Banerjee","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B3","doi-asserted-by":"crossref","first-page":"955","DOI":"10.1073\/pnas.0704723105","article-title":"Genome-wide transcriptional analysis of the human cell cycle identifies genes differentially regulated in normal and cancer cells","volume":"105","author":"Bar-Joseph","year":"2008","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533589200_B4","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1038\/nature08460","article-title":"Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1","volume":"462","author":"Barbie","year":"2009","journal-title":"Nature"},{"key":"2023012711533589200_B5","doi-asserted-by":"crossref","first-page":"D1005","DOI":"10.1093\/nar\/gkq1184","article-title":"NCBI GEO: archive for functional genomics data sets\u201410 years on","volume":"39","author":"Barrett","year":"2011","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B6","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1038\/4478","article-title":"Gene expression informatics\u2014it's all in your mine","volume":"21","author":"Bassett","year":"1999","journal-title":"Nat. Genet."},{"key":"2023012711533589200_B7","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1016\/S0092-8674(04)00304-6","article-title":"Predicting gene expression from sequence","volume":"117","author":"Beer","year":"2004","journal-title":"Cell"},{"key":"2023012711533589200_B8","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J.Roy. Stat. Soc."},{"key":"2023012711533589200_B9","doi-asserted-by":"crossref","first-page":"1266","DOI":"10.1016\/j.cell.2008.05.024","article-title":"Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences","volume":"133","author":"Berger","year":"2008","journal-title":"Cell"},{"key":"2023012711533589200_B10","doi-asserted-by":"crossref","first-page":"353","DOI":"10.1038\/nature04296","article-title":"Oncogenic pathway signatures in human cancers as a guide to targeted therapies","volume":"439","author":"Bild","year":"2006","journal-title":"Nature"},{"key":"2023012711533589200_B11","doi-asserted-by":"crossref","first-page":"D102","DOI":"10.1093\/nar\/gkm955","article-title":"JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update","volume":"36","author":"Bryne","year":"2008","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B12","doi-asserted-by":"crossref","first-page":"2926","DOI":"10.1093\/bioinformatics\/btl483","article-title":"GATHER: a systems approach to interpreting genomic signatures","volume":"22","author":"Chang","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012711533589200_B13","doi-asserted-by":"crossref","first-page":"2810","DOI":"10.1038\/sj.onc.1208612","article-title":"The E2F transcriptional network: old acquaintances with new faces","volume":"24","author":"Dimova","year":"2005","journal-title":"Oncogene"},{"key":"2023012711533589200_B14","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1214\/07-AOAS101","article-title":"On testing the significance of sets of genes","volume":"1","author":"Efron","year":"2007","journal-title":"Ann. Appl. Stat."},{"key":"2023012711533589200_B15","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1093\/bioinformatics\/bth469","article-title":"Outcome signature genes in breast cancer: is there a unique set?","volume":"21","author":"Ein-Dor","year":"2005","journal-title":"Bioinformatics"},{"key":"2023012711533589200_B16","doi-asserted-by":"crossref","first-page":"932","DOI":"10.1016\/j.jbi.2010.07.001","article-title":"Independent component analysis: mining microarray data for fundamental human gene expression modules","volume":"43","author":"Engreitz","year":"2010","journal-title":"J. Biomed. Inform."},{"key":"2023012711533589200_B17","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1186\/1471-2105-11-603","article-title":"Content-based microarray search using differential expression profiles","volume":"11","author":"Engreitz","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012711533589200_B18","doi-asserted-by":"crossref","first-page":"2174","DOI":"10.1016\/j.bbamcr.2008.07.018","article-title":"Nuclear architecture and gene regulation","volume":"1783","author":"Fedorova","year":"2008","journal-title":"Biochim. Biophys. Acta."},{"key":"2023012711533589200_B19","doi-asserted-by":"crossref","first-page":"972","DOI":"10.1126\/science.1136800","article-title":"Clustering by passing messages between data points","volume":"315","author":"Frey","year":"2007","journal-title":"Science"},{"key":"2023012711533589200_B20","doi-asserted-by":"crossref","first-page":"6994","DOI":"10.1073\/pnas.0912708107","article-title":"A pathway-based classification of human breast cancer","volume":"107","author":"Gatza","year":"2010","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533589200_B21","doi-asserted-by":"crossref","first-page":"3707","DOI":"10.1128\/MCB.23.11.3707-3720.2003","article-title":"Identification of E-box factor TFE3 as a functional partner for the E2F3 transcription factor","volume":"23","author":"Giangrande","year":"2003","journal-title":"Mol. Cell. Biol."},{"key":"2023012711533589200_B22","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.ccr.2007.11.031","article-title":"An E2F1-dependent gene expression program that determines the balance between proliferation and cell death","volume":"13","author":"Hallstrom","year":"2008","journal-title":"Cancer Cell"},{"key":"2023012711533589200_B23","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nature02800","article-title":"Transcriptional regulatory code of a eukaryotic genome","volume":"431","author":"Harbison","year":"2004","journal-title":"Nature"},{"key":"2023012711533589200_B24","doi-asserted-by":"crossref","first-page":"2692","DOI":"10.1093\/bioinformatics\/btm403","article-title":"Exploring the functional landscape of gene expression: directed search of large microarray compendia","volume":"23","author":"Hibbs","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012711533589200_B25","doi-asserted-by":"crossref","first-page":"i222","DOI":"10.1093\/bioinformatics\/btm222","article-title":"Systematic discovery of functional modules and context-specific functional annotation of human genome","volume":"23","author":"Huang","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012711533589200_B26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/nar\/gkn923","article-title":"Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists","volume":"37","author":"Huang","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B27","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1038\/nprot.2008.211","article-title":"Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources","volume":"4","author":"Huang","year":"2009","journal-title":"Nat. Protoc."},{"key":"2023012711533589200_B28","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/S0092-8674(00)00015-5","article-title":"Functional discovery via a compendium of expression profiles","volume":"102","author":"Hughes","year":"2000","journal-title":"Cell"},{"issue":"Suppl. 1","key":"2023012711533589200_B29","doi-asserted-by":"crossref","first-page":"S115","DOI":"10.1093\/bioinformatics\/17.suppl_1.S115","article-title":"GEST: a gene expression search tool based on a novel Bayesian similarity metric","volume":"17","author":"Hunter","year":"2001","journal-title":"Bioinformatics"},{"key":"2023012711533589200_B30","doi-asserted-by":"crossref","first-page":"4684","DOI":"10.1128\/MCB.21.14.4684-4699.2001","article-title":"Role for E2F in control of both DNA replication and mitotic functions as revealed from DNA microarray analysis","volume":"21","author":"Ishida","year":"2001","journal-title":"Mol. Cell. Biol."},{"key":"2023012711533589200_B31","doi-asserted-by":"crossref","first-page":"1497","DOI":"10.1126\/science.1141319","article-title":"Genome-wide mapping of in vivo protein-DNA interactions","volume":"316","author":"Johnson","year":"2007","journal-title":"Science"},{"key":"2023012711533589200_B32","doi-asserted-by":"crossref","first-page":"1659","DOI":"10.1128\/MCB.16.4.1659","article-title":"Interaction of Sp1 with the growth- and cell cycle-regulated transcription factor E2F","volume":"16","author":"Karlseder","year":"1996","journal-title":"Mol. Cell. Biol."},{"key":"2023012711533589200_B33","doi-asserted-by":"crossref","first-page":"2491","DOI":"10.1128\/jvi.69.4.2491-2500.1995","article-title":"E2F1 overexpression in quiescent fibroblasts leads to induction of cellular DNA synthesis and apoptosis","volume":"69","author":"Kowalik","year":"1995","journal-title":"J. Virol."},{"key":"2023012711533589200_B34","doi-asserted-by":"crossref","first-page":"1929","DOI":"10.1126\/science.1132939","article-title":"The Connectivity Map: using gene-expression signatures to connect small molecules, genes, and disease","volume":"313","author":"Lamb","year":"2006","journal-title":"Science"},{"key":"2023012711533589200_B35","doi-asserted-by":"crossref","first-page":"1739","DOI":"10.1093\/bioinformatics\/btr260","article-title":"Molecular signatures database (MSigDB) 3.0","volume":"27","author":"Liberzon","year":"2011","journal-title":"Bioinformatics"},{"key":"2023012711533589200_B36","doi-asserted-by":"crossref","first-page":"R180","DOI":"10.1186\/gb-2008-9-12-r180","article-title":"Singular value decomposition-based regression identifies activation of endogenous signaling pathways in vivo","volume":"9","author":"Liu","year":"2008","journal-title":"Genome Biol."},{"key":"2023012711533589200_B37","doi-asserted-by":"crossref","first-page":"D108","DOI":"10.1093\/nar\/gkj143","article-title":"TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes","volume":"34","author":"Matys","year":"2006","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B38","doi-asserted-by":"crossref","first-page":"13550","DOI":"10.1073\/pnas.0506230102","article-title":"An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects, and patient survival","volume":"102","author":"Miller","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533589200_B39","first-page":"585","article-title":"Toward an understanding of the functional complexity of the E2F and retinoblastoma families","volume":"9","author":"Nevins","year":"1998","journal-title":"Cell Growth Differ."},{"key":"2023012711533589200_B40","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1038\/35021093","article-title":"Molecular portraits of human breast tumours","volume":"406","author":"Perou","year":"2000","journal-title":"Nature"},{"key":"2023012711533589200_B41","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1038\/ng724","article-title":"Identifying regulatory networks by combinatorial analysis of promoter elements","volume":"29","author":"Pilpel","year":"2001","journal-title":"Nat. Genet."},{"key":"2023012711533589200_B42","doi-asserted-by":"crossref","first-page":"D542","DOI":"10.1093\/nar\/gkh107","article-title":"CleanEx: a database of heterogeneous gene expression data based on a consistent gene nomenclature","volume":"32","author":"Praz","year":"2004","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B43","doi-asserted-by":"crossref","first-page":"744","DOI":"10.1016\/j.cell.2010.01.044","article-title":"An atlas of combinatorial transcriptional regulation in mouse and man","volume":"140","author":"Ravasi","year":"2010","journal-title":"Cell"},{"key":"2023012711533589200_B44","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1593\/neo.07112","article-title":"Oncomine 3.0: genes, pathways, and networks in a collection of 18,000 cancer gene expression profiles","volume":"9","author":"Rhodes","year":"2007","journal-title":"Neoplasia"},{"key":"2023012711533589200_B45","doi-asserted-by":"crossref","first-page":"5775","DOI":"10.1093\/emboj\/cdf577","article-title":"Interaction of YY1 with E2Fs, mediated by RYBP, provides a mechanism for specificity of E2F function","volume":"21","author":"Schlisio","year":"2002","journal-title":"EMBO J."},{"key":"2023012711533589200_B46","doi-asserted-by":"crossref","first-page":"7360","DOI":"10.1093\/nar\/gkm885","article-title":"TREMOR\u2014a tool for retrieving transcriptional modules by incorporating motif covariance","volume":"35","author":"Singh","year":"2007","journal-title":"Nucleic Acids Res."},{"key":"2023012711533589200_B47","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1186\/1471-2407-10-539","article-title":"Genome profiling of ERBB2-amplified breast cancers","volume":"10","author":"Sircoulomb","year":"2010","journal-title":"BMC Cancer"},{"key":"2023012711533589200_B48","first-page":"369","article-title":"Prediction and uncertainty in the analysis of gene expression profiles","volume":"2","author":"Spang","year":"2002","journal-title":"In Silico Biol."},{"key":"2023012711533589200_B49","doi-asserted-by":"crossref","first-page":"1385","DOI":"10.1126\/science.1683715","article-title":"Regulation of a segmentation stripe by overlapping activators and repressors in the Drosophila embryo","volume":"254","author":"Stanojevic","year":"1991","journal-title":"Science"},{"key":"2023012711533589200_B50","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012711533589200_B51","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1038\/10343","article-title":"Systematic determination of genetic network architecture","volume":"22","author":"Tavazoie","year":"1999","journal-title":"Nat. Genet."},{"key":"2023012711533589200_B52","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1038\/nrm714","article-title":"Sibling rivalry in the E2F family","volume":"3","author":"Trimarchi","year":"2002","journal-title":"Nat. Rev. Mol. Cell Biol."},{"key":"2023012711533589200_B53","doi-asserted-by":"crossref","first-page":"1977","DOI":"10.1091\/mbc.02-02-0030","article-title":"Identification of genes periodically expressed in the human cell cycle and their expression in tumors","volume":"13","author":"Whitfield","year":"2002","journal-title":"Mol. Biol. Cell"},{"key":"2023012711533589200_B54","doi-asserted-by":"crossref","first-page":"264","DOI":"10.1186\/1471-2105-10-264","article-title":"GEOGLE: context mining tool for the correlation between gene expression and the phenotypic distinction","volume":"10","author":"Yu","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012711533589200_B55","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.ccr.2009.05.017","article-title":"Latent bone metastasis in breast cancer tied to Src-dependent survival signals","volume":"16","author":"Zhang","year":"2009","journal-title":"Cancer Cell"},{"key":"2023012711533589200_B56","doi-asserted-by":"crossref","first-page":"4615","DOI":"10.1038\/sj.emboj.7600459","article-title":"E2Fs link the control of G1\/S and G2\/M transcription","volume":"23","author":"Zhu","year":"2004","journal-title":"EMBO J."},{"key":"2023012711533589200_B57","doi-asserted-by":"crossref","first-page":"2798","DOI":"10.1093\/bioinformatics\/btn520","article-title":"GEOmetadb: powerful alternative search engine for the Gene Expression Omnibus","volume":"24","author":"Zhu","year":"2008","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/8\/1122\/48930321\/bioinformatics_28_8_1122.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/8\/1122\/48930321\/bioinformatics_28_8_1122.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,21]],"date-time":"2024-04-21T08:10:15Z","timestamp":1713687015000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/8\/1122\/196399"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,3,8]]},"references-count":57,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2012,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts112","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2012,4]]},"published":{"date-parts":[[2012,3,8]]}}}