{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,26]],"date-time":"2026-06-26T07:49:04Z","timestamp":1782460144142,"version":"3.54.5"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"8","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,4,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Microarray data analysis has expanded from testing individual genes for differential expression to testing gene sets for differential expression. The tests at the gene set level may focus on multivariate expression changes or on the differential expression of at least one gene in the gene set. These tests may be powerful at detecting subtle changes in expression, but findings at the gene set level need to be examined further to understand whether they are informative and if so how.<\/jats:p><jats:p>Results: We propose to first test for differential expression at the gene set level but then proceed to test for differential expression of individual genes within discovered gene sets. We introduce the overall false discovery rate (OFDR) as an appropriate error rate to control when testing multiple gene sets and genes. We illustrate the advantage of this procedure over procedures that only test gene sets or individual genes.<\/jats:p><jats:p>Availability: R code (www.r-project.org) for implementing our approach is included as supplementary material.<\/jats:p><jats:p>Contact: \u00a0ruheller@whatron.upenn.edu<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp076","type":"journal-article","created":{"date-parts":[[2009,2,13]],"date-time":"2009-02-13T01:13:32Z","timestamp":1234487612000},"page":"1019-1025","source":"Crossref","is-referenced-by-count":37,"title":["A flexible two-stage procedure for identifying gene sets that are differentially expressed"],"prefix":"10.1093","volume":"25","author":[{"given":"Ruth","family":"Heller","sequence":"first","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340, 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021 and 3Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Elisabetta","family":"Manduchi","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340, 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021 and 3Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Gregory R.","family":"Grant","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340, 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021 and 3Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Warren J.","family":"Ewens","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Wharton School, University of Pennsylvania, Philadelphia, PA 19104-6340, 2Computational Biology and Informatics Laboratory, Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104-6021 and 3Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2009,2,11]]},"reference":[{"key":"2023051607020847700_B1","doi-asserted-by":"crossref","first-page":"2133","DOI":"10.1002\/(SICI)1097-0258(19980930)17:18<2133::AID-SIM901>3.0.CO;2-2","article-title":"Testing strategies in multi-dose experiments including active control","volume":"17","author":"Bauer","year":"1998","journal-title":"Stat. Med."},{"key":"2023051607020847700_B2","doi-asserted-by":"crossref","first-page":"1215","DOI":"10.1111\/j.1541-0420.2007.00984.x","article-title":"Screening for partial conjunction hypotheses","volume":"64","author":"Benjamini","year":"2008","journal-title":"Biometrics"},{"key":"2023051607020847700_B3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate - a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. Roy. Stat. Soc. B Met."},{"key":"2023051607020847700_B4","doi-asserted-by":"crossref","first-page":"783","DOI":"10.1534\/genetics.104.036699","article-title":"Quantitative trait loci analysis using the false discovery rate","volume":"171","author":"Benjamini","year":"2005","journal-title":"Genetics"},{"key":"2023051607020847700_B5","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.1214\/aos\/1013699998","article-title":"The control of the false discovery rate in multiple testing under dependency","volume":"29","author":"Benjamini","year":"2001","journal-title":"Annal Stat."},{"key":"2023051607020847700_B6","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1093\/biomet\/93.3.491","article-title":"Adaptive linear step-up false discovery rate controlling procedures","volume":"93","author":"Benjamini","year":"2006","journal-title":"Biometrika"},{"key":"2023051607020847700_B7","doi-asserted-by":"crossref","first-page":"2771","DOI":"10.1182\/blood-2003-09-3243","article-title":"Gene expression profile of adult t-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival","volume":"103","author":"Chiaretti","year":"2004","journal-title":"Blood"},{"key":"2023051607020847700_B8"},{"key":"2023051607020847700_B9","first-page":"166","article-title":"Multiple testing procedures for controlling tail probability error rates","volume-title":"Technical report.","author":"Dudoit","year":"2004"},{"key":"2023051607020847700_B10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/BF02595811","article-title":"Resampling-based multiple testing for microarray data analysis","volume":"12","author":"Ge","year":"2003","journal-title":"Test"},{"key":"2023051607020847700_B11","doi-asserted-by":"crossref","first-page":"980","DOI":"10.1093\/bioinformatics\/btm051","article-title":"Analyzing gene expression data in terms of gene sets: methodological issues","volume":"23","author":"Goeman","year":"2007","journal-title":"Bioinformatics"},{"key":"2023051607020847700_B12","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1093\/bioinformatics\/btg382","article-title":"A global test for groups of genes: testing association with a clinical outcome","volume":"20","author":"Goeman","year":"2004","journal-title":"Bioinformatics"},{"key":"2023051607020847700_B13","doi-asserted-by":"crossref","first-page":"800","DOI":"10.1093\/biomet\/75.4.800","article-title":"A sharper bonferroni procedure for multiple tests of significance","volume":"75","author":"Hochberg","year":"1988","journal-title":"Biometrika"},{"key":"2023051607020847700_B14","first-page":"65","article-title":"A simple sequentially rejective multiple test procedure","volume":"6","author":"Holm","year":"1979","journal-title":"Scand. J. Stat."},{"key":"2023051607020847700_B15","doi-asserted-by":"crossref","first-page":"1225","DOI":"10.1093\/bioinformatics\/btm092","article-title":"Domain-enhanced analysis of microarray data using go annotations","volume":"23","author":"Liu","year":"2007","journal-title":"Bioinformatics"},{"key":"2023051607020847700_B16","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1093\/bib\/bbn001","article-title":"Gene-set approach for expression pattern analysis","volume":"9","author":"Nam","year":"2008","journal-title":"Brief. Bioinform."},{"key":"2023051607020847700_B17","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1093\/bioinformatics\/btm583","article-title":"Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis","volume":"24","author":"Nettleton","year":"2008","journal-title":"Bioinformatics"},{"key":"2023051607020847700_B18","author":"Pollard","year":"2008","journal-title":"Resampling-based multiple hypothesis testing. r bioconductor package version 1.20.0."},{"key":"2023051607020847700_B19","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1093\/bioinformatics\/btf877","article-title":"Identifying differentially expressed genes using false discovery rate controlling procedures","volume":"19","author":"Reiner","year":"2003","journal-title":"Bioinformatics"},{"key":"2023051607020847700_B20","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1002\/bimj.200510313","article-title":"Fdr control by the bh procedure for two-sided correlated tests with implications to gene expression data analysis","volume":"49","author":"Reiner","year":"2007","journal-title":"Biometrical J."},{"key":"2023051607020847700_B21","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1111\/j.1467-9868.2004.00439.x","article-title":"Strong control, conservative point estimation, and simultaneous conservative consistency of false discovery rates: A unified approach","volume":"66","author":"Storey","year":"2004","journal-title":"J. R. Stat. Soci. S. B"},{"key":"2023051607020847700_B22","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene Set Enrichment Analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. sci. USA"},{"key":"2023051607020847700_B23","doi-asserted-by":"crossref","first-page":"13544","DOI":"10.1073\/pnas.0506577102","article-title":"Discovering statistically significant pathways in expression profiling studies","volume":"102","author":"Tian","year":"2005","journal-title":"Proc. Natl Acad. sci. USA"},{"key":"2023051607020847700_B24","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1042","article-title":"Augmentation procedures for control of the generalized family-wise error rate and tail probabilities for the proportion of false positives","volume":"3","author":"Van Der Laan","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023051607020847700_B25","volume-title":"Resampling-Based Multiple Testing: Examples and Methods for P-Value Adjustment.","author":"Westfall","year":"1993"},{"key":"2023051607020847700_B26","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1198\/016214507000001373","article-title":"Hierarchical false discovery rate-controlling methodology","volume":"103","author":"Yekutieli","year":"2008","journal-title":"J. Am. Stat. Assoc."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/8\/1019\/50287411\/bioinformatics_25_8_1019.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/8\/1019\/50287411\/bioinformatics_25_8_1019.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,7]],"date-time":"2025-02-07T17:49:08Z","timestamp":1738950548000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/8\/1019\/323842"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,2,11]]},"references-count":26,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2009,4,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp076","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,4,15]]},"published":{"date-parts":[[2009,2,11]]}}}