{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T02:38:29Z","timestamp":1775702309698,"version":"3.50.1"},"reference-count":30,"publisher":"Oxford University Press (OUP)","issue":"18","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: Recently, many univariate and several multivariate approaches have been suggested for testing differential expression of gene sets between different phenotypes. However, despite a wealth of literature studying their performance on simulated and real biological data, still there is a need to quantify their relative performance when they are testing different null hypotheses.<\/jats:p>\n               <jats:p>Results: In this article, we compare the performance of univariate and multivariate tests on both simulated and biological data. In the simulation study we demonstrate that high correlations equally affect the power of both, univariate as well as multivariate tests. In addition, for most of them the power is similarly affected by the dimensionality of the gene set and by the percentage of genes in the set, for which expression is changing between two phenotypes. The application of different test statistics to biological data reveals that three statistics (sum of squared t-tests, Hotelling's T2, N-statistic), testing different null hypotheses, find some common but also some complementing differentially expressed gene sets under specific settings. This demonstrates that due to complementing null hypotheses each test projects on different aspects of the data and for the analysis of biological data it is beneficial to use all three tests simultaneously instead of focusing exclusively on just one.<\/jats:p>\n               <jats:p>Contact: \u00a0Galina_Glazko@urmc.rochester.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btp406","type":"journal-article","created":{"date-parts":[[2009,7,3]],"date-time":"2009-07-03T00:34:52Z","timestamp":1246581292000},"page":"2348-2354","source":"Crossref","is-referenced-by-count":95,"title":["Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets"],"prefix":"10.1093","volume":"25","author":[{"given":"Galina V.","family":"Glazko","sequence":"first","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY 14642, USA and 2Computational Biology and Machine Learning, Center for Cancer Research and Cell Biology, School of Medicine, Dentistry and Biomedical Sciences, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, UK"}]},{"given":"Frank","family":"Emmert-Streib","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics and Computational Biology, University of Rochester Medical Center, Rochester, NY 14642, USA and 2Computational Biology and Machine Learning, Center for Cancer Research and Cell Biology, School of Medicine, Dentistry and Biomedical Sciences, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, UK"}]}],"member":"286","published-online":{"date-parts":[[2009,7,2]]},"reference":[{"key":"2023013112122059900_B1","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1186\/1471-2105-10-47","article-title":"A general modular framework for gene set enrichment analysis","volume":"10","author":"Ackermann","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023013112122059900_B2","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/S0047-259X(03)00079-4","article-title":"On a new multivariate two-sample test","volume":"88","author":"Baringhaus","year":"2004","journal-title":"J. Multivariate Anal."},{"key":"2023013112122059900_B3","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1214\/07-AOAS146","article-title":"A statistical framework for testing functional categories in microarray data","volume":"2","author":"Barry","year":"2008","journal-title":"Ann. Appl. Stat."},{"key":"2023013112122059900_B4","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1214\/aoms\/1177706437","article-title":"A high dimentional two sample significance test","volume":"29","author":"Dempster","year":"1958","journal-title":"Ann. Math. Statist."},{"key":"2023013112122059900_B5","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-49317-6","volume-title":"Multiple Testing Procedures with Applications to Genomics.","author":"Dudoit","year":"2008"},{"key":"2023013112122059900_B6","doi-asserted-by":"crossref","first-page":"961","DOI":"10.1089\/cmb.2007.0041","article-title":"The chronic fatigue syndrome: a comparative pathway analysis","volume":"14","author":"Emmert-Streib","year":"2007","journal-title":"J. Comput. Biol."},{"key":"2023013112122059900_B7","doi-asserted-by":"crossref","DOI":"10.1002\/9783527622818","volume-title":"Analysis of Microarray Data: A Net-work-Based Approach.","author":"Emmert-Streib","year":"2008"},{"key":"2023013112122059900_B8","doi-asserted-by":"crossref","first-page":"980","DOI":"10.1093\/bioinformatics\/btm051","article-title":"Analyzing gene expression data in terms of gene sets: methodological issues","volume":"23","author":"Goeman","year":"2007","journal-title":"Bioinformatics"},{"issue":"Suppl. 1","key":"2023013112122059900_B9","doi-asserted-by":"crossref","first-page":"S96","DOI":"10.1093\/bioinformatics\/18.suppl_1.S96","article-title":"Variance stabilization applied to microarray data calibration and to the quantification of differential expression","volume":"18","author":"Huber","year":"2002","journal-title":"Bioinformatics"},{"key":"2023013112122059900_B10","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1093\/bioinformatics\/btm531","article-title":"GlobalANCOVA: exploration and assessment of gene group effects","volume":"24","author":"Hummel","year":"2008","journal-title":"Bioinformatics"},{"key":"2023013112122059900_B11","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1093\/bioinformatics\/btl599","article-title":"Extensions to gene set enrichment","volume":"23","author":"Jiang","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013112122059900_B12","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023013112122059900_B13","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1186\/1471-2105-6-144","article-title":"PAGE: parametric analysis of gene set enrichment","volume":"6","author":"Kim","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023013112122059900_B14","doi-asserted-by":"crossref","DOI":"10.7551\/mitpress\/3087.001.0001","volume-title":"Foundations of Systems Biology.","author":"Kitano","year":"2001"},{"key":"2023013112122059900_B15","doi-asserted-by":"crossref","first-page":"1139","DOI":"10.1142\/S0219720007003041","article-title":"A multivariate extension of the gene set enrichment analysis","volume":"5","author":"Klebanov","year":"2007","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023013112122059900_B16","doi-asserted-by":"crossref","first-page":"2373","DOI":"10.1093\/bioinformatics\/btl401","article-title":"A multivariate approach for integrating genome-wide expression data and biological knowledge","volume":"22","author":"Kong","year":"2006","journal-title":"Bioinformatics"},{"key":"2023013112122059900_B17","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1186\/1471-2105-8-431","article-title":"Comparative evaluation of gene-set analysis methods","volume":"8","author":"Liu","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023013112122059900_B18","doi-asserted-by":"crossref","first-page":"3105","DOI":"10.1093\/bioinformatics\/bti496","article-title":"Hotelling's T2 multivariate profiling for detecting differential expression in microarrays","volume":"21","author":"Lu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013112122059900_B19","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1038\/ng1180","article-title":"PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes","volume":"34","author":"Mootha","year":"2003","journal-title":"Nat. Genet."},{"key":"2023013112122059900_B20","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1093\/bioinformatics\/btm583","article-title":"Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis","volume":"24","author":"Nettleton","year":"2008","journal-title":"Bioinformatics"},{"key":"2023013112122059900_B21","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511790515","volume-title":"Systems Biology: Properties of Reconstructed Networks.","author":"Palsson","year":"2006"},{"issue":"Article 32","key":"2023013112122059900_B22","article-title":"A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics","volume":"4","author":"Schafer","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023013112122059900_B23","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1186\/1471-2105-9-502","article-title":"Microarray-based gene set analysis: a comparison of current methods","volume":"9","author":"Song","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023013112122059900_B24","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1016\/j.jmva.2006.11.002","article-title":"A test for the mean vector with fewer observations than the dimension","volume":"99","author":"Srivastava","year":"2008","journal-title":"J Multivariate Anal."},{"key":"2023013112122059900_B25","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112122059900_B26","doi-asserted-by":"crossref","first-page":"13544","DOI":"10.1073\/pnas.0506577102","article-title":"Discovering statistically significant pathways in expression profiling studies","volume":"102","author":"Tian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013112122059900_B27","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1186\/1471-2105-9-469","article-title":"Gene set analyses for interpreting microarray experiments on prokaryotic organisms","volume":"9","author":"Tintle","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023013112122059900_B28","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1186\/1471-2105-6-225","article-title":"Pathway level analysis of gene expression using singular value decomposition","volume":"6","author":"Tomfohr","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023013112122059900_B29","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3121-7","volume-title":"Modern Applied Statistics with S-PLUS.","author":"Venables","year":"1999"},{"key":"2023013112122059900_B30","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1093\/bioinformatics\/btl034","article-title":"Non-linear tests for identifying differentially expressed genes or genetic networks","volume":"22","author":"Xiong","year":"2006","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/18\/2348\/48995452\/bioinformatics_25_18_2348.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/18\/2348\/48995452\/bioinformatics_25_18_2348.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T21:36:25Z","timestamp":1675200985000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/18\/2348\/196468"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,7,2]]},"references-count":30,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2009,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btp406","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,9,15]]},"published":{"date-parts":[[2009,7,2]]}}}