{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T06:48:01Z","timestamp":1775630881248,"version":"3.50.1"},"reference-count":33,"publisher":"Oxford University Press (OUP)","issue":"23","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,12,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The analysis of differentially expressed gene sets became a routine in the analyses of gene expression data. There is a multitude of tests available, ranging from aggregation tests that summarize gene-level statistics for a gene set to true multivariate tests, accounting for intergene correlations. Most of them detect complex departures from the null hypothesis but when the null hypothesis is rejected, the specific alternative leading to the rejection is not easily identifiable.<\/jats:p><jats:p>Results: In this article we compare the power and Type I error rates of minimum-spanning tree (MST)-based non-parametric multivariate tests with several multivariate and aggregation tests, which are frequently used for pathway analyses. In our simulation study, we demonstrate that MST-based tests have power that is for many settings comparable with the power of conventional approaches, but outperform them in specific regions of the parameter space corresponding to biologically relevant configurations. Further, we find for simulated and for gene expression data that MST-based tests discriminate well against shift and scale alternatives. As a general result, we suggest a two-step practical analysis strategy that may increase the interpretability of experimental data: first, apply the most powerful multivariate test to find the subset of pathways for which the null hypothesis is rejected and second, apply MST-based tests to these pathways to select those that support specific alternative hypotheses.<\/jats:p><jats:p>Contact: \u00a0gvglazko@uams.edu or yrahmatallah@uams.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts579","type":"journal-article","created":{"date-parts":[[2012,10,9]],"date-time":"2012-10-09T00:42:28Z","timestamp":1349743348000},"page":"3073-3080","source":"Crossref","is-referenced-by-count":23,"title":["Gene set analysis for self-contained tests: complex null and specific alternative hypotheses"],"prefix":"10.1093","volume":"28","author":[{"given":"Y.","family":"Rahmatallah","sequence":"first","affiliation":[]},{"given":"F.","family":"Emmert-Streib","sequence":"additional","affiliation":[]},{"given":"G.","family":"Glazko","sequence":"additional","affiliation":[]}],"member":"286","published-online":{"date-parts":[[2012,10,7]]},"reference":[{"key":"2023062411492654200_bts579-B1","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1186\/1471-2105-10-47","article-title":"A general modular framework for gene set enrichment analysis","volume":"10","author":"Ackermann","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023062411492654200_bts579-B2","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nrg1749","article-title":"Microarray data analysis: from disarray to consolidation and consensus","volume":"7","author":"Allison","year":"2006","journal-title":"Nat. Rev. Genet."},{"key":"2023062411492654200_bts579-B3","doi-asserted-by":"crossref","first-page":"509","DOI":"10.1093\/bioinformatics\/17.6.509","article-title":"A Bayesian framework for the analysis of microarray expression data: regularized t -test and statistical inferences of gene changes","volume":"17","author":"Baldi","year":"2001","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B4","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/S0047-259X(03)00079-4","article-title":"On a new multivariate two-sample test","volume":"88","author":"Baringhaus","year":"2004","journal-title":"J. Multivariate Anal."},{"key":"2023062411492654200_bts579-B5","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: A practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. Roy. Statist. Soc. B"},{"key":"2023062411492654200_bts579-B6","doi-asserted-by":"crossref","first-page":"9546","DOI":"10.1073\/pnas.0914005107","article-title":"Independent filtering increases detection power for high-throughput experiments","volume":"107","author":"Bourgon","year":"2010","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023062411492654200_bts579-B7","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1186\/1471-2105-11-483","article-title":"Asymmetric microarray data produces gene lists highly predictive of research literature on multiple cancer types","volume":"11","author":"Dawany","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023062411492654200_bts579-B8","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1186\/1471-2105-8-242","article-title":"Improving gene set analysis of microarray data by SAM-GS","volume":"8","author":"Dinu","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023062411492654200_bts579-B9","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1093\/bib\/bbn042","article-title":"Gene-set analysis and reduction","volume":"10","author":"Dinu","year":"2009","journal-title":"Brief. Bioinform."},{"key":"2023062411492654200_bts579-B10","doi-asserted-by":"crossref","first-page":"961","DOI":"10.1089\/cmb.2007.0041","article-title":"The chronic fatigue syndrome: a comparative pathway analysis","volume":"14","author":"Emmert-Streib","year":"2007","journal-title":"J. Comput. Biol."},{"key":"2023062411492654200_bts579-B11","doi-asserted-by":"crossref","first-page":"e1002053","DOI":"10.1371\/journal.pcbi.1002053","article-title":"Pathway analysis of expression data: deciphering functional building blocks of complex diseases","volume":"7","author":"Emmert-Streib","year":"2011","journal-title":"PLoS Comput. Biol."},{"key":"2023062411492654200_bts579-B12","doi-asserted-by":"crossref","first-page":"697","DOI":"10.1214\/aos\/1176344722","article-title":"Multivariate generalization of the Wald-Wolfowitz and Smirnov two-sample tests","volume":"7","author":"Friedman","year":"1979","journal-title":"Ann. Stat."},{"key":"2023062411492654200_bts579-B13","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1186\/1471-2164-11-574","article-title":"Heading down the wrong pathway: on the influence of correlation within gene sets","volume":"11","author":"Gatti","year":"2010","journal-title":"BMC Genomics"},{"key":"2023062411492654200_bts579-B14","doi-asserted-by":"crossref","first-page":"2348","DOI":"10.1093\/bioinformatics\/btp406","article-title":"Unite and conquer: univariate and multivariate approaches for finding differentially expressed gene sets","volume":"25","author":"Glazko","year":"2009","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B15","doi-asserted-by":"crossref","first-page":"980","DOI":"10.1093\/bioinformatics\/btm051","article-title":"Analyzing gene expression data in terms of gene sets: methodological issues","volume":"23","author":"Goeman","year":"2007","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B16","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1186\/1471-2105-10-11","article-title":"Filtering for increased power for microarray data analysis","volume":"10","author":"Hackstadt","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023062411492654200_bts579-B17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1093\/nar\/gkn923","article-title":"Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists","volume":"37","author":"Huang da","year":"2009","journal-title":"Nucleic Acids Res."},{"key":"2023062411492654200_bts579-B18","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1093\/bib\/bbr049","article-title":"Gene set enrichment analysis: performance evaluation and usage guidelines","volume":"13","author":"Hung","year":"2012","journal-title":"Brief. Bioinform."},{"key":"2023062411492654200_bts579-B19","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1093\/bioinformatics\/btl599","article-title":"Extensions to gene set enrichment","volume":"23","author":"Jiang","year":"2007","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B20","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: kyoto encyclopedia of genes and genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023062411492654200_bts579-B21","doi-asserted-by":"crossref","first-page":"1139","DOI":"10.1142\/S0219720007003041","article-title":"A multivariate extension of the gene set enrichment analysis","volume":"5","author":"Klebanov","year":"2007","journal-title":"J. Bioinform. Comput. Biol."},{"key":"2023062411492654200_bts579-B22","doi-asserted-by":"crossref","first-page":"2373","DOI":"10.1093\/bioinformatics\/btl401","article-title":"A multivariate approach for integrating genome-wide expression data and biological knowledge","volume":"22","author":"Kong","year":"2006","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B23","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1186\/1471-2105-8-431","article-title":"Comparative evaluation of gene-set analysis methods","volume":"8","author":"Liu","year":"2007","journal-title":"BMC Bioinformatics"},{"key":"2023062411492654200_bts579-B24","doi-asserted-by":"crossref","first-page":"3105","DOI":"10.1093\/bioinformatics\/bti496","article-title":"Hotelling's T2 multivariate profiling for detecting differential expression in microarrays","volume":"21","author":"Lu","year":"2005","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B25","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1186\/1471-2164-10-197","article-title":"Gene set internal coherence in the context of functional profiling","volume":"10","author":"Montaner","year":"2009","journal-title":"BMC Genomics"},{"key":"2023062411492654200_bts579-B26","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1038\/ng1180","article-title":"PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes","volume":"34","author":"Mootha","year":"2003","journal-title":"Nat. Genet."},{"key":"2023062411492654200_bts579-B27","doi-asserted-by":"crossref","first-page":"Article3","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023062411492654200_bts579-B28","doi-asserted-by":"crossref","first-page":"13544","DOI":"10.1073\/pnas.0506577102","article-title":"Discovering statistically significant pathways in expression profiling studies","volume":"102","author":"Tian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023062411492654200_bts579-B29","doi-asserted-by":"crossref","first-page":"e37510","DOI":"10.1371\/journal.pone.0037510","article-title":"Assessment method for a power analysis to identify differentially expressed pathways","volume":"7","author":"Tripathi","year":"2012","journal-title":"PLoS One"},{"key":"2023062411492654200_bts579-B30","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to the ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023062411492654200_bts579-B31","doi-asserted-by":"crossref","first-page":"Article 13","DOI":"10.2202\/1544-6115.1641","article-title":"Linear combination test for hierarchical gene set analysis","volume":"10","author":"Wang","year":"2011","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023062411492654200_bts579-B32","doi-asserted-by":"crossref","first-page":"2176","DOI":"10.1093\/bioinformatics\/btq401","article-title":"ROAST: rotation gene set tests for complex microarray experiments","volume":"26","author":"Wu","year":"2010","journal-title":"Bioinformatics"},{"key":"2023062411492654200_bts579-B33","doi-asserted-by":"crossref","first-page":"919","DOI":"10.1093\/bioinformatics\/btl034","article-title":"Non-linear tests for identifying differentially expressed genes or genetic networks","volume":"22","author":"Xiong","year":"2006","journal-title":"Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/23\/3073\/50695256\/bioinformatics_28_23_3073.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/23\/3073\/50695256\/bioinformatics_28_23_3073.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,30]],"date-time":"2024-04-30T04:02:18Z","timestamp":1714449738000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/23\/3073\/193652"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,7]]},"references-count":33,"journal-issue":{"issue":"23","published-print":{"date-parts":[[2012,12,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts579","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2012,12]]},"published":{"date-parts":[[2012,10,7]]}}}