{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T09:54:19Z","timestamp":1775814859698,"version":"3.50.1"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"2","license":[{"start":{"date-parts":[[2016,10,2]],"date-time":"2016-10-02T00:00:00Z","timestamp":1475366400000},"content-version":"vor","delay-in-days":3232,"URL":"http:\/\/creativecommons.org\/licenses\/by-nc\/2.0\/uk\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2008,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The field of microarray data analysis is shifting emphasis from methods for identifying differentially expressed genes to methods for identifying differentially expressed gene categories. The latter approaches utilize a priori information about genes to group genes into categories and enhance the interpretation of experiments aimed at identifying expression differences across treatments. While almost all of the existing approaches for identifying differentially expressed gene categories are practically useful, they suffer from a variety of drawbacks. Perhaps most notably, many popular tools are based exclusively on gene-specific statistics that cannot detect many types of multivariate expression change.<\/jats:p><jats:p>Results: We have developed a nonparametric multivariate method for identifying gene categories whose multivariate expression distribution differs across two or more conditions. We illustrate our approach and compare its performance to several existing procedures via the analysis of a real data set and a unique data-based simulation study designed to capture the challenges and complexities of practical data analysis. We show that our method has good power for differentiating between differentially expressed and non-differentially expressed gene categories, and we utilize a resampling based strategy for controling the false discovery rate when testing multiple categories.<\/jats:p><jats:p>Availability: R code (www.r-project.org) for implementing our approach is available from the first author by request.<\/jats:p><jats:p>Contact: \u00a0dnett@iastate.edu<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm583","type":"journal-article","created":{"date-parts":[[2007,11,28]],"date-time":"2007-11-28T01:13:15Z","timestamp":1196212395000},"page":"192-201","source":"Crossref","is-referenced-by-count":51,"title":["Identification of differentially expressed gene categories in microarray studies using nonparametric multivariate analysis"],"prefix":"10.1093","volume":"24","author":[{"given":"Dan","family":"Nettleton","sequence":"first","affiliation":[{"name":"1 Department of Statistics, Iowa State University, Ames, Iowa 50011-1210, USA, 2Eli Lilly and Company, Lilly Research Laboratories, P.O. Box 708, Greenfield, Indiana 46140 and 3Department of Animal Science, Iowa State University, Ames, Iowa 50011-3150, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Justin","family":"Recknor","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Iowa State University, Ames, Iowa 50011-1210, USA, 2Eli Lilly and Company, Lilly Research Laboratories, P.O. Box 708, Greenfield, Indiana 46140 and 3Department of Animal Science, Iowa State University, Ames, Iowa 50011-3150, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James M.","family":"Reecy","sequence":"additional","affiliation":[{"name":"1 Department of Statistics, Iowa State University, Ames, Iowa 50011-1210, USA, 2Eli Lilly and Company, Lilly Research Laboratories, P.O. Box 708, Greenfield, Indiana 46140 and 3Department of Animal Science, Iowa State University, Ames, Iowa 50011-3150, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2007,11,27]]},"reference":[{"key":"2023020209483109900_B1","doi-asserted-by":"crossref","first-page":"578","DOI":"10.1093\/bioinformatics\/btg455","article-title":"Fatigo: a web tool for finding significant associations of gene ontology terms with groups of genes","volume":"20","author":"Al-Shahrour","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B2","first-page":"55","article-title":"Microarray data analysis: from disarray to consolidation and consensus","volume":"7","author":"Allison","year":"2006","journal-title":"Nature"},{"key":"2023020209483109900_B3","doi-asserted-by":"crossref","first-page":"1943","DOI":"10.1093\/bioinformatics\/bti260","article-title":"Significance analysis of functional categories in gene expression studies: a structured permutation approach","volume":"21","author":"Barry","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B4","doi-asserted-by":"crossref","first-page":"1464","DOI":"10.1093\/bioinformatics\/bth088","article-title":"Gostat: find statistically overrepresented gene ontologies within a group of genes","volume":"20","author":"Beibarth","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B5","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. B"},{"key":"2023020209483109900_B6","doi-asserted-by":"crossref","first-page":"2502","DOI":"10.1093\/bioinformatics\/btg363","article-title":"Characterizing gene sets with funcassociate","volume":"19","author":"Berriz","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B7","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1093\/nar\/gkg095","article-title":"The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003","volume":"31","author":"Boeckmann","year":"2003","journal-title":"Nucleic Acids Res."},{"key":"2023020209483109900_B8","doi-asserted-by":"crossref","first-page":"1462","DOI":"10.1093\/bioinformatics\/bth087","article-title":"NetAffx Gene Ontology Mining Tool: a visual approach for microarray data analysis","volume":"20","author":"Cheng","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B9","doi-asserted-by":"crossref","first-page":"2771","DOI":"10.1182\/blood-2003-09-3243","article-title":"Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival","volume":"103","author":"Chiaretti","year":"2004","journal-title":"Blood"},{"key":"2023020209483109900_B10","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1186\/gb-2003-4-1-r7","article-title":"MAPPFinder: using gene ontology and GenMAPP to create a global gene-expression profile from microarray data","volume":"4","author":"Doniger","year":"2003","journal-title":"Genome Biol."},{"key":"2023020209483109900_B11","first-page":"98","article-title":"Global functional profiling of gene expression","volume":"81","author":"Dr\u0103ghici","year":"2003","journal-title":"Genomics"},{"key":"2023020209483109900_B12","volume-title":"Randomization Tests","author":"Edgington","year":"1995","edition":"3rd"},{"key":"2023020209483109900_B13","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1214\/07-AOAS101","article-title":"On testing the significance of sets of genes","volume":"1","author":"Efron","year":"2007","journal-title":"Ann. Appl. Stat."},{"key":"2023020209483109900_B14","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene Ontology: tool for the unification of biology","volume":"25","author":"The Gene Ontology Consortium","year":"2000","journal-title":"Nat. Genet."},{"key":"2023020209483109900_B15","doi-asserted-by":"crossref","first-page":"980","DOI":"10.1093\/bioinformatics\/btm051","article-title":"Analyzing gene expression data in terms of gene sets: methodological issues","volume":"23","author":"Goeman","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B16","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1093\/bioinformatics\/btg382","article-title":"A global test for groups of genes: testing association with a clinical outcome","volume":"20","author":"Goeman","year":"2004","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B17","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1093\/bioinformatics\/btl599","article-title":"Extensions to gene set enrichment","volume":"23","author":"Jiang","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B18","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1093\/nar\/28.1.27","article-title":"KEGG: Kyoto Encyclopedia of Genes and Genomes","volume":"28","author":"Kanehisa","year":"2000","journal-title":"Nucleic Acids Res."},{"key":"2023020209483109900_B19","doi-asserted-by":"crossref","first-page":"3587","DOI":"10.1093\/bioinformatics\/bti565","article-title":"Ontological analysis of gene expression data: current tools, limitations, and open problems","volume":"21","author":"Khatri","year":"2005","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B20","doi-asserted-by":"crossref","first-page":"1225","DOI":"10.1093\/bioinformatics\/btm092","article-title":"Domain enhanced analysis of microarray data using GO annotations","volume":"23","author":"Liu","year":"2007","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B21","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-3449-2","volume-title":"Permutation methods: A Distance Function Approach","author":"Mielke","year":"2001"},{"key":"2023020209483109900_B22","doi-asserted-by":"crossref","first-page":"337","DOI":"10.1198\/108571106X129135","article-title":"Estimating the number of true null hypotheses from a histogram of p-values","volume":"11","author":"Nettleton","year":"2006","journal-title":"J. Agri., Bio., Environ. Stat."},{"key":"2023020209483109900_B23","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1214\/07-AOAS104","article-title":"Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis","volume":"1","author":"Newton","year":"2007","journal-title":"Ann. Appl. Stat."},{"key":"2023020209483109900_B24","volume-title":"Multivariate Permutation Tests with Applications in Biostatistics","author":"Pesarin","year":"2001"},{"key":"2023020209483109900_B25","doi-asserted-by":"crossref","first-page":"368","DOI":"10.1093\/bioinformatics\/btf877","article-title":"Identifying differentially expressed genes using false discovery rate controlling procedures","volume":"19","author":"Reiner","year":"2003","journal-title":"Bioinformatics"},{"key":"2023020209483109900_B26","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1002\/(SICI)1097-0134(199707)28:3<405::AID-PROT10>3.0.CO;2-L","article-title":"Pfam: A Comprehensive Database of Protein Families Based on Seed Alignments","volume":"28","author":"Sonnhammer","year":"1997","journal-title":"Proteins"},{"key":"2023020209483109900_B27","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1096\/fj.05-5125fje","article-title":"Transcriptional profiling of myostatin-knockout mice implicates Wnt signaling in postnatal skeletal muscle growth and hypertrophy","volume":"20","author":"Steelman","year":"2006","journal-title":"FASEB J."},{"key":"2023020209483109900_B28","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl. Acad. Sci."},{"key":"2023020209483109900_B29","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1186\/1471-2105-6-225","article-title":"Pathway level analysis of gene expression using singular value decomposition","volume":"6","author":"Tomfohr","year":"2005","journal-title":"BMC Bioinformatics"},{"key":"2023020209483109900_B30","article-title":"Resampling-Based Multiple Testing: Examples and Methods for p-Value Adjustment","author":"Westfall","year":"1993"},{"key":"2023020209483109900_B31","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1016\/S0378-3758(99)00041-5","article-title":"Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics","volume":"82","author":"Yekutieli","year":"1999","journal-title":"J. Stat. Plan Inference"},{"key":"2023020209483109900_B32","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1016\/S1535-6108(02)00032-6","article-title":"Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling","volume":"1","author":"Yeoh","year":"2002","journal-title":"Cancer Cell"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/2\/192\/49044451\/bioinformatics_24_2_192.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/24\/2\/192\/49044451\/bioinformatics_24_2_192.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,23]],"date-time":"2025-01-23T00:01:41Z","timestamp":1737590501000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/24\/2\/192\/228532"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,11,27]]},"references-count":32,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2008,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm583","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2008,1,15]]},"published":{"date-parts":[[2007,11,27]]}}}