{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,7]],"date-time":"2026-01-07T23:24:07Z","timestamp":1767828247077,"version":"3.49.0"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"2","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2009,1,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The elucidation of biological pathways enriched with differentially expressed genes has become an integral part of the analysis and interpretation of microarray data. Several statistical methods are commonly used in this context, but the question of the optimal approach has still not been resolved.<\/jats:p>\n               <jats:p>Results: We present a logistic regression-based method (LRpath) for identifying predefined sets of biologically related genes enriched with (or depleted of) differentially expressed transcripts in microarray experiments. We functionally relate the odds of gene set membership with the significance of differential expression, and calculate adjusted P-values as a measure of statistical significance. The new approach is compared with Fisher's exact test and other relevant methods in a simulation study and in the analysis of two breast cancer datasets. Overall results were concordant between the simulation study and the experimental data analysis, and provide useful information to investigators seeking to choose the appropriate method. LRpath displayed robust behavior and improved statistical power compared with tested alternatives. It is applicable in experiments involving two or more sample types, and accepts significance statistics of the investigator's choice as input.<\/jats:p>\n               <jats:p>Availability: An R function implementing LRpath can be downloaded from http:\/\/eh3.uc.edu\/lrpath.<\/jats:p>\n               <jats:p>Contact: \u00a0mario.medvedovic@uc.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online and at http:\/\/eh3.uc.edu\/lrpath.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btn592","type":"journal-article","created":{"date-parts":[[2008,11,28]],"date-time":"2008-11-28T01:24:25Z","timestamp":1227835465000},"page":"211-217","source":"Crossref","is-referenced-by-count":153,"title":["LRpath: a logistic regression approach for identifying enriched biological groups in gene expression data"],"prefix":"10.1093","volume":"25","author":[{"given":"Maureen A.","family":"Sartor","sequence":"first","affiliation":[{"name":"1 Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI, 2Department Environmental and Occupational Health, University of Pittsburgh, Pittsburgh, PA, 3Department of Environmental Health and 4Center for Environmental Genetics, University of Cincinnati, Cincinnati, OH, USA"}]},{"given":"George D.","family":"Leikauf","sequence":"additional","affiliation":[{"name":"1 Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI, 2Department Environmental and Occupational Health, University of Pittsburgh, Pittsburgh, PA, 3Department of Environmental Health and 4Center for Environmental Genetics, University of Cincinnati, Cincinnati, OH, USA"}]},{"given":"Mario","family":"Medvedovic","sequence":"additional","affiliation":[{"name":"1 Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI, 2Department Environmental and Occupational Health, University of Pittsburgh, Pittsburgh, PA, 3Department of Environmental Health and 4Center for Environmental Genetics, University of Cincinnati, Cincinnati, OH, USA"},{"name":"1 Center for Computational Medicine and Biology, University of Michigan, Ann Arbor, MI, 2Department Environmental and Occupational Health, University of Pittsburgh, Pittsburgh, PA, 3Department of Environmental Health and 4Center for Environmental Genetics, University of Cincinnati, Cincinnati, OH, USA"}]}],"member":"286","published-online":{"date-parts":[[2008,11,27]]},"reference":[{"key":"2023013109595718400_B1","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1038\/75556","article-title":"Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.","volume":"25","author":"Ashburner","year":"2000","journal-title":"Nat. Genet"},{"key":"2023013109595718400_B2","first-page":"289","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing.","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. B"},{"key":"2023013109595718400_B3","doi-asserted-by":"crossref","first-page":"2502","DOI":"10.1093\/bioinformatics\/btg363","article-title":"Characterizing gene sets with FuncAssociate","volume":"19","author":"Berriz","year":"2003","journal-title":"Bioinformatics"},{"key":"2023013109595718400_B4","doi-asserted-by":"crossref","first-page":"429","DOI":"10.1016\/j.tibtech.2005.05.011","article-title":"Pathways to the analysis of microarray data","volume":"23","author":"Curtis","year":"2005","journal-title":"Trends Biotechnol"},{"issue":"3","key":"2023013109595718400_B5","article-title":"DAVID: Database for Annotation, Visualization, and Integrated Discovery.","volume":"4","author":"Dennis","year":"2003","journal-title":"Genome Biol"},{"key":"2023013109595718400_B6","first-page":"98","article-title":"Global functional profiling of gene expression.","volume":"81","author":"Draghici","year":"2003","journal-title":"Genomics"},{"key":"2023013109595718400_B7","unstructured":"Gentleman\n              RC\n            \n          \n          Bioconductor package, GOstats vignette.\n          2005\n          Available at http:\/\/www.bioconductor.org\/repository\/devel\/vignette\/GOstats.pdf."},{"key":"2023013109595718400_B8","doi-asserted-by":"crossref","first-page":"D258","DOI":"10.1093\/nar\/gkh036","article-title":"The Gene Ontology (GO) database and informatics resource","volume":"32","author":"Harris","year":"2004","journal-title":"Nucleic Acids Res"},{"key":"2023013109595718400_B9","doi-asserted-by":"crossref","first-page":"R70","DOI":"10.1186\/gb-2003-4-10-r70","article-title":"Identifying biological themes within lists of genes with EASE.","volume":"4","author":"Hosack","year":"2003","journal-title":"Genome Biol"},{"key":"2023013109595718400_B10","first-page":"299","article-title":"R: a language for data analysis and graphics.","volume":"5","author":"Ihaka","year":"1996","journal-title":"J. Comput. Graph. Stat"},{"key":"2023013109595718400_B11","doi-asserted-by":"crossref","first-page":"D354","DOI":"10.1093\/nar\/gkj102","article-title":"From genomics to chemical genomics: new developments in KEGG.","volume":"34","author":"Kanehisa","year":"2006","journal-title":"Nucleic Acids Res"},{"key":"2023013109595718400_B12","doi-asserted-by":"crossref","first-page":"3587","DOI":"10.1093\/bioinformatics\/bti565","article-title":"Ontological analysis of gene expression data: current tools, limitations and open problems","volume":"21","author":"Khatri","year":"2005","journal-title":"Bioinformatics"},{"key":"2023013109595718400_B13","doi-asserted-by":"crossref","first-page":"W762","DOI":"10.1093\/nar\/gki472","article-title":"Recent additions and improvements to the onto-tools","volume":"33","author":"Khatri","year":"2005","journal-title":"Nucleic Acids Res"},{"key":"2023013109595718400_B14","doi-asserted-by":"crossref","first-page":"13550","DOI":"10.1073\/pnas.0506230102","article-title":"An expression signature for p53 status in human breast cancer predicts mutation status, transcriptional effects and patient survival.","volume":"102","author":"Miller","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013109595718400_B15","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1214\/07-AOAS104","article-title":"Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis","volume":"1","author":"Newton","year":"2007","journal-title":"Ann. of Applied Stat"},{"key":"2023013109595718400_B16","doi-asserted-by":"crossref","first-page":"8961","DOI":"10.1073\/pnas.0502674102","article-title":"Effects of threshold choice on biological conclusions reached during analysis of gene expression by DNA microarrays.","volume":"102","author":"Pan","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013109595718400_B17","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pmed.0020251","article-title":"Up-regulation and profibrotic role of osteopontin in human idiopathic pulmonary fibrosis.","volume":"2","author":"Pardo","year":"2005","journal-title":"PLoS Med"},{"key":"2023013109595718400_B18","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1093\/bioinformatics\/btl633","article-title":"Enrichment or depletion of a GO category within a class of genes: which test?","volume":"23","author":"Rivals","year":"2007","journal-title":"Bioinformatics"},{"key":"2023013109595718400_B19","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1186\/1471-2105-7-538","article-title":"Intensity-based hierarchical Bayes method improves testing for differentially expressed genes in microarray experiments.","volume":"7","author":"Sartor","year":"2006","journal-title":"BMC. Bioinformatics"},{"key":"2023013109595718400_B20","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1093\/jnci\/djj052","article-title":"Gene expression profiling in breast cancer: understanding the molecular basis of histologic grade to improve prognosis","volume":"98","author":"Sotiriou","year":"2006","journal-title":"J. Natl Cancer Inst"},{"key":"2023013109595718400_B21","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genomewide studies.","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013109595718400_B22","doi-asserted-by":"crossref","first-page":"15545","DOI":"10.1073\/pnas.0506580102","article-title":"Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.","volume":"102","author":"Subramanian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013109595718400_B23","doi-asserted-by":"crossref","first-page":"281","DOI":"10.1038\/10343","article-title":"Systematic determination of genetic network architecture","volume":"22","author":"Tavazoie","year":"1999","journal-title":"Nat. Genet"},{"key":"2023013109595718400_B24","doi-asserted-by":"crossref","first-page":"13544","DOI":"10.1073\/pnas.0506577102","article-title":"Discovering statistically significant pathways in expression profiling studies.","volume":"102","author":"Tian","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023013109595718400_B25","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1186\/1471-2105-7-86","article-title":"BayGO: Bayesian analysis of ontology term enrichment in microarray data.","volume":"7","author":"Vencio","year":"2006","journal-title":"BMC. Bioinformatics"},{"key":"2023013109595718400_B26","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1186\/1471-2105-8-383","article-title":"ProbCD: enrichment analysis accounting for categorization uncertainty","volume":"8","author":"Vencio","year":"2007","journal-title":"BMC. Bioinformatics"},{"key":"2023013109595718400_B27","doi-asserted-by":"crossref","first-page":"R28","DOI":"10.1186\/gb-2003-4-4-r28","article-title":"GoMiner: a resource for biological interpretation of genomic and proteomic data.","volume":"4","author":"Zeeberg","year":"2003","journal-title":"Genome Biol"},{"key":"2023013109595718400_B28","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1186\/1471-2105-6-168","article-title":"High-throughput GoMiner, an \u2018industrial-strength\u2019 integrative gene ontology tool for interpretation of multiple-microarray experiments, with application to studies of Common Variable Immune Deficiency (CVID)","volume":"6","author":"Zeeberg","year":"2005","journal-title":"BMC. Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/2\/211\/48982441\/bioinformatics_25_2_211.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/25\/2\/211\/48982441\/bioinformatics_25_2_211.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,31]],"date-time":"2023-01-31T18:38:34Z","timestamp":1675190314000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/25\/2\/211\/218259"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2008,11,27]]},"references-count":28,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2009,1,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btn592","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2009,1,15]]},"published":{"date-parts":[[2008,11,27]]}}}