{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,19]],"date-time":"2025-10-19T21:16:10Z","timestamp":1760908570012},"reference-count":17,"publisher":"Oxford University Press (OUP)","issue":"4","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,2,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: It is well known that patterns of differential gene expression across biological conditions are often shared by many genes, particularly those within functional groups. Taking advantage of these patterns can lead to increased statistical power and biological clarity when testing for differential expression in a microarray experiment. The optimal discovery procedure (ODP), which maximizes the expected number of true positives for each fixed number of expected false positives, is a framework aimed at this goal. Storey et al. introduced an estimator of the ODP for identifying differentially expressed genes. However, their ODP estimator grows quadratically in computational time with respect to the number of genes. Reducing this computational burden is a key step in making the ODP practical for usage in a variety of high-throughput problems.<\/jats:p>\n               <jats:p>Results: Here, we propose a new estimate of the ODP called the modular ODP (mODP). The existing \u2018full ODP\u2019 requires that the likelihood function for each gene be evaluated according to the parameter estimates for all genes. The mODP assigns genes to modules according to a Kullback\u2013Leibler distance, and then evaluates the statistic only at the module-averaged parameter estimates. We show that the mODP is relatively insensitive to the choice of the number of modules, but dramatically reduces the computational complexity from quadratic to linear in the number of genes. We compare the full ODP algorithm and mODP on simulated data and gene expression data from a recent study of Morrocan Amazighs. The mODP and full ODP algorithm perform very similarly across a range of comparisons.<\/jats:p>\n               <jats:p>Availability: The mODP methodology has been implemented into EDGE, a comprehensive gene expression analysis software package in R, available at http:\/\/genomine.org\/edge\/.<\/jats:p>\n               <jats:p>Contact: \u00a0jstorey@princeton.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq701","type":"journal-article","created":{"date-parts":[[2010,12,25]],"date-time":"2010-12-25T02:05:53Z","timestamp":1293242753000},"page":"509-515","source":"Crossref","is-referenced-by-count":12,"title":["A computationally efficient modular optimal discovery procedure"],"prefix":"10.1093","volume":"27","author":[{"given":"Sangsoon","family":"Woo","sequence":"first","affiliation":[{"name":"1 Department of Biostatistics, University of Washington, Seattle, WA 98195, 2Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205 and 3Lewis-Sigler Institute for Integrative Genomics and Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA"}]},{"given":"Jeffrey T.","family":"Leek","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics, University of Washington, Seattle, WA 98195, 2Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205 and 3Lewis-Sigler Institute for Integrative Genomics and Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA"}]},{"given":"John D.","family":"Storey","sequence":"additional","affiliation":[{"name":"1 Department of Biostatistics, University of Washington, Seattle, WA 98195, 2Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205 and 3Lewis-Sigler Institute for Integrative Genomics and Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA"}]}],"member":"286","published-online":{"date-parts":[[2010,12,24]]},"reference":[{"key":"2023012511573880800_B1","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1093\/biostatistics\/kxh018","article-title":"Improved statistical tests for differential gene expression by shrinking variance components estimates","volume":"6","author":"Cui","year":"2005","journal-title":"Biostatistics"},{"key":"2023012511573880800_B2","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1198\/016214501753382129","article-title":"Empirical bayes analysis of a microarray experiment","volume":"96","author":"Efron","year":"2001","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012511573880800_B3","doi-asserted-by":"crossref","first-page":"905","DOI":"10.1111\/j.1467-9868.2009.00714.x","article-title":"A bayesian discovery procedure","volume":"71","author":"Guindani","year":"2009","journal-title":"J. Roy. Stat. Soc. Ser. B"},{"key":"2023012511573880800_B4","doi-asserted-by":"crossref","first-page":"e1000052","DOI":"10.1371\/journal.pgen.1000052","article-title":"A gemone-wide gene expression signature of environmental geography in leukocytes of moroccan amazighs","volume":"4","author":"Idaghdour","year":"2008","journal-title":"PLoS Genet."},{"key":"2023012511573880800_B5","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1214\/aoms\/1177729694","article-title":"On information and sufficiency","volume":"22","author":"Kullback","year":"1961","journal-title":"Ann. Math. Stat."},{"key":"2023012511573880800_B6","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1093\/bioinformatics\/btk005","article-title":"EDGE: extraction and analysis of differential gene expression","volume":"22","author":"Leek","year":"2006","journal-title":"Bioinformatics"},{"key":"2023012511573880800_B7","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4757-1923-9","volume-title":"Testing Statistical Hypotheses","author":"Lehmann","year":"1986","edition":"2"},{"key":"2023012511573880800_B8","first-page":"31","article-title":"Replicated microarray data","volume":"12","author":"Lonnstedt","year":"2002","journal-title":"Stat. Sin."},{"key":"2023012511573880800_B9","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1093\/biostatistics\/5.2.155","article-title":"Detecting differential gene expression with a semiparametric hierarchical mixture method","volume":"5","author":"Newton","year":"2004","journal-title":"Biostatistics"},{"key":"2023012511573880800_B10","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1007\/978-3-642-00826-9_7","article-title":"Clustering multivariate normal distributions","volume-title":"Emerging Trends in Visual Computing","author":"Nielsen","year":"2009"},{"key":"2023012511573880800_B11","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat. Appl. Genet. Mol. Biol."},{"key":"2023012511573880800_B12","doi-asserted-by":"crossref","first-page":"9440","DOI":"10.1073\/pnas.1530509100","article-title":"Statistical significance for genome-wide studies","volume":"100","author":"Storey","year":"2003","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511573880800_B13","doi-asserted-by":"crossref","first-page":"12837","DOI":"10.1073\/pnas.0504609102","article-title":"Significance analysis of time course microarray experiments","volume":"102","author":"Storey","year":"2005","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511573880800_B14","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1093\/biostatistics\/kxl019","article-title":"The optimal discovery procedure for large significance testing, with applications to comparative microarray experiments","volume":"8","author":"Storey","year":"2007","journal-title":"Biostatistics"},{"key":"2023012511573880800_B15","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1111\/j.1467-9868.2007.005592.x","article-title":"The optimal discovery procedure: A new approach to simultaneous significance testing","volume":"69","author":"Storey","year":"2007","journal-title":"J. Roy. Stat. Soc. Ser. B"},{"key":"2023012511573880800_B16","doi-asserted-by":"crossref","first-page":"5116","DOI":"10.1073\/pnas.091062498","article-title":"Significance analysis of microarrays applied to the ionizing radiation response","volume":"98","author":"Tusher","year":"2001","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012511573880800_B17","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1128","article-title":"A general framework for weighted gene co-expression network analysis","volume":"4","author":"Zhang","year":"2005","journal-title":"Stat. Appl. Genet. Mol. Biol."}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/4\/509\/48866530\/bioinformatics_27_4_509.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/27\/4\/509\/48866530\/bioinformatics_27_4_509.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T12:46:26Z","timestamp":1674650786000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/27\/4\/509\/198547"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,12,24]]},"references-count":17,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2011,2,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq701","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2011,2,15]]},"published":{"date-parts":[[2010,12,24]]}}}