{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T22:40:11Z","timestamp":1740177611060,"version":"3.37.3"},"reference-count":43,"publisher":"Oxford University Press (OUP)","issue":"15","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2010,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The multifactor-dimensionality reduction (MDR) method has been widely used in multi-locus interaction analysis. It reduces dimensionality by partitioning the multi-locus genotypes into a high-risk group and a low-risk group according to whether the genotype-specific risk ratio exceeds a fixed threshold or not. Alternatively, one can maximize the \u03c72 value exhaustively over all possible ways of partitioning the multi-locus genotypes into two groups, and we aim to show that this is computationally feasible.<\/jats:p><jats:p>Methods: We advocate finding the optimal MDR (OMDR) that would have resulted from an exhaustive search over all possible ways of partitioning the multi-locus genotypes into two groups. It is shown that this optimal MDR can be obtained efficiently using an ordered combinatorial partitioning (OCP) method, which differs from the existing MDR method in the use of a data-driven rather than fixed threshold. The generalized extreme value distribution (GEVD) theory is applied to find the optimal order of gene combination and assess statistical significance of interactions.<\/jats:p><jats:p>Results: The computational complexity of OCP strategy is linear in the number of multi-locus genotypes in contrast with an exponential order for the naive exhaustive search strategy. Simulation studies show that OMDR can be more powerful than MDR with substantial power gain possible when the partitioning of OMDR is different from that of MDR. The analysis results of a breast cancer dataset show that the use of GEVD accelerates the determination of interaction order and reduces the time cost for P-value calculation by more than 10-fold.<\/jats:p><jats:p>Availability: C++ program is available at http:\/\/home.ustc.edu.cn\/\u223czhanghan\/ocp\/ocp.html<\/jats:p><jats:p>Contact: \u00a0zhanghan@mail.ustc.edu.cn<\/jats:p><jats:p>Supplementary Information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btq290","type":"journal-article","created":{"date-parts":[[2010,6,11]],"date-time":"2010-06-11T00:51:37Z","timestamp":1276217497000},"page":"1871-1878","source":"Crossref","is-referenced-by-count":10,"title":["Testing multiple gene interactions by the ordered combinatorial partitioning method in case\u2013control studies"],"prefix":"10.1093","volume":"26","author":[{"given":"Xing","family":"Hua","sequence":"first","affiliation":[{"name":"1 Department of Statistics and Finance, University of Science and Technology of China, Hefei, Anhui 230026, China, 2Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA and 3Department of Statistics and Applied Probability, National University of Singapore, Singapore 117546"}]},{"given":"Han","family":"Zhang","sequence":"additional","affiliation":[{"name":"1 Department of Statistics and Finance, University of Science and Technology of China, Hefei, Anhui 230026, China, 2Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA and 3Department of Statistics and Applied Probability, National University of Singapore, Singapore 117546"}]},{"given":"Hong","family":"Zhang","sequence":"additional","affiliation":[{"name":"1 Department of Statistics and Finance, University of Science and Technology of China, Hefei, Anhui 230026, China, 2Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA and 3Department of Statistics and Applied Probability, National University of Singapore, Singapore 117546"},{"name":"1 Department of Statistics and Finance, University of Science and Technology of China, Hefei, Anhui 230026, China, 2Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA and 3Department of Statistics and Applied Probability, National University of Singapore, Singapore 117546"}]},{"given":"Yaning","family":"Yang","sequence":"additional","affiliation":[{"name":"1 Department of Statistics and Finance, University of Science and Technology of China, Hefei, Anhui 230026, China, 2Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA and 3Department of Statistics and Applied Probability, National University of Singapore, Singapore 117546"}]},{"given":"Anthony Y.C.","family":"Kuk","sequence":"additional","affiliation":[{"name":"1 Department of Statistics and Finance, University of Science and Technology of China, Hefei, Anhui 230026, China, 2Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD 20892, USA and 3Department of Statistics and Applied Probability, National University of Singapore, Singapore 117546"}]}],"member":"286","published-online":{"date-parts":[[2010,6,10]]},"reference":[{"key":"2023012507595109300_B1","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1159\/000083029","article-title":"MDR and PRP: a comparison of methods for high-order genotype-phenotype associations","volume":"58","author":"Bastone","year":"2004","journal-title":"Hum. Hered."},{"key":"2023012507595109300_B2","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical approach and powerful approach for multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. B"},{"volume-title":"Classification and regression trees.","year":"1984","author":"Breiman","key":"2023012507595109300_B3"},{"key":"2023012507595109300_B4","first-page":"44","article-title":"Can neural network constraints in GP provide power to detect genes associated with human disease?","volume":"3449","author":"Bush","year":"2005","journal-title":"Appl. Evol. Comp. Proceed."},{"key":"2023012507595109300_B5","doi-asserted-by":"crossref","first-page":"238","DOI":"10.1186\/1471-2105-9-238","article-title":"Alternative contingency table measures improve the power and detection of multifactor dimensionality reduction","volume":"9","author":"Bush","year":"2008","journal-title":"BMC Bioinformatics"},{"key":"2023012507595109300_B6","doi-asserted-by":"crossref","first-page":"6532","DOI":"10.1002\/sim.3431","article-title":"Improving strategies for detecting genetic patterns of disease susceptibility in association studies","volume":"27","author":"Calle","year":"2008","journal-title":"Stat. Med."},{"key":"2023012507595109300_B7","doi-asserted-by":"crossref","first-page":"1002","DOI":"10.1086\/509704","article-title":"Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions","volume":"79","author":"Chatterjee","year":"2006","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507595109300_B8","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1002\/gepi.20272","article-title":"A support vector machine approach for detecting gene-gene interaction","volume":"32","author":"Chen","year":"2008","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B9","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1007\/s00125-003-1321-3","article-title":"Multifactor-dimensionality reduction shows a two-locus interaction associated with type 2 diabetes mellitus","volume":"47","author":"Cho","year":"2004","journal-title":"Diabetologia"},{"key":"2023012507595109300_B10","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1093\/bioinformatics\/btl557","article-title":"Odds ratio based multifactor-dimensionality reduction method for detecting gene\u2013gene interactions","volume":"23","author":"Chung","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012507595109300_B11","doi-asserted-by":"crossref","first-page":"392","DOI":"10.1038\/nrg2579","article-title":"Detecting gene-gene interactions that underlie human diseases","volume":"10","author":"Cordell","year":"2009","journal-title":"Nat. Rev. Genet."},{"key":"2023012507595109300_B12","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1086\/338759","article-title":"A perspective on epistasis: limits of models displaying no main effect","volume":"70","author":"Culverhouse","year":"2002","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507595109300_B13","doi-asserted-by":"crossref","first-page":"910","DOI":"10.1002\/gepi.20251","article-title":"Analysis of multiple SNPs in genetic association studies: comparison of three multi-locus methods to prioritize and select SNPs","volume":"31","author":"Heidema","year":"2007","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B14","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1038\/nrg1155","article-title":"Mathematical multi-locus approaches to localizing complex human trait genes","volume":"4","author":"Hoh","year":"2003","journal-title":"Nat. Genet. Rev."},{"key":"2023012507595109300_B15","doi-asserted-by":"crossref","first-page":"251","DOI":"10.1080\/00401706.1985.10488049","article-title":"Estimation of the generalized extreme value distribution by the method of probability-weighted moments","volume":"27","author":"Hosking","year":"1985","journal-title":"Technometrics"},{"key":"2023012507595109300_B16","doi-asserted-by":"crossref","first-page":"10529","DOI":"10.1073\/pnas.0403794101","article-title":"Tree-structured supervised learning and the genetics of hypertension","volume":"101","author":"Huang","year":"2004","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023012507595109300_B17","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1002\/qj.49708134804","article-title":"The frequency distribution of the annual maximum (or minimum) of meteorological elements","volume":"81","author":"Jenkinson","year":"1955","journal-title":"Q. J. R. Meteorol. Soc."},{"key":"2023012507595109300_B18","doi-asserted-by":"crossref","first-page":"R375","DOI":"10.1186\/bcr801","article-title":"The Breast Cancer Family Registry: an infrastructure for cooperative multinational, interdisciplinary and translational studies of the genetic epidemiology of breast cancer","volume":"6","author":"John","year":"2004","journal-title":"Breast Cancer Res."},{"key":"2023012507595109300_B19","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1016\/j.ygeno.2007.03.011","article-title":"Identification of a two-loci epistatic interaction associated with susceptibility to rheumatoid arthritis through reverse engineering and multifactor dimensionality reduction","volume":"90","author":"Julia","year":"2007","journal-title":"Genomics"},{"key":"2023012507595109300_B20","doi-asserted-by":"crossref","first-page":"2589","DOI":"10.1093\/bioinformatics\/btm396","article-title":"Log-linear model based multifactor dimensionality reduction method to detect gene\u2013gene interactions","volume":"23","author":"Lee","year":"2007","journal-title":"Bioinformatics"},{"key":"2023012507595109300_B21","doi-asserted-by":"crossref","first-page":"1125","DOI":"10.1086\/518312","article-title":"A generalized combinatorial approach for detecting gene-by-gene and gene-by-environment interactions with application to nicotine dependence","volume":"80","author":"Lou","year":"2007","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507595109300_B22","doi-asserted-by":"crossref","first-page":"413","DOI":"10.1038\/ng1537","article-title":"Genome-wide strategies for detecting multiple loci that influence complex diseases","volume":"37","author":"Marchini","year":"2005","journal-title":"Nat. Genet."},{"issue":"Suppl. 1","key":"2023012507595109300_B23","doi-asserted-by":"crossref","first-page":"S145","DOI":"10.1186\/1471-2156-6-S1-S145","article-title":"Extension of multifactor dimensionality reduction for identifying multilocus effects in the GAW14 simulated data","volume":"6","author":"Mei","year":"2005","journal-title":"BMC Genet."},{"key":"2023012507595109300_B24","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1086\/498850","article-title":"A testing framework for identifying susceptibility genes in the presence of epistasis","volume":"78","author":"Millstein","year":"2006","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507595109300_B25","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1093\/bib\/bbl028","article-title":"Statistical methods in genetics","volume":"7","author":"Montana","year":"2006","journal-title":"Brief. Bioinform."},{"key":"2023012507595109300_B26","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1016\/j.jtbi.2005.11.036","article-title":"A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility","volume":"241","author":"Moore","year":"2006","journal-title":"J. Theor. Biol."},{"key":"2023012507595109300_B27","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1093\/bioinformatics\/btp713","article-title":"Bioinformatics challenges for genome-wide association studies","volume":"26","author":"Moore","year":"2010","journal-title":"Bioinformatics"},{"key":"2023012507595109300_B28","doi-asserted-by":"crossref","first-page":"767","DOI":"10.1002\/gepi.20345","article-title":"A comparison of analytical methods for genetic association studies","volume":"32","author":"Motsinger-Reif","year":"2008","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B29","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1093\/bioinformatics\/btn629","article-title":"New evaluation measures for multifactor dimensionality reduction classifiers in gene-gene interaction analysis","volume":"25","author":"Namkung","year":"2009","journal-title":"Bioinformatics"},{"key":"2023012507595109300_B30","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1186\/1471-2407-6-114","article-title":"SNP-SNP interactions in breast cancer susceptibility","volume":"6","author":"Onay","year":"2006","journal-title":"BMC Cancer"},{"key":"2023012507595109300_B31","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1093\/biostatistics\/kxm010","article-title":"Penalized logistic regression for detecting gene interactions","volume":"9","author":"Park","year":"2008","journal-title":"Biostatistics"},{"key":"2023012507595109300_B32","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1002\/gepi.20360","article-title":"A computationally efficient hypothesis testing method for epistasis analysis using multifactor dimensionality reduction","volume":"33","author":"Pattin","year":"2008","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B33","doi-asserted-by":"crossref","first-page":"748","DOI":"10.1002\/gepi.20238","article-title":"Power of genome-wide association studies in the presence of interacting loci","volume":"31","author":"Pickrell","year":"2007","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B34","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1086\/321276","article-title":"Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer","volume":"69","author":"Ritchie","year":"2001","journal-title":"Am. J. Hum. Genet."},{"key":"2023012507595109300_B35","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1002\/gepi.10218","article-title":"Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity","volume":"24","author":"Ritchie","year":"2003","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B36","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1023\/A:1008920224518","article-title":"Families of splitting criteria for classification trees","volume":"9","author":"Shih","year":"1999","journal-title":"Stat. Comput."},{"key":"2023012507595109300_B37","doi-asserted-by":"crossref","first-page":"341","DOI":"10.1016\/S0167-7152(00)00188-7","article-title":"Selecting the best splits for classification trees with categorical variables","volume":"54","author":"Shih","year":"2001","journal-title":"Stat. Probab. Lett."},{"key":"2023012507595109300_B38","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1093\/biomet\/72.1.67","article-title":"Maximum likelihood estimation in a class of nonregular cases","volume":"72","author":"Smith","year":"1985","journal-title":"Biometrika"},{"key":"2023012507595109300_B39","doi-asserted-by":"crossref","first-page":"421","DOI":"10.1093\/jnci\/djh094","article-title":"Betting odds and genetic associations","volume":"96","author":"Thomas","year":"2004","journal-title":"J. Natl Cancer Inst."},{"key":"2023012507595109300_B40","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1016\/j.atherosclerosis.2006.09.014","article-title":"Renin-angiotensin system gene polymorphisms and coronary artery disease in a large angiographic cohort: detection of high order gene-gene interaction","volume":"195","author":"Tsai","year":"2007","journal-title":"Atherosclerosis"},{"key":"2023012507595109300_B41","doi-asserted-by":"crossref","first-page":"306","DOI":"10.1002\/gepi.20211","article-title":"A balanced accuracy function for epistasis modeling in imbalanced datasets using multifactor dimensionality reduction","volume":"31","author":"Velez","year":"2007","journal-title":"Genet. Epidemiol."},{"key":"2023012507595109300_B42","doi-asserted-by":"crossref","first-page":"434","DOI":"10.1093\/jnci\/djh075","article-title":"Assessing the probability that a positive report is false: an approach for molecular epidemiology studies","volume":"96","author":"Wacholder","year":"2004","journal-title":"J. Natl Cancer Inst."},{"key":"2023012507595109300_B43","article-title":"Epistasis as a genetic constraint within populations and an accelerant of adaptive divergence among them","volume-title":"Epistasis and Evolutionary Process.","author":"Wade","year":"2000"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/15\/1871\/48855038\/bioinformatics_26_15_1871.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/26\/15\/1871\/48855038\/bioinformatics_26_15_1871.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T22:19:57Z","timestamp":1740176397000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/26\/15\/1871\/188582"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,6,10]]},"references-count":43,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2010,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btq290","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2010,8,1]]},"published":{"date-parts":[[2010,6,10]]}}}