{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,20]],"date-time":"2026-01-20T11:45:51Z","timestamp":1768909551088,"version":"3.49.0"},"reference-count":53,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2020,11,30]],"date-time":"2020-11-30T00:00:00Z","timestamp":1606694400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/100010663","name":"European Research Council","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]},{"name":"European Union\u2019s Horizon 2020 research and innovation programme","award":["777111"],"award-info":[{"award-number":["777111"]}]},{"name":"German Federal Ministry of Education and Research","award":["01ZX1908A"],"award-info":[{"award-number":["01ZX1908A"]}]},{"name":"German Science Foundation"},{"name":"Collaborative Research Center","award":["SFB1371"],"award-info":[{"award-number":["SFB1371"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,7,19]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Recently, various tools for detecting single nucleotide polymorphisms (SNPs) involved in epistasis have been developed. However, no studies evaluate the employed statistical epistasis models such as the \u03c72-test or quadratic regression independently of the tools that use them. Such an independent evaluation is crucial for developing improved epistasis detection tools, for it allows to decide if a tool\u2019s performance should be attributed to the epistasis model or to the optimization strategy run on top of it.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>We present a protocol for evaluating epistasis models independently of the tools they are used in and generalize existing models designed for dichotomous phenotypes to the categorical and quantitative case. In addition, we propose a new model which scores candidate SNP sets by computing maximum likelihood distributions for the observed phenotypes in the cells of their penetrance tables. Extensive experiments show that the proposed maximum likelihood model outperforms three widely used epistasis models in most cases. The experiments also provide valuable insights into the properties of existing models, for instance, that quadratic regression perform particularly well on instances with quantitative phenotypes.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The evaluation protocol and all compared models are implemented in C++ and are supported under Linux and macOS. They are available at https:\/\/github.com\/baumbachlab\/genepiseeker\/, along with test datasets and scripts to reproduce the experiments.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa990","type":"journal-article","created":{"date-parts":[[2020,11,16]],"date-time":"2020-11-16T12:17:45Z","timestamp":1605529065000},"page":"1708-1716","source":"Crossref","is-referenced-by-count":9,"title":["A framework for modeling epistatic interaction"],"prefix":"10.1093","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8651-750X","authenticated-orcid":false,"given":"David B","family":"Blumenthal","sequence":"first","affiliation":[{"name":"Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich , 85354 Freising, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jan","family":"Baumbach","sequence":"additional","affiliation":[{"name":"Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich , 85354 Freising, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1920-288X","authenticated-orcid":false,"given":"Markus","family":"Hoffmann","sequence":"additional","affiliation":[{"name":"Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich , 85354 Freising, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5393-2413","authenticated-orcid":false,"given":"Tim","family":"Kacprowski","sequence":"additional","affiliation":[{"name":"Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich , 85354 Freising, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0941-4168","authenticated-orcid":false,"given":"Markus","family":"List","sequence":"additional","affiliation":[{"name":"Chair of Experimental Bioinformatics, TUM School of Life Sciences, Technical University of Munich , 85354 Freising, Germany"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,11,30]]},"reference":[{"key":"2023051709461940900_btaa990-B1","doi-asserted-by":"crossref","first-page":"716","DOI":"10.1109\/TAC.1974.1100705","article-title":"A new look at the statistical model identification","volume":"19","author":"Akaike","year":"1974","journal-title":"IEEE Trans. Autom. Control"},{"key":"2023051709461940900_btaa990-B2","doi-asserted-by":"crossref","first-page":"5078","DOI":"10.1093\/bioinformatics\/btz463","article-title":"New algorithms for detecting multi-effect and multi-way epistatic interactions","volume":"35","author":"Ansarifar","year":"2019","journal-title":"Bioinformatics"},{"key":"2023051709461940900_btaa990-B3","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btaa245","article-title":"EpiGEN: an epistasis simulation pipeline","author":"Blumenthal","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051709461940900_btaa990-B4","doi-asserted-by":"crossref","first-page":"e93379","DOI":"10.1371\/journal.pone.0093379","article-title":"Exploiting SNP correlations within random forest for genome-wide association studies","volume":"9","author":"Botta","year":"2014","journal-title":"PLoS One"},{"key":"2023051709461940900_btaa990-B5","doi-asserted-by":"crossref","first-page":"e1002822","DOI":"10.1371\/journal.pcbi.1002822","article-title":"Chapter 11: genome-wide association studies","volume":"8","author":"Bush","year":"2012","journal-title":"PLoS Comput. Biol"},{"key":"2023051709461940900_btaa990-B6","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1002\/humu.23951","article-title":"DualWMDR: detecting epistatic interaction with dual screening and multifactor dimensionality reduction","volume":"41","author":"Cao","year":"2020","journal-title":"Hum. Mutat"},{"key":"2023051709461940900_btaa990-B7","first-page":"243","volume-title":"RECOMB 2020, Volume 12074 of LNCS","author":"Caylak","year":"2020"},{"key":"2023051709461940900_btaa990-B8","doi-asserted-by":"crossref","first-page":"231","DOI":"10.1186\/s12859-018-2229-8","article-title":"Performance of epistasis detection methods in semi-simulated GWAS","volume":"19","author":"Chatelain","year":"2018","journal-title":"BMC Bioinformatics"},{"key":"2023051709461940900_btaa990-B9","doi-asserted-by":"crossref","first-page":"813","DOI":"10.21037\/atm.2019.12.87","article-title":"Gene-gene interaction: the curse of dimensionality","volume":"7","author":"Chattopadhyay","year":"2019","journal-title":"Ann. Transl. Med"},{"key":"2023051709461940900_btaa990-B10","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1002\/gepi.20272","article-title":"A support vector machine approach for detecting gene-gene interaction","volume":"32","author":"Chen","year":"2008","journal-title":"Genet. Epidemiol"},{"key":"2023051709461940900_btaa990-B11","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/BF00994110","article-title":"A Bayesian method for the induction of probabilistic networks from data","volume":"9","author":"Cooper","year":"1992","journal-title":"Mach. Learn"},{"key":"2023051709461940900_btaa990-B12","doi-asserted-by":"crossref","first-page":"2463","DOI":"10.1093\/hmg\/11.20.2463","article-title":"Epistasis: what it means, what it doesn\u2019t mean, and statistical methods to detect it in humans","volume":"11","author":"Cordell","year":"2002","journal-title":"Hum. Mol. Genet"},{"key":"2023051709461940900_btaa990-B13","doi-asserted-by":"crossref","first-page":"921","DOI":"10.1126\/science.8346443","article-title":"Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer\u2019s disease in late onset families","volume":"261","author":"Corder","year":"1993","journal-title":"Science"},{"key":"2023051709461940900_btaa990-B14","doi-asserted-by":"crossref","first-page":"180","DOI":"10.1038\/ng0694-180","article-title":"Protective effect of apolipoprotein E type 2 allele for late onset Alzheimer disease","volume":"7","author":"Corder","year":"1994","journal-title":"Nat. Genet"},{"key":"2023051709461940900_btaa990-B15","doi-asserted-by":"crossref","first-page":"e131","DOI":"10.1093\/nar\/gkx505","article-title":"Prioritizing tests of epistasis through hierarchical representation of genomic redundancies","volume":"45","author":"Cowman","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023051709461940900_btaa990-B16","doi-asserted-by":"crossref","first-page":"817","DOI":"10.1016\/j.ajhg.2016.07.022","article-title":"Are interactions between cis-regulatory variants evidence for biological epistasis or statistical artifacts?","volume":"99","author":"Fish","year":"2016","journal-title":"Am. J. Hum. Genet"},{"key":"2023051709461940900_btaa990-B17","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1093\/bib\/bbv038","article-title":"A roadmap to multifactor dimensionality reduction methods","volume":"17","author":"Gola","year":"2016","journal-title":"Brief. Bioinform"},{"key":"2023051709461940900_btaa990-B18","doi-asserted-by":"crossref","first-page":"114","DOI":"10.3390\/genes10020114","article-title":"Self-adjusting ant colony optimization based on information entropy for detecting epistatic interactions","volume":"10","author":"Guan","year":"2019","journal-title":"Genes"},{"key":"2023051709461940900_btaa990-B19","doi-asserted-by":"crossref","first-page":"354","DOI":"10.1016\/j.compbiolchem.2018.11.001","article-title":"Ant colony optimization with an automatic adjustment mechanism for detecting epistatic interactions","volume":"77","author":"Guan","year":"2018","journal-title":"Comput. Biol. Chem"},{"key":"2023051709461940900_btaa990-B20","doi-asserted-by":"crossref","first-page":"S14","DOI":"10.1186\/1752-0509-6-S3-S14","article-title":"Genetic studies of complex human diseases: characterizing SNP-disease associations using Bayesian networks","volume":"6","author":"Han","year":"2012","journal-title":"BMC Syst. Biol"},{"key":"2023051709461940900_btaa990-B21","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1145\/2001576.2001618","volume-title":"GECCO 2011","author":"Hardison","year":"2011"},{"key":"2023051709461940900_btaa990-B22","doi-asserted-by":"crossref","first-page":"S65","DOI":"10.1186\/1471-2105-10-S1-S65","article-title":"A random forest approach to the detection of epistatic interactions in case-control studies","volume":"10","author":"Jiang","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023051709461940900_btaa990-B23","doi-asserted-by":"crossref","first-page":"89","DOI":"10.1186\/1471-2105-12-89","article-title":"Learning genetic epistasis using Bayesian network scoring criteria","volume":"12","author":"Jiang","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051709461940900_btaa990-B24","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1093\/bioinformatics\/btu702","article-title":"MACOED: a multi-objective ant colony optimization algorithm for SNP epistasis detection in genome-wide association studies","volume":"31","author":"Jing","year":"2015","journal-title":"Bioinformatics"},{"key":"2023051709461940900_btaa990-B25","doi-asserted-by":"crossref","first-page":"e28415","DOI":"10.1371\/journal.pone.0028415","article-title":"Comparison of strategies to detect epistasis from eQTL data","volume":"6","author":"Kapur","year":"2011","journal-title":"PLoS One"},{"key":"2023051709461940900_btaa990-B26","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1155\/2013\/432375","article-title":"A review for detecting gene\u2013gene interactions using machine learning methods in genetic epidemiology","volume":"2013","author":"Koo","year":"2013","journal-title":"Biomed. Res. Int"},{"key":"2023051709461940900_btaa990-B27","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/s13040-018-0189-1","article-title":"Unified Cox model based multifactor dimensionality reduction method for gene\u2013gene interaction analysis of the survival phenotype","volume":"11","author":"Lee","year":"2018","journal-title":"BioData Min"},{"key":"2023051709461940900_btaa990-B28","doi-asserted-by":"crossref","first-page":"D896","DOI":"10.1093\/nar\/gkw1133","article-title":"The new NHGRI-EBI catalog of published genome-wide association studies (GWAS Catalog)","volume":"45","author":"MacArthur","year":"2017","journal-title":"Nucleic Acids Res"},{"key":"2023051709461940900_btaa990-B29","doi-asserted-by":"crossref","first-page":"125","DOI":"10.1186\/gm561","article-title":"Why epistasis is important for tackling complex human disease genetics","volume":"6","author":"Mackay","year":"2014","journal-title":"Genome Med"},{"key":"2023051709461940900_btaa990-B30","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1038\/nature08494","article-title":"Finding the missing heritability of complex diseases","volume":"461","author":"Manolio","year":"2009","journal-title":"Nature"},{"key":"2023051709461940900_btaa990-B31","doi-asserted-by":"crossref","first-page":"2478","DOI":"10.1093\/bioinformatics\/btp435","article-title":"An algorithm for learning maximum entropy probability models of disease risk that efficiently searches and sparingly encodes multilocus genomic interactions","volume":"25","author":"Miller","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051709461940900_btaa990-B32","doi-asserted-by":"crossref","first-page":"285","DOI":"10.3389\/fgene.2015.00285","article-title":"A survey about methods dedicated to epistasis detection","volume":"6","author":"Niel","year":"2015","journal-title":"Front. Genet"},{"key":"2023051709461940900_btaa990-B33","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1159\/000085222","article-title":"Application of logistic regression to case\u2013control association studies involving two causative loci","volume":"59","author":"North","year":"2005","journal-title":"Hum. Hered"},{"key":"2023051709461940900_btaa990-B34","doi-asserted-by":"crossref","first-page":"702","DOI":"10.1038\/ng.3285","article-title":"Meta-analysis of the heritability of human traits based on fifty years of twin studies","volume":"47","author":"Polderman","year":"2015","journal-title":"Nat. Genet"},{"key":"2023051709461940900_btaa990-B35","doi-asserted-by":"crossref","first-page":"713","DOI":"10.1016\/j.neuron.2007.05.022","article-title":"GAB2 alleles modify Alzheimer\u2019s risk in APOE epsilon4 carriers","volume":"54","author":"Reiman","year":"2007","journal-title":"Neuron"},{"key":"2023051709461940900_btaa990-B36","doi-asserted-by":"crossref","first-page":"138","DOI":"10.1086\/321276","article-title":"Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer","volume":"69","author":"Ritchie","year":"2001","journal-title":"Am. J. Hum. Genet"},{"key":"2023051709461940900_btaa990-B37","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1016\/j.asoc.2006.01.013","article-title":"Genetic programming neural networks: a powerful bioinformatics tool for human genetics","volume":"7","author":"Ritchie","year":"2007","journal-title":"Appl. Soft Comput"},{"key":"2023051709461940900_btaa990-B38","doi-asserted-by":"crossref","first-page":"124","DOI":"10.1016\/j.tig.2014.02.003","article-title":"Explaining additional genetic variation in complex traits","volume":"30","author":"Robinson","year":"2014","journal-title":"Trends Genet"},{"key":"2023051709461940900_btaa990-B39","doi-asserted-by":"crossref","first-page":"1468","DOI":"10.1093\/bioinformatics\/btq147","article-title":"FastEpistasis: a high performance computing solution for quantitative trait epistasis","volume":"26","author":"Sch\u00fcpbach","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051709461940900_btaa990-B40","doi-asserted-by":"crossref","first-page":"463","DOI":"10.1504\/IJDMB.2012.049300","article-title":"Support vector machines with L1 penalty for detecting gene\u2013gene interactions","volume":"6","author":"Shen","year":"2012","journal-title":"Int. J. Data Min. Bioinform"},{"key":"2023051709461940900_btaa990-B41","doi-asserted-by":"publisher","first-page":"215","DOI":"10.1145\/1830483.1830523","volume-title":"GECCO 2010","author":"Sinnott-Armstrong","year":"2010"},{"key":"2023051709461940900_btaa990-B42","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1016\/j.ijmedinf.2018.09.003","article-title":"A deep hybrid model to detect multi-locus interacting SNPs in the presence of noise","volume":"119","author":"Uppu","year":"2018","journal-title":"Int. J. Med. Inform"},{"key":"2023051709461940900_btaa990-B43","doi-asserted-by":"crossref","first-page":"16","DOI":"10.1186\/1756-0381-5-16","article-title":"GAMETES: a fast, direct algorithm for generating pure, strict, epistatic models with random architectures","volume":"5","author":"Urbanowicz","year":"2012","journal-title":"BioData Min"},{"key":"2023051709461940900_btaa990-B44","doi-asserted-by":"crossref","first-page":"325","DOI":"10.1016\/j.ajhg.2010.07.021","article-title":"BOOST: a fast approach to detecting gene\u2013gene interactions in genome-wide case-control studies","volume":"87","author":"Wan","year":"2010","journal-title":"Am. J. Hum. Genet"},{"key":"2023051709461940900_btaa990-B45","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1159\/000321967","article-title":"The meaning of interaction","volume":"70","author":"Wang","year":"2010","journal-title":"Hum. Hered"},{"key":"2023051709461940900_btaa990-B46","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1186\/1756-0500-3-117","article-title":"AntEpiSeeker: detecting epistatic interactions for case-control studies using a two-stage ant colony optimization algorithm","volume":"3","author":"Wang","year":"2010","journal-title":"BMC Res. Notes"},{"key":"2023051709461940900_btaa990-B47","doi-asserted-by":"crossref","first-page":"256","DOI":"10.1016\/j.tig.2017.01.009","article-title":"Epistasis in neuropsychiatric disorders","volume":"33","author":"Webber","year":"2017","journal-title":"Trends Genet"},{"key":"2023051709461940900_btaa990-B48","doi-asserted-by":"crossref","first-page":"722","DOI":"10.1038\/nrg3747","article-title":"Detecting epistasis in human complex traits","volume":"15","author":"Wei","year":"2014","journal-title":"Nat. Rev. Genet"},{"key":"2023051709461940900_btaa990-B49","doi-asserted-by":"crossref","first-page":"714","DOI":"10.1093\/bioinformatics\/btp041","article-title":"Genome-wide association analysis by lasso penalized logistic regression","volume":"25","author":"Wu","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051709461940900_btaa990-B50","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1038\/ng.608","article-title":"Common SNPs explain a large proportion of the heritability for human height","volume":"42","author":"Yang","year":"2010","journal-title":"Nat. Genet"},{"key":"2023051709461940900_btaa990-B51","doi-asserted-by":"crossref","first-page":"469","DOI":"10.1186\/1471-2105-12-469","article-title":"SNPInterForest: a new method for detecting epistatic interactions","volume":"12","author":"Yoshida","year":"2011","journal-title":"BMC Bioinformatics"},{"key":"2023051709461940900_btaa990-B52","doi-asserted-by":"crossref","first-page":"e1008222","DOI":"10.1371\/journal.pgen.1008222","article-title":"Solving the missing heritability problem","volume":"15","author":"Young","year":"2019","journal-title":"PLoS Genet"},{"key":"2023051709461940900_btaa990-B53","doi-asserted-by":"crossref","first-page":"1167","DOI":"10.1038\/ng2110","article-title":"Bayesian inference of epistatic interactions in case\u2013control studies","volume":"39","author":"Zhang","year":"2007","journal-title":"Nat. Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa990\/38614930\/btaa990.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/12\/1708\/50361302\/btaa990.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/12\/1708\/50361302\/btaa990.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,17]],"date-time":"2023-05-17T10:34:09Z","timestamp":1684319649000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/12\/1708\/6012351"}},"subtitle":[],"editor":[{"given":"Russell","family":"Schwartz","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2020,11,30]]},"references-count":53,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2021,7,19]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa990","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,6,15]]},"published":{"date-parts":[[2020,11,30]]}}}