{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,26]],"date-time":"2025-10-26T22:50:46Z","timestamp":1761519046982},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2012,9,26]],"date-time":"2012-09-26T00:00:00Z","timestamp":1348617600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Algorithms designed to detect complex genetic disease associations are initially evaluated using simulated datasets. Typical evaluations vary constraints that influence the correct detection of underlying models (i.e. number of loci, heritability, and minor allele frequency). Such studies neglect to account for model architecture (i.e. the unique specification and arrangement of penetrance values comprising the genetic model), which alone can influence the detectability of a model. In order to design a simulation study which efficiently takes architecture into account, a reliable metric is needed for model selection.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We evaluate three metrics as predictors of relative model detection difficulty derived from previous works: (1) Penetrance table variance (PTV), (2) customized odds ratio (COR), and (3) our own Ease of Detection Measure (EDM), calculated from the penetrance values and respective genotype frequencies of each simulated genetic model. We evaluate the reliability of these metrics across three very different data search algorithms, each with the capacity to detect epistatic interactions. We find that a model\u2019s EDM and COR are each stronger predictors of model detection success than heritability.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>This study formally identifies and evaluates metrics which quantify model detection difficulty. We utilize these metrics to intelligently select models from a population of potential architectures. This allows for an improved simulation study design which accounts for differences in detection difficulty attributed to model architecture. We implement the calculation and utilization of EDM and COR into GAMETES, an algorithm which rapidly and precisely generates pure, strict, <jats:italic>n<\/jats:italic>-locus epistatic models.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1756-0381-5-15","type":"journal-article","created":{"date-parts":[[2012,9,27]],"date-time":"2012-09-27T06:49:10Z","timestamp":1348728550000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":25,"title":["Predicting the difficulty of pure, strict, epistatic models: metrics for simulated model selection"],"prefix":"10.1186","volume":"5","author":[{"given":"Ryan J","family":"Urbanowicz","sequence":"first","affiliation":[]},{"given":"Jeff","family":"Kiralis","sequence":"additional","affiliation":[]},{"given":"Jonathan M","family":"Fisher","sequence":"additional","affiliation":[]},{"given":"Jason H","family":"Moore","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,9,26]]},"reference":[{"issue":"20","key":"73_CR1","doi-asserted-by":"publisher","first-page":"2463","DOI":"10.1093\/hmg\/11.20.2463","volume":"11","author":"H Cordell","year":"2002","unstructured":"Cordell H: Epistasis: what it means, what it doesn\u2019t mean, and statistical methods to detect it in humans. Human Mol Genet. 2002, 11 (20): 2463-10.1093\/hmg\/11.20.2463.","journal-title":"Human Mol Genet"},{"issue":"2","key":"73_CR2","doi-asserted-by":"publisher","first-page":"77","DOI":"10.2165\/00822942-200605020-00002","volume":"5","author":"B McKinney","year":"2006","unstructured":"McKinney B, Reif D, Ritchie M, Moore J: Machine learning for detecting gene-gene interactions: a review. Appl Bioinformatics. 2006, 5 (2): 77-88. 10.2165\/00822942-200605020-00002.","journal-title":"Appl Bioinformatics"},{"issue":"6","key":"73_CR3","doi-asserted-by":"publisher","first-page":"392","DOI":"10.1038\/nrg2579","volume":"10","author":"H Cordell","year":"2009","unstructured":"Cordell H: Detecting gene\u2013gene interactions that underlie human diseases. Nat Rev Genet. 2009, 10 (6): 392-404.","journal-title":"Nat Rev Genet"},{"issue":"4","key":"73_CR4","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1093\/bioinformatics\/btp713","volume":"26","author":"J Moore","year":"2010","unstructured":"Moore J, Asselbergs F, Williams S: Bioinformatics challenges for genome-wide association studies. Bioinformatics. 2010, 26 (4): 445-10.1093\/bioinformatics\/btp713.","journal-title":"Bioinformatics"},{"key":"73_CR5","doi-asserted-by":"publisher","DOI":"10.5962\/bhl.title.44575","volume-title":"Mendel\u2019s Principles of Heredity","author":"W Bateson","year":"1909","unstructured":"Bateson W: Mendel\u2019s Principles of Heredity. 1909, Cambridge University Press"},{"key":"73_CR6","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1017\/S0080456800012163","volume":"52","author":"R Fisher","year":"1918","unstructured":"Fisher R: The Correlation Between Relatives on the Supposition of Mendelian Inheritance. Trans R Soc Edinburgh. 1918, 52: 399-433.","journal-title":"Trans R Soc Edinburgh"},{"issue":"6","key":"73_CR7","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1159\/000022939","volume":"50","author":"W Li","year":"2000","unstructured":"Li W, Reich J: A Complete Enumeration and Classification of Two-Locus Disease Models. Hum Hered. 2000, 50 (6): 334-349. 10.1159\/000022939.","journal-title":"Hum Hered"},{"key":"73_CR8","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1186\/1471-2156-9-17","volume":"9","author":"I Hallgr\u00edmsd\u00f3ttir","year":"2008","unstructured":"Hallgr\u00edmsd\u00f3ttir I, Yuster D: A complete classification of epistatic two-locus models. BMC Genet. 2008, 9: 17-","journal-title":"BMC Genet"},{"key":"73_CR9","first-page":"1155","volume-title":"Proceedings of the Genetic and Evolutionary Computation Conference. New York, USA","author":"J Moore","year":"2002","unstructured":"Moore J, Hahn L, Ritchie M, Thornton T, White B: Application Of Genetic Algorithms To The Discovery Of Complex Models For Simulation Studies In Human Genetics. Proceedings of the Genetic and Evolutionary Computation Conference. New York, USA. 2002, 1155-1155."},{"key":"73_CR10","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1016\/j.asoc.2003.08.003","volume":"4","author":"J Moore","year":"2004","unstructured":"Moore J, Hahn L, Ritchie M, Thornton T, White B: Routine discovery of complex genetic models using genetic algorithms. Appl Soft Comput. 2004, 4: 79-86. 10.1016\/j.asoc.2003.08.003.","journal-title":"Appl Soft Comput"},{"issue":"8","key":"73_CR11","doi-asserted-by":"publisher","first-page":"767","DOI":"10.1002\/gepi.20345","volume":"32","author":"A Motsinger-Reif","year":"2008","unstructured":"Motsinger-Reif A, Reif D, Fanelli T, Ritchie M: A comparison of analytical methods for genetic association studies. Genet Epidemiol. 2008, 32 (8): 767-778. 10.1002\/gepi.20345.","journal-title":"Genet Epidemiol"},{"key":"73_CR12","doi-asserted-by":"publisher","first-page":"183","DOI":"10.1159\/000181157","volume":"67","author":"T Edwards","year":"2009","unstructured":"Edwards T, Lewis K, Digna T, Dudek R, Ritchie M: Exploring the performance of multifactor dimensionality reduction in large scale SNP studies and in the presence of genetic heterogeneity among epistatic disease models. Hum Hered. 2009, 67: 183-192. 10.1159\/000181157.","journal-title":"Hum Hered"},{"key":"73_CR13","volume-title":"BioData Mining","author":"R Urbanowicz","year":"2012,","unstructured":"Urbanowicz R, Kiralis J, Sinnott-Armstrong N, Heberling T, Fisher J, Moore J: GAMETES: A Fast, Direct Algorithm for Generating Pure, Strict, Epistatic Models with Random Architectures. BioData Mining. 2012,, In Press"},{"key":"73_CR14","first-page":"3","volume-title":"Epistasis and the Evolutionary Process","author":"E Brodie III","year":"2000","unstructured":"Brodie III E: Why evolutionary genetics does not always add up. Epistasis and the Evolutionary Process. 2000, 3-19."},{"issue":"2","key":"73_CR15","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1086\/338759","volume":"70","author":"R Culverhouse","year":"2002","unstructured":"Culverhouse R, Suarez B, Lin J, Reich T: A perspective on epistasis: limits of models displaying no main effect. Am J Human Genet. 2002, 70 (2): 461-471. 10.1086\/338759.","journal-title":"Am J Human Genet"},{"key":"73_CR16","doi-asserted-by":"publisher","first-page":"5","DOI":"10.1186\/1756-0381-2-5","volume":"2","author":"C Greene","year":"2009","unstructured":"Greene C, Penrod N, Kiralis J, Moore J: Spatially Uniform ReliefF (SURF) for computationally-efficient filtering of gene-gene interactions. BioData Mining. 2009, 2: 5-10.1186\/1756-0381-2-5.","journal-title":"BioData Mining"},{"key":"73_CR17","volume-title":"Principles of Population Genetics","author":"D Hartl","year":"1997","unstructured":"Hartl D, Clark A, Clark A: Principles of Population Genetics. 1997, Sunderland: Sinauer Associates"},{"key":"73_CR18","doi-asserted-by":"publisher","first-page":"138","DOI":"10.1086\/321276","volume":"69","author":"M Ritchie","year":"2001","unstructured":"Ritchie M, Hahn L, Roodi N, Bailey L, Dupont W, Parl F, Moore J: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Human Genet. 2001, 69: 138-147. 10.1086\/321276.","journal-title":"Am J Human Genet"},{"issue":"3","key":"73_CR19","doi-asserted-by":"publisher","first-page":"209","DOI":"10.1162\/106365603322365289","volume":"11","author":"E Bernad\u00f3-Mansilla","year":"2003","unstructured":"Bernad\u00f3-Mansilla E, Garrell-Guiu J: Accuracy-based learning classifier systems: models, analysis and applications to classification tasks. Evolutionary Comput. 2003, 11 (3): 209-238. 10.1162\/106365603322365289.","journal-title":"Evolutionary Comput"},{"key":"73_CR20","volume-title":"J Artif Evol Appl","author":"R Urbanowicz","year":"2009","unstructured":"Urbanowicz R, Moore J: Learning Classifier Systems: A Complete Introduction, Review, and Roadmap. J Artif Evol Appl. 2009, 2009:"},{"key":"73_CR21","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1145\/1830483.1830518","volume-title":"Proceedings of the 12th Annual Conference on Genetic and Evolutionary Computation. Portland, OR","author":"R Urbanowicz","year":"2010","unstructured":"Urbanowicz R, Moore J: The application of michigan-style learning classifier systems to address genetic heterogeneity and epistasis in association studies. Proceedings of the 12th Annual Conference on Genetic and Evolutionary Computation. Portland, OR. 2010, ACM, 195-202."},{"key":"73_CR22","volume-title":"Computational Intelligence: Special Issue Bioinformatics","author":"R Urbanowicz","year":"2012,","unstructured":"Urbanowicz R, Granizo-Mackenzie A, Moore J: An Analysis Pipeline with Visualization-Guided Knowledge Discovery for Michigan-Style Learning Classifier Systems. Computational Intelligence: Special Issue Bioinformatics. 2012,, In Press"},{"key":"73_CR23","volume-title":"R: A Language and Environment for Statistical Computing","author":"R Development Core Team","year":"2010","unstructured":"R Development Core Team: R: A Language and Environment for Statistical Computing. 2010, Vienna: R Foundation for Statistical Computing, [http:\/\/www.R-project.org]. [ISBN 3-900051-07-0],"}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1756-0381-5-15.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1756-0381-5-15\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1756-0381-5-15","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1756-0381-5-15.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T21:28:40Z","timestamp":1630531720000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/1756-0381-5-15"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,9,26]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["73"],"URL":"https:\/\/doi.org\/10.1186\/1756-0381-5-15","relation":{},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,9,26]]},"assertion":[{"value":"24 April 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 September 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 September 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"15"}}