{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T19:25:53Z","timestamp":1774466753493,"version":"3.50.1"},"reference-count":31,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2012,10,1]],"date-time":"2012-10-01T00:00:00Z","timestamp":1349049600000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BioData Mining"],"published-print":{"date-parts":[[2012,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Geneticists who look beyond single locus disease associations require additional strategies for the detection of complex multi-locus effects. Epistasis, a multi-locus masking effect, presents a particular challenge, and has been the target of bioinformatic development. Thorough evaluation of new algorithms calls for simulation studies in which known disease models are sought. To date, the best methods for generating simulated multi-locus epistatic models rely on genetic algorithms. However, such methods are computationally expensive, difficult to adapt to multiple objectives, and unlikely to yield models with a precise form of epistasis which we refer to as pure and strict. Purely and strictly epistatic models constitute the worst-case in terms of detecting disease associations, since such associations may only be observed if all <jats:italic>n<\/jats:italic>-loci are included in the disease model. This makes them an attractive gold standard for simulation studies considering complex multi-locus effects.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>We introduce GAMETES, a user-friendly software package and algorithm which generates complex biallelic single nucleotide polymorphism (SNP) disease models for simulation studies. GAMETES rapidly and precisely generates random, pure, strict <jats:italic>n<\/jats:italic>-locus models with specified genetic constraints. These constraints include heritability, minor allele frequencies of the SNPs, and population prevalence. GAMETES also includes a simple dataset simulation strategy which may be utilized to rapidly generate an archive of simulated datasets for given genetic models. We highlight the utility and limitations of GAMETES with an example simulation study using MDR, an algorithm designed to detect epistasis.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusions<\/jats:title>\n            <jats:p>GAMETES is a fast, flexible, and precise tool for generating complex <jats:italic>n<\/jats:italic>-locus models with random architectures. While GAMETES has a limited ability to generate models with higher heritabilities, it is proficient at generating the lower heritability models typically used in simulation studies evaluating new algorithms. In addition, the GAMETES modeling strategy may be flexibly combined with any dataset simulation strategy. Beyond dataset simulation, GAMETES could be employed to pursue theoretical characterization of genetic models and epistasis.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1756-0381-5-16","type":"journal-article","created":{"date-parts":[[2012,10,1]],"date-time":"2012-10-01T18:01:16Z","timestamp":1349114476000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":188,"title":["GAMETES: a fast, direct algorithm for generating pure, strict, epistatic models with random architectures"],"prefix":"10.1186","volume":"5","author":[{"given":"Ryan J","family":"Urbanowicz","sequence":"first","affiliation":[]},{"given":"Jeff","family":"Kiralis","sequence":"additional","affiliation":[]},{"given":"Nicholas A","family":"Sinnott-Armstrong","sequence":"additional","affiliation":[]},{"given":"Tamra","family":"Heberling","sequence":"additional","affiliation":[]},{"given":"Jonathan M","family":"Fisher","sequence":"additional","affiliation":[]},{"given":"Jason H","family":"Moore","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2012,10,1]]},"reference":[{"issue":"5833","key":"80_CR1","doi-asserted-by":"publisher","first-page":"1840c","DOI":"10.1126\/science.316.5833.1840c","volume":"316","author":"D Shriner","year":"2007","unstructured":"Shriner D, Vaughan L, Padilla M: Problems with genome-wide association studies. Science. 2007, 316 (5833): 1840c-10.1126\/science.316.5833.1840c.","journal-title":"Science"},{"issue":"6","key":"80_CR2","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1038\/nrg2809","volume":"11","author":"E Eichler","year":"2010","unstructured":"Eichler E, Flint J, Gibson G, Kong A, Leal S, Moore J, Nadeau J: Missing heritability and strategies for finding the underlying causes of complex disease. Nat Rev Genet. 2010, 11 (6): 446-450. 10.1038\/nrg2809.","journal-title":"Nat Rev Genet"},{"issue":"12","key":"80_CR3","doi-asserted-by":"publisher","first-page":"640","DOI":"10.1016\/j.tig.2004.09.007","volume":"20","author":"T Thornton-Wells","year":"2004","unstructured":"Thornton-Wells T, Moore J, Haines J: Genetics, statistics and human disease: analytical retooling for complexity. TRENDS in Genetics. 2004, 20 (12): 640-647. 10.1016\/j.tig.2004.09.007.","journal-title":"TRENDS in Genetics"},{"key":"80_CR4","doi-asserted-by":"publisher","DOI":"10.5962\/bhl.title.44575","volume-title":"Mendel\u2019s Principles of Heredity","author":"W Bateson","year":"1909","unstructured":"Bateson W: Mendel\u2019s Principles of Heredity. 1909, Cambridge University Press"},{"key":"80_CR5","doi-asserted-by":"publisher","first-page":"399","DOI":"10.1017\/S0080456800012163","volume":"52","author":"R Fisher","year":"1918","unstructured":"Fisher R: The Correlation Between Relatives on the Supposition of Mendelian Inheritance. Trans R Soc of Edinburgh. 1918, 52: 399-433.","journal-title":"Trans R Soc of Edinburgh"},{"issue":"20","key":"80_CR6","doi-asserted-by":"publisher","first-page":"2463","DOI":"10.1093\/hmg\/11.20.2463","volume":"11","author":"H Cordell","year":"2002","unstructured":"Cordell H: Epistasis: what it means, what it doesn\u2019t mean, and statistical methods to detect it in humans. Human Mol Genet. 2002, 11 (20): 2463-10.1093\/hmg\/11.20.2463.","journal-title":"Human Mol Genet"},{"issue":"2","key":"80_CR7","doi-asserted-by":"publisher","first-page":"77","DOI":"10.2165\/00822942-200605020-00002","volume":"5","author":"B McKinney","year":"2006","unstructured":"McKinney B, Reif D, Ritchie M, Moore J: Machine learning for detecting gene-gene interactions: a review. Appl Bioinf. 2006, 5 (2): 77-88. 10.2165\/00822942-200605020-00002.","journal-title":"Appl Bioinf"},{"issue":"6","key":"80_CR8","doi-asserted-by":"publisher","first-page":"392","DOI":"10.1038\/nrg2579","volume":"10","author":"H Cordell","year":"2009","unstructured":"Cordell H: Detecting gene\u2013gene interactions that underlie human diseases. Nat Rev Genet. 2009, 10 (6): 392-404. 10.1038\/nrg2579.","journal-title":"Nat Rev Genet"},{"issue":"4","key":"80_CR9","doi-asserted-by":"publisher","first-page":"445","DOI":"10.1093\/bioinformatics\/btp713","volume":"26","author":"J Moore","year":"2010","unstructured":"Moore J, Asselbergs F, Williams S: Bioinformatics challenges for genome-wide association studies. Bioinformatics. 2010, 26 (4): 445-10.1093\/bioinformatics\/btp713.","journal-title":"Bioinformatics"},{"issue":"4","key":"80_CR10","first-page":"2000","volume":"155","author":"O Carlborg","year":"2003","unstructured":"Carlborg O, Andersson L, Kinghorn B: The use of a genetic algorithm for simultaneous mapping of multiple interacting quantitative trait loci. Genetics. 2003, 155 (4): 2000-","journal-title":"Genetics"},{"issue":"4","key":"80_CR11","first-page":"543","volume":"44","author":"L Ploughman","year":"1989","unstructured":"Ploughman L, Boehnke M: Estimating the power of a proposed linkage study for a complex genetic trait. Am J Human Genet. 1989, 44 (4): 543-","journal-title":"Am J Human Genet"},{"issue":"3","key":"80_CR12","first-page":"A204","volume":"47","author":"D Weeks","year":"1990","unstructured":"Weeks D, Ott J, Lathrop G: SLINK: a general simulation program for linkage analysis. Am J Hum Genet. 1990, 47 (3): A204-","journal-title":"Am J Hum Genet"},{"key":"80_CR13","first-page":"93","volume-title":"Pacific Symposium on Biocomputing Hawaii","author":"M Bass","year":"2004","unstructured":"Bass M, Martin E, Mauser E: Pedigree generation for analysis of genetic linkage and association. Pacific Symposium on Biocomputing Hawaii. 2004, World Scientific Pub Co Inc, USA, 93-93."},{"key":"80_CR14","first-page":"1133","volume":"4","author":"M Schmidt","year":"2005","unstructured":"Schmidt M, Hauser E, Martin E, Schmidt S: Extension of the SIMLA package for generating pedigrees with complex inheritance patterns: environmental covariates, gene-gene and gene-environment interaction. Stat App Genet and Mol Biol. 2005, 4: 1133-","journal-title":"Stat App Genet and Mol Biol"},{"key":"80_CR15","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1186\/1471-2156-7-40","volume":"7","author":"M Lemire","year":"2006","unstructured":"Lemire M: SUP: an extension to SLINK to allow a larger number of marker loci to be simulated in pedigrees conditional on trait values. BMC Genet. 2006, 7: 40-","journal-title":"BMC Genet"},{"issue":"suppl 4","key":"80_CR16","first-page":"A2363","volume":"71","author":"M Nothnagel","year":"2002","unstructured":"Nothnagel M: Simulation of LD block-structured SNP haplotype data and its use for the analysis of case-control data by supervised learning methods. Am J Hum Genet. 2002, 71 (suppl 4): A2363-","journal-title":"Am J Hum Genet"},{"key":"80_CR17","first-page":"1155","volume-title":"Proceedings of the Genetic and Evolutionary Computation Conference New York","author":"J Moore","year":"2002","unstructured":"Moore J, Hahn L, Ritchie M, Thornton T, White B: Application Of Genetic Algorithms To The Discovery Of Complex Models For Simulation Studies In Human Genetics. Proceedings of the Genetic and Evolutionary Computation Conference New York. 2002, Morgan Kaufmann Publishers Inc, USA, 1155-1155."},{"key":"80_CR18","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1016\/j.asoc.2003.08.003","volume":"4","author":"J Moore","year":"2004","unstructured":"Moore J, Hahn L, Ritchie M, Thornton T, White B: Routine discovery of complex genetic models using genetic algorithms. Appl Soft Comput. 2004, 4: 79-86. 10.1016\/j.asoc.2003.08.003.","journal-title":"Appl Soft Comput"},{"key":"80_CR19","doi-asserted-by":"publisher","first-page":"252","DOI":"10.1186\/1471-2105-6-252","volume":"6","author":"T Mailund","year":"2005","unstructured":"Mailund T, Schierup M, Pedersen C, Mechlenborg P, Madsen J, Schauser L: CoaSim: a flexible environment for simulating genetic data under coalescent models. BMC Bioinf. 2005, 6: 252-10.1186\/1471-2105-6-252.","journal-title":"BMC Bioinf"},{"key":"80_CR20","first-page":"499","volume-title":"Pacific Symposium on Biocomputing","author":"S Dudek","year":"2006","unstructured":"Dudek S, Motsinger A, Velez D, Williams S, Ritchie M: Data simulation software for whole-genome association and other studies in human genetics. Pacific Symposium on Biocomputing: Hawaii, USA. 2006, 11:, 499-510."},{"key":"80_CR21","doi-asserted-by":"publisher","first-page":"140","DOI":"10.1093\/bioinformatics\/btm549","volume":"24","author":"C Li","year":"2008","unstructured":"Li C, Li M: GWAsimulator: a rapid whole-genome simulation program. Bioinformatics. 2008, 24: 140-10.1093\/bioinformatics\/btm549.","journal-title":"Bioinformatics"},{"key":"80_CR22","doi-asserted-by":"publisher","first-page":"44","DOI":"10.1186\/1471-2105-9-44","volume":"9","author":"J Li","year":"2008","unstructured":"Li J, Chen Y: Generating samples for association studies based on HapMap data. BMC Bioinf. 2008, 9: 44-10.1186\/1471-2105-9-44.","journal-title":"BMC Bioinf"},{"key":"80_CR23","doi-asserted-by":"publisher","first-page":"74","DOI":"10.1007\/978-3-642-12211-8_7","volume":"6023","author":"C Greene","year":"2010","unstructured":"Greene C, Himmelstein D, Moore J: A Model Free Method to Generate Human Genetics Datasets with Complex Gene-Disease Relationships. Evol Comput, Machine Learning and Data Mining in Bioinformatics. 2010, 6023: 74-85. 10.1007\/978-3-642-12211-8_7.","journal-title":"Evol Comput, Machine Learning and Data Mining in Bioinformatics"},{"issue":"6","key":"80_CR24","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1159\/000022939","volume":"50","author":"W Li","year":"2000","unstructured":"Li W, Reich J: A Complete Enumeration and Classification of Two-Locus Disease Models. Human Heredity. 2000, 50 (6): 334-349. 10.1159\/000022939.","journal-title":"Human Heredity"},{"key":"80_CR25","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1186\/1471-2156-9-17","volume":"9","author":"I Hallgr\u00edmsd\u00f3ttir","year":"2008","unstructured":"Hallgr\u00edmsd\u00f3ttir I, Yuster D: A complete classification of epistatic two-locus models. BMC Genet. 2008, 9: 17-","journal-title":"BMC Genet"},{"issue":"2","key":"80_CR26","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1086\/338759","volume":"70","author":"R Culverhouse","year":"2002","unstructured":"Culverhouse R, Suarez B, Lin J, Reich T: A perspective on epistasis: limits of models displaying no main effect. Am J Human Genet. 2002, 70 (2): 461-471. 10.1086\/338759.","journal-title":"Am J Human Genet"},{"key":"80_CR27","first-page":"51","volume":"2","author":"T Motzkin","year":"1953","unstructured":"Motzkin T, Ralffa H, Thompson G, Thrall R: The Double Description Method. In: Kuhn, HW, Tucker AW (eds) Contributions to theory of games. 1953, 2: 51-73.","journal-title":"In: Kuhn, HW, Tucker AW (eds) Contributions to theory of games"},{"key":"80_CR28","doi-asserted-by":"publisher","first-page":"138","DOI":"10.1086\/321276","volume":"69","author":"M Ritchie","year":"2001","unstructured":"Ritchie M, Hahn L, Roodi N, Bailey L, Dupont W, Parl F, Moore J: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Human Genet. 2001, 69: 138-147. 10.1086\/321276.","journal-title":"Am J Human Genet"},{"key":"80_CR29","volume-title":"Principles of population genetics","author":"D Hartl","year":"1997","unstructured":"Hartl D, Clark A, Clark A: Principles of population genetics. 1997, Sinauer associates Sunderland, MA"},{"key":"80_CR30","volume-title":"Why evolutionary genetics does not always add up. Epistasis and the evolutionary process","author":"E Brodie III","year":"2000,","unstructured":"Brodie III E: Why evolutionary genetics does not always add up. Epistasis and the evolutionary process. 2000,, pp. 3\u201319"},{"key":"80_CR31","volume-title":"The Art of Computer Programming 1: Fundamental Algorithms 2: Seminumerical Algorithms 3: Sorting and Searching","author":"D Knuth","year":"1968","unstructured":"Knuth D: The Art of Computer Programming 1: Fundamental Algorithms 2: Seminumerical Algorithms 3: Sorting and Searching. 1968, Addison-Wesley, MA"}],"container-title":["BioData Mining"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1756-0381-5-16.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1186\/1756-0381-5-16\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1186\/1756-0381-5-16","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1756-0381-5-16.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T22:46:58Z","timestamp":1630536418000},"score":1,"resource":{"primary":{"URL":"https:\/\/biodatamining.biomedcentral.com\/articles\/10.1186\/1756-0381-5-16"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,10,1]]},"references-count":31,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2012,12]]}},"alternative-id":["80"],"URL":"https:\/\/doi.org\/10.1186\/1756-0381-5-16","relation":{},"ISSN":["1756-0381"],"issn-type":[{"value":"1756-0381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,10,1]]},"assertion":[{"value":"24 April 2012","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 September 2012","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 October 2012","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"16"}}