{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T02:55:07Z","timestamp":1765421707560},"reference-count":55,"publisher":"Springer Science and Business Media LLC","issue":"1","content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2006,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:sec>\n            <jats:title>Background<\/jats:title>\n            <jats:p>Trait heterogeneity, which exists when a trait has been defined with insufficient specificity such that it is actually two or more distinct traits, has been implicated as a confounding factor in traditional statistical genetics of complex human disease. In the absence of detailed phenotypic data collected consistently in combination with genetic data, unsupervised computational methodologies offer the potential for discovering underlying trait heterogeneity. The performance of three such methods \u2013 Bayesian Classification, Hypergraph-Based Clustering, and Fuzzy <jats:italic>k<\/jats:italic>-Modes Clustering \u2013 appropriate for categorical data were compared. Also tested was the ability of these methods to detect trait heterogeneity in the presence of locus heterogeneity and\/or gene-gene interaction, which are two other complicating factors in discovering genetic models of complex human disease. To determine the efficacy of applying the Bayesian Classification method to real data, the reliability of its internal clustering metrics at finding good clusterings was evaluated using permutation testing.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Results<\/jats:title>\n            <jats:p>Bayesian Classification outperformed the other two methods, with the exception that the Fuzzy <jats:italic>k<\/jats:italic>-Modes Clustering performed best on the most complex genetic model. Bayesian Classification achieved excellent recovery for 75% of the datasets simulated under the simplest genetic model, while it achieved moderate recovery for 56% of datasets with a sample size of 500 or more (across all simulated models) and for 86% of datasets with 10 or fewer nonfunctional loci (across all simulated models). Neither Hypergraph Clustering nor Fuzzy <jats:italic>k<\/jats:italic>-Modes Clustering achieved good or excellent cluster recovery for a majority of datasets even under a restricted set of conditions. When using the average log of class strength as the internal clustering metric, the false positive rate was controlled very well, at three percent or less for all three significance levels (0.01, 0.05, 0.10), and the false negative rate was acceptably low (18 percent) for the least stringent significance level of 0.10.<\/jats:p>\n          <\/jats:sec>\n          <jats:sec>\n            <jats:title>Conclusion<\/jats:title>\n            <jats:p>Bayesian Classification shows promise as an unsupervised computational method for dissecting trait heterogeneity in genotypic data. Its control of false positive and false negative rates lends confidence to the validity of its results. Further investigation of how different parameter settings may improve the performance of Bayesian Classification, especially under more complex genetic models, is ongoing.<\/jats:p>\n          <\/jats:sec>","DOI":"10.1186\/1471-2105-7-204","type":"journal-article","created":{"date-parts":[[2006,4,20]],"date-time":"2006-04-20T14:39:23Z","timestamp":1145543963000},"update-policy":"http:\/\/dx.doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":25,"title":["Dissecting trait heterogeneity: a comparison of three clustering methods applied to genotypic data"],"prefix":"10.1186","volume":"7","author":[{"given":"Tricia A","family":"Thornton-Wells","sequence":"first","affiliation":[]},{"given":"Jason H","family":"Moore","sequence":"additional","affiliation":[]},{"given":"Jonathan L","family":"Haines","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2006,4,12]]},"reference":[{"key":"943_CR1","doi-asserted-by":"publisher","first-page":"640","DOI":"10.1016\/j.tig.2004.09.007","volume":"20","author":"TA Thornton-Wells","year":"2004","unstructured":"Thornton-Wells TA, Moore JH, Haines JL: Genetics, statistics and human disease: analytical retooling for complexity. Trends Genet 2004, 20: 640\u2013647.","journal-title":"Trends Genet"},{"key":"943_CR2","doi-asserted-by":"publisher","first-page":"1219","DOI":"10.1093\/hmg\/11.10.1219","volume":"11","author":"C Rivolta","year":"2002","unstructured":"Rivolta C, Sharon D, DeAngelis MM, Dryja TP: Retinitis pigmentosa and allied diseases: numerous diseases, genes, and inheritance patterns. Hum Mol Genet 2002, 11: 1219\u20131227.","journal-title":"Hum Mol Genet"},{"key":"943_CR3","doi-asserted-by":"publisher","first-page":"262","DOI":"10.1002\/ajmg.a.10886","volume":"116A","author":"LL Kulczycki","year":"2003","unstructured":"Kulczycki LL, Kostuch M, Bellanti JA: A clinical perspective of cystic fibrosis and new genetic findings: relationship of CFTR mutations to genotype-phenotype manifestations. Am J Hum Genet 2003, 116A: 262\u2013267.","journal-title":"Am J Hum Genet"},{"key":"943_CR4","doi-asserted-by":"publisher","first-page":"107","DOI":"10.1111\/j.1469-1809.1994.tb01881.x","volume":"58","author":"S Povey","year":"1994","unstructured":"Povey S, Burley MW, Attwood J, Benham F, Hunt D, Jeremiah SJ, Franklin D, Gillett G, Malas S, Robson EB, Tippett P, Edwards JH, Kwiatkowski DJ, Super M, Mueller R, Fryer A, Clarke A, Webb D, Osborne J: Two loci for tuberous sclerosis: one on 9q34 and one on 16p13. Ann Hum Genet 1994, 58: 107\u2013127.","journal-title":"Ann Hum Genet"},{"key":"943_CR5","doi-asserted-by":"publisher","first-page":"313","DOI":"10.1016\/S1357-4310(98)01245-3","volume":"4","author":"J Young","year":"1998","unstructured":"Young J, Povey S: The genetic basis of tuberous sclerosis. Mol Med Today 1998, 4: 313\u2013319.","journal-title":"Mol Med Today"},{"key":"943_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1093\/brain\/105.1.1","volume":"105","author":"AE Harding","year":"1982","unstructured":"Harding AE: The clinical features and classification of the late onset autosomal dominant cerebellar ataxias: a study of 11 families, including descendants of 'the Drew family of Walworth.'. Brain 1982, 105: 1\u201328.","journal-title":"Brain"},{"key":"943_CR7","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1212\/WNL.45.1.1","volume":"45","author":"RN Rosenberg","year":"1995","unstructured":"Rosenberg RN: Autosomal dominant cerebellar phenotypes: the genotype has settled the issue. Neurology 1995, 45: 1\u20135.","journal-title":"Neurology"},{"key":"943_CR8","doi-asserted-by":"publisher","first-page":"234","DOI":"10.1212\/WNL.56.2.234","volume":"56","author":"D Devos","year":"2001","unstructured":"Devos D, Schraen-Maschke S, Vuillaume I, Dujardin K, Naze P, Willoteaux C, Destee A, Sablonniere B: Clinical features and genetic analysis of a new form of spinocerebellar ataxia. Neurology 2001, 56: 234\u2013238.","journal-title":"Neurology"},{"key":"943_CR9","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1098\/rstb.2002.1198","volume":"358","author":"H Tager-Flusberg","year":"2003","unstructured":"Tager-Flusberg H, Joseph RM: Identifying neurocognitive phenotypes in autism. Philos Trans R Soc Lond B Biol Sci 2003, 358: 303\u2013314.","journal-title":"Philos Trans R Soc Lond B Biol Sci"},{"key":"943_CR10","doi-asserted-by":"publisher","first-page":"539","DOI":"10.1002\/ajmg.1497","volume":"105","author":"Y Bradford","year":"2001","unstructured":"Bradford Y, Haines JL, Hutcheson H, Gardiner M, Braun T, Sheffield V, Cassavant T, Huang W, Wang K, Vieland V, Folstein S, Santangelo S, Piven J: Incorporating language phenotypes strengthens evidence of linkage to autism. Am J Med Genet 2001, 105: 539\u2013547.","journal-title":"Am J Med Genet"},{"key":"943_CR11","doi-asserted-by":"publisher","first-page":"1058","DOI":"10.1086\/339765","volume":"70","author":"Y Shao","year":"2002","unstructured":"Shao Y, Raiford KL, Wolpert CM, Cope HA, Ravan SA, Ashley-Koch AA, Abramson RK, Wright HH, DeLong RG, Gilbert JR, Cuccaro ML, Pericak-Vance MA: Phenotypic homogeneity provides increased support for linkage on chromosome 2 in autistic disorder. Am J Hum Genet 2002, 70: 1058\u20131061.","journal-title":"Am J Hum Genet"},{"key":"943_CR12","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1038\/ng998","volume":"32","author":"MM Carrasquillo","year":"2002","unstructured":"Carrasquillo MM, McCallion AS, Puffenberger EG, Kashuk CS, Nouri N, Chakravarti A: Genome-wide association study and mouse model identify interaction between RET and EDNRB pathways in Hirschsprung disease. Nat Genet 2002, 32: 237\u2013244.","journal-title":"Nat Genet"},{"key":"943_CR13","doi-asserted-by":"publisher","first-page":"974","DOI":"10.1016\/0006-291X(89)92317-6","volume":"163","author":"K Doh-ura","year":"1989","unstructured":"Doh-ura K, Tateishi J, Sasaki H, Kitamoto T, Sakaki Y: Pro-to-leu change at position 102 of prion protein is the most common but not the sole mutation related to Gerstmann-Straussler syndrome. Biochem Biophys Res Comm 1989, 163: 974\u2013979.","journal-title":"Biochem Biophys Res Comm"},{"key":"943_CR14","doi-asserted-by":"publisher","first-page":"3103","DOI":"10.1093\/nar\/18.10.3103","volume":"18","author":"F Owen","year":"1990","unstructured":"Owen F, Poulter M, Collinge J, Crow TJ: A codon 129 polymorphism in the PRIP gene. Nucleic Acids Res 1990, 18: 3103.","journal-title":"Nucleic Acids Res"},{"key":"943_CR15","doi-asserted-by":"publisher","first-page":"1441","DOI":"10.1016\/0140-6736(91)93128-V","volume":"337","author":"J Collinge","year":"1991","unstructured":"Collinge J, Palmer MS, Dryden AJ: Genetic predisposition to iatrogenic Creutzfeldt-Jakob disease. Lancet 1991, 337: 1441\u20131442.","journal-title":"Lancet"},{"key":"943_CR16","doi-asserted-by":"publisher","first-page":"340","DOI":"10.1038\/352340a0","volume":"352","author":"MS Palmer","year":"1991","unstructured":"Palmer MS, Dryden AJ, Hughes JT, Collinge J: Homozygous prion protein genotype predisposes to sporadic Creutzfeldt-Jakob disease. Nature 1991, 352: 340\u2013342.","journal-title":"Nature"},{"key":"943_CR17","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1016\/0304-3940(94)90932-6","volume":"179","author":"R De Silva","year":"1994","unstructured":"De Silva R, Ironside JW, McCardle L, Esmonde T, Bell J, Will R, Windl O, Dempster M, Estibeiro P, Lathe R: Neuropathological phenotype and 'prion protein' genotype correlation in sporadic Creutzfeldt-Jakob disease. Neurosci Lett 1994, 179: 50\u201352.","journal-title":"Neurosci Lett"},{"key":"943_CR18","doi-asserted-by":"publisher","first-page":"801","DOI":"10.1038\/353801b0","volume":"353","author":"K Doh-ura","year":"1991","unstructured":"Doh-ura K, Kitamoto T, Sakaki Y, Tateishi J: CJD discrepancy. Nature 1991, 353: 801\u2013802.","journal-title":"Nature"},{"key":"943_CR19","doi-asserted-by":"publisher","first-page":"274","DOI":"10.1002\/ana.410310308","volume":"31","author":"LG Goldfarb","year":"1992","unstructured":"Goldfarb LG, Brown P, Haltia M, Cathala F, McCombie WR, Kovanen J, Cervenakova L, Goldin L, Nieto A, Godec MS, Asher DM, Gajdusek DC: Creutzfeldt-Jakob disease cosegregates with the codon 178Asn PRNP mutation in families of European origin. Ann Neurol 1992, 31: 274\u2013281.","journal-title":"Ann Neurol"},{"key":"943_CR20","doi-asserted-by":"publisher","first-page":"45","DOI":"10.1097\/00125817-200203000-00002","volume":"4","author":"JN Hirschhorn","year":"2002","unstructured":"Hirschhorn JN, Lohmueller K, Byrne E, Hirschhorn K: A comprehensive review of genetic association studies. Genet Med 2002, 4: 45\u201361.","journal-title":"Genet Med"},{"key":"943_CR21","first-page":"283","volume":"51","author":"J Ott","year":"1992","unstructured":"Ott J: Strategies for characterizing highly polymorphic markers in human gene mapping. Am J Hum Genet 1992, 51: 283\u2013290.","journal-title":"Am J Hum Genet"},{"key":"943_CR22","doi-asserted-by":"publisher","first-page":"175","DOI":"10.1111\/j.1469-1809.1963.tb00210.x","volume":"27","author":"CAB Smith","year":"1963","unstructured":"Smith CAB: Testing for heterogeneity of recombination fraction values in human genetics. Annals of Human Genetics 1963, 27: 175\u2013182.","journal-title":"Annals of Human Genetics"},{"key":"943_CR23","doi-asserted-by":"publisher","first-page":"138","DOI":"10.1086\/321276","volume":"69","author":"MD Ritchie","year":"2001","unstructured":"Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, Parl FF, Moore JH: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet 2001, 69: 138\u2013147.","journal-title":"Am J Hum Genet"},{"key":"943_CR24","doi-asserted-by":"publisher","first-page":"150","DOI":"10.1002\/gepi.10218","volume":"24","author":"MD Ritchie","year":"2003","unstructured":"Ritchie MD, Hahn LW, Moore JH: Power of multifactor dimensionality reduction for detecting gene-gene interactions in the presence of genotyping error, phenocopy and genetic heterogeneity. Genet Epidemiol 2003, 24: 150\u2013157.","journal-title":"Genet Epidemiol"},{"key":"943_CR25","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1159\/000073735","volume":"56","author":"JH Moore","year":"2003","unstructured":"Moore JH: The ubiquitous nature of epistasis in determining susceptibility to common human diseases. Hum Hered 2003, 56: 73\u201382.","journal-title":"Hum Hered"},{"key":"943_CR26","doi-asserted-by":"publisher","first-page":"808","DOI":"10.1126\/science.1091317","volume":"303","author":"AH Tong","year":"2004","unstructured":"Tong AH, Lesage G, Bader GD, Ding H, Xu H, Xin X, Young J, Berriz GF, Brost RL, Change M, Chen Y, Cheng X, Chua G, Friesen H, Goldberg DS, Haynes J, Humphries C, He G, Hussein S, Ke L, Krogan N, Li Z, Levinson JN, Lu H, Menard P, Munyana C, Parsons AB, Ryan O, Tonikan R, Roberts T, Sdicu A, Shapiro J, Sheikh B, Suter B, Wong SL, Zhang LV, Zhu H, Gurd CG, Numro S, Sander C, Rine J, Greenblatt J, Peter M, Bretscher A, Bell G, Roth FP, Brown GW, Andrews B, Bussey H, Boone C: Global mapping of the yeast genetic interaction network. Science 2004, 303: 808\u2013813.","journal-title":"Science"},{"key":"943_CR27","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1038\/ng0105-13","volume":"37","author":"JH Moore","year":"2005","unstructured":"Moore JH: A global view of epistasis. Nat Genet 2005, 37: 13\u201314.","journal-title":"Nat Genet"},{"key":"943_CR28","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1002\/bies.20236","volume":"27","author":"JH Moore","year":"2005","unstructured":"Moore JH, Williams SM: Traversing the conceptual divide between biological and statistical epistasis: Systems biology and a more modern synthesis. Bioessays 2005, 27: 637\u2013646.","journal-title":"Bioessays"},{"key":"943_CR29","doi-asserted-by":"publisher","first-page":"502","DOI":"10.1038\/ng1033","volume":"32","author":"DK Slonim","year":"2002","unstructured":"Slonim DK: From patterns to pathways: gene expression data analysis comes of age. Nat Genet Suppl 2002, 32: 502\u2013508.","journal-title":"Nat Genet Suppl"},{"key":"943_CR30","doi-asserted-by":"publisher","first-page":"705","DOI":"10.1086\/515510","volume":"61","author":"JL Mountain","year":"1997","unstructured":"Mountain JL, Cavalli-Sforza LL: Multilocus genotypes, a tree of individuals, and human evolutionary history. Am J Hum Genet 1997, 61: 705\u2013718.","journal-title":"Am J Hum Genet"},{"key":"943_CR31","doi-asserted-by":"publisher","first-page":"28","DOI":"10.1186\/1471-2105-4-28","volume":"4","author":"MD Ritchie","year":"2003","unstructured":"Ritchie MD, White B, Parker JS, Hahn LW, Moore JH: Optimization of neural network architecture improves the power to identify gene-gene interaction in common diseases. BMC Bioinformatics 2003, 4: 28.","journal-title":"BMC Bioinformatics"},{"key":"943_CR32","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1002\/gepi.20000","volume":"27","author":"ER Hauser","year":"2004","unstructured":"Hauser ER, Watanabe RM, Duren WL, Bass MP, Langefeld CD, Boehnke M: Ordered subset analysis in genetic linkage mapping of complex traits. Genet Epidemiol 2004, 27: 53\u201363.","journal-title":"Genet Epidemiol"},{"key":"943_CR33","doi-asserted-by":"publisher","first-page":"2115","DOI":"10.1101\/gr.204001","volume":"11","author":"J Hoh","year":"2001","unstructured":"Hoh J, Wille A, Ott J: Trimming, Weighting, and Grouping SNPs in Human Case-Control Association Studies. Genome Res 2001, 11: 2115\u20132119.","journal-title":"Genome Res"},{"key":"943_CR34","doi-asserted-by":"publisher","first-page":"569","DOI":"10.1089\/10665270360688192","volume":"10","author":"J Ott","year":"2003","unstructured":"Ott J, Hoh J: Set association analysis of SNP case-control and microarray data. J Comput Biol 2003, 10: 569\u2013574.","journal-title":"J Comput Biol"},{"key":"943_CR35","doi-asserted-by":"publisher","first-page":"376","DOI":"10.1093\/bioinformatics\/btf869","volume":"19","author":"LW Hahn","year":"2003","unstructured":"Hahn LW, Ritchie MD, Moore JH: Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions. Bioinformatics 2003, 19: 376\u2013382.","journal-title":"Bioinformatics"},{"key":"943_CR36","first-page":"183","volume":"4","author":"LW Hahn","year":"2004","unstructured":"Hahn LW, Moore JH: Ideal discrimination of discrete clinical endpoints using multilocus genotypes. In Silico Biol 2004, 4: 183\u201394.","journal-title":"In Silico Biol"},{"key":"943_CR37","doi-asserted-by":"publisher","first-page":"795","DOI":"10.1586\/14737159.4.6.795","volume":"4","author":"JH Moore","year":"2004","unstructured":"Moore JH: Computational analysis of gene-gene interactions using multifactor dimensionality reduction. Expert Rev Mol Diagn 2004, 4: 795\u2013803.","journal-title":"Expert Rev Mol Diagn"},{"key":"943_CR38","volume-title":"Cluster Analysis for Applications","author":"MR Anderberg","year":"1973","unstructured":"Anderberg MR: Cluster Analysis for Applications. New York: Academic Press; 1973."},{"key":"943_CR39","volume-title":"Technical Report # FIA-90-12-7-01","author":"R Hanson","year":"1991","unstructured":"Hanson R, Stutz J, Cheeseman P: Bayesian classification theory. In Technical Report # FIA-90\u201312\u20137-01. Artificial Intelligence Research Branch, NASA Ames Research Center; 1991."},{"key":"943_CR40","first-page":"9","volume-title":"Proceedings of the SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery: 1997; Tucson","author":"EH Han","year":"1997","unstructured":"Han EH, Karypis G, Kumar V, Mobasher B: Clustering Based on Association Rule Hypergraphs. Proceedings of the SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery: 1997; Tucson 1997, 9\u201313."},{"key":"943_CR41","doi-asserted-by":"publisher","first-page":"446","DOI":"10.1109\/91.784206","volume":"7","author":"Z Huang","year":"1999","unstructured":"Huang Z, Ng MK: A fuzzy k-modes algorithm for clustering categorical data. IEEE Trans Fuzzy Syst 1999, 7: 446\u2013452.","journal-title":"IEEE Trans Fuzzy Syst"},{"key":"943_CR42","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","volume":"2","author":"L Hubert","year":"1985","unstructured":"Hubert L, Arabie P: Comparing partitions. J Classif 1985, 2: 193\u2013218.","journal-title":"J Classif"},{"key":"943_CR43","volume-title":"Technical Report #97-063","author":"EH Han","year":"1997","unstructured":"Han EH, Karypis G, Kumar V, Mobasher B: Clustering in High Dimensional Space Using Hypergraph Models. In Technical Report #97\u2013063. Computer Science and Engineering, University of Minnesota; 1997."},{"key":"943_CR44","volume-title":"Pattern Classification and Scene Analysis","author":"RO Duda","year":"1973","unstructured":"Duda RO, Hart PE: Pattern Classification and Scene Analysis. New York: John Wiley and Sons; 1973."},{"key":"943_CR45","doi-asserted-by":"publisher","first-page":"39","DOI":"10.1038\/sj.ijo.0800541","volume":"22","author":"KM Flegal","year":"1998","unstructured":"Flegal KM, Carroll MD, Kuczmarski RJ: Overweight and obesity in the United States: prevalence and trends, 1960\u20131994. Int J Obe Relat Metab Disord 1998, 22: 39\u201347.","journal-title":"Int J Obe Relat Metab Disord"},{"key":"943_CR46","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1038\/nm0295-99","volume":"1","author":"SA Narod","year":"1995","unstructured":"Narod SA, Dupont A, Cusan L, Diamond P, Gomez J-L, Suburu R, Labrie F: The impact of family history on early detection of prostate cancer. Nat Med 1995, 1: 99\u2013101.","journal-title":"Nat Med"},{"key":"943_CR47","doi-asserted-by":"publisher","first-page":"1425","DOI":"10.1016\/S0140-6736(98)07549-7","volume":"353","author":"S Schultz","year":"1999","unstructured":"Schultz S, Andreasen N: Schizophrenia. Lancet 1999, 353: 1425\u20131430.","journal-title":"Lancet"},{"key":"943_CR48","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1159\/000110240","volume":"10","author":"JF Kurtzke","year":"1991","unstructured":"Kurtzke JF: Multiple sclerosis: changing times. Neuroepidemiology 1991, 10: 1\u20138.","journal-title":"Neuroepidemiology"},{"key":"943_CR49","doi-asserted-by":"publisher","first-page":"334","DOI":"10.1159\/000022939","volume":"50","author":"WT Li","year":"2000","unstructured":"Li WT, Reich J: A complete enumeration and classification of two-locus disease models. Human Heredity 2000, 50: 334\u2013349.","journal-title":"Human Heredity"},{"key":"943_CR50","doi-asserted-by":"publisher","first-page":"371","DOI":"10.1038\/ng1296-371","volume":"14","author":"WN Frankel","year":"1996","unstructured":"Frankel WN, Schork NJ: Who's afraid of epistasis? Nat Genet 1996, 14: 371\u2013373.","journal-title":"Nat Genet"},{"key":"943_CR51","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316801","volume-title":"Finding Groups in Data: An Introduction to Cluster Analysis","author":"L Kaufman","year":"1990","unstructured":"Kaufman L, Rousseeuw PJ: Finding Groups in Data: An Introduction to Cluster Analysis. New York: John Wiley & Sons, Inc; 1990."},{"key":"943_CR52","volume-title":"Advances in Knowledge Discovery and Data Mining","author":"P Cheeseman","year":"1996","unstructured":"Cheeseman P, Stutz J: Bayesian Classification (AutoClass): Theory and Results. In Advances in Knowledge Discovery and Data Mining. Edited by: Fayyad UM, Piatetsky-Shapiro G, Smyth P, Uthurusamy R. Menlo Park: The AAAI Press; 1996."},{"key":"943_CR53","doi-asserted-by":"publisher","first-page":"505","DOI":"10.1109\/ICDM.2001.989558","volume-title":"Proceedings of the IEEE Conference on Data Mining: 2001; IEEE Computer Society","author":"M Seno","year":"2001","unstructured":"Seno M, Karypis G: LPMiner: An Algorithm for Finding Frequent Itemsets Using Length-Decreasing Support Constraint. Proceedings of the IEEE Conference on Data Mining: 2001; IEEE Computer Society 2001, 505\u2013512."},{"key":"943_CR54","doi-asserted-by":"publisher","first-page":"386","DOI":"10.1037\/1082-989X.9.3.386","volume":"9","author":"D Steinley","year":"2004","unstructured":"Steinley D: Properties of the Hubert-Arabie Adjusted Rand Index. Psychol Methods 2004, 9: 386\u2013396.","journal-title":"Psychol Methods"},{"key":"943_CR55","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4757-3235-1","volume-title":"Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses","author":"P Good","year":"2000","unstructured":"Good P: Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses. New York: Springer; 2000."}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/1471-2105-7-204.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,9,1]],"date-time":"2021-09-01T11:00:20Z","timestamp":1630494020000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/1471-2105-7-204"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2006,4,12]]},"references-count":55,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2006,12]]}},"alternative-id":["943"],"URL":"https:\/\/doi.org\/10.1186\/1471-2105-7-204","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2006,4,12]]},"assertion":[{"value":"8 September 2005","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 April 2006","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 April 2006","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}],"article-number":"204"}}