{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,5]],"date-time":"2025-11-05T14:11:32Z","timestamp":1762351892108,"version":"3.30.2"},"reference-count":19,"publisher":"Oxford University Press (OUP)","issue":"16","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,8,15]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: The development of genome-wide capabilities for genotyping has led to the practical problem of identifying the minimum subset of genetic variants relevant to the classification of a phenotype. This challenge is especially difficult in the presence of attribute interactions, noise and small sample size.<\/jats:p>\n               <jats:p>Methods: Analogous to the physical mechanism of evaporation, we introduce an evaporative cooling (EC) feature selection algorithm that seeks to obtain a subset of attributes with the optimum information temperature (i.e. the least noise). EC uses an attribute quality measure analogous to thermodynamic free energy that combines Relief-F and mutual information to evaporate (i.e. remove) noise features, leaving behind a subset of attributes that contain DNA sequence variations associated with a given phenotype.<\/jats:p>\n               <jats:p>Results: EC is able to identify functional sequence variations that involve interactions (epistasis) between other sequence variations that influence their association with the phenotype. This ability is demonstrated on simulated genotypic data with attribute interactions and on real genotypic data from individuals who experienced adverse events following smallpox vaccination. The EC formalism allows us to combine information entropy, energy and temperature into a single information free energy attribute quality measure that balances interaction and main effects.<\/jats:p>\n               <jats:p>Availability: Open source software, written in Java, is freely available upon request.<\/jats:p>\n               <jats:p>Contact: \u00a0brett.mckinney@gmail.com<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm317","type":"journal-article","created":{"date-parts":[[2007,6,23]],"date-time":"2007-06-23T00:39:53Z","timestamp":1182559193000},"page":"2113-2120","source":"Crossref","is-referenced-by-count":38,"title":["Evaporative cooling feature selection for genotypic data involving interactions"],"prefix":"10.1093","volume":"23","author":[{"given":"B.A.","family":"McKinney","sequence":"first","affiliation":[{"name":"1 Department of Genetics, University of Alabama School of Medicine, Birmingham, AL 35294, 2National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, 3Computational Genetics Laboratory, Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756 and 4Program in Vaccine Sciences, Departments of Microbiology and Immunology, and Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"D.M.","family":"Reif","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of Alabama School of Medicine, Birmingham, AL 35294, 2National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, 3Computational Genetics Laboratory, Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756 and 4Program in Vaccine Sciences, Departments of Microbiology and Immunology, and Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"B.C.","family":"White","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of Alabama School of Medicine, Birmingham, AL 35294, 2National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, 3Computational Genetics Laboratory, Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756 and 4Program in Vaccine Sciences, Departments of Microbiology and Immunology, and Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"suffix":"Jr.","given":"J.E.","family":"Crowe","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of Alabama School of Medicine, Birmingham, AL 35294, 2National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, 3Computational Genetics Laboratory, Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756 and 4Program in Vaccine Sciences, Departments of Microbiology and Immunology, and Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"J.H.","family":"Moore","sequence":"additional","affiliation":[{"name":"1 Department of Genetics, University of Alabama School of Medicine, Birmingham, AL 35294, 2National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, NC 27711, 3Computational Genetics Laboratory, Department of Genetics, Dartmouth Medical School, Lebanon, NH 03756 and 4Program in Vaccine Sciences, Departments of Microbiology and Immunology, and Pediatrics, Vanderbilt University Medical Center, Nashville, TN 37232, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2007,6,22]]},"reference":[{"key":"2024121117583892000_B1","doi-asserted-by":"crossref","DOI":"10.1109\/CVPRW.2003.10065","article-title":"Iterative relief","volume-title":"Workshop on Learning in Computer Vision and Pattern Recognition","author":"Draper","year":"2003"},{"key":"2024121117583892000_B2","first-page":"499","article-title":"Data simulation software for whole-genome association and other studies in human genetics","volume":"11","author":"Dudek","year":"2006","journal-title":"Pac. Symp. Biocomput"},{"key":"2024121117583892000_B3","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1016\/0021-9991(90)90201-B","article-title":"Threshold accepting: a general purpose optimization algorithm appearing superior to simulated annealing","volume":"90","author":"Dueck","year":"1990","journal-title":"J. Comput. Phys"},{"key":"2024121117583892000_B4","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/ng1296-371","article-title":"Who's afraid of epistasis?","volume":"14","author":"Frankel","year":"1996","journal-title":"Nat. Genet"},{"key":"2024121117583892000_B5","doi-asserted-by":"crossref","first-page":"3476","DOI":"10.1103\/PhysRevB.34.3476","article-title":"Evaporative cooling of magnetically trapped and compressed spin-polarized hydrogen","volume":"34","author":"Hess","year":"1986","journal-title":"Phys. Rev. B"},{"key":"2024121117583892000_B6","doi-asserted-by":"crossref","first-page":"620","DOI":"10.1103\/PhysRev.106.620","article-title":"Information theory and statistical mechanics","volume":"106","author":"Jaynes","year":"1957","journal-title":"Phys. Rev"},{"key":"2024121117583892000_B7","doi-asserted-by":"crossref","DOI":"10.1016\/B978-1-55860-247-2.50037-1","article-title":"A practical approach to feature selection","volume-title":"Proceedings of the Tenth International Conference on Machine Learning","author":"Kira","year":"1992"},{"key":"2024121117583892000_B8","article-title":"Estimation of attributes: analysis and extensions of relief","volume-title":"European Conference on Machine Learning","author":"Kononenko","year":"1994"},{"key":"2024121117583892000_B9","article-title":"Relieff for estimation and discretization of attributes in classification, regression and ilp problems","volume-title":"Artificial Intelligence: Methodology, Systems, Applications","author":"Kononenko","year":"1996"},{"key":"2024121117583892000_B10","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1159\/000022939","article-title":"A complete enumeration and classification of two-locus disease models","volume":"50","author":"Li","year":"2000","journal-title":"Hum. Hered"},{"key":"2024121117583892000_B11","doi-asserted-by":"crossref","first-page":"77","DOI":"10.2165\/00822942-200605020-00002","article-title":"Machine learning for detecting gene-gene interactions","volume":"5","author":"McKinney","year":"2006","journal-title":"Appl. Bioinformatics"},{"key":"2024121117583892000_B12","doi-asserted-by":"crossref","first-page":"36092","DOI":"10.1086\/505503","article-title":"Cytokine expression patterns associated with systemic adverse events following smallpox immunization","volume":"194","author":"McKinney","year":"2006","journal-title":"J. Infect. Dis"},{"key":"2024121117583892000_B13","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1159\/000073735","article-title":"The ubiquitous nature of epistasis in determining susceptibility to common human diseases","volume":"56","author":"Moore","year":"2003","journal-title":"Hum. Hered"},{"key":"2024121117583892000_B14","article-title":"Genome-wide analysis of epistasis using multifactor dimensionality reduction: feature selection and construction in the domain of human genetics","author":"Moore","year":"2006","journal-title":"In Knowledge Discovery and Data Mining: Challenges and Realities with Real World Data"},{"key":"2024121117583892000_B15","article-title":"Tuning relieff for genome-wide genetic analysis","author":"Moore","year":"2007","journal-title":"Lecture Notes in Computer Science"},{"volume-title":"Elements of informational statistics with applications","year":"1979","author":"Onicescu","key":"2024121117583892000_B16"},{"volume-title":"Numerical Recipes in C: The Art of Scientific Computing","year":"1988","author":"Press","key":"2024121117583892000_B17"},{"key":"2024121117583892000_B18","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1016\/j.tig.2004.09.007","article-title":"Genetics, statistics and human disease: analytical retooling for complexity","volume":"20","author":"Thornton-Wells","year":"2004","journal-title":"Trends Genet"},{"key":"2024121117583892000_B19","volume-title":"Data Mining: Practical machine learning tools and techniques","author":"Witten","year":"2005","edition":"2nd ed"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/16\/2113\/61051537\/bioinformatics_23_16_2113.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/16\/2113\/61051537\/bioinformatics_23_16_2113.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,11]],"date-time":"2024-12-11T22:22:07Z","timestamp":1733955727000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/16\/2113\/199758"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,6,22]]},"references-count":19,"journal-issue":{"issue":"16","published-print":{"date-parts":[[2007,8,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm317","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2007,8,15]]},"published":{"date-parts":[[2007,6,22]]}}}