{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T01:51:07Z","timestamp":1762998667146,"version":"3.33.0"},"reference-count":31,"publisher":"Oxford University Press (OUP)","issue":"18","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2007,9,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: Human immunodeficiency virus type 1 (HIV-1) evolves in human body, and its exposure to a drug often causes mutations that enhance the resistance against the drug. To design an effective pharmacotherapy for an individual patient, it is important to accurately predict the drug resistance based on genotype data. Notably, the resistance is not just the simple sum of the effects of all mutations. Structural biological studies suggest that the association of mutations is crucial: even if mutations A or B alone do not affect the resistance, a significant change might happen when the two mutations occur together. Linear regression methods cannot take the associations into account, while decision tree methods can reveal only limited associations. Kernel methods and neural networks implicitly use all possible associations for prediction, but cannot select salient associations explicitly.<\/jats:p><jats:p>Results: Our method, itemset boosting, performs linear regression in the complete space of power sets of mutations. It implements a forward feature selection procedure where, in each iteration, one mutation combination is found by an efficient branch-and-bound search. This method uses all possible combinations, and salient associations are explicitly shown. In experiments, our method worked particularly well for predicting the resistance of nucleotide reverse transcriptase inhibitors (NRTIs). Furthermore, it successfully recovered many mutation associations known in biological literature.<\/jats:p><jats:p>Availability: \u00a0http:\/\/www.kyb.mpg.de\/bs\/people\/hiroto\/iboost\/<\/jats:p><jats:p>Contact: \u00a0koji.tsuda@tuebingen.mpg.de<\/jats:p><jats:p>Supplementary information: Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btm353","type":"journal-article","created":{"date-parts":[[2007,8,15]],"date-time":"2007-08-15T00:18:22Z","timestamp":1187137102000},"page":"2455-2462","source":"Crossref","is-referenced-by-count":32,"title":["Mining complex genotypic features for predicting HIV-1 drug resistance"],"prefix":"10.1093","volume":"23","author":[{"given":"Hiroto","family":"Saigo","sequence":"first","affiliation":[{"name":"1 Max Planck Institute for Biological Cybernetics, Spemannstra\u00dfe 38, 72076 T\u00fcbingen, Germany and 2National Institute of Informatics, 2-1-2, Hitotsubashi, Chiyoda-ku, Tokyo, Japan"}]},{"given":"Takeaki","family":"Uno","sequence":"additional","affiliation":[{"name":"1 Max Planck Institute for Biological Cybernetics, Spemannstra\u00dfe 38, 72076 T\u00fcbingen, Germany and 2National Institute of Informatics, 2-1-2, Hitotsubashi, Chiyoda-ku, Tokyo, Japan"}]},{"given":"Koji","family":"Tsuda","sequence":"additional","affiliation":[{"name":"1 Max Planck Institute for Biological Cybernetics, Spemannstra\u00dfe 38, 72076 T\u00fcbingen, Germany and 2National Institute of Informatics, 2-1-2, Hitotsubashi, Chiyoda-ku, Tokyo, Japan"}]}],"member":"286","published-online":{"date-parts":[[2007,8,13]]},"reference":[{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"3850","DOI":"10.1093\/nar\/gkg575","article-title":"Geno2pheno: estimating phenotypic drug resistance from HIV-1 genotypes","volume":"31","author":"Beerenwinkel","year":"2003","journal-title":"Nucleic Acids Res"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"8271","DOI":"10.1073\/pnas.112177799","article-title":"Diversity and complexity of HIV-1 drug resistance: a bioinformatics approach to predicting phenotype from genotype","volume":"99","author":"Beerenwinkel","year":"2002","journal-title":"Proc. Nalt Acad. Sci. USA"},{"key":"2023041106220475700_","first-page":"5802","article-title":"Combinatorial search methods for multi-SNP disease association","author":"Brinza","year":"2006"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1109\/TCBB.2006.22","article-title":"Functional census of mutation sequence spaces: the example of p53 cancer rescue mutants","volume":"3","author":"Danziger","year":"2006","journal-title":"IEEE\/ACM Trans. Comput. Biol. Bioinform"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"2975","DOI":"10.1093\/bioinformatics\/btl508","article-title":"Analysis of HIV-1 pol sequences using bayesian networks: implications for drug resistance","volume":"22","author":"Deforche","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1023\/A:1012470815092","article-title":"Linear programming boosting via column generation","volume":"46","author":"Demiriz","year":"2002","journal-title":"Mach. Learn"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1023\/A:1007607513941","article-title":"An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization","volume":"40","author":"Dietterich","year":"2000","journal-title":"Mach. Learn"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"2785","DOI":"10.1002\/sim.1516","article-title":"Non-parametric methods to predict HIV drug susceptibility tphenotype from genotype","volume":"22","author":"Dirienzo","year":"2003","journal-title":"Stat. Med"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"146","DOI":"10.1111\/j.0006-341X.2002.00145.x","article-title":"Characterizing the relationship between HIV-1 genotype and phenotype: prediction-based classification","volume":"58","author":"Foulkes","year":"2002","journal-title":"Biometrics"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1006\/jcss.1997.1504","article-title":"A decision-theoretic generalization of on-line learning and an application to boosting","volume":"55","author":"Freund","year":"1996","journal-title":"J. Comput. Syst. Sci"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"1086","DOI":"10.1128\/jvi.70.2.1086-1090.1996","article-title":"Multidrug-resistant human immunodeficiency virus type 1 strains resulting from combination antiretroviral therapy","volume":"70","author":"Iversen","year":"1996","journal-title":"J. Virol"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1089\/108729104323038874","article-title":"Cross-resistance patterns among HIV protease inhibitors","volume":"18","author":"Kozal","year":"2004","journal-title":"AIDS Patient Care and STDs"},{"key":"2023041106220475700_","first-page":"729","article-title":"An application of boosting to graph classification","volume-title":"Advances in Neural Information Processing Systems 17.","author":"Kudo","year":"2005"},{"key":"2023041106220475700_","first-page":"521","article-title":"Simpler knowledge-based support vector machines","author":"Le","year":"2006"},{"key":"2023041106220475700_","first-page":"790","article-title":"Bioinformatics-assisted anti-HIV therapy","volume":"4","author":"Lengauer","year":"2006","journal-title":"Nature"},{"key":"2023041106220475700_","article-title":"Computing optimal hypotheses efficiently for boosting","author":"Morishita","year":"2001","journal-title":"Discovery Science"},{"key":"2023041106220475700_","first-page":"364","article-title":"Mining the quantitative trait loci associated with oral glucose tolerance in the oletf rat","author":"Nakaya","year":"2000"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1093\/bioinformatics\/btk011","article-title":"Accurate prediction of HIV-1 drug response from the reverse transciptase and protease amino acid sequences using sparse models created by convex optimization","volume":"22","author":"Rabinowitz","year":"2006","journal-title":"Bioinformatics"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"298","DOI":"10.1093\/nar\/gkg100","article-title":"Human immunodeficiency virus reverse transcriptase and protease sequence database","volume":"31","author":"Rhee","year":"2003","journal-title":"Nucleic Acids. Res"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"17355","DOI":"10.1073\/pnas.0607274103","article-title":"Genotypic predictors of human immunodeficiency virus type 1 drug resistance","volume":"103","author":"Rhee","year":"2006","journal-title":"Proc. Natl Acad. Sci. USA"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"17526","DOI":"10.1016\/S0021-9258(19)37074-7","article-title":"Functional analysis of HIV-1 reverse transcriptase amino acids involved in resistance to multiple nonnucleoside inhibitors","volume":"267","author":"Sardana","year":"1992","journal-title":"J. Biol. Chem"},{"volume-title":"Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond.","year":"2002","author":"Scholk\u00f6pf","key":"2023041106220475700_"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"1297","DOI":"10.7551\/mitpress\/7503.003.0167","article-title":"Mutagenetic tree fisher kernel improves prediction of HIV drug resistance from viral genotype","volume-title":"Advances in Neural Information Processing Systems 19.","author":"Sing","year":"2007"},{"key":"2023041106220475700_","first-page":"285","article-title":"Characterization of novel HIV drug resistance mutations using clustering, multidimensional scaling and SVM-based feature ranking","author":"Sing","year":"2005"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"625","DOI":"10.1080\/10556789908805766","article-title":"Using SeDuMi 1.02, a MATLAB toolbox for optimization over symmetric cones","volume":"11\u201312","author":"Sturm","year":"1999","journal-title":"Opt. Meth. Softw"},{"issue":"1","key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the LASSO","volume":"58","author":"Tibshrani","year":"1996","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol"},{"key":"2023041106220475700_","first-page":"77","article-title":"LCM ver.3: collaboration of array, bitmap and prefix tree for frequent itemset mining","author":"Uno","year":"2005"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1186\/1471-2105-7-520","article-title":"An accurate and interpretable model for siRNA efficacy prediction","volume":"7","author":"Vert","year":"2006","journal-title":"BMC Bioinformatics"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1007\/s00018-005-5367-x","article-title":"Nucleoside and nucleotide inhibitors of HIV-1 replication","volume":"63","author":"Vivet-Boudou","year":"2006","journal-title":"Cell. Mole. Life Sci"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"653","DOI":"10.1086\/377453","article-title":"Enhanced prediction of lopinavir resistance from genotype by use of artificial neural networks","volume":"188","author":"Wang","year":"2003","journal-title":"J. Infect. Dis"},{"key":"2023041106220475700_","doi-asserted-by":"crossref","first-page":"845","DOI":"10.1101\/gr.3394405","article-title":"Discovering functional transcription-factor combinations in the human cell cycle","volume":"15","author":"Zhu","year":"2005","journal-title":"Genome Res"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/18\/2455\/49816939\/bioinformatics_23_18_2455.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/23\/18\/2455\/49816939\/bioinformatics_23_18_2455.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,1,20]],"date-time":"2025-01-20T10:09:08Z","timestamp":1737367748000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/23\/18\/2455\/237242"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2007,8,13]]},"references-count":31,"journal-issue":{"issue":"18","published-print":{"date-parts":[[2007,9,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btm353","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"type":"electronic","value":"1367-4811"},{"type":"print","value":"1367-4803"}],"subject":[],"published-other":{"date-parts":[[2007,9,15]]},"published":{"date-parts":[[2007,8,13]]}}}