{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T01:53:30Z","timestamp":1774403610206,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"3","license":[{"start":{"date-parts":[[2023,4,25]],"date-time":"2023-04-25T00:00:00Z","timestamp":1682380800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/pages\/standard-publication-reuse-rights"}],"funder":[{"name":"Department of Biostatistics, Columbia University"},{"DOI":"10.13039\/100000002","name":"NIH","doi-asserted-by":"publisher","id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,5,19]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Studies have found that human microbiome is associated with and predictive of human health and diseases. Many statistical methods developed for microbiome data focus on different distance metrics that can capture various information in microbiomes. Prediction models were also developed for microbiome data, including deep learning methods with convolutional neural networks that consider both taxa abundance profiles and taxonomic relationships among microbial taxa from a phylogenetic tree. Studies have also suggested that a health outcome could associate with multiple forms of microbiome profiles. In addition to the abundance of some taxa that are associated with a health outcome, the presence\/absence of some taxa is also associated with and predictive of the same health outcome. Moreover, associated taxa may be close to each other on a phylogenetic tree or spread apart on a phylogenetic tree. No prediction models currently exist that use multiple forms of microbiome-outcome associations. To address this, we propose a multi-kernel machine regression (MKMR) method that is able to capture various types of microbiome signals when doing predictions. MKMR utilizes multiple forms of microbiome signals through multiple kernels being transformed from multiple distance metrics for microbiomes and learn an optimal conic combination of these kernels, with kernel weights helping us understand contributions of individual microbiome signal types. Simulation studies suggest a much-improved prediction performance over competing methods with mixture of microbiome signals. Real data applicants to predict multiple health outcomes using throat and gut microbiome data also suggest a better prediction of MKMR than that of competing methods.<\/jats:p>","DOI":"10.1093\/bib\/bbad158","type":"journal-article","created":{"date-parts":[[2023,4,26]],"date-time":"2023-04-26T14:37:04Z","timestamp":1682519824000},"source":"Crossref","is-referenced-by-count":4,"title":["MKMR: a multi-kernel machine regression model to predict health outcomes using human microbiome data"],"prefix":"10.1093","volume":"24","author":[{"given":"Bing","family":"Li","sequence":"first","affiliation":[{"name":"Department of Biostatistics, School of Public Health, Brown University , Providence, Rhode Island,","place":["U.S.A"]}]},{"given":"Tian","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Mailman School of Public Health, Columbia University , 722 West 168th Street, New York, New York, 10032","place":["U.S.A"]}]},{"given":"Min","family":"Qian","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Mailman School of Public Health, Columbia University , 722 West 168th Street, New York, New York, 10032","place":["U.S.A"]}]},{"given":"Shuang","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, School of Public Health, Brown University , Providence, Rhode Island,","place":["U.S.A"]}]}],"member":"286","published-online":{"date-parts":[[2023,4,25]]},"reference":[{"issue":"11","key":"2026032301171193600_ref1","doi-asserted-by":"crossref","first-page":"805","DOI":"10.1038\/nrg1709","article-title":"Metagenomics: Dna sequencing of environmental samples","volume":"6","author":"Tringe","year":"2005","journal-title":"Nat Rev Genet"},{"issue":"9","key":"2026032301171193600_ref2","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1038\/nrmicro2857","article-title":"Genomic sequencing of uncultured microorganisms from single cells","volume":"10","author":"Lasken","year":"2012","journal-title":"Nat Rev Microbiol"},{"issue":"2","key":"2026032301171193600_ref3","doi-asserted-by":"crossref","first-page":"100","DOI":"10.1002\/cpmo.29","article-title":"Microbiota analysis using an illumina miseq platform to sequence 16s rrna genes","volume":"7","author":"Rapin","year":"2017","journal-title":"Current Protocol Mouse Biol"},{"issue":"5","key":"2026032301171193600_ref4","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1038\/nmeth.f.303","article-title":"Qiime allows analysis of high-throughput community sequencing data","volume":"7","author":"Gregory Caporaso","year":"2010","journal-title":"Nat Methods"},{"issue":"1","key":"2026032301171193600_ref5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/npjbiofilms.2016.4","article-title":"A perspective on 16s rrna operational taxonomic unit clustering using sequence similarity","volume":"2","author":"Nguyen","year":"2016","journal-title":"NPJ Biofilms Microbiomes"},{"issue":"6","key":"2026032301171193600_ref6","doi-asserted-by":"crossref","first-page":"1258","DOI":"10.1016\/j.cell.2012.01.035","article-title":"The impact of the gut microbiota on human health: an integrative view","volume":"148","author":"Clemente","year":"2012","journal-title":"Cell"},{"issue":"10","key":"2026032301171193600_ref7","doi-asserted-by":"crossref","first-page":"2435","DOI":"10.1038\/ismej.2016.37","article-title":"Cigarette smoking and the oral microbiome in a large study of american adults","volume":"10","author":"Jing","year":"2016","journal-title":"ISME J"},{"issue":"5519","key":"2026032301171193600_ref8","doi-asserted-by":"crossref","first-page":"1115","DOI":"10.1126\/science.1058709","article-title":"Commensal host-bacterial relationships in the gut","volume":"292","author":"Hooper","year":"2001","journal-title":"Science"},{"issue":"9","key":"2026032301171193600_ref9","doi-asserted-by":"crossref","first-page":"R79","DOI":"10.1186\/gb-2012-13-9-r79","article-title":"Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment","volume":"13","author":"Morgan","year":"2012","journal-title":"Genome Biol"},{"issue":"1","key":"2026032301171193600_ref10","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1186\/1471-2105-12-118","article-title":"Variance adjusted weighted unifrac: a powerful beta diversity measure for comparing communities based on phylogeny","volume":"12","author":"Chang","year":"2011","journal-title":"BMC Bioinform"},{"issue":"12","key":"2026032301171193600_ref11","doi-asserted-by":"crossref","first-page":"8228","DOI":"10.1128\/AEM.71.12.8228-8235.2005","article-title":"Unifrac: a new phylogenetic method for comparing microbial communities","volume":"71","author":"Lozupone","year":"2005","journal-title":"Appl Environ Microbiol"},{"issue":"5","key":"2026032301171193600_ref12","doi-asserted-by":"crossref","first-page":"1576","DOI":"10.1128\/AEM.01996-06","article-title":"Quantitative and qualitative beta diversity measures lead to different insights into factors that structure microbial communities","volume":"73","author":"Lozupone","year":"2007","journal-title":"Appl Environ Microbiol"},{"issue":"16","key":"2026032301171193600_ref13","doi-asserted-by":"crossref","first-page":"2106","DOI":"10.1093\/bioinformatics\/bts342","article-title":"Associating microbiome composition with environmental covariates using generalized unifrac distances","volume":"28","author":"Chen","year":"2012","journal-title":"Bioinformatics"},{"issue":"4","key":"2026032301171193600_ref14","first-page":"326","article-title":"An ordination of the upland forest communities of southern Wisconsin","volume":"27","author":"Roger Bray","year":"1957","journal-title":"Ecol Monogr"},{"issue":"5","key":"2026032301171193600_ref15","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1016\/j.ajhg.2015.04.003","article-title":"Testing in microbiome-profiling studies with mirkat, the microbiome regression-based kernel association test","volume":"96","author":"Zhao","year":"2015","journal-title":"Am J Hum Genet"},{"issue":"1","key":"2026032301171193600_ref16","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1186\/s40168-017-0262-x","article-title":"A powerful microbiome-based association test and a microbial taxa discovery framework for comprehensive association mapping","volume":"5","author":"Koh","year":"2017","journal-title":"Microbiome"},{"issue":"1","key":"2026032301171193600_ref17","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"Tibshirani","year":"1996","journal-title":"J R Stat Soc B Methodol"},{"issue":"1","key":"2026032301171193600_ref18","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach Learn"},{"issue":"4","key":"2026032301171193600_ref19","doi-asserted-by":"crossref","first-page":"449","DOI":"10.1093\/bioinformatics\/btt700","article-title":"Phylogeny-based classification of microbial communities","volume":"30","author":"Tanaseichuk","year":"2014","journal-title":"Bioinformatics"},{"issue":"24","key":"2026032301171193600_ref20","doi-asserted-by":"crossref","first-page":"3991","DOI":"10.1093\/bioinformatics\/btv497","article-title":"Glmgraph: an r package for variable selection and predictive modeling of structured genomic data","volume":"31","author":"Chen","year":"2015","journal-title":"Bioinformatics"},{"key":"2026032301171193600_ref21","doi-asserted-by":"crossref","first-page":"1391","DOI":"10.3389\/fmicb.2018.01391","article-title":"Predictive modeling of microbiome data using a phylogeny-regularized generalized linear mixed model","volume":"9","author":"Xiao","year":"2018","journal-title":"Front Microbiol"},{"issue":"4","key":"2026032301171193600_ref22","doi-asserted-by":"crossref","first-page":"104081","DOI":"10.1016\/j.isci.2022.104081","article-title":"Human disease prediction from microbiome data by multiple feature fusion and deep learning","volume":"25","author":"Chen","year":"2022","journal-title":"Iscience"},{"issue":"4","key":"2026032301171193600_ref23","doi-asserted-by":"crossref","first-page":"e1010050","DOI":"10.1371\/journal.pcbi.1010050","article-title":"Microbiome-based disease prediction with multimodal variational information bottlenecks","volume":"18","author":"Grazioli","year":"2022","journal-title":"PLoS Comput Biol"},{"issue":"3","key":"2026032301171193600_ref24","doi-asserted-by":"crossref","first-page":"bbaa073","DOI":"10.1093\/bib\/bbaa073","article-title":"A novel deep learning method for predictive modeling of microbiome data","volume":"22","author":"Wang","year":"2021","journal-title":"Brief Bioinform"},{"issue":"17","key":"2026032301171193600_ref25","doi-asserted-by":"crossref","first-page":"4544","DOI":"10.1093\/bioinformatics\/btaa542","article-title":"Taxonn: ensemble of neural networks on stratified microbiome data for disease prediction","volume":"36","author":"Sharma","year":"2020","journal-title":"Bioinformatics"},{"issue":"10","key":"2026032301171193600_ref26","doi-asserted-by":"crossref","first-page":"2993","DOI":"10.1109\/JBHI.2020.2993761","article-title":"Popphy-cnn: a phylogenetic tree embedded architecture for convolutional neural networks to predict host phenotype from metagenomic data","volume":"24","author":"Reiman","year":"2020","journal-title":"IEEE J Biomed Health Inform"},{"issue":"4","key":"2026032301171193600_ref27","doi-asserted-by":"crossref","first-page":"e1010066","DOI":"10.1371\/journal.pcbi.1010066","article-title":"Host phenotype classification from human microbiome data is mainly driven by the presence of microbial taxa","volume":"18","author":"Giliberti","year":"2022","journal-title":"PLoS Comput Biol"},{"issue":"7228","key":"2026032301171193600_ref28","doi-asserted-by":"crossref","first-page":"480","DOI":"10.1038\/nature07540","article-title":"A core gut microbiome in obese and lean twins","volume":"457","author":"Turnbaugh","year":"2009","journal-title":"Nature"},{"issue":"7452","key":"2026032301171193600_ref29","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1038\/nature12198","article-title":"Gut metagenome in european women with normal, impaired and diabetic glucose control","volume":"498","author":"Karlsson","year":"2013","journal-title":"Nature"},{"key":"2026032301171193600_ref30","first-page":"73","article-title":"Computing regularization paths for learning multiple kernels","volume-title":"Advances in neural information processing systems","author":"Bach F","year":"2005"},{"issue":"12","key":"2026032301171193600_ref31","doi-asserted-by":"crossref","first-page":"e15216","DOI":"10.1371\/journal.pone.0015216","article-title":"Disordered microbial communities in the upper respiratory tract of cigarette smokers","volume":"5","author":"Charlson","year":"2010","journal-title":"PloS One"},{"issue":"1","key":"2026032301171193600_ref32","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1186\/s40168-017-0316-0","article-title":"Oxalobacter formigenes-associated host features and microbial community structures examined using the american gut project","volume":"5","author":"Liu","year":"2017","journal-title":"Microbiome"},{"issue":"4","key":"2026032301171193600_ref33","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1002\/gepi.20567","article-title":"Relationship between genomic distance-based regression and kernel machine regression for multi-marker association testing","volume":"35","author":"Pan","year":"2011","journal-title":"Genet Epidemiol"},{"key":"2026032301171193600_ref34","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1007\/978-1-4614-7846-1_16","article-title":"Kernel methods for regression analysis of microbiome compositional data","volume-title":"Topics in Applied Statistics","author":"Chen","year":"2013"},{"key":"2026032301171193600_ref35","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/0024-3795(88)90223-6","article-title":"Computing a nearest symmetric positive semidefinite matrix","volume":"103","author":"Higham","year":"1988","journal-title":"Linear Algebra Appl"},{"key":"2026032301171193600_ref36","doi-asserted-by":"crossref","DOI":"10.1017\/CBO9780511804441","volume-title":"Convex optimization","author":"Boyd","year":"2004"},{"key":"2026032301171193600_ref37","first-page":"6","article-title":"Multiple kernel learning, conic duality, and the smo algorithm","volume-title":"Proceedings of the twenty-first international conference on Machine learning","author":"Bach","year":"2004"},{"key":"2026032301171193600_ref38","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1109\/BIBM.2018.8621382","article-title":"Paam-ml: A novel phylogeny and abundance aware machine learning modelling approach for microbiome classification","volume-title":"2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","author":"","year":"2018"},{"issue":"3","key":"2026032301171193600_ref39","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1109\/TNB.2019.2912824","article-title":"Phy-pmrfi: phylogeny-aware prediction of metagenomic functions using random forest feature importance","volume":"18","author":"Wassan","year":"2019","journal-title":"IEEE Trans Nanobiosci"},{"issue":"4","key":"2026032301171193600_ref40","doi-asserted-by":"crossref","first-page":"1079","DOI":"10.1111\/j.1541-0420.2007.00799.x","article-title":"Semiparametric regression of multidimensional genetic pathway data: least-squares kernel machines and linear mixed models","volume":"63","author":"Liu","year":"2007","journal-title":"Biometrics"},{"issue":"1","key":"2026032301171193600_ref41","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1186\/1471-2105-9-292","article-title":"Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models","volume":"9","author":"Liu","year":"2008","journal-title":"BMC Bioinform"},{"key":"2026032301171193600_ref42","author":"MDeep"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/3\/bbad158\/50411029\/bbad158.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/3\/bbad158\/50411029\/bbad158.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T05:17:22Z","timestamp":1774243042000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbad158\/7142722"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,25]]},"references-count":42,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2023,5,19]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbad158","relation":{},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,5]]},"published":{"date-parts":[[2023,4,25]]},"article-number":"bbad158"}}