{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T05:33:31Z","timestamp":1773812011264,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010714","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T00:00:00Z","timestamp":1672185600000}}],"reference-count":128,"publisher":"Public Library of Science (PLoS)","issue":"12","license":[{"start":{"date-parts":[[2022,12,14]],"date-time":"2022-12-14T00:00:00Z","timestamp":1670976000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004189","name":"Max-Planck-Gesellschaft","doi-asserted-by":"publisher","id":[{"id":"10.13039\/501100004189","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100009708","name":"Novo Nordisk Fonden","doi-asserted-by":"publisher","award":["0069071"],"award-info":[{"award-number":["0069071"]}],"id":[{"id":"10.13039\/501100009708","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>\n                    Tree ensemble machine learning models are increasingly used in microbiome science as they are compatible with the compositional, high-dimensional, and sparse structure of sequence-based microbiome data. While such models are often good at predicting phenotypes based on microbiome data, they only yield limited insights into how microbial taxa may be associated. We developed endoR, a method to interpret tree ensemble models. First, endoR simplifies the fitted model into a decision ensemble. Then, it extracts information on the importance of individual features and their pairwise interactions, displaying them as an interpretable network. Both the endoR network and importance scores provide insights into how features, and interactions between them, contribute to the predictive performance of the fitted model. Adjustable regularization and bootstrapping help reduce the complexity and ensure that only essential parts of the model are retained. We assessed endoR on both simulated and real metagenomic data. We found endoR to have comparable accuracy to other common approaches while easing and enhancing model interpretation. Using endoR, we also confirmed published results on gut microbiome differences between cirrhotic and healthy individuals. Finally, we utilized endoR to explore associations between human gut methanogens and microbiome components. Indeed, these hydrogen consumers are expected to interact with fermenting bacteria in a complex syntrophic network. Specifically, we analyzed a global metagenome dataset of 2203 individuals and confirmed the previously reported association between\n                    <jats:italic>Methanobacteriaceae<\/jats:italic>\n                    and\n                    <jats:italic>Christensenellales<\/jats:italic>\n                    . Additionally, we observed that\n                    <jats:italic>Methanobacteriaceae<\/jats:italic>\n                    are associated with a network of hydrogen-producing bacteria. Our method accurately captures how tree ensembles use features and interactions between them to predict a response. As demonstrated by our applications, the resultant visualizations and summary outputs facilitate model interpretation and enable the generation of novel hypotheses about complex systems.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1010714","type":"journal-article","created":{"date-parts":[[2022,12,14]],"date-time":"2022-12-14T13:40:46Z","timestamp":1671025246000},"page":"e1010714","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":5,"title":["Interpreting tree ensemble machine learning models with endoR"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5920-1710","authenticated-orcid":true,"given":"Albane","family":"Ruaud","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6203-9777","authenticated-orcid":true,"given":"Niklas","family":"Pfister","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9087-1672","authenticated-orcid":true,"given":"Ruth E.","family":"Ley","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7424-5276","authenticated-orcid":true,"given":"Nicholas D.","family":"Youngblut","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,12,14]]},"reference":[{"issue":"6086","key":"pcbi.1010714.ref001","doi-asserted-by":"crossref","first-page":"1262","DOI":"10.1126\/science.1223813","article-title":"Host-Gut Microbiota Metabolic Interactions","volume":"336","author":"JK Nicholson","year":"2012","journal-title":"Science (New York, NY)"},{"issue":"1","key":"pcbi.1010714.ref002","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1038\/s41564-017-0072-8","article-title":"Enterotypes in the landscape of gut microbial community composition","volume":"3","author":"PI Costea","year":"2018","journal-title":"Nature microbiology"},{"issue":"1","key":"pcbi.1010714.ref003","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1038\/s42255-018-0017-4","article-title":"Microbial regulation of organismal energy homeostasis","volume":"1","author":"PD Cani","year":"2019","journal-title":"Nature Metabolism"},{"issue":"1","key":"pcbi.1010714.ref004","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-017-01973-8","article-title":"Meta-analysis of gut microbiome studies identifies disease-specific and shared responses","volume":"8","author":"C Duvallet","year":"2017","journal-title":"Nature communications"},{"issue":"5","key":"pcbi.1010714.ref005","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/nmicrobiol.2017.4","article-title":"Dynamics of the human gut microbiome in inflammatory bowel disease","volume":"2","author":"J Halfvarson","year":"2017","journal-title":"Nature microbiology"},{"issue":"11","key":"pcbi.1010714.ref006","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1038\/s41592-018-0176-y","article-title":"Species-level functional profiling of metagenomes and metatranscriptomes","volume":"15","author":"EA Franzosa","year":"2018","journal-title":"Nature methods"},{"issue":"31","key":"pcbi.1010714.ref007","doi-asserted-by":"crossref","first-page":"11070","DOI":"10.1073\/pnas.0504978102","article-title":"Obesity alters gut microbial ecology","volume":"102","author":"RE Ley","year":"2005","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"7122","key":"pcbi.1010714.ref008","doi-asserted-by":"crossref","first-page":"1027","DOI":"10.1038\/nature05414","article-title":"An obesity-associated gut microbiome with increased capacity for energy harvest","volume":"444","author":"PJ Turnbaugh","year":"2006","journal-title":"Nature"},{"issue":"1","key":"pcbi.1010714.ref009","article-title":"The gut microbiome in atherosclerotic cardiovascular disease","volume":"8","author":"Z Jie","year":"2017","journal-title":"Nature Communications"},{"issue":"10","key":"pcbi.1010714.ref010","doi-asserted-by":"crossref","first-page":"661","DOI":"10.1038\/nrmicro3344","article-title":"The gut microbiota, bacterial metabolites and colorectal cancer","volume":"12","author":"P Louis","year":"2014","journal-title":"Nature reviews microbiology"},{"issue":"11","key":"pcbi.1010714.ref011","doi-asserted-by":"crossref","first-page":"766","DOI":"10.15252\/msb.20145645","article-title":"Potential of fecal microbiota for early-stage detection of colorectal cancer","volume":"10","author":"G Zeller","year":"2014","journal-title":"Molecular Systems Biology"},{"issue":"5","key":"pcbi.1010714.ref012","doi-asserted-by":"crossref","first-page":"878","DOI":"10.1016\/j.cmet.2020.06.005","article-title":"A universal gut-microbiome-derived signature predicts cirrhosis","volume":"32","author":"TG Oh","year":"2020","journal-title":"Cell metabolism"},{"issue":"4","key":"pcbi.1010714.ref013","doi-asserted-by":"crossref","first-page":"667","DOI":"10.1038\/s41591-019-0405-7","article-title":"Metagenomic analysis of colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and a link with choline degradation","volume":"25","author":"AM Thomas","year":"2019","journal-title":"Nature medicine"},{"issue":"4","key":"pcbi.1010714.ref014","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1038\/s41591-019-0406-6","article-title":"Meta-analysis of fecal metagenomes reveals global microbial signatures that are specific for colorectal cancer","volume":"25","author":"J Wirbel","year":"2019","journal-title":"Nature medicine"},{"issue":"2","key":"pcbi.1010714.ref015","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1111\/j.1574-6976.2010.00251.x","article-title":"Supervised classification of human microbiota","volume":"35","author":"D Knights","year":"2011","journal-title":"FEMS microbiology reviews"},{"issue":"7","key":"pcbi.1010714.ref016","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1038\/s41579-018-0029-9","article-title":"Best practices for analysing microbiomes","volume":"16","author":"R Knight","year":"2018","journal-title":"Nature Reviews Microbiology"},{"issue":"3","key":"pcbi.1010714.ref017","doi-asserted-by":"crossref","DOI":"10.1128\/mBio.00434-20","article-title":"A framework for effective application of machine learning to microbiome-based classification problems","volume":"11","author":"BD Top\u00e7uo\u011flu","year":"2020","journal-title":"Mbio"},{"key":"pcbi.1010714.ref018","doi-asserted-by":"crossref","first-page":"579","DOI":"10.3389\/fgene.2019.00579","article-title":"A review and tutorial of machine learning methods for microbiome host trait prediction","volume":"10","author":"YH Zhou","year":"2019","journal-title":"Frontiers in genetics"},{"issue":"1","key":"pcbi.1010714.ref019","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"L Breiman","year":"2001","journal-title":"Machine learning"},{"key":"pcbi.1010714.ref020","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-84858-7","volume-title":"The elements of statistical learning: data mining, inference, and prediction","author":"T Hastie","year":"2009"},{"issue":"12","key":"pcbi.1010714.ref021","doi-asserted-by":"crossref","first-page":"3483","DOI":"10.1016\/j.patcog.2013.05.018","article-title":"Gene selection with guided regularized random forest","volume":"46","author":"H Deng","year":"2013","journal-title":"Pattern Recognition"},{"issue":"1","key":"pcbi.1010714.ref022","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-018-2205-3","article-title":"Taxonomy-aware feature engineering for microbiome classification","volume":"19","author":"M Oudah","year":"2018","journal-title":"BMC bioinformatics"},{"issue":"1","key":"pcbi.1010714.ref023","first-page":"1","article-title":"binomialRF: interpretable combinatoric efficiency of random forests to identify biomarker interactions","volume":"21","author":"SR Zaim","year":"2020","journal-title":"BMC bioinformatics"},{"issue":"11","key":"pcbi.1010714.ref024","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v036.i11","article-title":"Feature selection with the Boruta package","volume":"36","author":"MB Kursa","year":"2010","journal-title":"J Stat Softw"},{"key":"pcbi.1010714.ref025","unstructured":"Breiman L, Cutler A. Manual on Setting Up, Using, and Understanding Random Forests, ver. 4.0; 2003. Available from: https:\/\/www.stat.berkeley.edu\/~breiman\/RandomForests\/cc_home.htm."},{"issue":"2","key":"pcbi.1010714.ref026","doi-asserted-by":"crossref","first-page":"112","DOI":"10.3390\/genes10020112","article-title":"Using decision tree aggregation with random forest model to identify gut microbes associated with colorectal cancer","volume":"10","author":"D Ai","year":"2019","journal-title":"Genes"},{"issue":"1","key":"pcbi.1010714.ref027","first-page":"629","article-title":"Random intersection trees","volume":"15","author":"RD Shah","year":"2014","journal-title":"The Journal of Machine Learning Research"},{"key":"pcbi.1010714.ref028","unstructured":"Lundberg S, Lee SI. A unified approach to interpreting model predictions. arXiv preprint arXiv:170507874. 2017;."},{"issue":"8","key":"pcbi.1010714.ref029","doi-asserted-by":"crossref","first-page":"1943","DOI":"10.1073\/pnas.1711236115","article-title":"Iterative random forests to discover predictive and stable high-order interactions","volume":"115","author":"S Basu","year":"2018","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"1","key":"pcbi.1010714.ref030","first-page":"3245","article-title":"DALEX: explainers for complex predictive models in R","volume":"19","author":"P Biecek","year":"2018","journal-title":"The Journal of Machine Learning Research"},{"issue":"2","key":"pcbi.1010714.ref031","doi-asserted-by":"crossref","first-page":"358","DOI":"10.2337\/dc20-1536","article-title":"Interpretable Machine Learning Framework Reveals Robust Gut Microbiome Features Associated With Type 2 Diabetes","volume":"44","author":"W Gou","year":"2021","journal-title":"Diabetes Care"},{"issue":"28","key":"pcbi.1010714.ref032","first-page":"307","article-title":"A value for n-person games","volume":"2","author":"LS Shapley","year":"1953","journal-title":"Contributions to the Theory of Games"},{"key":"pcbi.1010714.ref033","unstructured":"Sundararajan M, Najmi A. The many Shapley values for model explanation. In: International Conference on Machine Learning. PMLR; 2020. p. 9269\u20139278."},{"key":"pcbi.1010714.ref034","unstructured":"Gosiewska A, Biecek P. Do Not Trust Additive Explanations. arXiv. 2019;."},{"issue":"1","key":"pcbi.1010714.ref035","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1038\/s42256-019-0138-9","article-title":"From local explanations to global understanding with explainable AI for trees","volume":"2","author":"SM Lundberg","year":"2020","journal-title":"Nature Machine Intelligence"},{"issue":"8","key":"pcbi.1010714.ref036","doi-asserted-by":"crossref","first-page":"1943","DOI":"10.1073\/pnas.1711236115","article-title":"Iterative random forests to discover predictive and stable high-order interactions","volume":"115","author":"S Basu","year":"2018","journal-title":"Proceedings of the National Academy of Sciences"},{"issue":"3","key":"pcbi.1010714.ref037","doi-asserted-by":"crossref","first-page":"916","DOI":"10.1214\/07-AOAS148","article-title":"Predictive learning via rule ensembles","volume":"2","author":"JH Friedman","year":"2008","journal-title":"Ann Appl Stat"},{"issue":"4","key":"pcbi.1010714.ref038","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1007\/s41060-018-0144-8","article-title":"Interpreting tree ensembles with inTrees","volume":"7","author":"H Deng","year":"2019","journal-title":"International Journal of Data Science and Analytics"},{"key":"pcbi.1010714.ref039","unstructured":"Jiang Y, Biecek P, Paluszy\u0144ska O, agasitko, Kobylinska K. ModelOriented\/randomForestExplainer: CRAN release 0.10.1; 2020. Available from: https:\/\/doi.org\/10.5281\/zenodo.3941250."},{"issue":"11","key":"pcbi.1010714.ref040","doi-asserted-by":"crossref","first-page":"1023","DOI":"10.1038\/nmeth.4468","article-title":"Accessible, curated metagenomic data through ExperimentHub","volume":"14","author":"E Pasolli","year":"2017","journal-title":"Nature methods"},{"issue":"7516","key":"pcbi.1010714.ref041","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature13568","article-title":"Alterations of the human gut microbiome in liver cirrhosis","volume":"513","author":"N Qin","year":"2014","journal-title":"Nature"},{"issue":"25","key":"pcbi.1010714.ref042","doi-asserted-by":"crossref","first-page":"4548","DOI":"10.3748\/wjg.v23.i25.4548","article-title":"Distinct gut microbiota profiles in patients with primary sclerosing cholangitis and ulcerative colitis","volume":"23","author":"L Bajer","year":"2017","journal-title":"World journal of gastroenterology"},{"issue":"3","key":"pcbi.1010714.ref043","first-page":"18","article-title":"Classification and regression by randomForest","volume":"2","author":"A Liaw","year":"2002","journal-title":"R news"},{"issue":"1","key":"pcbi.1010714.ref044","doi-asserted-by":"crossref","DOI":"10.18637\/jss.v077.i01","article-title":"ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R","volume":"77","author":"MN Wright","year":"2017","journal-title":"Journal of Statistical Software"},{"key":"pcbi.1010714.ref045","unstructured":"Greenwell B, Boehmke B, Cunningham J, Developers G. gbm: Generalized Boosted Regression Models; 2020. Available from: https:\/\/cran.r-project.org\/package=gbm."},{"key":"pcbi.1010714.ref046","doi-asserted-by":"crossref","unstructured":"Chen T, Guestrin C. Xgboost: A scalable tree boosting system. In: Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining; 2016. p. 785\u2013794.","DOI":"10.1145\/2939672.2939785"},{"issue":"9","key":"pcbi.1010714.ref047","doi-asserted-by":"crossref","first-page":"e1002687","DOI":"10.1371\/journal.pcbi.1002687","article-title":"Inferring correlation networks from genomic survey data","volume":"8","author":"J Friedman","year":"2012","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1010714.ref048","doi-asserted-by":"crossref","unstructured":"Biecek P, Burzykowski T. Explanatory Model Analysis; 2020. https:\/\/pbiecek.github.io\/ema\/preface.html.","DOI":"10.1201\/9780429027192"},{"issue":"5","key":"pcbi.1010714.ref049","doi-asserted-by":"crossref","first-page":"1054","DOI":"10.1016\/j.cmet.2017.04.001","article-title":"Gut Microbiome-Based Metagenomic Signature for Non-invasive Detection of Advanced Fibrosis in Human Nonalcoholic Fatty Liver Disease","volume":"25","author":"R Loomba","year":"2017","journal-title":"Cell Metabolism"},{"issue":"2","key":"pcbi.1010714.ref050","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1099\/ijs.0.02378-0","article-title":"Phylogenetic analysis of some Sporomusa sub-branch members isolated from human clinical specimens: description of Megasphaera micronuciformis sp. nov","volume":"53","author":"H Marchandin","year":"2003","journal-title":"International Journal of Systematic and Evolutionary Microbiology"},{"issue":"19","key":"pcbi.1010714.ref051","doi-asserted-by":"crossref","first-page":"5002","DOI":"10.1128\/JB.00542-10","article-title":"The human oral microbiome","volume":"192","author":"FE Dewhirst","year":"2010","journal-title":"Journal of bacteriology"},{"issue":"6","key":"pcbi.1010714.ref052","doi-asserted-by":"crossref","first-page":"e37919","DOI":"10.1371\/journal.pone.0037919","article-title":"Deep sequencing of the oral microbiome reveals signatures of periodontal disease","volume":"7","author":"B Liu","year":"2012","journal-title":"PloS one"},{"issue":"1","key":"pcbi.1010714.ref053","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression shrinkage and selection via the lasso","volume":"58","author":"R Tibshirani","year":"1996","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"issue":"9","key":"pcbi.1010714.ref054","doi-asserted-by":"crossref","first-page":"2059","DOI":"10.1038\/ismej.2017.72","article-title":"Genomics and metagenomics of trimethylamine-utilizing Archaea in the human gut microbiome","volume":"11","author":"G Borrel","year":"2017","journal-title":"The ISME Journal"},{"issue":"6","key":"pcbi.1010714.ref055","doi-asserted-by":"crossref","first-page":"e66019","DOI":"10.1371\/journal.pone.0066019","article-title":"Archaea and Fungi of the Human Gut Microbiome: Correlations with Diet and Bacterial Residents","volume":"8","author":"C Hoffmann","year":"2013","journal-title":"PLoS ONE"},{"issue":"3","key":"pcbi.1010714.ref056","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1007\/s002849910035","article-title":"The Effect of a Methanogen, Methanobrevibacter smithii, on the Growth Rate, Organic Acid Production, and Specific ATP Activity of Three Predominant Ruminal Cellulolytic Bacteria","volume":"40","author":"JL Rychlik","year":"2000","journal-title":"Current Microbiology"},{"issue":"1","key":"pcbi.1010714.ref057","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1111\/j.1574-6968.2005.00016.x","article-title":"H2 and acetate transfers during xylan fermentation between a butyrate-producing xylanolytic species and hydrogenotrophic microorganisms from the human gut","volume":"254","author":"C Chassard","year":"2006","journal-title":"FEMS Microbiology Letters"},{"issue":"1","key":"pcbi.1010714.ref058","doi-asserted-by":"crossref","DOI":"10.1128\/mBio.03235-19","article-title":"Syntrophy via Interspecies H2 Transfer between Christensenella and Methanobrevibacter Underlies Their Global Cooccurrence in the Human Gut","volume":"11","author":"A Ruaud","year":"2020","journal-title":"mBio"},{"issue":"1594","key":"pcbi.1010714.ref059","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1016\/j.cell.2014.09.053","article-title":"Human genetics shape the gut microbiome","volume":"6","author":"JK Goodrich","year":"2014","journal-title":"Cell"},{"key":"pcbi.1010714.ref060","article-title":"Pan-genome of the dominant human gut-associated archaeon, Methanobrevibacter smithii, studied in twins","author":"EE Hansen","year":"2011","journal-title":"Proceedings of the National Academy of Sciences"},{"key":"pcbi.1010714.ref061","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/srep28797","article-title":"Impact of dietary resistant starch type 4 on human gut microbiota and immunometabolic functions","volume":"6","author":"B Upadhyaya","year":"2016","journal-title":"Scientific Reports"},{"issue":"5","key":"pcbi.1010714.ref062","doi-asserted-by":"crossref","first-page":"576","DOI":"10.3390\/nu10050576","article-title":"Microbiome Responses to an Uncontrolled Short-Term Diet Intervention in the Frame of the Citizen Science Project","volume":"10","author":"N Klimenko","year":"2018","journal-title":"Nutrients"},{"issue":"1","key":"pcbi.1010714.ref063","doi-asserted-by":"crossref","first-page":"205","DOI":"10.1111\/j.1574-6941.2010.00941.x","article-title":"The cellulose-degrading microbial community of the human gut varies according to the presence or absence of methanogens","volume":"74","author":"C Chassard","year":"2010","journal-title":"FEMS microbiology ecology"},{"issue":"7346","key":"pcbi.1010714.ref064","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1038\/nature09944","article-title":"Enterotypes of the human gut microbiome","volume":"473","author":"M Arumugam","year":"2011","journal-title":"Nature"},{"issue":"6","key":"pcbi.1010714.ref065","doi-asserted-by":"crossref","first-page":"1612","DOI":"10.1007\/s10620-011-1590-5","article-title":"Methane on Breath Testing Is Associated with Constipation: A Systematic Review and Meta-analysis","volume":"56","author":"D Kunkel","year":"2011","journal-title":"Digestive Diseases and Sciences"},{"key":"pcbi.1010714.ref066","doi-asserted-by":"crossref","first-page":"e13056","DOI":"10.1111\/nmo.13056","article-title":"Assessing the colonic microbiome, hydrogenogenic and hydrogenotrophic genes, transit and breath methane in constipation","author":"PG Wolf","year":"2017","journal-title":"Neurogastroenterology & Motility"},{"issue":"1","key":"pcbi.1010714.ref067","doi-asserted-by":"crossref","first-page":"6505","DOI":"10.1038\/ncomms7505","article-title":"Subsistence strategies in traditional societies distinguish gut microbiomes","volume":"6","author":"AJ Obregon-Tito","year":"2015","journal-title":"Nature Communications"},{"issue":"4","key":"pcbi.1010714.ref068","doi-asserted-by":"crossref","first-page":"1379","DOI":"10.1111\/1462-2920.13692","article-title":"Meta-analysis of the human gut microbiome from urbanized and pre-agricultural populations","volume":"19","author":"L Mancabelli","year":"2017","journal-title":"Environmental Microbiology"},{"issue":"1","key":"pcbi.1010714.ref069","doi-asserted-by":"crossref","first-page":"11356","DOI":"10.1038\/s41598-018-29687-x","article-title":"Gut microbiota is associated with obesity and cardiometabolic disease in a population in the midst of Westernization","volume":"8","author":"J de la Cuesta-Zuluaga","year":"2018","journal-title":"Scientific Reports"},{"key":"pcbi.1010714.ref070","doi-asserted-by":"crossref","first-page":"26752","DOI":"10.1038\/srep26752","article-title":"Weight gain in anorexia nervosa does not ameliorate the faecal microbiota, branched chain fatty acid profiles, and gastrointestinal complaints","volume":"6","author":"I Mack","year":"2016","journal-title":"Scientific reports"},{"issue":"9","key":"pcbi.1010714.ref071","doi-asserted-by":"crossref","first-page":"e7125","DOI":"10.1371\/journal.pone.0007125","article-title":"Monitoring bacterial community of human gut microbiota reveals an increase in Lactobacillus in obese patients and Methanogens in anorexic patients","volume":"4","author":"F Armougom","year":"2009","journal-title":"PloS one"},{"issue":"6","key":"pcbi.1010714.ref072","doi-asserted-by":"crossref","first-page":"817","DOI":"10.1038\/ijo.2011.153","article-title":"Obesity-associated gut microbiota is enriched in Lactobacillus reuteri and depleted in Bifidobacterium animalis and Methanobrevibacter smithii","volume":"36","author":"M Million","year":"2012","journal-title":"International Journal of Obesity"},{"issue":"1","key":"pcbi.1010714.ref073","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1038\/oby.2009.167","article-title":"Microbiota and SCFA in lean and overweight healthy subjects","volume":"18","author":"A Schwiertz","year":"2010","journal-title":"Obesity"},{"issue":"3","key":"pcbi.1010714.ref074","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.cmi.2015.10.031","article-title":"Correlation between body mass index and faecal microbiota from children","volume":"22","author":"A Ignacio","year":"2016","journal-title":"Clinical Microbiology and Infection"},{"issue":"1","key":"pcbi.1010714.ref075","doi-asserted-by":"crossref","first-page":"5426","DOI":"10.1038\/s41598-021-84641-8","article-title":"Clinical evidence of the role of Methanobrevibacter smithii in severe acute malnutrition","volume":"11","author":"A Camara","year":"2021","journal-title":"Scientific Reports"},{"issue":"12","key":"pcbi.1010714.ref076","doi-asserted-by":"crossref","first-page":"2508","DOI":"10.1002\/oby.21266","article-title":"Gut colonization with methanobrevibacter smithii is associated with childhood weight development","volume":"23","author":"CA Mbakwa","year":"2015","journal-title":"Obesity"},{"issue":"7","key":"pcbi.1010714.ref077","doi-asserted-by":"crossref","first-page":"2365","DOI":"10.1073\/pnas.0812600106","article-title":"Human gut microbiota in obesity and after gastric bypass","volume":"106","author":"H Zhang","year":"2009","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"key":"pcbi.1010714.ref078","first-page":"457","article-title":"A core gut microbiome in obese and lean twins","author":"PJ Turnbaugh","year":"2009","journal-title":"Nature"},{"issue":"1","key":"pcbi.1010714.ref079","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1079\/PHN19980004","article-title":"The nutrition transition and its health implications in lower-income countries","volume":"1","author":"BM Popkin","year":"1998","journal-title":"Public Health Nutrition"},{"issue":"3","key":"pcbi.1010714.ref080","doi-asserted-by":"crossref","first-page":"649","DOI":"10.1016\/j.cell.2019.01.001","article-title":"Extensive unexplored human microbiome diversity revealed by over 150,000 genomes from metagenomes spanning age, geography, and lifestyle","volume":"176","author":"E Pasolli","year":"2019","journal-title":"Cell"},{"key":"pcbi.1010714.ref081","first-page":"1","article-title":"Navigating the pitfalls of applying machine learning in genomics","author":"S Whalen","year":"2021","journal-title":"Nature Reviews Genetics"},{"key":"pcbi.1010714.ref082","first-page":"291","article-title":"FunGene: the functional gene pipeline and repository","volume":"4","author":"JA Fish","year":"2013","journal-title":"Frontiers in microbiology"},{"key":"pcbi.1010714.ref083","doi-asserted-by":"crossref","DOI":"10.1093\/database\/baz142","article-title":"AcetoBase: a functional gene repository and database for formyltetrahydrofolate synthetase sequences","volume":"2019","author":"A Singh","year":"2019","journal-title":"Database"},{"issue":"1","key":"pcbi.1010714.ref084","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/srep34212","article-title":"HydDB: a web tool for hydrogenase classification and analysis","volume":"6","author":"D S\u00f8ndergaard","year":"2016","journal-title":"Scientific reports"},{"issue":"1","key":"pcbi.1010714.ref085","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-019-1843-8","article-title":"DNA extraction for human microbiome studies: the issue of standardization","volume":"20","author":"KL Greathouse","year":"2019","journal-title":"Genome biology"},{"issue":"9","key":"pcbi.1010714.ref086","doi-asserted-by":"crossref","first-page":"e7063","DOI":"10.1371\/journal.pone.0007063","article-title":"High prevalence of Methanobrevibacter smithii and Methanosphaera stadtmanae detected in the human gut using an improved DNA detection protocol","volume":"4","author":"B Dridi","year":"2009","journal-title":"PloS one"},{"issue":"1","key":"pcbi.1010714.ref087","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1756-0500-6-186","article-title":"A semi-automated protocol for Archaea DNA extraction from stools","volume":"6","author":"S Khelaifia","year":"2013","journal-title":"BMC research notes"},{"key":"pcbi.1010714.ref088","article-title":"Fast gene set enrichment analysis","author":"G Korotkevich","year":"2021","journal-title":"bioRxiv"},{"issue":"6","key":"pcbi.1010714.ref089","doi-asserted-by":"crossref","first-page":"679","DOI":"10.1136\/gut.31.6.679","article-title":"Alternative pathways for hydrogen disposal during fermentation in the human colon","volume":"31","author":"G Gibson","year":"1990","journal-title":"Gut"},{"issue":"3","key":"pcbi.1010714.ref090","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1080\/19490976.2018.1546522","article-title":"Hydrogen cross-feeders of the human gastrointestinal tract","volume":"10","author":"NW Smith","year":"2019","journal-title":"Gut Microbes"},{"issue":"4","key":"pcbi.1010714.ref091","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1111\/j.1467-9868.2010.00740.x","article-title":"Stability selection","volume":"72","author":"N Meinshausen","year":"2010","journal-title":"Journal of the Royal Statistical Society: Series B (Statistical Methodology)"},{"key":"pcbi.1010714.ref092","first-page":"283","article-title":"Analysis of Gut Microbiome Using Explainable Machine Learning Predicts Risk of Diarrhea Associated With Tyrosine Kinase Inhibitor Neratinib: A Pilot Study","volume":"11","author":"CW Wong","year":"2021","journal-title":"Frontiers in Oncology"},{"issue":"1","key":"pcbi.1010714.ref093","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-021-83922-6","article-title":"Explainable AI reveals changes in skin microbiome composition linked to phenotypic differences","volume":"11","author":"AP Carrieri","year":"2021","journal-title":"Scientific reports"},{"key":"pcbi.1010714.ref094","doi-asserted-by":"crossref","first-page":"2050312115601122","DOI":"10.1177\/2050312115601122","article-title":"Periodontal disease and liver cirrhosis: A systematic review","volume":"3","author":"LL Gr\u00f8nkj\u00e6r","year":"2015","journal-title":"SAGE open medicine"},{"issue":"7753","key":"pcbi.1010714.ref095","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1038\/s41586-019-1058-x","article-title":"New insights from uncultivated genomes of the global human gut microbiome","volume":"568","author":"S Nayfach","year":"2019","journal-title":"Nature"},{"issue":"1","key":"pcbi.1010714.ref096","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12864-020-06807-4","article-title":"Phylogenomics of expanding uncultured environmental Tenericutes provides insights into their pathogenicity and evolutionary relationship with Bacilli","volume":"21","author":"Y Wang","year":"2020","journal-title":"BMC genomics"},{"issue":"1","key":"pcbi.1010714.ref097","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1038\/nchembio.276","article-title":"Relating diffusion along the substrate tunnel and oxygen sensitivity in hydrogenase","volume":"6","author":"PP Liebgott","year":"2010","journal-title":"Nature chemical biology"},{"issue":"8","key":"pcbi.1010714.ref098","doi-asserted-by":"crossref","first-page":"579","DOI":"10.1038\/nrmicro1931","article-title":"Methanogenic archaea: ecologically relevant differences in energy conservation","volume":"6","author":"RK Thauer","year":"2008","journal-title":"Nature Reviews Microbiology"},{"issue":"3","key":"pcbi.1010714.ref099","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1038\/ismej.2015.153","article-title":"Genomic and metagenomic surveys of hydrogenase distribution indicate H 2 is a widely utilised energy source for microbial growth and survival","volume":"10","author":"C Greening","year":"2016","journal-title":"The ISME journal"},{"issue":"7","key":"pcbi.1010714.ref100","doi-asserted-by":"crossref","first-page":"1256","DOI":"10.1038\/ismej.2013.80","article-title":"Dysbiosis in inflammatory bowel diseases: the oxygen hypothesis","volume":"7","author":"L Rigottier-Gois","year":"2013","journal-title":"The ISME journal"},{"issue":"1","key":"pcbi.1010714.ref101","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12918-017-0522-1","article-title":"Microbiota dysbiosis in inflammatory bowel diseases: in silico investigation of the oxygen hypothesis","volume":"11","author":"MA Henson","year":"2017","journal-title":"BMC systems biology"},{"issue":"6418","key":"pcbi.1010714.ref102","doi-asserted-by":"crossref","DOI":"10.1126\/science.aat9076","article-title":"Colonocyte metabolism shapes the gut microbiota","volume":"362","author":"Y Litvak","year":"2018","journal-title":"Science"},{"key":"pcbi.1010714.ref103","doi-asserted-by":"crossref","first-page":"285","DOI":"10.1016\/j.micpath.2018.01.029","article-title":"Alterations of the human gut Methanobrevibacter smithii as a biomarker for inflammatory bowel diseases","volume":"117","author":"SB Ghavami","year":"2018","journal-title":"Microbial pathogenesis"},{"issue":"1","key":"pcbi.1010714.ref104","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/1471-2180-8-79","article-title":"Human methanogen diversity and incidence in healthy and diseased colonic groups using mcrA gene analysis","volume":"8","author":"PD Scanlan","year":"2008","journal-title":"BMC microbiology"},{"issue":"5","key":"pcbi.1010714.ref105","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1136\/gutjnl-2016-313235","article-title":"A microbial signature for Crohn\u2019s disease","volume":"66","author":"V Pascal","year":"2017","journal-title":"Gut"},{"issue":"3","key":"pcbi.1010714.ref106","doi-asserted-by":"crossref","first-page":"835","DOI":"10.1111\/1462-2920.13658","article-title":"Oscillospira and related bacteria\u2013From metagenomic species to metabolic features","volume":"19","author":"U Gophna","year":"2017","journal-title":"Environmental microbiology"},{"issue":"6","key":"pcbi.1010714.ref107","doi-asserted-by":"crossref","first-page":"512","DOI":"10.1097\/01.mcg.0000165665.94777.bd","article-title":"Breath methane associated with slow colonic transit time in children with chronic constipation","volume":"39","author":"ACF Soares","year":"2005","journal-title":"Journal of clinical gastroenterology"},{"issue":"6","key":"pcbi.1010714.ref108","doi-asserted-by":"crossref","DOI":"10.1038\/ajg.2009.655","article-title":"Methanogenic flora is associated with altered colonic transit but not stool characteristics in constipation without IBS","volume":"105","author":"A Attaluri","year":"2010","journal-title":"The American journal of gastroenterology"},{"issue":"6","key":"pcbi.1010714.ref109","doi-asserted-by":"crossref","first-page":"G1089","DOI":"10.1152\/ajpgi.00574.2004","article-title":"Methane, a gas produced by enteric bacteria, slows intestinal transit and augments small intestinal contractile activity","volume":"290","author":"M Pimentel","year":"2006","journal-title":"American Journal of Physiology-Gastrointestinal and Liver Physiology"},{"issue":"1","key":"pcbi.1010714.ref110","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41598-018-29687-x","article-title":"Gut microbiota is associated with obesity and cardiometabolic disease in a population in the midst of Westernization","volume":"8","author":"J de la Cuesta-Zuluaga","year":"2018","journal-title":"Scientific reports"},{"issue":"4","key":"pcbi.1010714.ref111","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1016\/j.celrep.2015.03.049","article-title":"The Gut Microbiota of Rural Papua New Guineans: Composition, Diversity Patterns, and Ecological Processes","volume":"11","author":"I Mart\u00ednez","year":"2015","journal-title":"Cell Reports"},{"issue":"3","key":"pcbi.1010714.ref112","doi-asserted-by":"crossref","first-page":"e1500183","DOI":"10.1126\/sciadv.1500183","article-title":"The microbiome of uncontacted Amerindians","volume":"1","author":"JC Clemente","year":"2015","journal-title":"Science Advances"},{"key":"pcbi.1010714.ref113","doi-asserted-by":"crossref","DOI":"10.1002\/9781118625590","volume-title":"Applied regression analysis","author":"NR Draper","year":"1998"},{"issue":"1","key":"pcbi.1010714.ref114","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1177\/001316446002000104","article-title":"A coefficient of agreement for nominal scales","volume":"20","author":"J Cohen","year":"1960","journal-title":"Educational and psychological measurement"},{"key":"pcbi.1010714.ref115","article-title":"Incorporating genome-based phylogeny and trait similarity into diversity assessments helps to resolve a global collection of human gut metagenomes","author":"ND Youngblut","year":"2020","journal-title":"bioRxiv"},{"issue":"1","key":"pcbi.1010714.ref116","doi-asserted-by":"crossref","first-page":"1","DOI":"10.18637\/jss.v033.i01","article-title":"Regularization paths for generalized linear models via coordinate descent","volume":"33","author":"J Friedman","year":"2010","journal-title":"Journal of statistical software"},{"issue":"6","key":"pcbi.1010714.ref117","doi-asserted-by":"crossref","first-page":"e1002358","DOI":"10.1371\/journal.pcbi.1002358","article-title":"Metabolic reconstruction for metagenomic data and its application to the human microbiome","volume":"8","author":"S Abubucker","year":"2012","journal-title":"PLoS computational biology"},{"issue":"D1","key":"pcbi.1010714.ref118","doi-asserted-by":"crossref","first-page":"D445","DOI":"10.1093\/nar\/gkz862","article-title":"The MetaCyc database of metabolic pathways and enzymes-a 2019 update","volume":"48","author":"R Caspi","year":"2020","journal-title":"Nucleic acids research"},{"issue":"1","key":"pcbi.1010714.ref119","first-page":"1","article-title":"Improved metagenomic analysis with Kraken 2","volume":"20","author":"DE Wood","year":"2019","journal-title":"Genome biology"},{"key":"pcbi.1010714.ref120","doi-asserted-by":"crossref","first-page":"e104","DOI":"10.7717\/peerj-cs.104","article-title":"Bracken: estimating species abundance in metagenomics data","volume":"3","author":"J Lu","year":"2017","journal-title":"PeerJ Computer Science"},{"issue":"6","key":"pcbi.1010714.ref121","doi-asserted-by":"crossref","first-page":"1925","DOI":"10.1093\/bioinformatics\/btz848","article-title":"GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database","volume":"36","author":"PA Chaumeil","year":"2019","journal-title":"Bioinformatics"},{"issue":"7","key":"pcbi.1010714.ref122","doi-asserted-by":"crossref","first-page":"2314","DOI":"10.1093\/bioinformatics\/btz899","article-title":"Struo: a pipeline for building custom databases for common metagenome profilers","volume":"36","author":"J de la Cuesta-Zuluaga","year":"2020","journal-title":"Bioinformatics"},{"issue":"3","key":"pcbi.1010714.ref123","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1093\/biostatistics\/kxm045","article-title":"Sparse inverse covariance estimation with the graphical lasso","volume":"9","author":"J Friedman","year":"2008","journal-title":"Biostatistics"},{"issue":"5","key":"pcbi.1010714.ref124","doi-asserted-by":"crossref","first-page":"e1004226","DOI":"10.1371\/journal.pcbi.1004226","article-title":"Sparse and compositionally robust inference of microbial ecological networks","volume":"11","author":"ZD Kurtz","year":"2015","journal-title":"PLoS Comput Biol"},{"issue":"10","key":"pcbi.1010714.ref125","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1038\/s41551-018-0304-0","article-title":"Explainable machine-learning predictions for the prevention of hypoxaemia during surgery","volume":"2","author":"SM Lundberg","year":"2018","journal-title":"Nat Biomed Eng"},{"key":"pcbi.1010714.ref126","doi-asserted-by":"crossref","unstructured":"Liu Y, Just A. SHAPforxgboost: SHAP Plots for\u2019XGBoost\u2019; 2020. Available from: https:\/\/github.com\/liuyanguu\/SHAPforxgboost\/.","DOI":"10.32614\/CRAN.package.SHAPforxgboost"},{"issue":"1","key":"pcbi.1010714.ref127","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nmeth.3176","article-title":"Fast and sensitive protein alignment using DIAMOND","volume":"12","author":"B Buchfink","year":"2015","journal-title":"Nature methods"},{"key":"pcbi.1010714.ref128","unstructured":"R Core Team. R: A Language and Environment for Statistical Computing; 2020. Available from: https:\/\/www.R-project.org\/."}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010714","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T00:00:00Z","timestamp":1672185600000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010714","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,10]],"date-time":"2024-10-10T06:43:36Z","timestamp":1728542616000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010714"}},"subtitle":[],"editor":[{"given":"Luis Pedro","family":"Coelho","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,12,14]]},"references-count":128,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2022,12,14]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010714","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2022.01.03.474763","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,12,14]]}}}