{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,7]],"date-time":"2026-04-07T16:41:39Z","timestamp":1775580099401,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010108","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000}}],"reference-count":40,"publisher":"Public Library of Science (PLoS)","issue":"7","license":[{"start":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T00:00:00Z","timestamp":1657065600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Determining associations between intestinal bacteria and continuously measured physiological outcomes is important for understanding the bacteria-host relationship but is not straightforward since abundance data (compositional data) are not normally distributed. To address this issue, we developed a fully Bayesian linear regression model (BRACoD;<jats:underline>B<\/jats:underline>ayesian<jats:underline>R<\/jats:underline>egression<jats:underline>A<\/jats:underline>nalysis of<jats:underline>Co<\/jats:underline>mpositional<jats:underline>D<\/jats:underline>ata) with physiological measurements (continuous data) as a function of a matrix of relative bacterial abundances. Bacteria can be classified as operational taxonomic units or by taxonomy (genus, family, etc.). Bacteria associated with the physiological measurement were identified using a Bayesian variable selection method: Stochastic Search Variable Selection. The output is a list of inclusion probabilities (<jats:inline-formula id=\"pcbi.1010108.e001\"><jats:alternatives><jats:graphic xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" id=\"pcbi.1010108.e001g\" mimetype=\"image\" position=\"anchor\" xlink:href=\"info:doi\/10.1371\/journal.pcbi.1010108.e001\" xlink:type=\"simple\"\/><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\" id=\"M1\"><mml:mover accent=\"true\"><mml:mrow><mml:mi>p<\/mml:mi><\/mml:mrow><mml:mo>^<\/mml:mo><\/mml:mover><\/mml:math><\/jats:alternatives><\/jats:inline-formula>) and coefficients that indicate the strength of the association (<jats:inline-formula id=\"pcbi.1010108.e002\"><jats:alternatives><jats:graphic xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" id=\"pcbi.1010108.e002g\" mimetype=\"image\" position=\"anchor\" xlink:href=\"info:doi\/10.1371\/journal.pcbi.1010108.e002\" xlink:type=\"simple\"\/><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\" id=\"M2\"><mml:msub><mml:mrow><mml:mover accent=\"true\"><mml:mrow><mml:mi>\u03b2<\/mml:mi><\/mml:mrow><mml:mo>^<\/mml:mo><\/mml:mover><\/mml:mrow><mml:mrow><mml:mi>i<\/mml:mi><mml:mi>n<\/mml:mi><mml:mi>c<\/mml:mi><mml:mi>l<\/mml:mi><mml:mi>u<\/mml:mi><mml:mi>d<\/mml:mi><mml:mi>e<\/mml:mi><mml:mi>d<\/mml:mi><\/mml:mrow><\/mml:msub><\/mml:math><\/jats:alternatives><\/jats:inline-formula>) for each bacterial taxa. Tests with simulated communities showed that adopting a cut point value of<jats:inline-formula id=\"pcbi.1010108.e003\"><jats:alternatives><jats:graphic xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" id=\"pcbi.1010108.e003g\" mimetype=\"image\" position=\"anchor\" xlink:href=\"info:doi\/10.1371\/journal.pcbi.1010108.e003\" xlink:type=\"simple\"\/><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\" id=\"M3\"><mml:mover accent=\"true\"><mml:mrow><mml:mi>p<\/mml:mi><\/mml:mrow><mml:mo>^<\/mml:mo><\/mml:mover><\/mml:math><\/jats:alternatives><\/jats:inline-formula>\u2265 0.3 for identifying included bacteria optimized the true positive rate (TPR) while maintaining a false positive rate (FPR) of \u2264 5%. At this point, the chances of identifying non-contributing bacteria were low and all well-established contributors were included. Comparison with other methods showed that BRACoD (at<jats:inline-formula id=\"pcbi.1010108.e004\"><jats:alternatives><jats:graphic xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" id=\"pcbi.1010108.e004g\" mimetype=\"image\" position=\"anchor\" xlink:href=\"info:doi\/10.1371\/journal.pcbi.1010108.e004\" xlink:type=\"simple\"\/><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\" id=\"M4\"><mml:mover accent=\"true\"><mml:mrow><mml:mi>p<\/mml:mi><\/mml:mrow><mml:mo>^<\/mml:mo><\/mml:mover><\/mml:math><\/jats:alternatives><\/jats:inline-formula>\u2265 0.3) had higher precision and a higher TPR than a commonly used center log transformed LASSO procedure (clr-LASSO) as well as higher TPR than an off-the-shelf Spike and Slab method after center log transformation (clr-SS). BRACoD was also less likely to include non-contributing bacteria that merely correlate with contributing bacteria. Analysis of a rat microbiome experiment identified 47 operational taxonomic units that contributed to fecal butyrate levels. Of these, 31 were positively and 16 negatively associated with butyrate. Consistent with their known role in butyrate metabolism, most of these fell within the Lachnospiraceae and Ruminococcaceae. We conclude that BRACoD provides a more precise and accurate method for determining bacteria associated with a continuous physiological outcome compared to clr-LASSO. It is more sensitive than a generalized clr-SS algorithm, although it has a higher FPR. Its ability to distinguish genuine contributors from correlated bacteria makes it better suited to discriminating bacteria that directly contribute to an outcome. The algorithm corrects for the distortions arising from compositional data making it appropriate for analysis of microbiome data.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1010108","type":"journal-article","created":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T17:47:46Z","timestamp":1657129666000},"page":"e1010108","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":3,"title":["A Bayesian method for identifying associations between response variables and bacterial community composition"],"prefix":"10.1371","volume":"18","author":[{"given":"Adrian","family":"Verster","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0763-1214","authenticated-orcid":true,"given":"Nicholas","family":"Petronella","sequence":"additional","affiliation":[]},{"given":"Judy","family":"Green","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3594-1380","authenticated-orcid":true,"given":"Fernando","family":"Matias","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1169-6221","authenticated-orcid":true,"given":"Stephen P. J.","family":"Brooks","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,7,6]]},"reference":[{"issue":"23","key":"pcbi.1010108.ref001","doi-asserted-by":"crossref","first-page":"7537","DOI":"10.1128\/AEM.01541-09","article-title":"Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities","volume":"75","author":"PD Schloss","year":"2009","journal-title":"Appl Environ Microbiol"},{"key":"pcbi.1010108.ref002","doi-asserted-by":"crossref","first-page":"D593","DOI":"10.1093\/nar\/gku1201","article-title":"rrnDB: improved tools for interpreting rRNA gene abundance in bacteria and archaea and a new foundation for future development","volume":"43","author":"SF Stoddard","year":"2015","journal-title":"Nucleic Acids Res"},{"key":"pcbi.1010108.ref003","doi-asserted-by":"crossref","first-page":"2224","DOI":"10.3389\/fmicb.2017.02224","article-title":"Microbiome Datasets Are Compositional: And This Is Not Optional.","volume":"8","author":"GB Gloor","year":"2017","journal-title":"Front Microbiol"},{"issue":"19","key":"pcbi.1010108.ref004","doi-asserted-by":"crossref","first-page":"3172","DOI":"10.1093\/bioinformatics\/btv349","article-title":"CCLasso: correlation inference for compositional data through Lasso","volume":"31","author":"H Fang","year":"2015","journal-title":"Bioinformatics"},{"issue":"9","key":"pcbi.1010108.ref005","doi-asserted-by":"crossref","first-page":"e1002687","DOI":"10.1371\/journal.pcbi.1002687","article-title":"Inferring correlation networks from genomic survey data.","volume":"8","author":"J Friedman","year":"2012","journal-title":"PLoS Comput Biol"},{"issue":"11","key":"pcbi.1010108.ref006","doi-asserted-by":"crossref","first-page":"e1005852","DOI":"10.1371\/journal.pcbi.1005852","article-title":"A Bayesian method for detecting pairwise associations in compositional data.","volume":"13","author":"E Schwager","year":"2017","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1010108.ref007","doi-asserted-by":"crossref","first-page":"107315","DOI":"10.1016\/j.csda.2021.107315","article-title":"Robust regression with compositional covariates.","volume":"165","author":"A Mishra","year":"2022","journal-title":"Comput Stat Data Anal"},{"issue":"2","key":"pcbi.1010108.ref008","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1007\/s12561-020-09283-2","article-title":"Regression models for compositional data: General log-contrast formulations, proximal optimization, and microbiome data applications.","volume":"13","author":"PL Combettes","year":"2021","journal-title":"Statistics in Biosciences"},{"key":"pcbi.1010108.ref009","doi-asserted-by":"crossref","first-page":"106","DOI":"10.1016\/j.ab.2018.12.008","article-title":"Evaluation of metabolite-microbe correlation detection methods","volume":"567","author":"Y You","year":"2019","journal-title":"Anal Biochem"},{"key":"pcbi.1010108.ref010","article-title":"High-dimensional log-error-in-variable regression with applications to microbial compositional data analysis","author":"P Shi","year":"2018"},{"issue":"4","key":"pcbi.1010108.ref011","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1093\/biomet\/asu031","article-title":"Variable selection in regression with compositional covariates","volume":"101","author":"W Lin","year":"2014","journal-title":"Biometrika"},{"issue":"7","key":"pcbi.1010108.ref012","doi-asserted-by":"crossref","first-page":"8033","DOI":"10.1096\/fj.201900177R","article-title":"Ingestion of resistant starch by mice markedly increases microbiome-derived metabolites","volume":"33","author":"YC Koay","year":"2019","journal-title":"FASEB J"},{"issue":"7351","key":"pcbi.1010108.ref013","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1038\/nature10213","article-title":"Human nutrition, the gut microbiome and the immune system","volume":"474","author":"AL Kau","year":"2011","journal-title":"Nature"},{"issue":"2","key":"pcbi.1010108.ref014","first-page":"202","article-title":"Gut Microbial Metabolites Fuel Host Antibody Responses. Cell Host Microbe","volume":"20","author":"M Kim","year":"2016"},{"issue":"3","key":"pcbi.1010108.ref015","doi-asserted-by":"crossref","first-page":"930","DOI":"10.3945\/ajcn.117.156265","article-title":"Short-chain fatty acids, prebiotics, synbiotics, and systemic inflammation: a systematic review and meta-analysis","volume":"106","author":"RF McLoughlin","year":"2017","journal-title":"Am J Clin Nutr"},{"issue":"3","key":"pcbi.1010108.ref016","doi-asserted-by":"crossref","first-page":"635","DOI":"10.3945\/ajcn.116.146928","article-title":"Substituting whole grains for refined grains in a 6-wk randomized trial has a modest effect on gut microbiota and immune and inflammatory markers of healthy adults","volume":"105","author":"SM Vanegas","year":"2017","journal-title":"Am J Clin Nutr"},{"issue":"Suppl","key":"pcbi.1010108.ref017","doi-asserted-by":"crossref","first-page":"S120","DOI":"10.1097\/MCG.0b013e31822fecfe","article-title":"Fermentation in the human large intestine: its physiologic consequences and the potential contribution of prebiotics.","volume":"45","author":"GT Macfarlane","year":"2011","journal-title":"J Clin Gastroenterol."},{"issue":"1","key":"pcbi.1010108.ref018","doi-asserted-by":"crossref","first-page":"93","DOI":"10.1136\/gutjnl-2014-307264","article-title":"Diets that differ in their FODMAP content alter the colonic luminal microenvironment","volume":"64","author":"EP Halmos","year":"2015","journal-title":"Gut"},{"issue":"1","key":"pcbi.1010108.ref019","doi-asserted-by":"crossref","first-page":"11789","DOI":"10.1038\/s41598-017-10722-2","article-title":"Fructooligosaccharide (FOS) and Galactooligosaccharide (GOS) Increase Bifidobacterium but Reduce Butyrate Producing Bacteria with Adverse Glycemic Metabolism in healthy young population","volume":"7","author":"F Liu","year":"2017","journal-title":"Sci Rep"},{"issue":"1","key":"pcbi.1010108.ref020","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1111\/1574-6941.12186","article-title":"Prebiotic stimulation of human colonic butyrate-producing bacteria and bifidobacteria, in vitro","volume":"87","author":"KP Scott","year":"2014","journal-title":"FEMS Microbiol Ecol"},{"key":"pcbi.1010108.ref021","first-page":"51","volume-title":"The Large Intestine: Physiology, Pathophysiology, and Disease.","author":"GT Macfarlane","year":"1991"},{"issue":"7","key":"pcbi.1010108.ref022","doi-asserted-by":"crossref","first-page":"2099","DOI":"10.1128\/JB.186.7.2099-2106.2004","article-title":"Restricted distribution of the butyrate kinase pathway among butyrate-producing bacteria from the human colon","volume":"186","author":"P Louis","year":"2004","journal-title":"J Bacteriol"},{"issue":"2","key":"pcbi.1010108.ref023","doi-asserted-by":"crossref","first-page":"304","DOI":"10.1111\/j.1462-2920.2009.02066.x","article-title":"Diversity of human colonic butyrate-producing bacteria revealed by analysis of the butyryl-CoA:acetate CoA-transferase gene","volume":"12","author":"P Louis","year":"2010","journal-title":"Environ Microbiol"},{"issue":"2","key":"pcbi.1010108.ref024","doi-asserted-by":"crossref","first-page":"e00889","DOI":"10.1128\/mBio.00889-14","article-title":"Revealing the bacterial butyrate synthesis pathways by analyzing (meta)genomic data.","volume":"5","author":"M Vital","year":"2014","journal-title":"mBio"},{"issue":"5","key":"pcbi.1010108.ref025","doi-asserted-by":"crossref","first-page":"3593","DOI":"10.1128\/AEM.72.5.3593-3599.2006","article-title":"Two routes of metabolic cross-feeding between Bifidobacterium adolescentis and butyrate-producing anaerobes from the human gut","volume":"72","author":"A Belenguer","year":"2006","journal-title":"Appl Environ Microbiol"},{"issue":"10","key":"pcbi.1010108.ref026","doi-asserted-by":"crossref","first-page":"5810","DOI":"10.1128\/AEM.70.10.5810-5817.2004","article-title":"Lactate-utilizing bacteria, isolated from human feces, that produce butyrate as a major fermentation product","volume":"70","author":"SH Duncan","year":"2004","journal-title":"Appl Environ Microbiol"},{"issue":"2","key":"pcbi.1010108.ref027","doi-asserted-by":"crossref","first-page":"730","DOI":"10.1214\/009053604000001147","article-title":"Spike and slab variable selection: Frequentist and Bayesian strategies","volume":"33","author":"H Ishwaran","year":"2005","journal-title":"The Annals of Statistics"},{"issue":"1","key":"pcbi.1010108.ref028","doi-asserted-by":"crossref","first-page":"267","DOI":"10.1111\/j.2517-6161.1996.tb02080.x","article-title":"Regression Shrinkage and Selection via the Lasso","volume":"58","author":"R. Tibshirani","year":"1996","journal-title":"Journal of the Royal Statistical Society Series B (Methodological)."},{"key":"pcbi.1010108.ref029","doi-asserted-by":"crossref","first-page":"e55","DOI":"10.7717\/peerj-cs.55","article-title":"Probabilistic programming in Python using PyMC3","volume":"2","author":"J Salvatier","year":"2016","journal-title":"PeerJ Computer Science"},{"issue":"1","key":"pcbi.1010108.ref030","first-page":"1593","article-title":"The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo.","volume":"15","author":"MD Hoffman","year":"2014","journal-title":"J Mach Learn Res"},{"issue":"2","key":"pcbi.1010108.ref031","doi-asserted-by":"crossref","first-page":"163","DOI":"10.1007\/s11222-011-9299-z","article-title":"Sequential Monte Carlo on large binary sampling spaces","volume":"23","author":"C Sch\u00e4fer","year":"2013","journal-title":"Statistics and Computing"},{"issue":"5","key":"pcbi.1010108.ref032","doi-asserted-by":"crossref","first-page":"717","DOI":"10.1093\/jn\/127.5.717","article-title":"Acarbose enhances human colonic butyrate production","volume":"127","author":"GA Weaver","year":"1997","journal-title":"J Nutr"},{"issue":"1","key":"pcbi.1010108.ref033","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1139\/m92-010","article-title":"A general method of isolating high molecular weight DNA from methanogenic archaea (archaebacteria).","volume":"38","author":"KF Jarrell","year":"1992","journal-title":"Can J Microbiol"},{"issue":"17","key":"pcbi.1010108.ref034","doi-asserted-by":"crossref","first-page":"5112","DOI":"10.1128\/AEM.01043-13","article-title":"Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform","volume":"79","author":"JJ Kozich","year":"2013","journal-title":"Appl Environ Microbiol"},{"issue":"5","key":"pcbi.1010108.ref035","doi-asserted-by":"crossref","first-page":"1516","DOI":"10.1111\/jam.12157","article-title":"Temporal change in the gut community of rats fed high amylose cornstarch is driven by endogenous urea rather than strictly on carbohydrate availability","volume":"114","author":"M Kalmokoff","year":"2013","journal-title":"J Appl Microbiol"},{"issue":"1","key":"pcbi.1010108.ref036","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1186\/s40168-019-0633-6","article-title":"CAMISIM: simulating metagenomes and microbial communities.","volume":"7","author":"A Fritz","year":"2019","journal-title":"Microbiome"},{"issue":"4","key":"pcbi.1010108.ref037","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1093\/biostatistics\/kxy025","article-title":"Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis","volume":"20","author":"ZZ Tang","year":"2019","journal-title":"Biostatistics"},{"issue":"1","key":"pcbi.1010108.ref038","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1080\/02664763.2018.1454894","article-title":"Clustering transformed compositional data using K-means, with applications in gene expression and bicycle sharing system data","volume":"46","author":"A Godichon-Baggioni","year":"2018","journal-title":"Journal of Applied Statistics"},{"issue":"10","key":"pcbi.1010108.ref039","doi-asserted-by":"crossref","first-page":"5186","DOI":"10.1128\/AEM.68.10.5186-5190.2002","article-title":"Acetate utilization and butyryl coenzyme A (CoA):acetate-CoA transferase in butyrate-producing bacteria from the human large intestine","volume":"68","author":"SH Duncan","year":"2002","journal-title":"Appl Environ Microbiol"},{"issue":"5","key":"pcbi.1010108.ref040","doi-asserted-by":"crossref","first-page":"1589","DOI":"10.1128\/aem.62.5.1589-1592.1996","article-title":"Pathways of acetate, propionate, and butyrate formation by the human fecal microbial flora","volume":"62","author":"TL Miller","year":"1996","journal-title":"Appl Environ Microbiol"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010108","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010108","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,11]],"date-time":"2023-02-11T00:47:22Z","timestamp":1676076442000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010108"}},"subtitle":[],"editor":[{"given":"Jason A.","family":"Papin","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,7,6]]},"references-count":40,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2022,7,6]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010108","relation":{"new_version":[{"id-type":"doi","id":"10.1371\/journal.pcbi.1010108","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,6]]}}}