{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T02:41:33Z","timestamp":1778208093267,"version":"3.51.4"},"reference-count":32,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,7,22]],"date-time":"2020-07-22T00:00:00Z","timestamp":1595376000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2020,7,22]],"date-time":"2020-07-22T00:00:00Z","timestamp":1595376000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Background<\/jats:title><jats:p>Understanding the relation between the human microbiome and modulating factors, such as diet, may help researchers design intervention strategies that promote and maintain healthy microbial communities. Numerous analytical tools are available to help identify these relations, oftentimes via automated variable selection methods. However, available tools frequently ignore evolutionary relations among microbial taxa, potential relations between modulating factors, as well as model selection uncertainty.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>We present MicroBVS, an R package for Dirichlet-tree multinomial models with Bayesian variable selection, for the identification of covariates associated with microbial taxa abundance data. The underlying Bayesian model accommodates phylogenetic structure in the abundance data and various parameterizations of covariates\u2019 prior probabilities of inclusion.<\/jats:p><\/jats:sec><jats:sec><jats:title>Conclusion<\/jats:title><jats:p>While developed to study the human microbiome, our software can be employed in various research applications, where the aim is to generate insights into the relations between a set of covariates and compositional data with or without a known tree-like structure.<\/jats:p><\/jats:sec>","DOI":"10.1186\/s12859-020-03640-0","type":"journal-article","created":{"date-parts":[[2020,7,13]],"date-time":"2020-07-13T12:06:02Z","timestamp":1594641962000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["MicroBVS: Dirichlet-tree multinomial regression models with Bayesian variable selection - an R package"],"prefix":"10.1186","volume":"21","author":[{"given":"Matthew D.","family":"Koslovsky","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marina","family":"Vannucci","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2020,7,22]]},"reference":[{"issue":"1-2","key":"3640_CR1","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1038\/pr.2014.170","volume":"77","author":"Y Sanz","year":"2014","unstructured":"Sanz Y, Olivares M, Moya-P\u00e9rez \u00c1, Agostoni C. Understanding the role of gut microbiome in metabolic disease risk. Pediatr Res. 2014; 77(1-2):236\u201344.","journal-title":"Pediatr Res"},{"key":"3640_CR2","doi-asserted-by":"publisher","first-page":"73","DOI":"10.1146\/annurev-statistics-010814-020351","volume":"2","author":"H Li","year":"2015","unstructured":"Li H. Microbiome, metagenomics, and high-dimensional compositional data analysis. Annu Rev Stat Appl. 2015; 2:73\u201394.","journal-title":"Annu Rev Stat Appl"},{"issue":"1","key":"3640_CR3","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/10618600.2016.1154063","volume":"26","author":"Y Zhang","year":"2017","unstructured":"Zhang Y, Zhou H, Zhou J, Sun W. Regression models for multivariate count data. J Comput Graph Stat. 2017; 26(1):1\u201313.","journal-title":"J Comput Graph Stat"},{"issue":"12","key":"3640_CR4","doi-asserted-by":"publisher","first-page":"52078","DOI":"10.1371\/journal.pone.0052078","volume":"7","author":"PS La Rosa","year":"2012","unstructured":"La Rosa PS, Brooks JP, Deych E, Boone EL, Edwards DJ, Wang Q, Sodergren E, Weinstock G, Shannon WD. Hypothesis testing and power calculations for taxonomic-based human microbiome data. PloS ONE. 2012; 7(12):52078.","journal-title":"PloS ONE"},{"issue":"1","key":"3640_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s12859-016-1414-x","volume":"18","author":"WD Wadsworth","year":"2017","unstructured":"Wadsworth WD, Argiento R, Guindani M, Galloway-Pena J, Shelburne SA, Vannucci M. An integrative Bayesian Dirichlet-multinomial regression model for the analysis of taxonomic abundances in microbiome data. BMC Bioinformatics. 2017; 18(1):1\u201312.","journal-title":"BMC Bioinformatics"},{"issue":"1","key":"3640_CR6","doi-asserted-by":"publisher","first-page":"418","DOI":"10.1214\/12-AOAS592","volume":"7","author":"J Chen","year":"2013","unstructured":"Chen J, Li H. Variable selection for sparse Dirichlet-multinomial regression with an application to microbiome data analysis. Ann Appl Stat. 2013; 7(1):418\u201342.","journal-title":"Ann Appl Stat"},{"issue":"12","key":"3640_CR7","doi-asserted-by":"publisher","first-page":"4069","DOI":"10.1080\/03610929108830757","volume":"20","author":"SY Dennis III","year":"1991","unstructured":"Dennis III SY. On the hyper-Dirichlet type 1 and hyper-Liouville distributions. Commun Stat-Theory Methods. 1991; 20(12):4069\u201381.","journal-title":"Commun Stat-Theory Methods"},{"key":"3640_CR8","unstructured":"Minka T. The Dirichlet-tree distribution. 1999. https:\/\/www.microsoft.com\/en-us\/research\/publication\/dirichlet-tree-distribution\/."},{"issue":"3","key":"3640_CR9","doi-asserted-by":"publisher","first-page":"792","DOI":"10.1111\/biom.12654","volume":"73","author":"T Wang","year":"2017","unstructured":"Wang T, Zhao H. A Dirichlet-tree multinomial regression model for associating dietary nutrients with gut microorganisms. Biometrics. 2017; 73(3):792\u2013801.","journal-title":"Biometrics"},{"issue":"1","key":"3640_CR10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1214\/17-AOAS1086","volume":"12","author":"Y Tang","year":"2018","unstructured":"Tang Y, Ma L, Nicolae DL, et al. A phylogenetic scan test on a Dirichlet-tree multinomial model for microbiome data. Ann Appl Stat. 2018; 12(1):1\u201326.","journal-title":"Ann Appl Stat"},{"issue":"9","key":"3640_CR11","doi-asserted-by":"crossref","first-page":"1278","DOI":"10.1093\/bioinformatics\/btw804","volume":"33","author":"Z-Z Tang","year":"2017","unstructured":"Tang Z-Z, Chen G, Alekseyenko AV, Li H. A general framework for association analysis of microbial communities on a taxonomic tree. Bioinformatics. 2017; 33(9):1278\u201385.","journal-title":"Bioinformatics"},{"issue":"17","key":"3640_CR12","doi-asserted-by":"publisher","first-page":"2611","DOI":"10.1093\/bioinformatics\/btw308","volume":"32","author":"EZ Chen","year":"2016","unstructured":"Chen EZ, Li H. A two-part mixed-effects model for analyzing longitudinal microbiome compositional data. Bioinformatics. 2016; 32(17):2611\u20137.","journal-title":"Bioinformatics"},{"issue":"4","key":"3640_CR13","doi-asserted-by":"publisher","first-page":"1003531","DOI":"10.1371\/journal.pcbi.1003531","volume":"10","author":"PJ McMurdie","year":"2014","unstructured":"McMurdie PJ, Holmes S. Waste not, want not: Why rarefying microbiome data is inadmissible. PLoS Comput Biol. 2014; 10(4):1003531.","journal-title":"PLoS Comput Biol"},{"issue":"6","key":"3640_CR14","doi-asserted-by":"publisher","first-page":"831","DOI":"10.1093\/bioinformatics\/btt608","volume":"30","author":"TP Garcia","year":"2013","unstructured":"Garcia TP, M\u00fcller S, Carroll RJ, Walzem RL. Identification of important regressor groups, subgroups and individuals via regularization methods: application to gut microbiome data. Bioinformatics. 2013; 30(6):831\u20137.","journal-title":"Bioinformatics"},{"issue":"6052","key":"3640_CR15","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1126\/science.1208344","volume":"334","author":"GD Wu","year":"2011","unstructured":"Wu GD, Chen J, Hoffmann C, Bittinger K, Chen Y-Y, Keilbaugh SA, Bewtra M, Knights D, Walters WA, Knight R, et al. Linking long-term dietary patterns with gut microbial enterotypes. Science. 2011; 334(6052):105\u20138.","journal-title":"Science"},{"key":"3640_CR16","doi-asserted-by":"crossref","unstructured":"Koslovsky MD, Hoffman KL, Daniel CR, Vannucci M. A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes. Ann Appl Stat. 2020. In press.","DOI":"10.1214\/20-AOAS1354"},{"key":"3640_CR17","doi-asserted-by":"crossref","unstructured":"Brown PJ, Vannucci M, Fearn T. J R Stat Soc Ser B Stat Methodol. 1998; 60(3):627\u201341.","DOI":"10.1111\/1467-9868.00144"},{"issue":"2","key":"3640_CR18","first-page":"339","volume":"7","author":"EI George","year":"1997","unstructured":"George EI, McCulloch RE. Approaches for Bayesian variable selection. Stat Sin. 1997; 7(2):339\u201373. JSTOR.","journal-title":"Stat Sin"},{"key":"3640_CR19","doi-asserted-by":"publisher","first-page":"1054","DOI":"10.1016\/j.csda.2013.02.005","volume":"71","author":"D Eddelbuettel","year":"2014","unstructured":"Eddelbuettel D, Sanderson C. RcppArmadillo: Accelerating R with high-performance C++ linear algebra. Comput Stat Data Anal. 2014; 71:1054\u201363.","journal-title":"Comput Stat Data Anal"},{"issue":"8","key":"3640_CR20","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v040.i08","volume":"40","author":"D Eddelbuettel","year":"2011","unstructured":"Eddelbuettel D, Fran\u00e7ois R, Allaire J, Ushey K, Kou Q, Russel N, Chambers J, Bates D. Rcpp: Seamless R and C++ integration. J Stat Softw. 2011; 40(8):1\u201318.","journal-title":"J Stat Softw"},{"issue":"2","key":"3640_CR21","doi-asserted-by":"publisher","first-page":"289","DOI":"10.1093\/bioinformatics\/btg412","volume":"20","author":"E Paradis","year":"2004","unstructured":"Paradis E, Claude J, Strimmer K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics. 2004; 20(2):289\u201390.","journal-title":"Bioinformatics"},{"issue":"3","key":"3640_CR22","doi-asserted-by":"publisher","first-page":"870","DOI":"10.1214\/009053604000000238","volume":"32","author":"MM Barbieri","year":"2004","unstructured":"Barbieri MM, Berger JO, et al. Optimal predictive model selection. Ann Stat. 2004; 32(3):870\u201397.","journal-title":"Ann Stat"},{"issue":"1","key":"3640_CR23","doi-asserted-by":"publisher","first-page":"13","DOI":"10.1128\/mSystems.00013-15","volume":"1","author":"C Noecker","year":"2016","unstructured":"Noecker C, Eng A, Srinivasan S, Theriot CM, Young VB, Jansson JK, Fredricks DN, Borenstein E. Metabolic model-based integration of microbiome taxonomic and metabolomic profiles elucidates mechanistic links between ecological and metabolic variation. MSystems. 2016; 1(1):13\u20135.","journal-title":"MSystems"},{"issue":"1","key":"3640_CR24","first-page":"7","volume":"6","author":"M Plummer","year":"2006","unstructured":"Plummer M, Best N, Cowles K, Vines K. CODA: convergence diagnosis and output analysis for MCMC. R News. 2006; 6(1):7\u201311.","journal-title":"R News"},{"key":"3640_CR25","unstructured":"Akaikei H. Information theory and an extension of maximum likelihood principle. In: Proc 2nd Int Symp on Information Theory: 1973. p. 267\u201381."},{"issue":"7","key":"3640_CR26","doi-asserted-by":"publisher","first-page":"1017","DOI":"10.1002\/sim.6792","volume":"35","author":"CB Peterson","year":"2016","unstructured":"Peterson CB, Stingo FC, Vannucci M. Joint Bayesian variable and graph selection for regression models with network-structured predictors. Stat Med. 2016; 35(7):1017\u201331.","journal-title":"Stat Med"},{"issue":"3","key":"3640_CR27","doi-asserted-by":"publisher","first-page":"1978","DOI":"10.1214\/11-AOAS463","volume":"5","author":"FC Stingo","year":"2011","unstructured":"Stingo FC, Chen YA, Tadesse MG, Vannucci M. Incorporating biological information into linear models: A Bayesian approach to the selection of pathways and genes. Ann Appl Stat. 2011; 5(3):1978\u20132002.","journal-title":"Ann Appl Stat"},{"issue":"7346","key":"3640_CR28","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1038\/nature09944","volume":"473","author":"M Arumugam","year":"2011","unstructured":"Arumugam M, Raes J, Pelletier E, Le Paslier D, Yamada T, Mende DR, Fernandes GR, Tap J, Bruls T, Batto J-M, et al. Enterotypes of the human gut microbiome. Nature. 2011; 473(7346):174\u201380.","journal-title":"Nature"},{"issue":"2","key":"3640_CR29","doi-asserted-by":"publisher","first-page":"369","DOI":"10.1214\/10-BA607","volume":"5","author":"M Kyung","year":"2010","unstructured":"Kyung M, Gill J, Ghosh M, Casella G, et al. Penalized regression, standard errors, and Bayesian lassos. Bayesian Anal. 2010; 5(2):369\u2013411.","journal-title":"Bayesian Anal"},{"issue":"3","key":"3640_CR30","doi-asserted-by":"publisher","first-page":"1780","DOI":"10.1214\/11-AOAS455","volume":"5","author":"Y Guan","year":"2011","unstructured":"Guan Y, Stephens M. Bayesian variable selection regression for genome-wide association studies and other large-scale problems. Ann Appl Stat. 2011; 5(3):1780\u2013815.","journal-title":"Ann Appl Stat"},{"issue":"1","key":"3640_CR31","doi-asserted-by":"publisher","first-page":"17","DOI":"10.2307\/3315687","volume":"24","author":"H Chipman","year":"1996","unstructured":"Chipman H. Bayesian variable selection with related predictors. Can J Stat. 1996; 24(1):17\u201336.","journal-title":"Can J Stat"},{"key":"3640_CR32","doi-asserted-by":"crossref","unstructured":"Miao Y, Kook J, Lu Y, Guindani M, Vannucci M. Scalable Bayesian variable selection regression models for count data In: Yanan F, Smith M, Nott D, Dortet-Bernadet J, editors. Flexible Bayesian Regression Modelling. Elsevier: 2020. p. 187\u2013219.","DOI":"10.1016\/B978-0-12-815862-3.00015-9"}],"updated-by":[{"DOI":"10.1186\/s12859-020-03912-9","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2020,12,28]],"date-time":"2020-12-28T00:00:00Z","timestamp":1609113600000}}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03640-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-020-03640-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-03640-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T05:10:11Z","timestamp":1696396211000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-020-03640-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,7,22]]},"references-count":32,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["3640"],"URL":"https:\/\/doi.org\/10.1186\/s12859-020-03640-0","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,7,22]]},"assertion":[{"value":"2 January 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 July 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"28 December 2020","order":4,"name":"change_date","label":"Change Date","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Correction","order":5,"name":"change_type","label":"Change Type","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"An amendment to this paper has been published and can be accessed via the original article.","order":6,"name":"change_details","label":"Change Details","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"301"}}