{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,22]],"date-time":"2025-02-22T00:44:55Z","timestamp":1740185095477,"version":"3.37.3"},"reference-count":28,"publisher":"Oxford University Press (OUP)","issue":"24","license":[{"start":{"date-parts":[[2021,7,24]],"date-time":"2021-07-24T00:00:00Z","timestamp":1627084800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/academic.oup.com\/journals\/pages\/open_access\/funder_policies\/chorus\/standard_publication_model"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key R&D Program of China","doi-asserted-by":"crossref","award":["2018YFC0910500"],"award-info":[{"award-number":["2018YFC0910500"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["11971017"],"award-info":[{"award-number":["11971017"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Shanghai Municipal Science and Technology Major Project","award":["2017SHZDZX01"],"award-info":[{"award-number":["2017SHZDZX01"]}]},{"name":"Multidisciplinary Cross Research Foundation of Shanghai Jiao Tong University","award":["YG2019QNA26","YG2019QNA37","YG2021QN06"],"award-info":[{"award-number":["YG2019QNA26","YG2019QNA37","YG2021QN06"]}]},{"name":"Neil Shen\u2019s SJTU Medical Research Fund"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,12,11]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Microbiome data have proven extremely useful for understanding microbial communities and their impacts in health and disease. Although microbiome analysis methods and standards are evolving rapidly, obtaining meaningful and interpretable results from microbiome studies still requires careful statistical treatment. In particular, many existing and emerging methods for differential abundance (DA) analysis fail to account for the fact that microbiome data are high-dimensional and sparse, compositional, negatively and positively correlated and phylogenetically structured. To better describe microbiome data and improve the power of DA testing, there is still a great need for the continued development of appropriate statistical methodology.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>In this article, we propose a model-based approach for microbiome data transformation, and a phylogenetically informed procedure for DA testing based on the transformed data. First, we extend the Dirichlet-tree multinomial (DTM) to zero-inflated DTM for multivariate modeling of microbial counts, addressing data sparsity and correlation and phylogeny among bacterial taxa. Then, within this framework and using a Bayesian formulation, we introduce posterior mean transformation to convert raw counts into non-zero relative abundances that sum to one, accounting for the compositionality nature of microbiome data. Second, using the transformed data, we propose adaptive analysis of composition of microbiomes (adaANCOM) for DA testing by constructing log-ratios adaptively on the tree for each taxon, greatly reducing the computational complexity of ANCOM in high dimensions. Finally, we present extensive simulation studies, an analysis of HMP data across 18 body sites and 2 visits, and an application to a gut microbiome and malnutrition study, to investigate the performance of posterior mean transformation and adaANCOM. Comparisons with ANCOM and other DA testing procedures show that adaANCOM controls the false discovery rate well, allows for easy interpretation of the results, and is computationally efficient for high-dimensional problems.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The developed R package is available at https:\/\/github.com\/ZRChao\/adaANCOM. For replicability purposes, scripts for our simulations and data analysis are available at https:\/\/github.com\/ZRChao\/Papers_supplementary.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Supplementary information<\/jats:title>\n                  <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab543","type":"journal-article","created":{"date-parts":[[2021,7,22]],"date-time":"2021-07-22T11:10:07Z","timestamp":1626952207000},"page":"4652-4660","source":"Crossref","is-referenced-by-count":9,"title":["Transformation and differential abundance analysis of microbiome data incorporating phylogeny"],"prefix":"10.1093","volume":"37","author":[{"given":"Chao","family":"Zhou","sequence":"first","affiliation":[{"name":"Department of Bioinformatics and Biostatistics, Shanghai Jiao Tong University , 200240 Shanghai, China"},{"name":"SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University , 200240 Shanghai, China"}]},{"given":"Hongyu","family":"Zhao","sequence":"additional","affiliation":[{"name":"SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University , 200240 Shanghai, China"},{"name":"Department of Biostatistics, Yale University , New Haven, CT 06511, USA"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1218-4017","authenticated-orcid":false,"given":"Tao","family":"Wang","sequence":"additional","affiliation":[{"name":"Department of Bioinformatics and Biostatistics, Shanghai Jiao Tong University , 200240 Shanghai, China"},{"name":"SJTU-Yale Joint Center for Biostatistics and Data Science, Shanghai Jiao Tong University , 200240 Shanghai, China"},{"name":"MoE Key Lab of Artificial Intelligence, Shanghai Jiao Tong University , 200240 Shanghai, China"}]}],"member":"286","published-online":{"date-parts":[[2021,7,24]]},"reference":[{"key":"2023051607133126200_btab543-B1","doi-asserted-by":"crossref","first-page":"aad3311","DOI":"10.1126\/science.aad3311","article-title":"Gut bacteria that prevent growth impairments transmitted by microbiota from malnourished children","volume":"351","author":"Blanton","year":"2016","journal-title":"Science"},{"key":"2023051607133126200_btab543-B2","doi-asserted-by":"crossref","first-page":"852","DOI":"10.1038\/s41587-019-0209-9","article-title":"Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2","volume":"37","author":"Bolyen","year":"2019","journal-title":"Nat. Biotechnol"},{"key":"2023051607133126200_btab543-B3","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1023\/A:1023818214614","article-title":"Isometric logratio transformations for compositional data analysis","volume":"35","author":"Egozcue","year":"2003","journal-title":"Math. Geol"},{"year":"2011","author":"Ehrlich","first-page":"307","key":"2023051607133126200_btab543-B4"},{"key":"2023051607133126200_btab543-B5","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1038\/nature11234","article-title":"Structure, function and diversity of the healthy human microbiome","volume":"486","author":"Huttenhower","year":"2012","journal-title":"Nature"},{"key":"2023051607133126200_btab543-B6","doi-asserted-by":"crossref","first-page":"1000","DOI":"10.1093\/bioinformatics\/btz686","article-title":"Phylogenetic tree-based microbiome association test","volume":"36","author":"Kim","year":"2020","journal-title":"Bioinformatics"},{"key":"2023051607133126200_btab543-B7","doi-asserted-by":"crossref","first-page":"410","DOI":"10.1038\/s41579-018-0029-9","article-title":"Best practices for analysing microbiomes","volume":"16","author":"Knight","year":"2018","journal-title":"Nat. Rev. Microbiol"},{"key":"2023051607133126200_btab543-B8","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1186\/s40168-017-0262-x","article-title":"A powerful microbiome-based association test and a microbial taxa discovery framework for comprehensive association mapping","volume":"5","author":"Koh","year":"2017","journal-title":"Microbiome"},{"key":"2023051607133126200_btab543-B9","doi-asserted-by":"crossref","first-page":"799","DOI":"10.1186\/s12864-018-5160-5","article-title":"Analysis and correction of compositional bias in sparse sequencing count data","volume":"19","author":"Kumar","year":"2018","journal-title":"BMC Genomics"},{"key":"2023051607133126200_btab543-B10","doi-asserted-by":"crossref","first-page":"e52078","DOI":"10.1371\/journal.pone.0052078","article-title":"Hypothesis testing and power calculations for taxonomic-based human microbiome data","volume":"7","author":"La Rosa","year":"2012","journal-title":"PLoS One"},{"key":"2023051607133126200_btab543-B11","doi-asserted-by":"crossref","first-page":"3514","DOI":"10.1038\/s41467-020-17041-7","article-title":"Analysis of compositions of microbiomes with bias correction","volume":"11","author":"Lin","year":"2020","journal-title":"Nat. Commun"},{"key":"2023051607133126200_btab543-B12","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1186\/s12859-020-03552-z","article-title":"An empirical Bayes approach to normalization and differential abundance testing for microbiome data","volume":"21","author":"Liu","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"2023051607133126200_btab543-B13","doi-asserted-by":"crossref","first-page":"61","DOI":"10.1038\/nature23889","article-title":"Strains, functions and dynamics in the expanded Human Microbiome Project","volume":"550","author":"Lloyd-Price","year":"2017","journal-title":"Nature"},{"key":"2023051607133126200_btab543-B14","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023051607133126200_btab543-B15","doi-asserted-by":"crossref","first-page":"169","DOI":"10.1038\/ismej.2010.133","article-title":"UniFrac: an effective distance metric for microbial community comparison","volume":"5","author":"Lozupone","year":"2011","journal-title":"ISME J"},{"key":"2023051607133126200_btab543-B16","first-page":"27663","article-title":"Analysis of composition of microbiomes: a novel method for studying microbial composition","volume":"26","author":"Mandal","year":"2015","journal-title":"Microb. Ecol. Health Dis"},{"key":"2023051607133126200_btab543-B17","doi-asserted-by":"crossref","first-page":"134","DOI":"10.1177\/1471082X14535524","article-title":"Bayesian-multiplicative treatment of count zeros in compositional data sets","volume":"15","author":"Mart\u00edn-Fern\u00e1ndez","year":"2015","journal-title":"Stat. Modell"},{"key":"2023051607133126200_btab543-B18","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/j.micpath.2016.02.003","article-title":"Gut microbiota and malnutrition","volume":"106","author":"Million","year":"2017","journal-title":"Microb. Pathog"},{"key":"2023051607133126200_btab543-B19","doi-asserted-by":"crossref","first-page":"2304","DOI":"10.1016\/j.clnu.2018.10.004","article-title":"Altered host-gut microbes symbiosis in severely malnourished anorexia nervosa (AN) patients undergoing enteral nutrition: an explicative factor of functional intestinal disorders?","volume":"38","author":"Mouna","year":"2019","journal-title":"Clin. Nutrition"},{"key":"2023051607133126200_btab543-B21","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1038\/nmeth.2658","article-title":"Differential abundance analysis for microbial marker\u2013gene surveys","volume":"10","author":"Paulson","year":"2013","journal-title":"Nat. Methods"},{"key":"2023051607133126200_btab543-B22","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051607133126200_btab543-B23","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1038\/nature13421","article-title":"Persistent gut microbiota immaturity in malnourished Bangladeshi children","volume":"510","author":"Subramanian","year":"2014","journal-title":"Nature"},{"key":"2023051607133126200_btab543-B24","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1093\/biostatistics\/kxy025","article-title":"Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis","volume":"20","author":"Tang","year":"2019","journal-title":"Biostatistics"},{"key":"2023051607133126200_btab543-B25","doi-asserted-by":"crossref","first-page":"792","DOI":"10.1111\/biom.12654","article-title":"A Dirichlet-tree multinomial regression model for associating dietary nutrients with gut microorganisms","volume":"73","author":"Wang","year":"2017","journal-title":"Biometrics"},{"key":"2023051607133126200_btab543-B26","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1038\/s41564-018-0156-0","article-title":"Methods for phylogenetic analysis of microbiome data","volume":"3","author":"Washburne","year":"2018","journal-title":"Nat. Microbiol"},{"key":"2023051607133126200_btab543-B27","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/s40168-017-0237-y","article-title":"Normalization and microbial differential abundance strategies depend upon data characteristics","volume":"5","author":"Weiss","year":"2017","journal-title":"Microbiome"},{"key":"2023051607133126200_btab543-B28","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/10618600.2016.1154063","article-title":"Regression models for multivariate count data","volume":"26","author":"Zhang","year":"2017","journal-title":"J. Comput. Graph. Stat"},{"key":"2023051607133126200_btab543-B29","doi-asserted-by":"crossref","first-page":"797","DOI":"10.1016\/j.ajhg.2015.04.003","article-title":"Testing in microbiome-profiling studies with MiRKAT, the microbiome regression-based kernel association test","volume":"96","author":"Zhao","year":"2015","journal-title":"Am. J. Hum. Genet"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab543\/39737941\/btab543.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/24\/4652\/50334771\/btab543.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/24\/4652\/50334771\/btab543.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,5,16]],"date-time":"2023-05-16T07:41:44Z","timestamp":1684222904000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/24\/4652\/6327682"}},"subtitle":[],"editor":[{"given":"Peter","family":"Robinson","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,7,24]]},"references-count":28,"journal-issue":{"issue":"24","published-print":{"date-parts":[[2021,12,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab543","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"type":"print","value":"1367-4803"},{"type":"electronic","value":"1367-4811"}],"subject":[],"published-other":{"date-parts":[[2021,12,15]]},"published":{"date-parts":[[2021,7,24]]}}}