{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,18]],"date-time":"2026-02-18T22:53:31Z","timestamp":1771455211176,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1013124","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T00:00:00Z","timestamp":1762128000000}}],"reference-count":43,"publisher":"Public Library of Science (PLoS)","issue":"10","license":[{"start":{"date-parts":[[2025,10,24]],"date-time":"2025-10-24T00:00:00Z","timestamp":1761264000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Institute of Health","award":["R01GM144351"],"award-info":[{"award-number":["R01GM144351"]}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DMS-1830392, DMS-2113359, DMS-1811747"],"award-info":[{"award-number":["DMS-1830392, DMS-2113359, DMS-1811747"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["DMS-2113360"],"award-info":[{"award-number":["DMS-2113360"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014535","name":"Center for Individualized Medicine, Mayo Clinic","doi-asserted-by":"publisher","award":["N\/A"],"award-info":[{"award-number":["N\/A"]}],"id":[{"id":"10.13039\/100014535","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Microbiome sequencing data are inherently sparse and compositional, with excessive zeros arising from biological absence or insufficient sampling. These zeros pose significant challenges for downstream analyses, particularly those that require log-transformation. We introduce BMDD (BiModal Dirichlet Distribution), a novel probabilistic modeling framework for accurate imputation of microbiome sequencing data. Unlike existing imputation approaches that assume unimodal abundance, BMDD captures the bimodal abundance distribution of the taxa via a mixture of Dirichlet priors. It uses variational inference and a scalable expectation-maximization algorithm for efficient imputation. Through simulations and real microbiome datasets, we demonstrate that BMDD outperforms competing methods in reconstructing true abundances and improves the performance of differential abundance analysis. Through multiple posterior samples, BMDD enables robust inference by accounting for uncertainty in zero imputation. Our method offers a principled and computationally efficient solution for analyzing high-dimensional, zero-inflated microbiome sequencing data and is broadly applicable in microbial biomarker discovery and host-microbiome interaction studies.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1013124","type":"journal-article","created":{"date-parts":[[2025,10,24]],"date-time":"2025-10-24T17:54:54Z","timestamp":1761328494000},"page":"e1013124","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":2,"title":["BMDD: A probabilistic framework for accurate imputation of zero-inflated microbiome sequencing data"],"prefix":"10.1371","volume":"21","author":[{"given":"Huijuan","family":"Zhou","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1273-5624","authenticated-orcid":true,"given":"Jun","family":"Chen","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xianyang","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"340","published-online":{"date-parts":[[2025,10,24]]},"reference":[{"issue":"12","key":"pcbi.1013124.ref001","doi-asserted-by":"crossref","first-page":"1855","DOI":"10.1016\/j.mayocp.2017.10.004","article-title":"Microbiome at the frontier of personalized medicine","volume":"92","author":"PC Kashyap","year":"2017","journal-title":"Mayo Clin Proc."},{"issue":"3","key":"pcbi.1013124.ref002","doi-asserted-by":"crossref","first-page":"382","DOI":"10.1016\/j.chom.2014.02.005","article-title":"The treatment-naive microbiome in new-onset Crohn\u2019s disease","volume":"15","author":"D Gevers","year":"2014","journal-title":"Cell Host Microbe."},{"issue":"7422","key":"pcbi.1013124.ref003","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1038\/nature11582","article-title":"Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease","volume":"491","author":"L Jostins","year":"2012","journal-title":"Nature."},{"issue":"2","key":"pcbi.1013124.ref004","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1101\/gr.126573.111","article-title":"Genomic analysis identifies association of Fusobacterium with colorectal carcinoma","volume":"22","author":"AD Kostic","year":"2012","journal-title":"Genome Res."},{"issue":"7418","key":"pcbi.1013124.ref005","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nature11450","article-title":"A metagenome-wide association study of gut microbiota in type 2 diabetes","volume":"490","author":"J Qin","year":"2012","journal-title":"Nature."},{"key":"pcbi.1013124.ref006","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.01202","article-title":"Expansion of intestinal Prevotella copri correlates with enhanced susceptibility to arthritis","volume":"2","author":"JU Scher","year":"2013","journal-title":"Elife."},{"issue":"7516","key":"pcbi.1013124.ref007","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1038\/nature13568","article-title":"Alterations of the human gut microbiome in liver cirrhosis","volume":"513","author":"N Qin","year":"2014","journal-title":"Nature."},{"issue":"6","key":"pcbi.1013124.ref008","doi-asserted-by":"crossref","first-page":"815","DOI":"10.1016\/j.immuni.2014.05.012","article-title":"Mining the human gut microbiota for effector strains that shape the immune system","volume":"40","author":"PP Ahern","year":"2014","journal-title":"Immunity."},{"issue":"5","key":"pcbi.1013124.ref009","doi-asserted-by":"crossref","DOI":"10.1016\/j.cell.2017.01.022","article-title":"Mining the human gut microbiota for immunomodulatory organisms","volume":"168","author":"N Geva-Zatorsky","year":"2017","journal-title":"Cell."},{"issue":"4","key":"pcbi.1013124.ref010","doi-asserted-by":"crossref","DOI":"10.1016\/j.cell.2016.10.020","article-title":"Linking the human gut microbiome to inflammatory cytokine production capacity","volume":"167","author":"M Schirmer","year":"2016","journal-title":"Cell."},{"issue":"7612","key":"pcbi.1013124.ref011","doi-asserted-by":"crossref","first-page":"376","DOI":"10.1038\/nature18646","article-title":"Human gut microbes impact host serum metabolome and insulin sensitivity","volume":"535","author":"HK Pedersen","year":"2016","journal-title":"Nature."},{"issue":"1","key":"pcbi.1013124.ref012","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1038\/s41467-017-02651-5","article-title":"Microbiota derived short chain fatty acids promote histone crotonylation in the colon through histone deacetylases","volume":"9","author":"R Fellows","year":"2018","journal-title":"Nat Commun."},{"key":"pcbi.1013124.ref013","article-title":"The role of the microbiome in human health and disease: an introduction for clinicians","volume":"356","author":"VB Young","year":"2017","journal-title":"BMJ."},{"issue":"1","key":"pcbi.1013124.ref014","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1038\/nrg3129","article-title":"Experimental and analytical tools for studying the human microbiome","volume":"13","author":"J Kuczynski","year":"2011","journal-title":"Nat Rev Genet."},{"issue":"1","key":"pcbi.1013124.ref015","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1186\/s40168-022-01320-0","article-title":"A comprehensive evaluation of microbial differential abundance analysis methods: current status and potential solutions","volume":"10","author":"L Yang","year":"2022","journal-title":"Microbiome."},{"issue":"4","key":"pcbi.1013124.ref016","doi-asserted-by":"crossref","DOI":"10.1214\/22-AOAS1607","article-title":"Testing for differential abundance in compositional counts data, with application to microbiome studies","volume":"16","author":"B Brill","year":"2022","journal-title":"Ann Appl Stat."},{"issue":"7","key":"pcbi.1013124.ref017","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1038\/s41592-018-0033-z","article-title":"SAVER: gene expression recovery for single-cell RNA sequencing","volume":"15","author":"M Huang","year":"2018","journal-title":"Nat Methods."},{"issue":"1","key":"pcbi.1013124.ref018","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1186\/s13059-022-02657-3","article-title":"mbDenoise: microbiome data denoising using zero-inflated probabilistic principal components analysis","volume":"23","author":"Y Zeng","year":"2022","journal-title":"Genome Biol."},{"issue":"1","key":"pcbi.1013124.ref019","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1186\/s13059-021-02400-4","article-title":"mbImpute: an accurate and robust imputation method for microbiome data","volume":"22","author":"R Jiang","year":"2021","journal-title":"Genome Biol."},{"issue":"1","key":"pcbi.1013124.ref020","doi-asserted-by":"crossref","first-page":"997","DOI":"10.1038\/s41467-018-03405-7","article-title":"An accurate and robust imputation method scImpute for single-cell RNA-seq data","volume":"9","author":"WV Li","year":"2018","journal-title":"Nat Commun."},{"issue":"1","key":"pcbi.1013124.ref021","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1038\/s41467-021-27729-z","article-title":"Zero-preserving imputation of single-cell RNA-seq data","volume":"13","author":"GC Linderman","year":"2022","journal-title":"Nat Commun."},{"key":"pcbi.1013124.ref022","doi-asserted-by":"crossref","first-page":"4344","DOI":"10.1038\/ncomms5344","article-title":"Tipping elements in the human intestinal ecosystem","volume":"5","author":"L Lahti","year":"2014","journal-title":"Nat Commun."},{"issue":"3","key":"pcbi.1013124.ref023","doi-asserted-by":"crossref","DOI":"10.1128\/mSystems.00031-18","article-title":"American gut: an open platform for citizen science microbiome research","volume":"3","author":"D McDonald","year":"2018","journal-title":"mSystems."},{"issue":"5","key":"pcbi.1013124.ref024","doi-asserted-by":"crossref","first-page":"807","DOI":"10.1093\/bioinformatics\/bty729","article-title":"Batch effects correction for microbiome data with Dirichlet-multinomial regression","volume":"35","author":"Z Dai","year":"2019","journal-title":"Bioinformatics."},{"key":"pcbi.1013124.ref025","volume-title":"Multiple imputation for nonresponse in surveys","author":"DB Rubin","year":"2004"},{"key":"pcbi.1013124.ref026","doi-asserted-by":"crossref","first-page":"412","DOI":"10.1016\/j.jmva.2012.07.007","article-title":"A generalization of the Dirichlet distribution","volume":"114","author":"A Ongaro","year":"2013","journal-title":"Journal of Multivariate Analysis."},{"issue":"2","key":"pcbi.1013124.ref027","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0030126","article-title":"Dirichlet multinomial mixtures: generative models for microbial metagenomics","volume":"7","author":"I Holmes","year":"2012","journal-title":"PLoS One."},{"issue":"1","key":"pcbi.1013124.ref028","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1186\/s13059-022-02655-5","article-title":"LinDA: linear models for differential abundance analysis of microbiome compositional data","volume":"23","author":"H Zhou","year":"2022","journal-title":"Genome Biol."},{"issue":"6052","key":"pcbi.1013124.ref029","doi-asserted-by":"crossref","first-page":"105","DOI":"10.1126\/science.1208344","article-title":"Linking long-term dietary patterns with gut microbial enterotypes","volume":"334","author":"GD Wu","year":"2011","journal-title":"Science."},{"issue":"1","key":"pcbi.1013124.ref030","doi-asserted-by":"crossref","DOI":"10.1093\/bib\/bbac607","article-title":"Benchmarking differential abundance analysis methods for correlated microbiome sequencing data","volume":"24","author":"L Yang","year":"2023","journal-title":"Brief Bioinform."},{"issue":"1","key":"pcbi.1013124.ref031","doi-asserted-by":"crossref","first-page":"3514","DOI":"10.1038\/s41467-020-17041-7","article-title":"Analysis of compositions of microbiomes with bias correction","volume":"11","author":"H Lin","year":"2020","journal-title":"Nat Commun."},{"issue":"11","key":"pcbi.1013124.ref032","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1009442","article-title":"Multivariable association discovery in population-scale meta-omics studies","volume":"17","author":"H Mallick","year":"2021","journal-title":"PLoS Comput Biol."},{"issue":"1","key":"pcbi.1013124.ref033","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1038\/s41592-023-02092-7","article-title":"Multigroup analysis of compositions of microbiomes with covariate adjustments and repeated measures","volume":"21","author":"H Lin","year":"2024","journal-title":"Nat Methods."},{"issue":"1","key":"pcbi.1013124.ref034","doi-asserted-by":"crossref","first-page":"1784","DOI":"10.1038\/s41467-017-01973-8","article-title":"Meta-analysis of gut microbiome studies identifies disease-specific and shared responses","volume":"8","author":"C Duvallet","year":"2017","journal-title":"Nat Commun."},{"issue":"4","key":"pcbi.1013124.ref035","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1093\/bioinformatics\/btx650","article-title":"An omnibus test for differential distribution analysis of microbiome sequencing data","volume":"34","author":"J Chen","year":"2018","journal-title":"Bioinformatics."},{"issue":"7461","key":"pcbi.1013124.ref036","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1038\/nature12331","article-title":"Treg induction by a rationally selected mixture of Clostridia strains from the human microbiota","volume":"500","author":"K Atarashi","year":"2013","journal-title":"Nature."},{"issue":"30","key":"pcbi.1013124.ref037","article-title":"LOCOM: a logistic regression model for testing differential abundance in compositional microbiome data with false discovery rate control","volume":"119","author":"Y Hu","year":"2022","journal-title":"Proc Natl Acad Sci U S A."},{"key":"pcbi.1013124.ref038","doi-asserted-by":"crossref","first-page":"749573","DOI":"10.3389\/fgene.2021.749573","article-title":"RFtest: a robust and flexible community-level test for microbiome data powerfully detects phylogenetically clustered signals","volume":"12","author":"L Zhang","year":"2022","journal-title":"Front Genet."},{"issue":"1","key":"pcbi.1013124.ref039","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1186\/s40168-021-01199-3","article-title":"Performance determinants of unsupervised clustering methods for microbiome data","volume":"10","author":"Y Shi","year":"2022","journal-title":"Microbiome."},{"issue":"2","key":"pcbi.1013124.ref040","article-title":"Regression analysis for microbiome compositional data","volume":"10","author":"P Shi","year":"2016","journal-title":"Ann Appl Stat."},{"key":"pcbi.1013124.ref041","doi-asserted-by":"crossref","DOI":"10.7554\/eLife.11888","article-title":"Recurring patterns in bacterioplankton dynamics during coastal spring algae blooms","volume":"5","author":"H Teeling","year":"2016","journal-title":"Elife."},{"issue":"9","key":"pcbi.1013124.ref042","doi-asserted-by":"crossref","first-page":"567","DOI":"10.1038\/s41579-018-0024-1","article-title":"Keystone taxa as drivers of microbiome structure and functioning","volume":"16","author":"S Banerjee","year":"2018","journal-title":"Nat Rev Microbiol."},{"key":"pcbi.1013124.ref043","unstructured":"Bishop CM. Pattern recognition and machine learning. Springer; 2006."}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1013124","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T00:00:00Z","timestamp":1762128000000}},{"DOI":"10.1371\/journal.pcbi.1013974","type":"correction","label":"Correction","source":"publisher","updated":{"date-parts":[[2026,2,17]],"date-time":"2026-02-17T00:00:00Z","timestamp":1771286400000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013124","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,11,3]],"date-time":"2025-11-03T18:42:28Z","timestamp":1762195348000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1013124"}},"subtitle":[],"editor":[{"given":"Alberto J. M.","family":"Martin","sequence":"first","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2025,10,24]]},"references-count":43,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2025,10,24]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1013124","relation":{},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,24]]}}}