{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T06:13:02Z","timestamp":1772172782557,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1009089","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2021,6,30]],"date-time":"2021-06-30T00:00:00Z","timestamp":1625011200000}}],"reference-count":41,"publisher":"Public Library of Science (PLoS)","issue":"6","license":[{"start":{"date-parts":[[2021,6,18]],"date-time":"2021-06-18T00:00:00Z","timestamp":1623974400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"CEA\u2019s High Commissioner office","award":["Th`ese Phare"],"award-info":[{"award-number":["Th`ese Phare"]}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>The advent of high-throughput metagenomic sequencing has prompted the development of efficient taxonomic profiling methods allowing to measure the presence, abundance and phylogeny of organisms in a wide range of environmental samples. Multivariate sequence-derived abundance data further has the potential to enable inference of ecological associations between microbial populations, but several technical issues need to be accounted for, like the compositional nature of the data, its extreme sparsity and overdispersion, as well as the frequent need to operate in under-determined regimes.<\/jats:p>\n                  <jats:p>The ecological network reconstruction problem is frequently cast into the paradigm of Gaussian Graphical Models (GGMs) for which efficient structure inference algorithms are available, like the graphical lasso and neighborhood selection. Unfortunately, GGMs or variants thereof can not properly account for the extremely sparse patterns occurring in real-world metagenomic taxonomic profiles. In particular, structural zeros (as opposed to sampling zeros) corresponding to true absences of biological signals fail to be properly handled by most statistical methods.<\/jats:p>\n                  <jats:p>\n                    We present here a zero-inflated log-normal graphical model (available at\n                    <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/vincentprost\/Zi-LN\" xlink:type=\"simple\">https:\/\/github.com\/vincentprost\/Zi-LN<\/jats:ext-link>\n                    ) specifically aimed at handling such \u201cbiological\u201d zeros, and demonstrate significant performance gains over state-of-the-art statistical methods for the inference of microbial association networks, with most notable gains obtained when analyzing taxonomic profiles displaying sparsity levels on par with real-world metagenomic datasets.\n                  <\/jats:p>","DOI":"10.1371\/journal.pcbi.1009089","type":"journal-article","created":{"date-parts":[[2021,6,18]],"date-time":"2021-06-18T13:39:35Z","timestamp":1624023575000},"page":"e1009089","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":19,"title":["A zero inflated log-normal model for inference of sparse microbial association networks"],"prefix":"10.1371","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7986-3156","authenticated-orcid":true,"given":"Vincent","family":"Prost","sequence":"first","affiliation":[]},{"given":"St\u00e9phane","family":"Gazut","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3525-8979","authenticated-orcid":true,"given":"Thomas","family":"Br\u00fcls","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2021,6,18]]},"reference":[{"issue":"5879","key":"pcbi.1009089.ref001","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1126\/science.1153213","article-title":"The microbial engines that drive Earth\u2019s biogeochemical cycles","volume":"320","author":"PG Falkowski","year":"2008","journal-title":"science"},{"issue":"3","key":"pcbi.1009089.ref002","doi-asserted-by":"crossref","first-page":"218","DOI":"10.1016\/S0958-1669(02)00315-4","article-title":"Bacterial community composition and function in sewage treatment systems","volume":"13","author":"M Wagner","year":"2002","journal-title":"Current opinion in biotechnology"},{"issue":"4","key":"pcbi.1009089.ref003","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1016\/S0168-6445(03)00039-1","article-title":"New concepts of microbial treatment processes for the nitrogen removal in wastewater","volume":"27","author":"I Schmidt","year":"2003","journal-title":"FEMS microbiology reviews"},{"issue":"1","key":"pcbi.1009089.ref004","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-020-18127-y","article-title":"Faecal microbiota transplantation for the treatment of diarrhoea induced by tyrosine-kinase inhibitors in patients with metastatic renal cell carcinoma","volume":"11","author":"G Ianiro","year":"2020","journal-title":"Nature communications"},{"issue":"8","key":"pcbi.1009089.ref005","doi-asserted-by":"crossref","first-page":"475","DOI":"10.2307\/1307540","article-title":"A clarification of interactions in ecological systems","volume":"29","author":"WZ Lidicker","year":"1979","journal-title":"Bioscience"},{"key":"pcbi.1009089.ref006","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1016\/j.mib.2016.04.020","article-title":"The bright side of microbial dark matter: lessons learned from the uncultivated majority","volume":"31","author":"L Solden","year":"2016","journal-title":"Current opinion in microbiology"},{"issue":"96","key":"pcbi.1009089.ref007","doi-asserted-by":"crossref","first-page":"20140065","DOI":"10.1098\/rsif.2014.0065","article-title":"Co-culture systems and technologies: taking synthetic biology to the next level","volume":"11","author":"L Goers","year":"2014","journal-title":"Journal of The Royal Society Interface"},{"issue":"8","key":"pcbi.1009089.ref008","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1038\/nrmicro2832","article-title":"Microbial interactions: from networks to models","volume":"10","author":"K Faust","year":"2012","journal-title":"Nature Reviews Microbiology"},{"issue":"7","key":"pcbi.1009089.ref009","doi-asserted-by":"crossref","first-page":"1669","DOI":"10.1038\/ismej.2015.235","article-title":"Correlation detection strategies in microbial data sets vary widely in sensitivity and precision","volume":"10","author":"S Weiss","year":"2016","journal-title":"The ISME journal"},{"issue":"3","key":"pcbi.1009089.ref010","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1016\/j.cels.2019.08.002","article-title":"Rapid inference of direct interactions in large-scale ecological networks from heterogeneous microbial sequencing data","volume":"9","author":"J Tackmann","year":"2019","journal-title":"Cell systems"},{"key":"pcbi.1009089.ref011","doi-asserted-by":"crossref","DOI":"10.1093\/oso\/9780198522195.001.0001","volume-title":"Graphical Models","author":"SL Lauritzen","year":"1996"},{"key":"pcbi.1009089.ref012","doi-asserted-by":"crossref","first-page":"432","DOI":"10.1093\/biostatistics\/kxm045","article-title":"Sparse inverse covariance estimation with the graphical LASSO","volume":"9","author":"J Friedman","year":"2008","journal-title":"Biostatistics (Oxford, England)"},{"issue":"3","key":"pcbi.1009089.ref013","doi-asserted-by":"crossref","first-page":"1436","DOI":"10.1214\/009053606000000281","article-title":"High-dimensional graphs and variable selection with the Lasso","volume":"34","author":"N Meinshausen","year":"2006","journal-title":"Ann Statist"},{"key":"pcbi.1009089.ref014","article-title":"Bayesian variable selection for multivariate zero-inflated models: Application to microbiome count data","author":"KH Lee","year":"2018","journal-title":"Biostatistics"},{"key":"pcbi.1009089.ref015","article-title":"Naught all zeros in sequence count data are the same","author":"JD Silverman","year":"2018","journal-title":"bioRxiv"},{"key":"pcbi.1009089.ref016","doi-asserted-by":"crossref","DOI":"10.1186\/s12863-017-0561-z","article-title":"Network analysis for count data with excess zeros","volume":"18","author":"H Choi","year":"2017","journal-title":"BMC Genetics"},{"key":"pcbi.1009089.ref017","unstructured":"Chiquet J, Robin S, Mariadassou M. Variational Inference for sparse network reconstruction from count data. In: Chaudhuri K, Salakhutdinov R, editors. Proceedings of the 36th International Conference on Machine Learning. vol. 97 of Proceedings of Machine Learning Research. Long Beach, California, USA: PMLR; 2019. p. 1162\u20131171. Available from: http:\/\/proceedings.mlr.press\/v97\/chiquet19a.html."},{"issue":"16","key":"pcbi.1009089.ref018","doi-asserted-by":"crossref","first-page":"3105","DOI":"10.1080\/00949655.2019.1657116","article-title":"Sparse inverse covariance estimation for high-throughput microRNA sequencing data in the Poisson log-normal graphical model","volume":"89","author":"D Sinclair","year":"2019","journal-title":"Journal of Statistical Computation and Simulation"},{"key":"pcbi.1009089.ref019","doi-asserted-by":"crossref","first-page":"526","DOI":"10.1089\/cmb.2016.0061","article-title":"Learning Microbial Interaction Networks from Metagenomic Count Data","volume":"23","author":"S Biswas","year":"2016","journal-title":"Journal of Computational Biology"},{"key":"pcbi.1009089.ref020","author":"H Wu","year":"2016","journal-title":"Sparse Estimation of Multivariate Poisson Log-Normal Models from Count Data"},{"key":"pcbi.1009089.ref021","doi-asserted-by":"crossref","DOI":"10.1089\/cmb.2017.0054","article-title":"gCoda: Conditional Dependence Network Inference for Compositional Data","volume":"24","author":"H Fang","year":"2017","journal-title":"Journal of Computational Biology"},{"key":"pcbi.1009089.ref022","doi-asserted-by":"crossref","first-page":"e1005852","DOI":"10.1371\/journal.pcbi.1005852","article-title":"A Bayesian method for detecting pairwise associations in compositional data","volume":"13","author":"E Schwager","year":"2017","journal-title":"PLOS Computational Biology"},{"issue":"5","key":"pcbi.1009089.ref023","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1371\/journal.pcbi.1004226","article-title":"Sparse and Compositionally Robust Inference of Microbial Ecological Networks","volume":"11","author":"ZD Kurtz","year":"2015","journal-title":"PLOS Computational Biology"},{"key":"pcbi.1009089.ref024","article-title":"MAGMA: inference of sparse microbial association networks","author":"A Cougoul","year":"2019","journal-title":"bioRxiv"},{"key":"pcbi.1009089.ref025","doi-asserted-by":"crossref","first-page":"e77503","DOI":"10.1371\/journal.pone.0077503","article-title":"A Hierarchical Poisson Log-Normal Model for Network Inference from RNA Sequencing Data","volume":"8","author":"M Gallopin","year":"2013","journal-title":"PloS one"},{"key":"pcbi.1009089.ref026","doi-asserted-by":"crossref","unstructured":"Allen GI, Liu Z. A log-linear graphical model for inferring genetic networks from high-throughput sequencing data. In: 2012 IEEE International Conference on Bioinformatics and Biomedicine. IEEE; 2012. p. 1\u20136.","DOI":"10.1109\/BIBM.2012.6392619"},{"key":"pcbi.1009089.ref027","unstructured":"Yang E, Allen G, Liu Z, Ravikumar PK. Graphical models via generalized linear models. In: Advances in Neural Information Processing Systems; 2012. p. 1358\u20131366."},{"key":"pcbi.1009089.ref028","doi-asserted-by":"crossref","DOI":"10.1186\/s12859-019-2882-6","article-title":"metaSPARSim: a 16S rRNA gene sequencing count data simulator","volume":"20","author":"I Patuzzi","year":"2019","journal-title":"BMC Bioinformatics"},{"issue":"4","key":"pcbi.1009089.ref029","doi-asserted-by":"crossref","first-page":"643","DOI":"10.1093\/biomet\/76.4.643","article-title":"The multivariate Poisson-log normal distribution","volume":"76","author":"J Aitchison","year":"1989","journal-title":"Biometrika"},{"key":"pcbi.1009089.ref030","doi-asserted-by":"crossref","DOI":"10.1186\/s40168-017-0237-y","article-title":"Normalization and microbial differential abundance strategies depend upon data characteristics","volume":"5","author":"S Weiss","year":"2017","journal-title":"Microbiome"},{"issue":"4","key":"pcbi.1009089.ref031","doi-asserted-by":"crossref","first-page":"e1003531","DOI":"10.1371\/journal.pcbi.1003531","article-title":"Waste not, want not: why rarefying microbiome data is inadmissible","volume":"10","author":"PJ McMurdie","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1009089.ref032","author":"A G\u00e9gout-Petit","year":"2019","journal-title":"Graph estimation for Gaussian data zero-inflated by double truncation"},{"issue":"6285","key":"pcbi.1009089.ref033","doi-asserted-by":"crossref","first-page":"565","DOI":"10.1126\/science.aad3369","article-title":"Population-based metagenomics analysis reveals markers for gut microbiome composition and diversity","volume":"352","author":"A Zhernakova","year":"2016","journal-title":"Science"},{"issue":"3","key":"pcbi.1009089.ref034","doi-asserted-by":"crossref","first-page":"e00020","DOI":"10.1128\/mSystems.00020-16","article-title":"MetaPalette: A K-mer painting approach for metagenomic taxonomic profiling and quantification of novel strain variation","volume":"1","author":"D Koslicki","year":"2016","journal-title":"MSystems"},{"key":"pcbi.1009089.ref035","volume-title":"An introduction to copulas","author":"RB Nelsen","year":"2007"},{"issue":"9","key":"pcbi.1009089.ref036","doi-asserted-by":"crossref","first-page":"e1002687","DOI":"10.1371\/journal.pcbi.1002687","article-title":"Inferring correlation networks from genomic survey data","volume":"8","author":"J Friedman","year":"2012","journal-title":"PLoS Comput Biol"},{"issue":"3","key":"pcbi.1009089.ref037","doi-asserted-by":"crossref","first-page":"286","DOI":"10.1016\/j.cels.2019.08.002","article-title":"Rapid inference of direct interactions in large-scale ecological networks from heterogeneous microbial sequencing data","volume":"9","author":"J Tackmann","year":"2019","journal-title":"Cell systems"},{"issue":"1","key":"pcbi.1009089.ref038","article-title":"Local causal and markov blanket induction for causal discovery and feature selection for classification part i: Algorithms and empirical evaluation","volume":"11","author":"CF Aliferis","year":"2010","journal-title":"Journal of Machine Learning Research"},{"issue":"7","key":"pcbi.1009089.ref039","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1002606","article-title":"Microbial co-occurrence relationships in the human microbiome","volume":"8","author":"K Faust","year":"2012","journal-title":"PLoS computational biology"},{"key":"pcbi.1009089.ref040","doi-asserted-by":"crossref","first-page":"380","DOI":"10.1016\/j.isci.2019.11.032","article-title":"Co-existence of network architectures supporting the human gut microbiome","volume":"22","author":"CV Hall","year":"2019","journal-title":"iScience"},{"key":"pcbi.1009089.ref041","article-title":"The igraph software package for complex network research","author":"G Csardi","year":"2006","journal-title":"InterJournal"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1009089","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2021,6,30]],"date-time":"2021-06-30T00:00:00Z","timestamp":1625011200000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009089","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,4]],"date-time":"2023-11-04T23:14:09Z","timestamp":1699139649000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1009089"}},"subtitle":[],"editor":[{"given":"Niranjan","family":"Nagarajan","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,6,18]]},"references-count":41,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2021,6,18]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1009089","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.11.13.381384","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,6,18]]}}}