{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,11]],"date-time":"2026-04-11T06:14:55Z","timestamp":1775888095508,"version":"3.50.1"},"reference-count":40,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2023,6,28]],"date-time":"2023-06-28T00:00:00Z","timestamp":1687910400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["GM123056"],"award-info":[{"award-number":["GM123056"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["GM129781"],"award-info":[{"award-number":["GM129781"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Quantification of microbial covariations from 16S rRNA and metagenomic sequencing data is difficult due to their sparse nature. In this article, we propose using copula models with mixed zero-beta margins for the estimation of taxon\u2013taxon covariations using data of normalized microbial relative abundances. Copulas allow for separate modeling of the dependence structure from the margins, marginal covariate adjustment, and uncertainty measurement.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Our method shows that a two-stage maximum-likelihood approach provides accurate estimation of model parameters. A corresponding two-stage likelihood ratio test for the dependence parameter is derived and is used for constructing covariation networks. Simulation studies show that the test is valid, robust, and more powerful than tests based upon Pearson\u2019s and rank correlations. Furthermore, we demonstrate that our method can be used to build biologically meaningful microbial networks based on a dataset from the American Gut Project.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>R package for implementation is available at https:\/\/github.com\/rebeccadeek\/CoMiCoN.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad413","type":"journal-article","created":{"date-parts":[[2023,6,28]],"date-time":"2023-06-28T16:20:56Z","timestamp":1687969256000},"source":"Crossref","is-referenced-by-count":6,"title":["Inference of microbial covariation networks using copula models with mixture margins"],"prefix":"10.1093","volume":"39","author":[{"given":"Rebecca A","family":"Deek","sequence":"first","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania , Philadelphia, PA 19104, United States"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3662-3907","authenticated-orcid":false,"given":"Hongzhe","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania , Philadelphia, PA 19104, United States"}]}],"member":"286","published-online":{"date-parts":[[2023,6,28]]},"reference":[{"key":"2023071202311943600_btad413-B1","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1111\/j.2517-6161.1982.tb01195.x","article-title":"The statistical analysis of compositional data","volume":"44","author":"Aitchison","year":"1982","journal-title":"J R Stat Soc B"},{"key":"2023071202311943600_btad413-B2","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1038\/ismej.2011.119","article-title":"Using network analysis to explore co-occurrence patterns in soil microbial communities","volume":"6","author":"Barber\u00e1n","year":"2012","journal-title":"ISME J"},{"key":"2023071202311943600_btad413-B3","doi-asserted-by":"crossref","first-page":"1165","DOI":"10.1214\/aos\/1013699998","article-title":"The control of the false discovery rate in multiple testing under dependency","volume":"29","author":"Benjamini","year":"2001","journal-title":"Ann Stat"},{"key":"2023071202311943600_btad413-B4","volume-title":"Fungi in Biological Control Systems","author":"Burge","year":"1988"},{"key":"2023071202311943600_btad413-B5","doi-asserted-by":"crossref","first-page":"759","DOI":"10.1080\/01621459.2018.1442340","article-title":"Large covariance estimation for compositional data via composition-adjusted thresholding","volume":"114","author":"Cao","year":"2019","journal-title":"J Am Stat Assoc"},{"key":"2023071202311943600_btad413-B6","doi-asserted-by":"crossref","first-page":"2611","DOI":"10.1093\/bioinformatics\/btw308","article-title":"A two-part mixed-effects model for analyzing longitudinal microbiome compositional data","volume":"32","author":"Chen","year":"2016","journal-title":"Bioinformatics"},{"key":"2023071202311943600_btad413-B7","doi-asserted-by":"crossref","first-page":"538","DOI":"10.1038\/nrmicro2832","article-title":"Microbial interactions: from networks to models","volume":"10","author":"Faust","year":"2012","journal-title":"Nat Rev Microbiol"},{"key":"2023071202311943600_btad413-B8","doi-asserted-by":"crossref","first-page":"e1002606","DOI":"10.1371\/journal.pcbi.1002606","article-title":"Microbial co-occurrence relationships in the human microbiome","volume":"8","author":"Faust","year":"2012","journal-title":"PLoS Comput Biol"},{"key":"2023071202311943600_btad413-B9","doi-asserted-by":"crossref","first-page":"e1002687","DOI":"10.1371\/journal.pcbi.1002687","article-title":"Inferring correlation networks from genomic survey data","volume":"8","author":"Friedman","year":"2012","journal-title":"PLoS Comput Biol"},{"key":"2023071202311943600_btad413-B10","doi-asserted-by":"crossref","first-page":"543","DOI":"10.1093\/biomet\/82.3.543","article-title":"A semiparametric estimation procedure of dependence parameters in multivariate families of distributions","volume":"82","author":"Genest","year":"1995","journal-title":"Biometrika"},{"key":"2023071202311943600_btad413-B11","doi-asserted-by":"crossref","first-page":"4131","DOI":"10.1016\/j.febslet.2014.02.037","article-title":"The dynamic microbiome","volume":"588","author":"Gerber","year":"2014","journal-title":"FEBS Lett"},{"key":"2023071202311943600_btad413-B12","doi-asserted-by":"crossref","first-page":"594","DOI":"10.1073\/pnas.1116053109","article-title":"Metagenomic systems biology of the human gut microbiome reveals topological shifts associated with obesity and inflammatory bowel disease","volume":"109","author":"Greenblum","year":"2012","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023071202311943600_btad413-B13","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1080\/07350015.2018.1469998","article-title":"Mixed marginal copula modeling","volume":"38","author":"Gunawan","year":"2020","journal-title":"J Bus Econ Stat"},{"key":"2023071202311943600_btad413-B14","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1186\/s12859-019-2744-2","article-title":"metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models","volume":"20","author":"Ho","year":"2019","journal-title":"BMC Bioinformatics"},{"key":"2023071202311943600_btad413-B15","volume-title":"Multivariate Models and Multivariate Dependence Concepts","author":"Joe","year":"1997"},{"key":"2023071202311943600_btad413-B16","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1016\/j.jmva.2004.06.003","article-title":"Asymptotic efficiency of the two-stage estimation method for copula-based models","volume":"94","author":"Joe","year":"2005","journal-title":"J Multivar Anal"},{"key":"2023071202311943600_btad413-B17","author":"Joe","year":"1996"},{"key":"2023071202311943600_btad413-B18","doi-asserted-by":"crossref","first-page":"e1004226","DOI":"10.1371\/journal.pcbi.1004226","article-title":"Sparse and compositionally robust inference of microbial ecological networks","volume":"11","author":"Kurtz","year":"2015","journal-title":"PLoS Comput Biol"},{"key":"2023071202311943600_btad413-B19","first-page":"8","article-title":"\u2018Ome sweet \u2018omics - a genealogical treasury of words","volume":"15","author":"Lederberg","year":"2001","journal-title":"Scientist"},{"key":"2023071202311943600_btad413-B20","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1146\/annurev-statistics-010814-020351","article-title":"Microbiome, metagenomics, and high-dimensional compositional data analysis","volume":"2","author":"Li","year":"2015","journal-title":"Annu Rev Stat Appl"},{"key":"2023071202311943600_btad413-B21","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1111\/j.2517-6161.1996.tb02116.x","article-title":"On the asymptotic behaviour of the pseudolikelihood ratio test statistic","volume":"58","author":"Liang","year":"1996","journal-title":"J R Stat Soc B"},{"key":"2023071202311943600_btad413-B22","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1214\/18-STS681","article-title":"Statistical analysis of zero-inflated nonnegative continuous data: a review","volume":"34","author":"Liu","year":"2019","journal-title":"Stat Sci"},{"key":"2023071202311943600_btad413-B23","doi-asserted-by":"crossref","DOI":"10.1128\/mSystems.00031-18","article-title":"American gut: an open platform for citizen science microbiome research","volume":"3","author":"McDonald","year":"2018","journal-title":"mSystems"},{"key":"2023071202311943600_btad413-B24","doi-asserted-by":"crossref","first-page":"e1003531","DOI":"10.1371\/journal.pcbi.1003531","article-title":"Waste not, want not: why rarefying microbiome data is inadmissible","volume":"10","author":"McMurdie","year":"2014","journal-title":"PLoS Comput Biol"},{"key":"2023071202311943600_btad413-B25","doi-asserted-by":"crossref","first-page":"5070","DOI":"10.1002\/sim.7050","article-title":"Modeling zero-modified count and semicontinuous data in health services research part 1: background and overview","volume":"35","author":"Neelon","year":"2016","journal-title":"Stat Med"},{"key":"2023071202311943600_btad413-B26","doi-asserted-by":"crossref","first-page":"1609","DOI":"10.1016\/j.csda.2011.10.005","article-title":"A general class of zero-or-one inflated beta regression models","volume":"56","author":"Ospina","year":"2012","journal-title":"Comput Stat Data Anal"},{"key":"2023071202311943600_btad413-B27","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1038\/nmeth.2658","article-title":"Differential abundance analysis for microbial marker-gene surveys","volume":"10","author":"Paulson","year":"2013","journal-title":"Nat Methods"},{"key":"2023071202311943600_btad413-B28","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1089\/cmb.2015.0157","article-title":"Zero-Inflated beta regression for differential abundance analysis with metagenomics data","volume":"23","author":"Peng","year":"2016","journal-title":"J Comput Biol"},{"key":"2023071202311943600_btad413-B29","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1214\/aoms\/1177729394","article-title":"Remarks on a multivariate transformation","volume":"23","author":"Rosenblatt","year":"1952","journal-title":"Ann Math Stat"},{"key":"2023071202311943600_btad413-B30","doi-asserted-by":"crossref","first-page":"351","DOI":"10.1111\/j.1467-9868.2010.00766.x","article-title":"Regression for compositional data by using distributions defined on the hypersphere","volume":"73","author":"Scealy","year":"2011","journal-title":"J R Stat Soc B"},{"key":"2023071202311943600_btad413-B31","doi-asserted-by":"crossref","first-page":"569","DOI":"10.1038\/nrrheum.2011.121","article-title":"The microbiome and rheumatoid arthritis","volume":"7","author":"Scher","year":"2011","journal-title":"Nat Rev Rheumatol"},{"key":"2023071202311943600_btad413-B32","doi-asserted-by":"crossref","first-page":"1384","DOI":"10.2307\/2533269","article-title":"Inferences on the association parameter in copula models for bivariate survival data","volume":"51","author":"Shih","year":"1995","journal-title":"Biometrics"},{"key":"2023071202311943600_btad413-B33","first-page":"229","article-title":"Fonctions de repartition an dimensions et leurs marges","volume":"8","author":"Sklar","year":"1959","journal-title":"Publ Inst Stat Univ Paris"},{"key":"2023071202311943600_btad413-B34","doi-asserted-by":"crossref","first-page":"4244","DOI":"10.1016\/j.febslet.2014.05.034","article-title":"Arthritis susceptibility and the gut microbiome","volume":"588","author":"Taneja","year":"2014","journal-title":"FEBS Lett"},{"key":"2023071202311943600_btad413-B35","doi-asserted-by":"crossref","first-page":"804","DOI":"10.1038\/nature06244","article-title":"The human microbiome project","volume":"449","author":"Turnbaugh","year":"2007","journal-title":"Nature"},{"key":"2023071202311943600_btad413-B36","doi-asserted-by":"crossref","first-page":"e1000352","DOI":"10.1371\/journal.pcbi.1000352","article-title":"Statistical methods for detecting differentially abundant features in clinical metagenomic samples","volume":"5","author":"White","year":"2009","journal-title":"PLoS Comput Biol"},{"key":"2023071202311943600_btad413-B37","doi-asserted-by":"crossref","first-page":"12799","DOI":"10.1073\/pnas.1411723111","article-title":"Fluvial network organization imprints on microbial co-occurrence networks","volume":"111","author":"Widder","year":"2014","journal-title":"Proc Natl Acad Sci USA"},{"key":"2023071202311943600_btad413-B38","doi-asserted-by":"crossref","DOI":"10.3389\/fmicb.2014.00358","article-title":"Demonstrating microbial co-occurrence pattern analyses within and between ecosystems","volume":"5","author":"Williams","year":"2014","journal-title":"Front Microbiol"},{"key":"2023071202311943600_btad413-B39","doi-asserted-by":"crossref","DOI":"10.3389\/fgene.2019.00516","article-title":"Microbial networks in SPRING - semi-parametric rank-based correlation and partial correlation estimation for quantitative microbiome data","volume":"10","author":"Yoon","year":"2019","journal-title":"Front Genet"},{"key":"2023071202311943600_btad413-B40","doi-asserted-by":"crossref","DOI":"10.2202\/1544-6115.1128","article-title":"A general framework for weighted gene co-expression network analysis","volume":"4","author":"Zhang","year":"2005","journal-title":"Stat Appl Genet Mol Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad413\/50731762\/btad413.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/7\/btad413\/50856779\/btad413.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/7\/btad413\/50856779\/btad413.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,23]],"date-time":"2024-10-23T04:49:30Z","timestamp":1729658970000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad413\/7209520"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,6,28]]},"references-count":40,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2023,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad413","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,7,1]]},"published":{"date-parts":[[2023,6,28]]},"article-number":"btad413"}}