{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T09:16:51Z","timestamp":1780996611933,"version":"3.54.1"},"reference-count":49,"publisher":"Oxford University Press (OUP)","issue":"5","license":[{"start":{"date-parts":[[2023,8,10]],"date-time":"2023-08-10T00:00:00Z","timestamp":1691625600000},"content-version":"vor","delay-in-days":1,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R03-DE028983"],"award-info":[{"award-number":["R03-DE028983"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000072","name":"National Institute of Dental and Craniofacial Research","doi-asserted-by":"publisher","award":["U01-DE025046"],"award-info":[{"award-number":["U01-DE025046"]}],"id":[{"id":"10.13039\/100000072","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,9,20]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Understanding the function of the human microbiome is important but the development of statistical methods specifically for the microbial gene expression (i.e. metatranscriptomics) is in its infancy. Many currently employed differential expression analysis methods have been designed for different data types and have not been evaluated in metatranscriptomics settings. To address this gap, we undertook a comprehensive evaluation and benchmarking of 10 differential analysis methods for metatranscriptomics data. We used a combination of real and simulated data to evaluate performance (i.e. type I error, false discovery rate and sensitivity) of the following methods: log-normal (LN), logistic-beta (LB), MAST, DESeq2, metagenomeSeq, ANCOM-BC, LEfSe, ALDEx2, Kruskal\u2013Wallis and two-part Kruskal\u2013Wallis. The simulation was informed by supragingival biofilm microbiome data from 300 preschool-age children enrolled in a study of childhood dental disease (early childhood caries, ECC), whereas validations were sought in two additional datasets from the ECC study and an inflammatory bowel disease study. The LB test showed the highest sensitivity in both small and large samples and reasonably controlled type I error. Contrarily, MAST was hampered by inflated type I error. Upon application of the LN and LB tests in the ECC study, we found that genes C8PHV7 and C8PEV7, harbored by the lactate-producing Campylobacter gracilis, had the strongest association with childhood dental disease. This comprehensive model evaluation offers practical guidance for selection of appropriate methods for rigorous analyses of differential expression in metatranscriptomics. Selection of an optimal method increases the possibility of detecting true signals while minimizing the chance of claiming false ones.<\/jats:p>","DOI":"10.1093\/bib\/bbad279","type":"journal-article","created":{"date-parts":[[2023,7,21]],"date-time":"2023-07-21T18:39:28Z","timestamp":1689964768000},"source":"Crossref","is-referenced-by-count":21,"title":["Comprehensive evaluation of methods for differential expression analysis of metatranscriptomics data"],"prefix":"10.1093","volume":"24","author":[{"given":"Hunyong","family":"Cho","sequence":"first","affiliation":[{"name":"Department of Biostatistics, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yixiang","family":"Qu","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chuwen","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Boyang","family":"Tang","sequence":"additional","affiliation":[{"name":"Department of Statistics, University of Connecticut , Storrs, CT , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ruiqi","family":"Lyu","sequence":"additional","affiliation":[{"name":"School of Computer Science, Carnegie Mellon University , Pittsburgh, Pennsylvania , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Bridget M","family":"Lin","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jeffrey","family":"Roach","sequence":"additional","affiliation":[{"name":"Research Computing, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"M Andrea","family":"Azcarate-Peril","sequence":"additional","affiliation":[{"name":"Department of Medicine and Nutrition, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Apoena","family":"Aguiar Ribeiro","sequence":"additional","affiliation":[{"name":"Division of Diagnostic Sciences, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Michael I","family":"Love","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of North Carolina , Chapel Hill, NC , United States"},{"name":"Department of Genetics, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Kimon","family":"Divaris","sequence":"additional","affiliation":[{"name":"Division of Pediatric and Public Health, University of North Carolina , Chapel Hill, NC , United States"},{"name":"Department of Epidemiology, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Di","family":"Wu","sequence":"additional","affiliation":[{"name":"Department of Biostatistics, University of North Carolina , Chapel Hill, NC , United States"},{"name":"Division of Oral and Craniofacial Health Sciences, Adam School of Dentistry, University of North Carolina , Chapel Hill, NC , United States"},{"name":"Lineberger Comprehensive Cancer Center, University of North Carolina , Chapel Hill, NC , United States"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"286","published-online":{"date-parts":[[2023,8,9]]},"reference":[{"issue":"10","key":"2023092216512370800_ref1","doi-asserted-by":"crossref","first-page":"3258","DOI":"10.1128\/JCM.01396-12","article-title":"Microbial dysbiosis in pediatric patients with crohn\u2019s disease","volume":"50","author":"Kaakoush","year":"2012","journal-title":"J Clin Microbiol"},{"issue":"6","key":"2023092216512370800_ref2","doi-asserted-by":"crossref","first-page":"2126","DOI":"10.1172\/JCI58109","article-title":"Gut microbiome, obesity, and metabolic dysfunction","volume":"121","author":"Tilg","year":"2011","journal-title":"J Clin Invest"},{"issue":"10","key":"2023092216512370800_ref3","doi-asserted-by":"crossref","first-page":"657","DOI":"10.1038\/sj.bdj.2016.865","article-title":"The oral microbiome\u2013an update for oral healthcare professionals","volume":"221","author":"Mogens Kilian","year":"2016","journal-title":"Br Dent J"},{"issue":"4","key":"2023092216512370800_ref4","doi-asserted-by":"crossref","first-page":"570","DOI":"10.1016\/j.ccell.2018.03.015","article-title":"The influence of the gut microbiome on cancer, immunity, and cancer immunotherapy","volume":"33","author":"Gopalakrishnan","year":"2018","journal-title":"Cancer Cell"},{"issue":"1","key":"2023092216512370800_ref5","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-019-12476-z","article-title":"Interplay between the human gut microbiome and host metabolism","volume":"10","author":"Visconti","year":"2019","journal-title":"Nat Commun"},{"issue":"1","key":"2023092216512370800_ref6","doi-asserted-by":"crossref","first-page":"1586422","DOI":"10.1080\/20002297.2019.1586422","article-title":"Can oral bacteria affect the microbiome of the gut?","volume":"11","author":"Olsen","year":"2019","journal-title":"J Oral Microbiol"},{"issue":"7758","key":"2023092216512370800_ref7","doi-asserted-by":"crossref","first-page":"655","DOI":"10.1038\/s41586-019-1237-9","article-title":"Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases","volume":"569","author":"Lloyd-Price","year":"2019","journal-title":"Nature"},{"key":"2023092216512370800_ref8","doi-asserted-by":"crossref","first-page":"108","DOI":"10.3389\/fcimb.2014.00108","article-title":"Functional expression of dental plaque microbiota","volume":"4","author":"Peterson","year":"2014","journal-title":"Front Cell Infect Microbiol"},{"issue":"8","key":"2023092216512370800_ref9","doi-asserted-by":"crossref","first-page":"1659","DOI":"10.1038\/ismej.2014.23","article-title":"Community-wide transcriptome of the oral microbiome in subjects with and without periodontitis","volume":"8","author":"Duran-Pinedo","year":"2014","journal-title":"ISME J"},{"issue":"1","key":"2023092216512370800_ref10","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1111\/prd.12350","article-title":"Metatranscriptomic analyses of the oral microbiome","volume":"85","author":"Duran-Pinedo","year":"2021","journal-title":"Periodontol 2000"},{"issue":"1","key":"2023092216512370800_ref11","doi-asserted-by":"crossref","first-page":"228","DOI":"10.1186\/s13059-017-1359-z","article-title":"Experimental design and quantitative analysis of microbial community multiomics","volume":"18","author":"Mallick","year":"2017","journal-title":"Genome Biol"},{"issue":"6","key":"2023092216512370800_ref12","first-page":"1415","article-title":"Bioinformatics tools for quantitative and functional metagenome and metatranscriptome data analysis in microbes","volume":"19","author":"Niu","year":"2018","journal-title":"Brief Bioinform"},{"issue":"1","key":"2023092216512370800_ref13","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-016-1116-8","article-title":"Imp: a pipeline for reproducible reference-independent integrated metagenomic and metatranscriptomic analyses","volume":"17","author":"Narayanasamy","year":"2016","journal-title":"Genome Biol"},{"issue":"17","key":"2023092216512370800_ref14","doi-asserted-by":"crossref","first-page":"2611","DOI":"10.1093\/bioinformatics\/btw308","article-title":"A two-part mixed-effects model for analyzing longitudinal microbiome compositional data","volume":"32","author":"Chen","year":"2016","journal-title":"Bioinformatics"},{"key":"2023092216512370800_ref15","doi-asserted-by":"crossref","DOI":"10.1186\/s13059-020-02104-1","article-title":"Assessment of statistical methods from single cell, bulk rna-seq and metagenomics applied to microbiome data","volume":"21","author":"Calgaro","year":"2020","journal-title":"Genome Biol"},{"issue":"1","key":"2023092216512370800_ref16","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1214\/19-AOAS1283","article-title":"Modeling microbial abundances and dysbiosis with beta-binomial regression","volume":"14","author":"Martin","year":"2020","journal-title":"Ann Appl Stat"},{"issue":"1","key":"2023092216512370800_ref17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s12859-018-2189-z","article-title":"Samsa2: a standalone metatranscriptome analysis pipeline","volume":"19","author":"Westreich","year":"2018","journal-title":"BMC Bioinformatics"},{"issue":"9","key":"2023092216512370800_ref18","doi-asserted-by":"crossref","first-page":"367","DOI":"10.3390\/microorganisms7090367","article-title":"Sample preservation and storage significantly impact taxonomic and functional profiles in metaproteomics studies of the human gut microbiome","volume":"7","author":"Hickl","year":"2019","journal-title":"Microorganisms"},{"issue":"1","key":"2023092216512370800_ref19","first-page":"342","article-title":"Microbiome differential abundance methods produce different results across 38 datasets. Nature","volume":"13","author":"Nearing","year":"2022","journal-title":"Communications"},{"issue":"1","key":"2023092216512370800_ref20","doi-asserted-by":"crossref","first-page":"130","DOI":"10.1186\/s40168-022-01320-0","article-title":"A comprehensive evaluation of microbial differential abundance analysis methods: current status and potential solutions. Microbiome","volume":"10","author":"Yang","year":"2022","journal-title":"Microbiome"},{"issue":"1","key":"2023092216512370800_ref21","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1038\/s41522-020-00160-w","article-title":"Analysis of microbial compositions: a review of normalization and differential abundance analysis","volume":"6","author":"Lin","year":"2020","journal-title":"NPJ Biofilms Microbiomes"},{"issue":"9","key":"2023092216512370800_ref22","doi-asserted-by":"crossref","first-page":"e1010467","DOI":"10.1371\/journal.pcbi.1010467","article-title":"Investigating differential abundance methods in microbiome data: a benchmark study","volume":"18","author":"Cappellato","year":"2022","journal-title":"PLoS Comput Biol"},{"issue":"1","key":"2023092216512370800_ref23","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1186\/s40168-017-0237-y","article-title":"Normalization and microbial differential abundance strategies depend upon data characteristics","volume":"5","author":"Weiss","year":"2017","journal-title":"Microbiome"},{"issue":"1","key":"2023092216512370800_ref24","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1093\/bib\/bbx104","article-title":"A broken promise: microbiome differential abundance methods do not control the false discovery rate","volume":"20","author":"Hawinkel","year":"2019","journal-title":"Brief Bioinform"},{"issue":"12","key":"2023092216512370800_ref25","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1038\/nmeth.2658","article-title":"Differential abundance analysis for microbial marker-gene surveys","volume":"10","author":"Paulson","year":"2013","journal-title":"Nat Methods"},{"issue":"Supplement_1","key":"2023092216512370800_ref26","doi-asserted-by":"crossref","first-page":"i34","DOI":"10.1093\/bioinformatics\/btab327","article-title":"Statistical approaches for differential expression analysis in metatranscriptomics","volume":"37","author":"Zhang","year":"2021","journal-title":"Bioinformatics"},{"key":"2023092216512370800_ref27","volume-title":"Genome Biology"},{"issue":"2","key":"2023092216512370800_ref28","doi-asserted-by":"crossref","first-page":"102","DOI":"10.1089\/cmb.2015.0157","article-title":"Zero-inflated beta regression for differential abundance analysis with metagenomics data","volume":"23","author":"Peng","year":"2016","journal-title":"J Comput Biol"},{"key":"2023092216512370800_ref29","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1007\/978-1-4939-9012-2_40","article-title":"The supragingival biofilm in early childhood caries: clinical and laboratory protocols and bioinformatics pipelines supporting oral metagenomics, metatranscriptomics and metabolomics studies of the oral microbiome","volume":"1922","author":"Divaris","year":"2019","journal-title":"Methods Mol Biol"},{"issue":"21","key":"2023092216512370800_ref30","doi-asserted-by":"crossref","first-page":"8056","DOI":"10.3390\/ijerph17218056","article-title":"Cohort profile: Zoe 2.0\u2014a community-based, genetic epidemiologic study of early childhood oral health","volume":"17","author":"Divaris","year":"2020","journal-title":"Int J Environ Res Public Health"},{"issue":"12","key":"2023092216512370800_ref31","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for rna-seq data with deseq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"issue":"1","key":"2023092216512370800_ref32","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-018-1406-4","article-title":"Observation weights unlock bulk rna-seq tools for zero inflation and single-cell applications","volume":"19","author":"Van den Berge","year":"2018","journal-title":"Genome Biol"},{"issue":"1","key":"2023092216512370800_ref33","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-020-17041-7","article-title":"Analysis of compositions of microbiomes with bias correction","volume":"11","author":"Lin","year":"2020","journal-title":"Nat Commun"},{"issue":"6","key":"2023092216512370800_ref34","doi-asserted-by":"crossref","first-page":"R60","DOI":"10.1186\/gb-2011-12-6-r60","article-title":"Metagenomic Biomarker Discovery and Explanation","volume":"12","author":"Segata","year":"2011","journal-title":"Genome Biol"},{"issue":"1","key":"2023092216512370800_ref35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/2049-2618-2-15","article-title":"Unifying the analysis of high-throughput sequencing datasets:characterizing rna-seq, 16s rrna gene sequencing and selective growth experiments by compositional data analysis","volume":"2","author":"Fernandes","year":"2014","journal-title":"Microbiome"},{"issue":"5","key":"2023092216512370800_ref36","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pone.0020296","article-title":"Application of two-part statistics for comparison of sequence variant counts","volume":"6","author":"Wagner","year":"2011","journal-title":"PloS one"},{"issue":"1","key":"2023092216512370800_ref37","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41467-017-02554-5","article-title":"A general and flexible method for signal extraction from single-cell rna-seq data","volume":"9","author":"Risso","year":"2018","journal-title":"Nat Commun"},{"issue":"1","key":"2023092216512370800_ref38","first-page":"27663","article-title":"Analysis of composition of microbiomes: a novel method for studying microbial composition","volume":"26","author":"Mandal","year":"2015","journal-title":"Microb Ecol Health Dis"},{"issue":"2","key":"2023092216512370800_ref39","first-page":"72","article-title":"Early childhood caries: Iapd Bangkok declaration","volume":"86","author":"Pitts","year":"2019","journal-title":"J Dent Child (Chic)"},{"key":"2023092216512370800_ref40","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4939-9012-2_39","article-title":"Measurement of early childhood oral health for research purposes: dental caries experience and developmental defects of the enamel in the primary dentition","volume-title":"Methods Mol. Biol","author":"Ginnis"},{"key":"2023092216512370800_ref41","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13059-019-1891-0","article-title":"Improved metagenomic analysis with kraken 2","volume":"20","author":"Wood","year":"2019","journal-title":"Genome Biol"},{"key":"2023092216512370800_ref42","doi-asserted-by":"crossref","first-page":"e104","DOI":"10.7717\/peerj-cs.104","article-title":"Bracken: estimating species abundance in metagenomics data","volume":"3","author":"Jennifer","year":"2017","journal-title":"PeerJ Comput Sci"},{"issue":"11","key":"2023092216512370800_ref43","doi-asserted-by":"crossref","first-page":"962","DOI":"10.1038\/s41592-018-0176-y","article-title":"Species-level functional profiling of metagenomes and metatranscriptomes","volume":"15","author":"Franzosa","year":"2018","journal-title":"Nat Methods"},{"issue":"10","key":"2023092216512370800_ref44","doi-asserted-by":"crossref","first-page":"902","DOI":"10.1038\/nmeth.3589","article-title":"Metaphlan2 for enhanced metagenomic taxonomic profiling","volume":"12","author":"Truong","year":"2015","journal-title":"Nat Methods"},{"issue":"D1","key":"2023092216512370800_ref45","doi-asserted-by":"crossref","first-page":"D480","DOI":"10.1093\/nar\/gkaa1100","article-title":"Uniprot: the universal protein knowledgebase in 2021","volume":"49","year":"2021","journal-title":"Nucleic Acids Res"},{"issue":"318","key":"2023092216512370800_ref46","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1080\/01621459.1967.10482916","article-title":"On the kolmogorov-smirnov test for normality with mean and variance unknown","volume":"62","author":"Lilliefors","year":"1967","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"2023092216512370800_ref47","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1214\/aoms\/1177730256","article-title":"Table for estimating the goodness of fit of empirical distributions","volume":"19","author":"Smirnov","year":"1948","journal-title":"Ann Math Stat"},{"issue":"9","key":"2023092216512370800_ref48","doi-asserted-by":"crossref","first-page":"2392","DOI":"10.1093\/gbe\/evu195","article-title":"Genomic evidence for the emergence and evolution of pathogenicity and niche preferences in the genus campylobacter","volume":"6","author":"Iraola","year":"2014","journal-title":"Genome Biol Evol"},{"issue":"360a","key":"2023092216512370800_ref49","doi-asserted-by":"crossref","first-page":"851","DOI":"10.1080\/01621459.1977.10479969","article-title":"Wald\u2019s test as applied to hypotheses in logit analysis","volume":"72","author":"Hauck Jr","year":"1977","journal-title":"J Am Stat Assoc"}],"container-title":["Briefings in Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/5\/bbad279\/51711456\/bbad279.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bib\/article-pdf\/24\/5\/bbad279\/51711456\/bbad279.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,9,22]],"date-time":"2023-09-22T13:13:19Z","timestamp":1695388399000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bib\/article\/doi\/10.1093\/bib\/bbad279\/7239897"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,8,9]]},"references-count":49,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2023,9,20]]}},"URL":"https:\/\/doi.org\/10.1093\/bib\/bbad279","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.07.14.452374","asserted-by":"object"}]},"ISSN":["1467-5463","1477-4054"],"issn-type":[{"value":"1467-5463","type":"print"},{"value":"1477-4054","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,9]]},"published":{"date-parts":[[2023,8,9]]},"article-number":"bbad279"}}