{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T07:58:47Z","timestamp":1769241527493,"version":"3.49.0"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"14","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2015,7,15]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Motivation: The analysis of differential abundance for features (e.g. species or genes) can provide us with a better understanding of microbial communities, thus increasing our comprehension and understanding of the behaviors of microbial communities. However, it could also mislead us about the characteristics of microbial communities if the abundances or counts of features on different scales are not properly normalized within and between communities, prior to the analysis of differential abundance. Normalization methods used in the differential analysis typically try to adjust counts on different scales to a common scale using the total sum, mean or median of representative features across all samples. These methods often yield undesirable results when the difference in total counts of differentially abundant features (DAFs) across different conditions is large.<\/jats:p><jats:p>Results: We develop a novel method, Ratio Approach for Identifying Differential Abundance (RAIDA), which utilizes the ratio between features in a modified zero-inflated lognormal model. RAIDA removes possible problems associated with counts on different scales within and between conditions. As a result, its performance is not affected by the amount of difference in total abundances of DAFs across different conditions. Through comprehensive simulation studies, the performance of our method is consistently powerful, and under some situations, RAIDA greatly surpasses other existing methods. We also apply RAIDA on real datasets of type II diabetes and find interesting results consistent with previous reports.<\/jats:p><jats:p>Availability and implementation: An R package for RAIDA can be accessed from http:\/\/cals.arizona.edu\/%7Eanling\/sbg\/software.htm.<\/jats:p><jats:p>Contact: \u00a0anling@email.arizona.edu<\/jats:p><jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online.<\/jats:p>","DOI":"10.1093\/bioinformatics\/btv165","type":"journal-article","created":{"date-parts":[[2015,3,20]],"date-time":"2015-03-20T17:01:45Z","timestamp":1426870905000},"page":"2269-2275","source":"Crossref","is-referenced-by-count":55,"title":["A robust approach for identifying differentially abundant features in metagenomic samples"],"prefix":"10.1093","volume":"31","author":[{"given":"Michael B.","family":"Sohn","sequence":"first","affiliation":[{"name":"1 Interdisciplinary Program in Statistics and"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ruofei","family":"Du","sequence":"additional","affiliation":[{"name":"2 Department of Agricultural and Biosystems Engineering, University of Arizona, Tucson, AZ 85721, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lingling","family":"An","sequence":"additional","affiliation":[{"name":"1 Interdisciplinary Program in Statistics and"},{"name":"2 Department of Agricultural and Biosystems Engineering, University of Arizona, Tucson, AZ 85721, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2015,3,19]]},"reference":[{"key":"2023020202154854200_btv165-B1","first-page":"363","article-title":"The Bhattacharyya metric as an absolute similarity measure for frequency coded data","volume":"34","author":"Aherne","year":"1998","journal-title":"Kybernetika"},{"key":"2023020202154854200_btv165-B2","doi-asserted-by":"crossref","DOI":"10.1007\/978-94-009-4109-0","volume-title":"The Statistical Analysis of Compositional Data","author":"Aitchison","year":"1986","edition":"1st edn"},{"key":"2023020202154854200_btv165-B3","doi-asserted-by":"crossref","first-page":"R106","DOI":"10.1186\/gb-2010-11-10-r106","article-title":"Differential expression analysis for sequence count data","volume":"11","author":"Anders","year":"2010","journal-title":"Genome Biol."},{"key":"2023020202154854200_btv165-B4","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. B"},{"key":"2023020202154854200_btv165-B5","doi-asserted-by":"crossref","first-page":"1075","DOI":"10.1198\/jasa.2011.tm10183","article-title":"Hierarchical clustering with prototypes via minimax linkage","volume":"106","author":"Bien","year":"2011","journal-title":"J. Am. Stat. Assoc."},{"key":"2023020202154854200_btv165-B6","doi-asserted-by":"crossref","first-page":"1190","DOI":"10.1137\/0916069","article-title":"A limited memory algorithm for bound constrained optimization","volume":"16","author":"Byrd","year":"1995","journal-title":"SIAM J. Sci. Comput."},{"key":"2023020202154854200_btv165-B7","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1109\/PROC.1979.11327","article-title":"Image segmentation by clustering","volume":"67","author":"Coleman","year":"1979","journal-title":"Proc IEEE"},{"key":"2023020202154854200_btv165-B8","doi-asserted-by":"crossref","first-page":"671","DOI":"10.1093\/bib\/bbs046","article-title":"A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis","volume":"14","author":"Dillies","year":"2013","journal-title":"Brief. Bioinform."},{"key":"2023020202154854200_btv165-B9","doi-asserted-by":"crossref","first-page":"4399","DOI":"10.1128\/AEM.67.10.4399-4406.2001","article-title":"Counting the uncountable: statistical approaches to estimating microbial diversity","volume":"67","author":"Hughes","year":"2001","journal-title":"Appl. Environ. Microbiol."},{"key":"2023020202154854200_btv165-B10","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1109\/TCOM.1967.1089532","article-title":"The divergence and Bhattacharyya distance measures in signal selection","volume":"15","author":"Kailath","year":"1967","journal-title":"IEEE Trans. Commun."},{"key":"2023020202154854200_btv165-B11","doi-asserted-by":"crossref","first-page":"2341","DOI":"10.2337\/diacare.27.10.2341","article-title":"The treatment of diabetic gastroparesis with botulinum toxin injection of the pylorus","volume":"27","author":"Lacy","year":"2004","journal-title":"Diabetes Care"},{"key":"2023020202154854200_btv165-B12","doi-asserted-by":"crossref","first-page":"e9085","DOI":"10.1371\/journal.pone.0009085","article-title":"Gut microbiota in human adults with type 2 diabetes differs from non-diabetic adults","volume":"5","author":"Larsen","year":"2010","journal-title":"PLoS One"},{"key":"2023020202154854200_btv165-B13","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1038\/nmeth.2658","article-title":"Differential abundance analysis for microbial marker\u2013gene surveys","volume":"10","author":"Paulson","year":"2013","journal-title":"Nat. Methods"},{"key":"2023020202154854200_btv165-B14","doi-asserted-by":"crossref","first-page":"55","DOI":"10.1038\/nature11450","article-title":"A metagenome-wide association study of gut microbiota in type 2 diabetes","volume":"490","author":"Qin","year":"2012","journal-title":"Nature"},{"key":"2023020202154854200_btv165-B15","doi-asserted-by":"crossref","first-page":"812","DOI":"10.1016\/j.patcog.2005.12.003","article-title":"The Bhattacharyya space for feature selection and its application to texture segmentation","volume":"39","author":"Reyes-Aldasoroa","year":"2006","journal-title":"Pattern Recognit."},{"key":"2023020202154854200_btv165-B16","doi-asserted-by":"crossref","first-page":"R25","DOI":"10.1186\/gb-2010-11-3-r25","article-title":"A scaling normalization method for differential expression analysis of RNA-seq data","volume":"11","author":"Robinson","year":"2010","journal-title":"Genome Biol."},{"key":"2023020202154854200_btv165-B17","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023020202154854200_btv165-B18","doi-asserted-by":"crossref","first-page":"229","DOI":"10.1186\/gb-2005-6-8-229","article-title":"Metagenomics for studying unculturable microorganisms: cutting the Gordian knot","volume":"6","author":"Schloss","year":"2005","journal-title":"Genome Biol."},{"key":"2023020202154854200_btv165-B19","doi-asserted-by":"crossref","first-page":"397","DOI":"10.1007\/0-387-29362-0_23","article-title":"Limma: linear models for microarray data","volume-title":"Bioinformatics and Computational Biology Solutions using R and Bioconductor","author":"Smyth","year":"2005"},{"key":"2023020202154854200_btv165-B20","doi-asserted-by":"crossref","first-page":"242","DOI":"10.1186\/1471-2105-15-242","article-title":"Accurate genome relative abundance estimation for closely related species in a metagenomic sample","volume":"15","author":"Sohn","year":"2014","journal-title":"BMC Bioinformatics"},{"key":"2023020202154854200_btv165-B21","doi-asserted-by":"crossref","first-page":"3","DOI":"10.1186\/2042-5783-2-3","article-title":"Metagenomics - a guide from sampling to data analysis","volume":"2","author":"Thomas","year":"2012","journal-title":"Microb. Inform. Exp."},{"key":"2023020202154854200_btv165-B22","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1016\/j.cell.2011.09.009","article-title":"Metagenomics and Personalized Medicine","volume":"147","author":"Virgin","year":"2011","journal-title":"Cell"},{"key":"2023020202154854200_btv165-B23","doi-asserted-by":"crossref","first-page":"e1000352","DOI":"10.1371\/journal.pcbi.1000352","article-title":"Statistical methods for detecting differentially abundant features in clinical metagenomic samples","volume":"5","author":"White","year":"2009","journal-title":"PLoS Comput. Biol."},{"key":"2023020202154854200_btv165-B24","doi-asserted-by":"crossref","first-page":"1473","DOI":"10.1212\/01.wnl.0000345968.05959.cf","article-title":"Botulinum toxin for diabetic neuropathic pain: A randomized double-blind crossover trial","volume":"72","author":"Yuan","year":"2009","journal-title":"Neurology"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/14\/2269\/49034997\/bioinformatics_31_14_2269.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/31\/14\/2269\/49034997\/bioinformatics_31_14_2269.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,7]],"date-time":"2024-06-07T23:19:06Z","timestamp":1717802346000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/31\/14\/2269\/256302"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,3,19]]},"references-count":24,"journal-issue":{"issue":"14","published-print":{"date-parts":[[2015,7,15]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btv165","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2015,7,15]]},"published":{"date-parts":[[2015,3,19]]}}}