{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T20:30:43Z","timestamp":1774470643424,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1010284","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,7,21]],"date-time":"2022-07-21T00:00:00Z","timestamp":1658361600000}}],"reference-count":48,"publisher":"Public Library of Science (PLoS)","issue":"7","license":[{"start":{"date-parts":[[2022,7,11]],"date-time":"2022-07-11T00:00:00Z","timestamp":1657497600000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100000854","name":"Human Frontier Science Program","doi-asserted-by":"publisher","award":["HFSP RGP005"],"award-info":[{"award-number":["HFSP RGP005"]}],"id":[{"id":"10.13039\/501100000854","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["NSF DMS 17-13012"],"award-info":[{"award-number":["NSF DMS 17-13012"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["NSF BCS 1552848"],"award-info":[{"award-number":["NSF BCS 1552848"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["NSF DBI 1661386"],"award-info":[{"award-number":["NSF DBI 1661386"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["NSF IIS 15-46331"],"award-info":[{"award-number":["NSF IIS 15-46331"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100008982","name":"National Science Foundation","doi-asserted-by":"publisher","award":["NSF DMS 16-13261"],"award-info":[{"award-number":["NSF DMS 16-13261"]}],"id":[{"id":"10.13039\/501100008982","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100005562","name":"North Carolina Biotechnology Center","doi-asserted-by":"crossref","award":["2016-IDG-1013"],"award-info":[{"award-number":["2016-IDG-1013"]}],"id":[{"id":"10.13039\/100005562","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/100005562","name":"North Carolina Biotechnology Center","doi-asserted-by":"crossref","award":["2020-IIG-2109"],"award-info":[{"award-number":["2020-IIG-2109"]}],"id":[{"id":"10.13039\/100005562","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Duke Forge health data science center"}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Concerns have been raised about the use of relative abundance data derived from next generation sequencing as a proxy for absolute abundances. For example, in the differential abundance setting, compositional effects in relative abundance data may give rise to spurious differences (false positives) when considered from the absolute perspective. In practice however, relative abundances are often transformed by renormalization strategies intended to compensate for these effects and the scope of the practical problem remains unclear. We used simulated data to explore the consistency of differential abundance calling on renormalized relative abundances versus absolute abundances and find that, while overall consistency is high, with a median sensitivity (true positive rates) of 0.91 and specificity (1\u2014false positive rates) of 0.89, consistency can be much lower where there is widespread change in the abundance of features across conditions. We confirm these findings on a large number of real data sets drawn from 16S metabarcoding, expression array, bulk RNA-seq, and single-cell RNA-seq experiments, where data sets with the greatest change between experimental conditions are also those with the highest false positive rates. Finally, we evaluate the predictive utility of summary features of relative abundance data themselves. Estimates of sparsity and the prevalence of feature-level change in relative abundance data give reasonable predictions of discrepancy in differential abundance calling in simulated data and can provide useful bounds for worst-case outcomes in real data.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1010284","type":"journal-article","created":{"date-parts":[[2022,7,11]],"date-time":"2022-07-11T13:35:51Z","timestamp":1657546551000},"page":"e1010284","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":23,"title":["The accuracy of absolute differential abundance analysis from relative count data"],"prefix":"10.1371","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6541-9393","authenticated-orcid":true,"given":"Kimberly E.","family":"Roche","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6715-3920","authenticated-orcid":true,"given":"Sayan","family":"Mukherjee","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2022,7,11]]},"reference":[{"key":"pcbi.1010284.ref001","doi-asserted-by":"crossref","first-page":"57","DOI":"10.3389\/fmicb.2017.02224","article-title":"Microbiome Datasets Are Compositional: And This Is Not Optional","volume":"8","author":"GB Gloor","year":"2017","journal-title":"Front Microbiol"},{"issue":"5","key":"pcbi.1010284.ref002","doi-asserted-by":"crossref","first-page":"776","DOI":"10.1093\/bib\/bbx008","article-title":"Selecting between-sample RNA-Seq normalization methods from the perspective of their assumptions","volume":"19","author":"C Evans","year":"2018","journal-title":"Brief Bioinform"},{"issue":"1","key":"pcbi.1010284.ref003","doi-asserted-by":"crossref","first-page":"4667","DOI":"10.1038\/s41467-019-12266-7","article-title":"A systematic evaluation of single cell RNA-seq analysis pipelines","volume":"10","author":"B Vieth","year":"2019","journal-title":"Nat Commun"},{"key":"pcbi.1010284.ref004","doi-asserted-by":"crossref","first-page":"534","DOI":"10.1093\/gbe\/evq038","article-title":"Quantifying whole transcriptome size, a prerequisite for understanding transcriptome evolution across species: an example from a plant allopolyploid","volume":"2","author":"JE Coate","year":"2010","journal-title":"Genome Biol Evol"},{"issue":"1","key":"pcbi.1010284.ref005","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1007\/s00412-014-0496-3","article-title":"Variation in transcriptome size: are we getting the message?","volume":"124","author":"JE Coate","year":"2015","journal-title":"Chromosoma"},{"issue":"1","key":"pcbi.1010284.ref006","doi-asserted-by":"crossref","first-page":"68","DOI":"10.1016\/j.cell.2012.08.033","article-title":"c-Myc is a universal amplifier of expressed genes in lymphocytes and embryonic stem cells","volume":"151","author":"Z Nie","year":"2012","journal-title":"Cell"},{"issue":"1","key":"pcbi.1010284.ref007","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1016\/j.cell.2012.08.026","article-title":"Transcriptional amplification in tumor cells with elevated c-Myc","volume":"151","author":"CY Lin","year":"2012","journal-title":"Cell"},{"issue":"3","key":"pcbi.1010284.ref008","doi-asserted-by":"crossref","first-page":"476","DOI":"10.1016\/j.cell.2012.10.012","article-title":"Revisiting global gene expression analysis","volume":"151","author":"J Lov\u00e9n","year":"2012","journal-title":"Cell"},{"issue":"3","key":"pcbi.1010284.ref009","doi-asserted-by":"crossref","first-page":"e1006794","DOI":"10.1371\/journal.pcbi.1006794","article-title":"A complete statistical model for calibration of RNA-seq counts using external spike-ins and maximum likelihood theory","volume":"15","author":"R Athanasiadou","year":"2019","journal-title":"PLoS Comput Biol"},{"key":"pcbi.1010284.ref010","doi-asserted-by":"crossref","unstructured":"McGee WA, Pimentel H, Pachter L, Wu JY. Compositional data analysis is necessary for simulating and analyzing RNA-Seq data; 2019.","DOI":"10.1101\/564955"},{"issue":"1","key":"pcbi.1010284.ref011","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1038\/nmeth.2694","article-title":"Quantitative assessment of single-cell RNA-sequencing methods","volume":"11","author":"AR Wu","year":"2014","journal-title":"Nat Methods"},{"issue":"3","key":"pcbi.1010284.ref012","doi-asserted-by":"crossref","first-page":"133","DOI":"10.1038\/nrg3833","article-title":"Computational and analytical challenges in single-cell transcriptomics","volume":"16","author":"O Stegle","year":"2015","journal-title":"Nat Rev Genet"},{"issue":"11","key":"pcbi.1010284.ref013","doi-asserted-by":"crossref","first-page":"1795","DOI":"10.1101\/gr.222877.117","article-title":"Assessing the reliability of spike-in normalization for analyses of single-cell RNA sequencing data","volume":"27","author":"ATL Lun","year":"2017","journal-title":"Genome Res"},{"issue":"4","key":"pcbi.1010284.ref014","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1038\/nmeth.4220","article-title":"Power analysis of single-cell RNA-sequencing experiments","volume":"14","author":"V Svensson","year":"2017","journal-title":"Nat Methods"},{"issue":"4","key":"pcbi.1010284.ref015","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1016\/j.molcel.2017.01.023","article-title":"Comparative Analysis of Single-Cell RNA Sequencing Methods","volume":"65","author":"C Ziegenhain","year":"2017","journal-title":"Mol Cell"},{"issue":"1","key":"pcbi.1010284.ref016","first-page":"1","article-title":"Batch effects and the effective design of single-cell gene expression studies","volume":"7","author":"PY Tung","year":"2017","journal-title":"Sci Rep"},{"issue":"9","key":"pcbi.1010284.ref017","doi-asserted-by":"crossref","first-page":"896","DOI":"10.1038\/nbt.2931","article-title":"Normalization of RNA-seq data using factor analysis of control genes or samples","volume":"32","author":"D Risso","year":"2014","journal-title":"Nat Biotechnol"},{"issue":"16","key":"pcbi.1010284.ref018","doi-asserted-by":"crossref","first-page":"2870","DOI":"10.1093\/bioinformatics\/bty175","article-title":"Understanding sequencing data as compositions: an outlook and review","volume":"34","author":"TP Quinn","year":"2018","journal-title":"Bioinformatics"},{"issue":"1","key":"pcbi.1010284.ref019","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1093\/bib\/bbx104","article-title":"A broken promise: microbiome differential abundance methods do not control the false discovery rate","volume":"20","author":"S Hawinkel","year":"2019","journal-title":"Brief Bioinform"},{"issue":"1","key":"pcbi.1010284.ref020","doi-asserted-by":"crossref","first-page":"191","DOI":"10.1186\/s13059-020-02104-1","article-title":"Assessment of statistical methods from single cell, bulk RNA-seq, and metagenomics applied to microbiome data","volume":"21","author":"M Calgaro","year":"2020","journal-title":"Genome Biol"},{"issue":"7681","key":"pcbi.1010284.ref021","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1038\/nature24460","article-title":"Quantitative microbiome profiling links gut community variation to microbial load","volume":"551","author":"D Vandeputte","year":"2017","journal-title":"Nature"},{"issue":"3","key":"pcbi.1010284.ref022","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1023\/A:1023818214614","article-title":"Isometric Logratio Transformations for Compositional Data Analysis","volume":"35","author":"JJ Egozcue","year":"2003","journal-title":"Math Geol"},{"issue":"7","key":"pcbi.1010284.ref023","doi-asserted-by":"crossref","first-page":"e67019","DOI":"10.1371\/journal.pone.0067019","article-title":"ANOVA-like differential expression (ALDEx) analysis for mixed population RNA-Seq","volume":"8","author":"AD Fernandes","year":"2013","journal-title":"PLoS One"},{"issue":"1","key":"pcbi.1010284.ref024","doi-asserted-by":"crossref","first-page":"3514","DOI":"10.1038\/s41467-020-17041-7","article-title":"Analysis of compositions of microbiomes with bias correction","volume":"11","author":"H Lin","year":"2020","journal-title":"Nat Commun"},{"issue":"12","key":"pcbi.1010284.ref025","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"MI Love","year":"2014","journal-title":"Genome Biol"},{"issue":"1","key":"pcbi.1010284.ref026","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"MD Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"pcbi.1010284.ref027","first-page":"2122","article-title":"A step-by-step workflow for low-level analysis of single-cell RNA-seq data with Bioconductor","volume":"5","author":"ATL Lun","year":"2016","journal-title":"F1000Res"},{"issue":"11","key":"pcbi.1010284.ref028","doi-asserted-by":"crossref","first-page":"1826","DOI":"10.1038\/s41564-019-0483-9","article-title":"Quantitative microbiome profiling disentangles inflammation- and bile duct obstruction-associated microbiota alterations across PSC\/IBD diagnoses","volume":"4","author":"S Vieira-Silva","year":"2019","journal-title":"Nat Microbiol"},{"issue":"4","key":"pcbi.1010284.ref029","doi-asserted-by":"crossref","first-page":"385","DOI":"10.1016\/j.cels.2016.09.002","article-title":"A Single-Cell Transcriptome Atlas of the Human Pancreas","volume":"3","author":"MJ Muraro","year":"2016","journal-title":"Cell Syst"},{"issue":"7730","key":"pcbi.1010284.ref030","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1038\/s41586-018-0657-2","article-title":"Gene expression variability across cells and species shapes innate immunity","volume":"563","author":"T Hagai","year":"2018","journal-title":"Nature"},{"key":"pcbi.1010284.ref031","doi-asserted-by":"crossref","first-page":"77","DOI":"10.1186\/s13059-016-0938-8","article-title":"CEL-Seq2: sensitive highly-multiplexed single-cell RNA-Seq","volume":"17","author":"T Hashimshony","year":"2016","journal-title":"Genome Biol"},{"issue":"6","key":"pcbi.1010284.ref032","doi-asserted-by":"crossref","first-page":"637","DOI":"10.1038\/nmeth.2930","article-title":"Validation of noise models for single-cell transcriptomics","volume":"11","author":"D Gr\u00fcn","year":"2014","journal-title":"Nat Methods"},{"issue":"1","key":"pcbi.1010284.ref033","doi-asserted-by":"crossref","first-page":"207","DOI":"10.1186\/s13059-018-1576-0","article-title":"Linking single-cell measurements of mass, growth rate, and gene expression","volume":"19","author":"RJ Kimmerling","year":"2018","journal-title":"Genome Biol"},{"issue":"7","key":"pcbi.1010284.ref034","doi-asserted-by":"crossref","first-page":"2035","DOI":"10.1007\/s00262-020-02840-0","article-title":"Comparative analysis of the tumor immune-microenvironment of primary and brain metastases of non-small-cell lung cancer reveals organ-specific and EGFR mutation-dependent unique immune landscape","volume":"70","author":"SG Song","year":"2021","journal-title":"Cancer Immunol Immunother"},{"issue":"1","key":"pcbi.1010284.ref035","doi-asserted-by":"crossref","first-page":"2590","DOI":"10.1038\/s41467-020-16224-6","article-title":"A quantitative sequencing framework for absolute abundance measurements of mucosal and lumenal microbial communities","volume":"11","author":"JT Barlow","year":"2020","journal-title":"Nat Commun"},{"issue":"6","key":"pcbi.1010284.ref036","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1016\/j.celrep.2019.01.041","article-title":"RNA-Seq Signatures Normalized by mRNA Abundance Allow Absolute Deconvolution of Human Immune Cell Types","volume":"26","author":"G Monaco","year":"2019","journal-title":"Cell Rep"},{"issue":"1","key":"pcbi.1010284.ref037","first-page":"1","article-title":"A rat RNA-Seq transcriptomic BodyMap across 11 organs and 4 developmental stages","volume":"5","author":"Y Yu","year":"2014","journal-title":"Nat Commun"},{"issue":"5","key":"pcbi.1010284.ref038","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","article-title":"Droplet Barcoding for Single-Cell Transcriptomics Applied to Embryonic Stem Cells","volume":"161","author":"AM Klein","year":"2015","journal-title":"Cell"},{"issue":"3","key":"pcbi.1010284.ref039","doi-asserted-by":"crossref","first-page":"632","DOI":"10.1016\/j.celrep.2015.12.050","article-title":"Measuring Absolute RNA Copy Numbers at High Temporal Resolution Reveals Transcriptome Kinetics in Development","volume":"14","author":"NDL Owens","year":"2016","journal-title":"Cell Rep"},{"issue":"2","key":"pcbi.1010284.ref040","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1016\/j.molcel.2015.03.005","article-title":"Single mammalian cells compensate for differences in cellular volume and DNA copy number through independent global transcriptional mechanisms","volume":"58","author":"O Padovan-Merhar","year":"2015","journal-title":"Mol Cell"},{"issue":"1","key":"pcbi.1010284.ref041","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1038\/labinvest.3700208","article-title":"Normalization of gene expression measurements in tumor tissues: comparison of 13 endogenous control genes","volume":"85","author":"JB de Kok","year":"2005","journal-title":"Lab Invest"},{"issue":"1","key":"pcbi.1010284.ref042","doi-asserted-by":"crossref","first-page":"8716","DOI":"10.1038\/s41598-018-26707-8","article-title":"Validation of Common Housekeeping Genes as Reference for qPCR Gene Expression Analysis During iPS Reprogramming Process","volume":"8","author":"Y Panina","year":"2018","journal-title":"Sci Rep"},{"issue":"12","key":"pcbi.1010284.ref043","doi-asserted-by":"crossref","first-page":"e0225790","DOI":"10.1371\/journal.pone.0225790","article-title":"Housekeeping gene validation for RT-qPCR studies on synovial fibroblasts derived from healthy and osteoarthritic patients with focus on mechanical loading","volume":"14","author":"U Nazet","year":"2019","journal-title":"PLoS One"},{"issue":"6","key":"pcbi.1010284.ref044","doi-asserted-by":"crossref","first-page":"789","DOI":"10.1016\/j.chom.2019.05.005","article-title":"Daily Sampling Reveals Personalized Diet-Microbiome Associations in Humans","volume":"25","author":"AJ Johnson","year":"2019","journal-title":"Cell Host Microbe"},{"key":"pcbi.1010284.ref045","doi-asserted-by":"crossref","first-page":"2789","DOI":"10.1016\/j.csbj.2020.09.014","article-title":"Naught all zeros in sequence count data are the same","volume":"18","author":"JD Silverman","year":"2020","journal-title":"Comput Struct Biotechnol J"},{"issue":"6","key":"pcbi.1010284.ref046","doi-asserted-by":"crossref","first-page":"770","DOI":"10.1038\/s41588-021-00873-4","article-title":"Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis","volume":"53","author":"A Sarkar","year":"2021","journal-title":"Nat Genet"},{"issue":"3","key":"pcbi.1010284.ref047","first-page":"18","article-title":"Classification and Regression by randomForest","volume":"2","author":"A Liaw","year":"2002","journal-title":"R News"},{"key":"pcbi.1010284.ref048","article-title":"Building predictive models in R using the caret package","author":"M Kuhn","year":"2008","journal-title":"J Stat Softw"}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1010284","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2022,7,21]],"date-time":"2022-07-21T00:00:00Z","timestamp":1658361600000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010284","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,21]],"date-time":"2022-07-21T14:07:57Z","timestamp":1658412477000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1010284"}},"subtitle":[],"editor":[{"given":"Jinyan","family":"Li","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2022,7,11]]},"references-count":48,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2022,7,11]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1010284","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2021.12.06.471397","asserted-by":"object"}]},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,11]]}}}