{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,18]],"date-time":"2026-01-18T22:22:01Z","timestamp":1768774921580,"version":"3.49.0"},"reference-count":26,"publisher":"Oxford University Press (OUP)","issue":"1","license":[{"start":{"date-parts":[[2025,11,23]],"date-time":"2025-11-23T00:00:00Z","timestamp":1763856000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2026,1,2]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>In single-cell studies, a common question is whether there is a change in cell composition between conditions. While ideally, one needs absolute cell counts (number of cells per volumetric unit in a sample) to address these questions, current experimentation typically obtains cell counts that only carry relative information. Therefore, one should account for the compositional nature of cell count data in the statistical analysis. While recently developed methods address compositionality using compositional transformations together with a bias correction, they do not account for the uncertainty involved in estimation of the bias term, nor do they accommodate the mean\u2013variance structure of the counts.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Here, we introduce a statistical method, voomCLR, for assessing differences in cell composition between conditions incorporating both uncertainty on the bias term as well as acknowledging the mean\u2013variance structure of the transformed data, by leveraging developments from the differential gene expression literature. We demonstrate the performances of voomCLR, illustrate the benefit of all components, and compare the methodology to the state-of-the-art on simulated and real single-cell gene expression datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>voomCLR software is available as an open-source R package on GitHub at https:\/\/github.com\/johnsonandjohnson\/voomCLR.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaf637","type":"journal-article","created":{"date-parts":[[2025,11,21]],"date-time":"2025-11-21T13:19:59Z","timestamp":1763731199000},"source":"Crossref","is-referenced-by-count":0,"title":["Assessing differential cell composition in single-cell studies using voomCLR"],"prefix":"10.1093","volume":"42","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7773-0621","authenticated-orcid":false,"given":"Alemu","family":"Takele Assefa","sequence":"first","affiliation":[{"name":"Statistics and Decision Sciences, Johnson & Johnson , Beerse 2340,","place":["Belgium"]}]},{"given":"Bie","family":"Verbist","sequence":"additional","affiliation":[{"name":"Statistics and Decision Sciences, Johnson & Johnson , Beerse 2340,","place":["Belgium"]}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1833-8478","authenticated-orcid":false,"given":"Koen","family":"Van den Berge","sequence":"additional","affiliation":[{"name":"Statistics and Decision Sciences, Johnson & Johnson , Beerse 2340,","place":["Belgium"]}]}],"member":"286","published-online":{"date-parts":[[2025,11,23]]},"reference":[{"key":"2026011804562249100_btaf637-B1","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1111\/j.2517-6161.1982.tb01195.x","article-title":"The statistical analysis of compositional data","volume":"44","author":"Aitchison","year":"1982","journal-title":"J R Stat Soc Ser B (Methodol)"},{"key":"2026011804562249100_btaf637-B2","volume-title":"The Statistical Analysis of Compositional Data. Monographs on Statistics and Applied Probability (MSAP)","author":"Aitchison"},{"key":"2026011804562249100_btaf637-B3","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1038\/s41592-019-0654-x","article-title":"Orchestrating single-cell analysis with Bioconductor","volume":"17","author":"Amezquita","year":"2020","journal-title":"Nat Methods"},{"key":"2026011804562249100_btaf637-B4","doi-asserted-by":"crossref","first-page":"378","DOI":"10.32614\/RJ-2017-066","article-title":"glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling","volume":"9","author":"Brooks","year":"2017","journal-title":"R J"},{"key":"2026011804562249100_btaf637-B5","doi-asserted-by":"publisher","first-page":"6876","DOI":"10.1038\/s41467-021-27150-6","article-title":"scCODA is a Bayesian model for compositional single-cell data analysis","volume":"12","author":"B\u00fcttner","year":"2021","journal-title":"Nat Commun"},{"key":"2026011804562249100_btaf637-B6","doi-asserted-by":"publisher","first-page":"245","DOI":"10.1038\/s41587-021-01033-z","article-title":"Differential abundance testing on single-cell data using k-nearest neighbor graphs","volume":"40","author":"Dann","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2026011804562249100_btaf637-B7","doi-asserted-by":"publisher","first-page":"440","DOI":"10.1186\/s12859-023-05569-6","article-title":"A cell abundance analysis based on efficient PAM clustering for a better understanding of the dynamics of endometrial remodelling","volume":"24","author":"Domingo","year":"2023","journal-title":"BMC Bioinformatics"},{"key":"2026011804562249100_btaf637-B8","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1214\/aoms\/1177732594","article-title":"The limiting distributions of certain statistics","volume":"6","author":"Doob","year":"1935","journal-title":"Ann Math Statist"},{"key":"2026011804562249100_btaf637-B9","first-page":"129","article-title":"A note on the delta-method for finding variance formulae","volume":"1","author":"Dorfman","year":"1938","journal-title":"Biometric Bulletin"},{"key":"2026011804562249100_btaf637-B10","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1214\/aos\/1176344552","article-title":"Bootstrap methods: another look at the jackknife","volume":"7","author":"Efron","year":"1979","journal-title":"Ann Statist"},{"key":"2026011804562249100_btaf637-B11","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1038\/s41592-019-0355-5","article-title":"Cell composition analysis of bulk genomics using single-cell data","volume":"16","author":"Frishberg","year":"2019","journal-title":"Nat Methods"},{"key":"2026011804562249100_btaf637-B12","doi-asserted-by":"publisher","first-page":"btad668","DOI":"10.1093\/bioinformatics\/btad668","article-title":"Compositional analysis of microbiome data using the linear decomposition model (LDM)","volume":"39","author":"Hu","year":"2023","journal-title":"Bioinformatics"},{"key":"2026011804562249100_btaf637-B13","doi-asserted-by":"publisher","first-page":"R29","DOI":"10.1186\/gb-2014-15-2-r29","article-title":"Voom: precision weights unlock linear model analysis tools for RNA-seq read counts","volume":"15","author":"Law","year":"2014","journal-title":"Genome Biol"},{"key":"2026011804562249100_btaf637-B14","doi-asserted-by":"publisher","first-page":"151","DOI":"10.1186\/s13059-023-02980-3","article-title":"DCATS: differential composition analysis for flexible single-cell experimental designs","volume":"24","author":"Lin","year":"2023","journal-title":"Genome Biol"},{"key":"2026011804562249100_btaf637-B15","doi-asserted-by":"publisher","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2026011804562249100_btaf637-B16","doi-asserted-by":"publisher","first-page":"3244","DOI":"10.1038\/s41467-023-39017-z","article-title":"Cell-attribute aware community detection improves differential abundance testing from single-cell RNA-Seq data","volume":"14","author":"Maity","year":"2023","journal-title":"Nat Commun"},{"key":"2026011804562249100_btaf637-B17","doi-asserted-by":"publisher","first-page":"e2203828120","DOI":"10.1073\/pnas.2203828120","article-title":"sccomp: robust differential composition and variability analysis for single-cell data","volume":"120","author":"Mangiola","year":"2023","journal-title":"Proc Natl Acad Sci USA"},{"key":"2026011804562249100_btaf637-B18","doi-asserted-by":"publisher","first-page":"eabf1970","DOI":"10.1126\/science.abf1970","article-title":"Single-cell RNA-seq reveals cell type-specific molecular and genetic associations to lupus","volume":"376","author":"Perez","year":"2022","journal-title":"Science (New York, N.Y.)"},{"key":"2026011804562249100_btaf637-B19","doi-asserted-by":"publisher","first-page":"4720","DOI":"10.1093\/bioinformatics\/btac582","article-title":"propeller: testing for differences in cell type proportions in single cell data","volume":"38","author":"Phipson","year":"2022","journal-title":"Bioinformatics"},{"key":"2026011804562249100_btaf637-B20","doi-asserted-by":"publisher","first-page":"2870","DOI":"10.1093\/bioinformatics\/bty175","article-title":"Understanding sequencing data as compositions: an outlook and review","volume":"34","author":"Quinn","year":"2018","journal-title":"Bioinformatics"},{"key":"2026011804562249100_btaf637-B21","doi-asserted-by":"publisher","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2026011804562249100_btaf637-B22","doi-asserted-by":"publisher","first-page":"e21887","DOI":"10.7554\/eLife.21887","article-title":"A phylogenetic transform enhances analysis of compositional microbiota data","volume":"6","author":"Silverman","year":"2017","journal-title":"eLife"},{"key":"2026011804562249100_btaf637-B23","doi-asserted-by":"publisher","first-page":"Article3","DOI":"10.2202\/1544-6115.1027","article-title":"Linear models and empirical Bayes methods for assessing differential expression in microarray experiments","volume":"3","author":"Smyth","year":"2004","journal-title":"Stat Appl Genet Mol Biol"},{"key":"2026011804562249100_btaf637-B24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s42003-019-0415-5","article-title":"diffcyt: differential discovery in high-dimensional cytometry via high-resolution clustering","volume":"2","author":"Weber","year":"2019","journal-title":"Commun Biol"},{"key":"2026011804562249100_btaf637-B25","doi-asserted-by":"publisher","first-page":"e2100293118","DOI":"10.1073\/pnas.2100293118","article-title":"Detection of differentially abundant cell subpopulations in scRNA-seq data","volume":"118","author":"Zhao","year":"2021","journal-title":"Proc Natl Acad Sci USA"},{"key":"2026011804562249100_btaf637-B26","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1186\/s13059-022-02655-5","article-title":"LinDA: linear models for differential abundance analysis of microbiome compositional data","volume":"23","author":"Zhou","year":"2022","journal-title":"Genome Biol"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaf637\/65484132\/btaf637.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/1\/btaf637\/65484132\/btaf637.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/42\/1\/btaf637\/65484132\/btaf637.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,1,18]],"date-time":"2026-01-18T09:56:29Z","timestamp":1768730189000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btaf637\/8340593"}},"subtitle":[],"editor":[{"given":"Pier Luigi","family":"Martelli","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,11,23]]},"references-count":26,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,1,2]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaf637","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2026,1]]},"published":{"date-parts":[[2025,11,23]]},"article-number":"btaf637"}}