{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,12]],"date-time":"2026-04-12T06:44:52Z","timestamp":1775976292591,"version":"3.50.1"},"reference-count":32,"publisher":"Oxford University Press (OUP)","issue":"19","license":[{"start":{"date-parts":[[2021,4,1]],"date-time":"2021-04-01T00:00:00Z","timestamp":1617235200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001656","name":"Helmholtz Association","doi-asserted-by":"publisher","award":["VH-NG-1010"],"award-info":[{"award-number":["VH-NG-1010"]}],"id":[{"id":"10.13039\/501100001656","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,10,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:sec><jats:title>Motivation<\/jats:title><jats:p>Single-cell gene expression distributions measured by single-cell RNA-sequencing (scRNA-seq) often display complex differences between samples. These differences are biologically meaningful but cannot be identified using standard methods for differential expression.<\/jats:p><\/jats:sec><jats:sec><jats:title>Results<\/jats:title><jats:p>Here, we derive and implement a flexible and fast differential distribution testing procedure based on the 2-Wasserstein distance. Our method is able to detect any type of difference in distribution between conditions. To interpret distributional differences, we decompose the 2-Wasserstein distance into terms that capture the relative contribution of changes in mean, variance and shape to the overall difference. Finally, we derive mathematical generalizations that allow our method to be used in a broad range of disciplines other than scRNA-seq or bioinformatics.<\/jats:p><\/jats:sec><jats:sec><jats:title>Availability and implementation<\/jats:title><jats:p>Our methods are implemented in the R\/Bioconductor package waddR, which is freely available at https:\/\/github.com\/goncalves-lab\/waddR, along with documentation and examples.<\/jats:p><\/jats:sec><jats:sec><jats:title>Supplementary information<\/jats:title><jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p><\/jats:sec>","DOI":"10.1093\/bioinformatics\/btab226","type":"journal-article","created":{"date-parts":[[2021,3,31]],"date-time":"2021-03-31T11:45:12Z","timestamp":1617191112000},"page":"3204-3211","source":"Crossref","is-referenced-by-count":28,"title":["Fast identification of differential distributions in single-cell RNA-sequencing data with waddR"],"prefix":"10.1093","volume":"37","author":[{"given":"Roman","family":"Schefzik","sequence":"first","affiliation":[{"name":"German Cancer Research Center (DKFZ), Somatic Evolution and Early Detection Group, 69120 Heidelberg, Germany"}]},{"given":"Julian","family":"Flesch","sequence":"additional","affiliation":[{"name":"German Cancer Research Center (DKFZ), Somatic Evolution and Early Detection Group, 69120 Heidelberg, Germany"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3890-7658","authenticated-orcid":false,"given":"Angela","family":"Goncalves","sequence":"additional","affiliation":[{"name":"German Cancer Research Center (DKFZ), Somatic Evolution and Early Detection Group, 69120 Heidelberg, Germany"}]}],"member":"286","published-online":{"date-parts":[[2021,4,1]]},"reference":[{"key":"2023051608264628000_btab226-B1","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1038\/s41592-019-0654-x","article-title":"Orchestrating single-cell analysis with Bioconductor","volume":"17","author":"Amezquita","year":"2020","journal-title":"Nat. Methods"},{"key":"2023051608264628000_btab226-B2","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1186\/s13059-016-0927-y","article-title":"Design and computational analysis of single-cell RNA-sequencing experiments","volume":"17","author":"Bacher","year":"2016","journal-title":"Genome Biol"},{"key":"2023051608264628000_btab226-B3","doi-asserted-by":"crossref","first-page":"289","DOI":"10.1111\/j.2517-6161.1995.tb02031.x","article-title":"Controlling the false discovery rate: a practical and powerful approach to multiple testing","volume":"57","author":"Benjamini","year":"1995","journal-title":"J. R. Stat. Soc. Ser. B"},{"key":"2023051608264628000_btab226-B4","volume-title":"Statistical Postprocessing of Ensemble Forecasts","author":"Buizza","year":"2018"},{"key":"2023051608264628000_btab226-B5","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"Butler","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023051608264628000_btab226-B6","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/j.cels.2019.03.010","article-title":"Performance assessment and selection of normalization procedures for single-cell RNA-seq","volume":"8","author":"Cole","year":"2019","journal-title":"Cell Syst"},{"key":"2023051608264628000_btab226-B7","doi-asserted-by":"crossref","first-page":"62","DOI":"10.3389\/fgene.2017.00062","article-title":"Single-cell RNA-sequencing: assessment of differential expression analysis methods","volume":"8","author":"Dal Molin","year":"2017","journal-title":"Front. Genet"},{"key":"2023051608264628000_btab226-B8","doi-asserted-by":"crossref","first-page":"278","DOI":"10.1186\/s13059-015-0844-5","article-title":"MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data","volume":"16","author":"Finak","year":"2015","journal-title":"Genome Biol"},{"key":"2023051608264628000_btab226-B9","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1007\/s11634-014-0176-4","article-title":"Basic statistics for distributional symbolic variables: a new metric-based approach","volume":"9","author":"Irpino","year":"2015","journal-title":"Adv. Data Anal. Classif"},{"key":"2023051608264628000_btab226-B10","first-page":"735","article-title":"Comparison of methods to detect differentially expressed genes between single-cell populations","volume":"18","author":"Jaakkola","year":"2017","journal-title":"Brief. Bioinf"},{"key":"2023051608264628000_btab226-B11","doi-asserted-by":"crossref","first-page":"1397","DOI":"10.3389\/fimmu.2019.01397","article-title":"Features of human decidual NK cells in healthy pregnancy and during viral infection","volume":"10","author":"Jabrane-Ferrat","year":"2019","journal-title":"Front. Immunol"},{"key":"2023051608264628000_btab226-B12","doi-asserted-by":"crossref","first-page":"i161","DOI":"10.1093\/bioinformatics\/btp211","article-title":"Fewer permutations, more accurate p-values","volume":"25","author":"Knijnenburg","year":"2009","journal-title":"Bioinformatics"},{"key":"2023051608264628000_btab226-B13","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1016\/j.stem.2015.09.011","article-title":"Single cell RNA-sequencing of pluripotent states unlocks modular transcriptional variation","volume":"17","author":"Kolodziejczyk","year":"2015","journal-title":"Cell Stem Cell"},{"key":"2023051608264628000_btab226-B14","doi-asserted-by":"crossref","first-page":"1201","DOI":"10.1084\/jem.20030305","article-title":"Human decidual natural killer cells are a unique NK cell subset with immunomodulatory potential","volume":"198","author":"Koopman","year":"2003","journal-title":"J. Exp. Med"},{"key":"2023051608264628000_btab226-B15","doi-asserted-by":"crossref","first-page":"222","DOI":"10.1186\/s13059-016-1077-y","article-title":"A statistical approach for identifying differential distributions in single-cell RNA-seq experiments","volume":"17","author":"Korthauer","year":"2016","journal-title":"Genome Biol"},{"key":"2023051608264628000_btab226-B16","doi-asserted-by":"crossref","first-page":"550","DOI":"10.1186\/s13059-014-0550-8","article-title":"Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2","volume":"15","author":"Love","year":"2014","journal-title":"Genome Biol"},{"key":"2023051608264628000_btab226-B17","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1093\/biostatistics\/kxw055","article-title":"Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data","volume":"18","author":"Lun","year":"2017","journal-title":"Biostatistics"},{"key":"2023051608264628000_btab226-B18","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1186\/s13059-016-0947-7","article-title":"Pooling across cells to normalize single-cell RNA sequencing data with many zero counts","volume":"17","author":"Lun","year":"2016","journal-title":"Genome Biol"},{"key":"2023051608264628000_btab226-B19","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1101\/gr.161034.113","article-title":"From single-cell to cell-pool transcriptomes: stochasticity in gene expression and RNA splicing","volume":"24","author":"Marinov","year":"2014","journal-title":"Genome Res"},{"key":"2023051608264628000_btab226-B20","doi-asserted-by":"crossref","first-page":"1433","DOI":"10.1126\/science.aah4115","article-title":"Aging increases cell-to-cell transcriptional variability upon immune stimulation","volume":"355","author":"Martinez-Jimenez","year":"2017","journal-title":"Science"},{"key":"2023051608264628000_btab226-B21","doi-asserted-by":"crossref","first-page":"2248","DOI":"10.1093\/bioinformatics\/btw138","article-title":"D3M: detection of differential distributions of methylation levels","volume":"32","author":"Matsui","year":"2016","journal-title":"Bioinformatics"},{"key":"2023051608264628000_btab226-B22","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1007\/s40484-016-0089-7","article-title":"Differential expression analyses for single-cell RNA-Seq: old questions on new data","volume":"4","author":"Miao","year":"2016","journal-title":"Quant. Biol"},{"key":"2023051608264628000_btab226-B23","doi-asserted-by":"crossref","first-page":"3223","DOI":"10.1093\/bioinformatics\/bty332","article-title":"DEsingle for detecting three types of differential expression in single-cell RNA-seq data","volume":"34","author":"Miao","year":"2018","journal-title":"Bioinformatics"},{"key":"2023051608264628000_btab226-B24","doi-asserted-by":"crossref","first-page":"405","DOI":"10.1146\/annurev-statistics-030718-104938","article-title":"Statistical aspects of Wasserstein distances","volume":"6","author":"Panaretos","year":"2019","journal-title":"Annu. Rev. Stat. Appl"},{"key":"2023051608264628000_btab226-B25","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1093\/bioinformatics\/btp616","article-title":"edgeR: a Bioconductor package for differential expression analysis of digital gene expression data","volume":"26","author":"Robinson","year":"2010","journal-title":"Bioinformatics"},{"key":"2023051608264628000_btab226-B26","volume-title":"Encyclopaedia of Mathematics","author":"R\u00fcschendorf","year":"2001"},{"key":"2023051608264628000_btab226-B27","doi-asserted-by":"crossref","first-page":"255","DOI":"10.1038\/nmeth.4612","article-title":"Bias, robustness and scalability in single-cell differential expression analysis","volume":"15","author":"Soneson","year":"2018","journal-title":"Nat. Methods"},{"key":"2023051608264628000_btab226-B28","doi-asserted-by":"crossref","first-page":"39921","DOI":"10.1038\/srep39921","article-title":"Batch effects and the effective design of single-cell gene expression studies","volume":"7","author":"Tung","year":"2017","journal-title":"Sci. Rep"},{"key":"2023051608264628000_btab226-B29","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1186\/s13059-016-0930-3","article-title":"Beyond comparisons of means: understanding changes in gene expression at the single-cell level","volume":"17","author":"Vallejos","year":"2016","journal-title":"Genome Biol"},{"key":"2023051608264628000_btab226-B30","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1038\/s41586-018-0698-6","article-title":"Reconstructing the human first trimester fetal-maternal interface using single cell transcriptomics","volume":"563","author":"Vento-Tormo","year":"2018","journal-title":"Nature"},{"key":"2023051608264628000_btab226-B31","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.ymeth.2018.04.017","article-title":"SigEMD: a powerful method for differential gene expression analysis in single-cell RNA sequencing data","volume":"145","author":"Wang","year":"2018","journal-title":"Methods"},{"key":"2023051608264628000_btab226-B32","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1186\/s12859-019-2599-6","article-title":"Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data","volume":"20","author":"Wang","year":"2019","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btab226\/38865503\/btab226.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3204\/50338109\/btab226.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/37\/19\/3204\/50338109\/btab226.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,27]],"date-time":"2024-08-27T06:17:54Z","timestamp":1724739474000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/37\/19\/3204\/6207964"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2021,4,1]]},"references-count":32,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2021,10,11]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btab226","relation":{},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,10,1]]},"published":{"date-parts":[[2021,4,1]]}}}