{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,2]],"date-time":"2026-04-02T11:31:28Z","timestamp":1775129488880,"version":"3.50.1"},"reference-count":58,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,9,6]],"date-time":"2021-09-06T00:00:00Z","timestamp":1630886400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,9,6]],"date-time":"2021-09-06T00:00:00Z","timestamp":1630886400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["BE5601\/4-1"],"award-info":[{"award-number":["BE5601\/4-1"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["BE5601\/6-1"],"award-info":[{"award-number":["BE5601\/6-1"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001659","name":"Deutsche Forschungsgemeinschaft","doi-asserted-by":"publisher","award":["EXC 2064 ML 390727645"],"award-info":[{"award-number":["EXC 2064 ML 390727645"]}],"id":[{"id":"10.13039\/501100001659","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002347","name":"Bundesministerium f?r Bildung und Forschung","doi-asserted-by":"publisher","award":["01GQ1601"],"award-info":[{"award-number":["01GQ1601"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002347","name":"Bundesministerium f?r Bildung und Forschung","doi-asserted-by":"publisher","award":["01IS18039A"],"award-info":[{"award-number":["01IS18039A"]}],"id":[{"id":"10.13039\/501100002347","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000025","name":"National Institute of Mental Health","doi-asserted-by":"publisher","award":["1U19MH114830-01"],"award-info":[{"award-number":["1U19MH114830-01"]}],"id":[{"id":"10.13039\/100000025","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Genome Biol"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>Standard preprocessing of single-cell RNA-seq UMI data includes normalization by sequencing depth to remove this technical variability, and nonlinear transformation to stabilize the variance across genes with different expression levels. Instead, two recent papers propose to use statistical count models for these tasks: Hafemeister and Satija (Genome Biol 20:296, 2019) recommend using Pearson residuals from negative binomial regression, while Townes et al. (Genome Biol 20:295, 2019) recommend fitting a generalized PCA model. Here, we investigate the connection between these approaches theoretically and empirically, and compare their effects on downstream processing.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We show that the model of Hafemeister and Satija produces noisy parameter estimates because it is overspecified, which is why the original paper employs post hoc smoothing. When specified more parsimoniously, it has a simple analytic solution equivalent to the rank-one Poisson GLM-PCA of Townes et al. Further, our analysis indicates that per-gene overdispersion estimates in Hafemeister and Satija are biased, and that the data are in fact consistent with the overdispersion parameter being independent of gene expression. We then use negative control data without biological variability to estimate the technical overdispersion of UMI counts, and find that across several different experimental protocols, the data are close to Poisson and suggest very moderate overdispersion. Finally, we perform a benchmark to compare the performance of Pearson residuals, variance-stabilizing transformations, and GLM-PCA on scRNA-seq datasets with known ground truth.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>We demonstrate that analytic Pearson residuals strongly outperform other methods for identifying biologically variable genes, and capture more of the biologically meaningful variation when used for dimensionality reduction.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s13059-021-02451-7","type":"journal-article","created":{"date-parts":[[2021,9,6]],"date-time":"2021-09-06T04:04:39Z","timestamp":1630901079000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":146,"title":["Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data"],"prefix":"10.1186","volume":"22","author":[{"given":"Jan","family":"Lause","sequence":"first","affiliation":[]},{"given":"Philipp","family":"Berens","sequence":"additional","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5639-7209","authenticated-orcid":false,"given":"Dmitry","family":"Kobak","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2021,9,6]]},"reference":[{"issue":"6","key":"2451_CR1","doi-asserted-by":"publisher","first-page":"e8746","DOI":"10.15252\/msb.20188746","volume":"15","author":"MD Luecken","year":"2019","unstructured":"Luecken MD, C Theis FJ. Current best practices in single-cell RNA-seq analysis: a tutorial. Mol Syst Biol. 2019; 15(6):e8746. https:\/\/doi.org\/10.15252\/msb.20188746.","journal-title":"Mol Syst Biol"},{"issue":"2","key":"2451_CR2","doi-asserted-by":"publisher","first-page":"137","DOI":"10.1038\/s41592-019-0654-x","volume":"17","author":"RA Amezquita","year":"2020","unstructured":"Amezquita RA, Lun AT, Becht E, Carey VJ, Carpp LN, Geistlinger L, Marini F, Rue-Albrecht K, Risso D, Soneson C, et al.Orchestrating single-cell analysis with Bioconductor. Nat Methods. 2020; 17(2):137\u201345. https:\/\/doi.org\/10.1038\/s41592-019-0654-x.","journal-title":"Nat Methods"},{"key":"2451_CR3","doi-asserted-by":"publisher","first-page":"296","DOI":"10.1186\/s13059-019-1874-1","volume":"20","author":"C Hafemeister","year":"2019","unstructured":"Hafemeister C, Satija R. Normalization and variance stabilization of single-cell RNA-seq data using regularized negative binomial regression. Genome Biol. 2019; 20:296. https:\/\/doi.org\/10.1186\/s13059-019-1874-1.","journal-title":"Genome Biol"},{"key":"2451_CR4","doi-asserted-by":"publisher","first-page":"295","DOI":"10.1186\/s13059-019-1861-6","volume":"20","author":"FW Townes","year":"2019","unstructured":"Townes FW, Hicks SC, Aryee MJ, Irizarry RA. Feature selection and dimension reduction for single-cell RNA-seq based on a multinomial model. Genome Biol. 2019; 20:295. https:\/\/doi.org\/10.1186\/s13059-019-1861-6.","journal-title":"Genome Biol"},{"issue":"1","key":"2451_CR5","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-017-1382-0","volume":"19","author":"FA Wolf","year":"2018","unstructured":"Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19(1):1\u20135. https:\/\/doi.org\/10.1186\/s13059-017-1382-0.","journal-title":"Genome Biol"},{"issue":"12","key":"2451_CR6","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-014-0550-8","volume":"15","author":"MI Love","year":"2014","unstructured":"Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014; 15(12):1\u201321. https:\/\/doi.org\/10.1186\/s13059-014-0550-8.","journal-title":"Genome Biol"},{"issue":"3","key":"2451_CR7","doi-asserted-by":"publisher","first-page":"284","DOI":"10.1016\/j.cels.2018.06.011","volume":"7","author":"N Eling","year":"2018","unstructured":"Eling N, Richard AC, Richardson S, Marioni JC, Vallejos CA. Correcting the mean-variance dependency for differential variability testing using single-cell RNA sequencing data. Cell Syst. 2018; 7(3):284\u201394. https:\/\/doi.org\/10.1016\/j.cels.2018.06.011.","journal-title":"Cell Syst"},{"issue":"12","key":"2451_CR8","doi-asserted-by":"publisher","first-page":"1053","DOI":"10.1038\/s41592-018-0229-2","volume":"15","author":"R Lopez","year":"2018","unstructured":"Lopez R, Regier J, Cole MB, Jordan MI, Yosef N. Deep generative modeling for single-cell transcriptomics. Nat Methods. 2018; 15(12):1053\u20138. https:\/\/doi.org\/10.1038\/s41592-018-0229-2.","journal-title":"Nat Methods"},{"issue":"11","key":"2451_CR9","doi-asserted-by":"publisher","first-page":"3418","DOI":"10.1093\/bioinformatics\/btaa169","volume":"36","author":"V Svensson","year":"2020","unstructured":"Svensson V, Gayoso A, Yosef N, Pachter L. Interpretable factor models of single-cell RNA-seq via variational autoencoders. Bioinformatics. 2020; 36(11):3418\u20133421. https:\/\/doi.org\/10.1093\/bioinformatics\/btaa169.","journal-title":"Bioinformatics"},{"issue":"6","key":"2451_CR10","doi-asserted-by":"publisher","first-page":"770","DOI":"10.1038\/s41588-021-00873-4","volume":"53","author":"A Sarkar","year":"2021","unstructured":"Sarkar A, Stephens M. Separating measurement and expression models clarifies confusion in single-cell RNA sequencing analysis. Nat Genet. 2021; 53(6):770\u20137. https:\/\/doi.org\/10.1038\/s41588-021-00873-4.","journal-title":"Nat Genet"},{"issue":"6","key":"2451_CR11","doi-asserted-by":"publisher","first-page":"637","DOI":"10.1038\/nmeth.2930","volume":"11","author":"D Gr\u00fcn","year":"2014","unstructured":"Gr\u00fcn D, Kester L, Van Oudenaarden A. Validation of noise models for single-cell transcriptomics. Nat Methods. 2014; 11(6):637\u201340. https:\/\/doi.org\/10.1038\/nmeth.2930.","journal-title":"Nat Methods"},{"issue":"2","key":"2451_CR12","doi-asserted-by":"publisher","first-page":"147","DOI":"10.1038\/s41587-019-0379-5","volume":"38","author":"V Svensson","year":"2020","unstructured":"Svensson V. Droplet scRNA-seq is not zero-inflated. Nat Biotechnol. 2020; 38(2):147\u201350. https:\/\/doi.org\/10.1038\/s41587-019-0379-5.","journal-title":"Nat Biotechnol"},{"key":"2451_CR13","volume-title":"Foundations of linear and generalized linear models","author":"A Agresti","year":"2015","unstructured":"Agresti A. Foundations of linear and generalized linear models. Hoboken: Wiley; 2015."},{"key":"2451_CR14","unstructured":"Culhane A. Correspondence analysis in R [Internet]. [place unknown]: GitHub; 2021 [updated 2021 Jul 15; cited 2021 Jul 31]. https:\/\/aedin.github.io\/PCAworkshop\/articles\/c_COA.html."},{"issue":"3","key":"2451_CR15","doi-asserted-by":"publisher","first-page":"340","DOI":"10.2307\/2347127","volume":"23","author":"MO Hill","year":"1974","unstructured":"Hill MO. Correspondence analysis: a neglected multivariate method. J R Stat Soc Ser C (Appl Stat). 1974; 23(3):340\u201354. https:\/\/doi.org\/10.2307\/2347127.","journal-title":"J R Stat Soc Ser C (Appl Stat)"},{"issue":"398","key":"2451_CR16","doi-asserted-by":"publisher","first-page":"437","DOI":"10.1080\/01621459.1987.10478446","volume":"82","author":"M Greenacre","year":"1987","unstructured":"Greenacre M, Hastie T. The geometric interpretation of correspondence analysis. J Am Stat Assoc. 1987; 82(398):437\u201347. https:\/\/doi.org\/10.1080\/01621459.1987.10478446.","journal-title":"J Am Stat Assoc"},{"key":"2451_CR17","doi-asserted-by":"publisher","unstructured":"Greenacre M. Correspondence analysis in practice: Chapman and Hall\/CRC; 2007. https:\/\/doi.org\/10.1201\/9781420011234.","DOI":"10.1201\/9781420011234"},{"key":"2451_CR18","doi-asserted-by":"publisher","unstructured":"Holmes S. Multivariate data analysis: the French way. In: Probability and statistics: essays in honor of David A. Freedman. Institute of Mathematical Statistics: 2008. p. 219\u201333. https:\/\/doi.org\/10.1214\/193940307000000455.","DOI":"10.1214\/193940307000000455"},{"key":"2451_CR19","doi-asserted-by":"publisher","first-page":"520","DOI":"10.1017\/S0305004100013517","volume":"31","author":"HO Hirschfeld","year":"1935","unstructured":"Hirschfeld HO. A connection between correlation and contingency. Math Proc Camb Philos Soc. 1935; 31:520\u20134. https:\/\/doi.org\/10.1017\/s0305004100013517.","journal-title":"Math Proc Camb Philos Soc"},{"issue":"1","key":"2451_CR20","doi-asserted-by":"publisher","first-page":"349","DOI":"10.1007\/BF01894768","volume":"33","author":"L Willson","year":"1986","unstructured":"Willson L, Folks J, Young J. Complete sufficiency and maximum likelihood estimation for the two-parameter negative binomial distribution. Metrika. 1986; 33(1):349\u201362. https:\/\/doi.org\/10.1007\/BF01894768.","journal-title":"Metrika"},{"issue":"1","key":"2451_CR21","doi-asserted-by":"publisher","first-page":"306","DOI":"10.2307\/2532055","volume":"45","author":"SJ Clark","year":"1989","unstructured":"Clark SJ, Perry JN. Estimation of the negative binomial parameter \u03ba by maximum quasi-likelihood. Biometrics. 1989; 45(1):306\u201316. https:\/\/doi.org\/10.2307\/2532055.","journal-title":"Biometrics"},{"issue":"4","key":"2451_CR22","doi-asserted-by":"publisher","first-page":"751","DOI":"10.1016\/j.aap.2006.02.001","volume":"38","author":"D Lord","year":"2006","unstructured":"Lord D. Modeling motor vehicle crashes using Poisson-gamma models: Examining the effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter. Accid Anal Prev. 2006; 38(4):751\u201366. https:\/\/doi.org\/10.1016\/j.aap.2006.02.001.","journal-title":"Accid Anal Prev"},{"issue":"5","key":"2451_CR23","doi-asserted-by":"publisher","first-page":"751","DOI":"10.1016\/j.ssci.2007.03.005","volume":"46","author":"D Lord","year":"2008","unstructured":"Lord D, Miranda-Moreno LF. Effects of low sample mean values and small sample size on the estimation of the fixed dispersion parameter of Poisson-gamma models for modeling motor vehicle crashes: a bayesian perspective. Saf Sci. 2008; 46(5):751\u201370. https:\/\/doi.org\/10.1016\/j.ssci.2007.03.005.","journal-title":"Saf Sci"},{"issue":"1","key":"2451_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/ncomms9687","volume":"6","author":"JK Kim","year":"2015","unstructured":"Kim JK, Kolodziejczyk AA, Ilicic T, Teichmann SA, Marioni JC. Characterizing noise structure in single-cell RNA-seq distinguishes genuine from technical stochastic allelic expression. Nat Commun. 2015; 6(1):1\u20139. https:\/\/doi.org\/10.1038\/ncomms9687.","journal-title":"Nat Commun"},{"issue":"28","key":"2451_CR25","doi-asserted-by":"publisher","first-page":"6437","DOI":"10.1073\/pnas.1721085115","volume":"115","author":"J Wang","year":"2018","unstructured":"Wang J, Huang M, Torre E, Dueck H, Shaffer S, Murray J, Raj A, Li M, Zhang NR. Gene expression distribution deconvolution in single-cell RNA sequencing. Proc Natl Acad Sci. 2018; 115(28):6437\u201346. https:\/\/doi.org\/10.1073\/pnas.1721085115.","journal-title":"Proc Natl Acad Sci"},{"key":"2451_CR26","doi-asserted-by":"publisher","unstructured":"Lopez-Delisle L, Delisle J-B. baredsc: Bayesian approach to retrieve expression distribution of single-cell. bioRxiv. 2021. https:\/\/doi.org\/10.1101\/2021.05.26.445740.","DOI":"10.1101\/2021.05.26.445740"},{"issue":"4","key":"2451_CR27","doi-asserted-by":"publisher","first-page":"803","DOI":"10.1093\/biomet\/75.4.803","volume":"75","author":"SK Bar-Lev","year":"1988","unstructured":"Bar-Lev SK, Enis P. On the classical choice of variance stabilizing transformations and an application for a Poisson variate. Biometrika. 1988; 75(4):803\u20134. https:\/\/doi.org\/10.1093\/biomet\/75.4.803.","journal-title":"Biometrika"},{"issue":"3\/4","key":"2451_CR28","doi-asserted-by":"publisher","first-page":"246","DOI":"10.2307\/2332343","volume":"35","author":"FJ Anscombe","year":"1948","unstructured":"Anscombe FJ. The transformation of Poisson, binomial and negative-binomial data. Biometrika. 1948; 35(3\/4):246\u201354. https:\/\/doi.org\/10.2307\/2332343.","journal-title":"Biometrika"},{"issue":"4","key":"2451_CR29","doi-asserted-by":"publisher","first-page":"607","DOI":"10.1214\/aoms\/1177729756","volume":"21","author":"MF Freeman","year":"1950","unstructured":"Freeman MF, Tukey JW. Transformations related to the angular and the square root. Ann Math Stat. 1950; 21(4):607\u201311. https:\/\/doi.org\/10.1214\/aoms\/1177729756.","journal-title":"Ann Math Stat"},{"key":"2451_CR30","doi-asserted-by":"publisher","unstructured":"Wagner F. Straightforward clustering of single-cell RNA-Seq data with t-SNE and DBSCAN. BioRxiv. 2019. https:\/\/doi.org\/10.1101\/770388.","DOI":"10.1101\/770388"},{"key":"2451_CR31","doi-asserted-by":"publisher","unstructured":"Wagner F. Monet: an open-source Python package for analyzing and integrating scRNA-Seq data using PCA-based latent spaces. bioRxiv. 2020. https:\/\/doi.org\/10.1101\/2020.06.08.140673.","DOI":"10.1101\/2020.06.08.140673"},{"issue":"1","key":"2451_CR32","doi-asserted-by":"publisher","first-page":"362","DOI":"10.1111\/biom.12728","volume":"74","author":"DI Warton","year":"2018","unstructured":"Warton DI. Why you cannot transform your way out of trouble for small counts. Biometrics. 2018; 74(1):362\u20138. https:\/\/doi.org\/10.1111\/biom.12728.","journal-title":"Biometrics"},{"issue":"5","key":"2451_CR33","doi-asserted-by":"publisher","first-page":"1202","DOI":"10.1016\/j.cell.2015.05.002","volume":"161","author":"EZ Macosko","year":"2015","unstructured":"Macosko EZ, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M, Tirosh I, Bialas AR, Kamitaki N, Martersteck EM, et al.Highly parallel genome-wide expression profiling of individual cells using nanoliter droplets. Cell. 2015; 161(5):1202\u201314. https:\/\/doi.org\/10.1016\/j.cell.2015.05.002.","journal-title":"Cell"},{"issue":"5","key":"2451_CR34","doi-asserted-by":"publisher","first-page":"495","DOI":"10.1038\/nbt.3192","volume":"33","author":"R Satija","year":"2015","unstructured":"Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat Biotechnol. 2015; 33(5):495\u2013502. https:\/\/doi.org\/10.1038\/nbt.3192.","journal-title":"Nat Biotechnol"},{"issue":"7","key":"2451_CR35","doi-asserted-by":"publisher","first-page":"1888","DOI":"10.1016\/j.cell.2019.05.031","volume":"177","author":"T Stuart","year":"2019","unstructured":"Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck III WM, Hao Y, Stoeckius M, Smibert P, Satija R. Comprehensive integration of single-cell data. Cell. 2019; 177(7):1888\u2013902. https:\/\/doi.org\/10.1016\/j.cell.2019.05.031.","journal-title":"Cell"},{"issue":"1","key":"2451_CR36","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/s41467-019-13056-x","volume":"10","author":"D Kobak","year":"2019","unstructured":"Kobak D, Berens P. The art of using t-SNE for single-cell transcriptomics. Nat Commun. 2019; 10(1):1\u201314. https:\/\/doi.org\/10.1038\/s41467-019-13056-x.","journal-title":"Nat Commun"},{"issue":"5","key":"2451_CR37","doi-asserted-by":"publisher","first-page":"1308","DOI":"10.1016\/j.cell.2016.07.054","volume":"166","author":"K Shekhar","year":"2016","unstructured":"Shekhar K, Lapan SW, Whitney IE, Tran NM, Macosko EZ, Kowalczyk M, Adiconis X, Levin JZ, Nemesh J, Goldman M, et al.Comprehensive classification of retinal bipolar neurons by single-cell transcriptomics. Cell. 2016; 166(5):1308\u201323. https:\/\/doi.org\/10.1016\/j.cell.2016.07.054.","journal-title":"Cell"},{"issue":"6","key":"2451_CR38","doi-asserted-by":"publisher","first-page":"1039","DOI":"10.1016\/j.neuron.2019.11.006","volume":"104","author":"NM Tran","year":"2019","unstructured":"Tran NM, Shekhar K, Whitney IE, Jacobi A, Benhar I, Hong G, Yan W, Adiconis X, Arnold ME, Lee JM, et al.Single-cell profiles of retinal ganglion cells differing in resilience to injury reveal neuroprotective genes. Neuron. 2019; 104(6):1039\u201355. https:\/\/doi.org\/10.1016%2Fj.neuron.2019.11.006.","journal-title":"Neuron"},{"issue":"7745","key":"2451_CR39","doi-asserted-by":"publisher","first-page":"496","DOI":"10.1038\/s41586-019-0969-x","volume":"566","author":"J Cao","year":"2019","unstructured":"Cao J, Spielmann M, Qiu X, Huang X, Ibrahim DM, Hill AJ, Zhang F, Mundlos S, Christiansen L, Steemers FJ, et al.The single-cell transcriptional landscape of mammalian organogenesis. Nature. 2019; 566(7745):496\u2013502. https:\/\/doi.org\/10.1038\/s41586-019-0969-x.","journal-title":"Nature"},{"key":"2451_CR40","unstructured":"B\u00f6hm JN, Berens P, Kobak D. A Unifying Perspective on Neighbor Embeddings along the Attraction-Repulsion Spectrum. arXiv. 2020. https:\/\/arxiv.org\/abs\/2007.08902."},{"key":"2451_CR41","doi-asserted-by":"publisher","unstructured":"Lun A. Overcoming systematic errors caused by log-transformation of normalized single-cell RNA sequencing data. BioRxiv. 2018. https:\/\/doi.org\/10.1101\/404962.","DOI":"10.1101\/404962"},{"key":"2451_CR42","doi-asserted-by":"publisher","first-page":"1141","DOI":"10.12688\/f1000research.15666.2","volume":"7","author":"A Du\u00f2","year":"2018","unstructured":"Du\u00f2 A, Robinson MD, Soneson C. A systematic performance evaluation of clustering methods for single-cell RNA-seq data. F1000Research. 2018; 7:1141. https:\/\/doi.org\/10.12688\/f1000research.15666.2.","journal-title":"F1000Research"},{"issue":"1","key":"2451_CR43","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1038\/ncomms14049","volume":"8","author":"GX Zheng","year":"2017","unstructured":"Zheng GX, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, Ziraldo SB, Wheeler TD, McDermott GP, Zhu J, et al.Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017; 8(1):1\u201312. https:\/\/doi.org\/10.1038\/ncomms14049.","journal-title":"Nat Commun"},{"issue":"1","key":"2451_CR44","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-016-0947-7","volume":"17","author":"AT Lun","year":"2016","unstructured":"Lun AT, Bach K, Marioni JC. Pooling across cells to normalize single-cell RNA sequencing data with many zero counts. Genome Biol. 2016; 17(1):1\u201314. https:\/\/doi.org\/10.1186\/s13059-016-0947-7.","journal-title":"Genome Biol"},{"key":"2451_CR45","unstructured":"Irizarry R. smallcount: R package with methods for small counts stored in a sparse matrix.[Internet]. [place unknown]: GitHub; 2021 [updated 2021 Jul 13; cited 2021 Jul 31]. https:\/\/github.com\/rafalab\/smallcount."},{"key":"2451_CR46","doi-asserted-by":"publisher","unstructured":"Choudhary S, Satija R. Comparison and evaluation of statistical error models for scRNA-seq. bioRxiv. 2021. https:\/\/doi.org\/10.1101\/2021.07.07.451498.","DOI":"10.1101\/2021.07.07.451498"},{"issue":"24","key":"2451_CR47","doi-asserted-by":"publisher","first-page":"5701","DOI":"10.1093\/bioinformatics\/btaa1009","volume":"36","author":"C Ahlmann-Eltze","year":"2020","unstructured":"Ahlmann-Eltze C, Huber W. glmGamPoi: fitting Gamma-Poisson generalized linear models on single cell count data. Bioinformatics. 2020; 36(24):5701\u20132. https:\/\/doi.org\/10.1093\/bioinformatics\/btaa1009.","journal-title":"Bioinformatics"},{"issue":"10","key":"2451_CR48","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2010-11-10-r106","volume":"11","author":"S Anders","year":"2010","unstructured":"Anders S, Huber W. Differential expression analysis for sequence count data. Genome Biol. 2010; 11(10):1\u201312. https:\/\/doi.org\/10.1038\/npre.2010.4282.1.","journal-title":"Genome Biol"},{"issue":"2","key":"2451_CR49","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/gb-2014-15-2-r29","volume":"15","author":"CW Law","year":"2014","unstructured":"Law CW, Chen Y, Shi W, Smyth GK. voom: Precision weights unlock linear model analysis tools for RNA-seq read counts. Genome Biol. 2014; 15(2):1\u201317. https:\/\/doi.org\/10.1186\/gb-2014-15-2-r29.","journal-title":"Genome Biol"},{"key":"2451_CR50","doi-asserted-by":"publisher","first-page":"1008","DOI":"10.1038\/s41587-021-00875-x","volume":"39","author":"J Breda","year":"2021","unstructured":"Breda J, Zavolan M, van Nimwegen E. Bayesian inference of gene expression states from single-cell RNA-seq data. Nat Biotechnol. 2021; 39:1008\u201316. https:\/\/doi.org\/10.1038\/s41587-021-00875-x.","journal-title":"Nat Biotechnol"},{"key":"2451_CR51","doi-asserted-by":"publisher","unstructured":"Ahlmann-Eltze C, Huber W. Transformation and preprocessing of single-cell RNA-seq data. bioRxiv. 2021. https:\/\/doi.org\/10.1101\/2021.06.24.449781.","DOI":"10.1101\/2021.06.24.449781"},{"issue":"4","key":"2451_CR52","doi-asserted-by":"publisher","first-page":"381","DOI":"10.1038\/nmeth.4220","volume":"14","author":"V Svensson","year":"2017","unstructured":"Svensson V, Natarajan KN, Ly L-H, Miragaia RJ, Labalette C, Macaulay IC, Cvejic A, Teichmann SA. Power analysis of single-cell rna-sequencing experiments. Nat Methods. 2017; 14(4):381\u20137. https:\/\/doi.org\/10.1038\/nmeth.4220.","journal-title":"Nat Methods"},{"issue":"5","key":"2451_CR53","doi-asserted-by":"publisher","first-page":"1187","DOI":"10.1016\/j.cell.2015.04.044","volume":"161","author":"AM Klein","year":"2015","unstructured":"Klein AM, Mazutis L, Akartuna I, Tallapragada N, Veres A, Li V, Peshkin L, Weitz DA, Kirschner MW. Droplet barcoding for single-cell transcriptomics applied to embryonic stem cells. Cell. 2015; 161(5):1187\u2013201. https:\/\/doi.org\/10.1016\/j.cell.2015.04.044.","journal-title":"Cell"},{"issue":"5","key":"2451_CR54","doi-asserted-by":"publisher","first-page":"1091","DOI":"10.1016\/j.cell.2018.02.001","volume":"172","author":"X Han","year":"2018","unstructured":"Han X, Wang R, Zhou Y, Fei L, Sun H, Lai S, Saadatpour A, Zhou Z, Chen H, Ye F, et al.Mapping the mouse cell atlas by microwell-seq. Cell. 2018; 172(5):1091\u2013107. https:\/\/doi.org\/10.1016\/j.cell.2018.02.001.","journal-title":"Cell"},{"key":"2451_CR55","first-page":"2825","volume":"12","author":"F Pedregosa","year":"2011","unstructured":"Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, et al.Scikit-learn: machine learning in Python. J Mach Learn Res. 2011; 12:2825\u201330. http:\/\/jmlr.org\/papers\/v12\/pedregosa11a.html.","journal-title":"J Mach Learn Res"},{"issue":"3","key":"2451_CR56","doi-asserted-by":"publisher","first-page":"243","DOI":"10.1038\/s41592-018-0308-4","volume":"16","author":"GC Linderman","year":"2019","unstructured":"Linderman GC, Rachh M, Hoskins JG, Steinerberger S, Kluger Y. Fast interpolation-based t-SNE for improved visualization of single-cell RNA-seq data. Nat Methods. 2019; 16(3):243\u20135. https:\/\/doi.org\/10.1038\/s41592-018-0308-4.","journal-title":"Nat Methods"},{"key":"2451_CR57","doi-asserted-by":"crossref","unstructured":"Lause J. Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data. [Internet]. [place unknown]: GitHub; 2021 [updated 2021 Jul 31; cited 2021 Jul 31]. https:\/\/github.com\/berenslab\/umi-normalization.","DOI":"10.1101\/2020.12.01.405886"},{"key":"2451_CR58","doi-asserted-by":"publisher","unstructured":"Lause J. Analytic Pearson residuals for normalization of single-cell RNA-seq UMI data. Zenodo. 2021. https:\/\/doi.org\/10.5281\/zenodo.5150534.","DOI":"10.5281\/zenodo.5150534"}],"container-title":["Genome Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13059-021-02451-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13059-021-02451-7\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13059-021-02451-7.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,2,6]],"date-time":"2023-02-06T00:12:06Z","timestamp":1675642326000},"score":1,"resource":{"primary":{"URL":"https:\/\/genomebiology.biomedcentral.com\/articles\/10.1186\/s13059-021-02451-7"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,6]]},"references-count":58,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,12]]}},"alternative-id":["2451"],"URL":"https:\/\/doi.org\/10.1186\/s13059-021-02451-7","relation":{"has-review":[{"id-type":"doi","id":"10.3410\/f.740762795.793590567","asserted-by":"object"}],"has-preprint":[{"id-type":"doi","id":"10.1101\/2020.12.01.405886","asserted-by":"object"}]},"ISSN":["1474-760X"],"issn-type":[{"value":"1474-760X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,6]]},"assertion":[{"value":"1 December 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 August 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"6 September 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"The authors declare that they have no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"258"}}