{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T07:46:40Z","timestamp":1769240800320,"version":"3.49.0"},"reference-count":61,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,10,5]],"date-time":"2024-10-05T00:00:00Z","timestamp":1728086400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,10,5]],"date-time":"2024-10-05T00:00:00Z","timestamp":1728086400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In the past two decades, genomics has advanced significantly, with single-cell RNA-sequencing (scRNA-seq) marking a pivotal milestone. ScRNA-seq provides unparalleled insights into cellular diversity and has spurred diverse studies across multiple conditions and samples, resulting in an influx of complex multidimensional genomics data. This highlights the need for robust methodologies capable of handling the complexity and multidimensionality of such genomics data. Furthermore, single-cell data grapples with sparsity due to issues like low capture efficiency and dropout effects. Tensor factorizations (TF) have emerged as powerful tools to unravel the complex patterns from multi-dimensional genomics data. Classic TF methods, based on maximum likelihood estimation, struggle with zero-inflated count data, while the inherent stochasticity in TFs further complicates result interpretation and reproducibility. Our paper introduces Zero Inflated Poisson Tensor Factorization (ZIPTF), a novel method for high-dimensional zero-inflated count data factorization. We also present Consensus-ZIPTF (C-ZIPTF), merging ZIPTF with a consensus-based approach to address stochasticity. We evaluate our proposed methods on synthetic zero-inflated count data, simulated scRNA-seq data, and real multi-sample multi-condition scRNA-seq datasets. ZIPTF consistently outperforms baseline matrix and tensor factorization methods, displaying enhanced reconstruction accuracy for zero-inflated data. When dealing with high probabilities of excess zeros, ZIPTF achieves up to <jats:inline-formula><jats:alternatives><jats:tex-math>$$2.4\\times$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\">\n                  <mml:mrow>\n                    <mml:mn>2.4<\/mml:mn>\n                    <mml:mo>\u00d7<\/mml:mo>\n                  <\/mml:mrow>\n                <\/mml:math><\/jats:alternatives><\/jats:inline-formula> better accuracy. Moreover, C-ZIPTF notably enhances the factorization\u2019s consistency. When tested on synthetic and real scRNA-seq data, ZIPTF and C-ZIPTF consistently uncover known and biologically meaningful gene expression programs. Access our data and code at: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/klarman-cell-observatory\/scBTF\">https:\/\/github.com\/klarman-cell-observatory\/scBTF<\/jats:ext-link> and <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/klarman-cell-observatory\/scbtf_experiments\">https:\/\/github.com\/klarman-cell-observatory\/scbtf_experiments<\/jats:ext-link>.<\/jats:p>","DOI":"10.1186\/s12859-024-05886-4","type":"journal-article","created":{"date-parts":[[2024,10,5]],"date-time":"2024-10-05T12:01:55Z","timestamp":1728129715000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["C-ziptf: stable tensor factorization for zero-inflated multi-dimensional genomics data"],"prefix":"10.1186","volume":"25","author":[{"given":"Daniel","family":"Chafamo","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vignesh","family":"Shanmugam","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Neriman","family":"Tokcan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,10,5]]},"reference":[{"issue":"1","key":"5886_CR1","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1038\/s41368-021-00146-0","volume":"13","author":"X Li","year":"2021","unstructured":"Li X, Wang C-Y. From bulk, single-cell to spatial RNA sequencing. Int J Oral Sci. 2021;13(1):36. https:\/\/doi.org\/10.1038\/s41368-021-00146-0.","journal-title":"Int J Oral Sci"},{"issue":"1","key":"5886_CR2","doi-asserted-by":"publisher","first-page":"4307","DOI":"10.1038\/s41467-020-18158-5","volume":"11","author":"S Aldridge","year":"2020","unstructured":"Aldridge S, Teichmann SA. Single cell transcriptomics comes of age. Nat Commun. 2020;11(1):4307. https:\/\/doi.org\/10.1038\/s41467-020-18158-5.","journal-title":"Nat Commun"},{"issue":"1","key":"5886_CR3","doi-asserted-by":"publisher","first-page":"4272","DOI":"10.1038\/s41467-023-39923-2","volume":"14","author":"Y Lin","year":"2023","unstructured":"Lin Y, Cao Y, Willie E, Patrick E, Yang JY. Atlas-scale single-cell multi-sample multi-condition data integration using scMerge2. Nat Commun. 2023;14(1):4272. https:\/\/doi.org\/10.1038\/s41467-023-39923-2.","journal-title":"Nat Commun"},{"issue":"1","key":"5886_CR4","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13059-020-1926-6","volume":"21","author":"D L\u00e4hnemann","year":"2020","unstructured":"L\u00e4hnemann D, K\u00f6ster J, Szczurek E, McCarthy DJ, Hicks SC, Robinson MD, Vallejos CA, Campbell KR, Beerenwinkel N, Mahfouz A, et al. Eleven grand challenges in single-cell data science. Genome Biol. 2020;21(1):1\u201335. https:\/\/doi.org\/10.1186\/s13059-020-1926-6.","journal-title":"Genome Biol"},{"issue":"1","key":"5886_CR5","doi-asserted-by":"publisher","first-page":"6077","DOI":"10.1038\/s41467-020-19894-4","volume":"11","author":"HL Crowell","year":"2020","unstructured":"Crowell HL, Soneson C, Germain P-L, Calini D, Collin L, Raposo C, Malhotra D, Robinson MD. Muscat detects subpopulation-specific state transitions from multi-sample multi-condition single-cell transcriptomics data. Nat Commun. 2020;11(1):6077. https:\/\/doi.org\/10.1038\/s41467-020-19894-4.","journal-title":"Nat Commun"},{"issue":"1","key":"5886_CR6","doi-asserted-by":"publisher","first-page":"5692","DOI":"10.1038\/s41467-021-25960-2","volume":"12","author":"JW Squair","year":"2021","unstructured":"Squair JW, Gautier M, Kathe C, Anderson MA, James ND, Hutson TH, Hudelle R, Qaiser T, Matson KJE, Barraud Q, Levine AJ, La Manno G, Skinnider MA, Courtine G. Confronting false discoveries in single-cell differential expression. Nat Commun. 2021;12(1):5692. https:\/\/doi.org\/10.1038\/s41467-021-25960-2.","journal-title":"Nat Commun"},{"key":"5886_CR7","doi-asserted-by":"publisher","DOI":"10.3389\/fgene.2021.682841","volume":"12","author":"I Jung","year":"2021","unstructured":"Jung I, Kim M, Rhee S, Lim S, Kim S. Monti: a multi-omics non-negative tensor decomposition framework for gene-level integrative analysis. Front Genet. 2021;12: 682841. https:\/\/doi.org\/10.3389\/fgene.2021.682841.","journal-title":"Front Genet"},{"key":"5886_CR8","doi-asserted-by":"publisher","unstructured":"Diaz D, Bollig-Fischer A, Kotov A. Tensor decomposition for sub-typing of complex diseases based on clinical and genomic data. In: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 2019; pp. 647\u2013651. https:\/\/doi.org\/10.1109\/BIBM47256.2019.8983014 . IEEE.","DOI":"10.1109\/BIBM47256.2019.8983014"},{"issue":"9","key":"5886_CR9","doi-asserted-by":"publisher","first-page":"1602","DOI":"10.1109\/JPROC.2015.2438719","volume":"103","author":"E Acar","year":"2015","unstructured":"Acar E, Bro R, Smilde AK. Data fusion in metabolomics using coupled matrix and tensor factorizations. Proc IEEE. 2015;103(9):1602\u201320. https:\/\/doi.org\/10.1109\/JPROC.2015.2438719.","journal-title":"Proc IEEE"},{"issue":"3","key":"5886_CR10","doi-asserted-by":"publisher","first-page":"455","DOI":"10.1137\/07070111X","volume":"51","author":"TG Kolda","year":"2009","unstructured":"Kolda TG, Bader BW. Tensor decompositions and applications. SIAM Review. 2009;51(3):455\u2013500. https:\/\/doi.org\/10.1137\/07070111X.","journal-title":"SIAM Review"},{"issue":"4","key":"5886_CR11","doi-asserted-by":"publisher","first-page":"1272","DOI":"10.1137\/110859063","volume":"33","author":"EC Chi","year":"2012","unstructured":"Chi EC, Kolda TG. On tensors, sparsity, and nonnegative factorizations. SIAM J Matrix Anal Appl. 2012;33(4):1272\u201399. https:\/\/doi.org\/10.1137\/110859063.","journal-title":"SIAM J Matrix Anal Appl"},{"issue":"2","key":"5886_CR12","doi-asserted-by":"publisher","DOI":"10.1088\/2632-2153\/ab8241","volume":"1","author":"JL Hinrich","year":"2020","unstructured":"Hinrich JL, Madsen KH, M\u00f8rup M. The probabilistic tensor decomposition toolbox. Mach Learn: Sci Technol. 2020;1(2): 025011. https:\/\/doi.org\/10.1088\/2632-2153\/ab8241.","journal-title":"Mach Learn: Sci Technol"},{"key":"5886_CR13","doi-asserted-by":"publisher","unstructured":"Schein A, Paisley J, Blei DM, Wallach H. Bayesian poisson tensor factorization for inferring multilateral relations from sparse dyadic event counts. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2015; pp. 1045\u20131054. https:\/\/doi.org\/10.1145\/2783258.2783414","DOI":"10.1145\/2783258.2783414"},{"key":"5886_CR14","doi-asserted-by":"publisher","unstructured":"Hu C, Rai P, Chen C, Harding M, Carin L. Scalable bayesian non-negative tensor factorization for massive count data. In: Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2015, Porto, Portugal, September 7-11, 2015, Proceedings, Part II 15. 2015; pp. 53\u201370. https:\/\/doi.org\/10.1007\/978-3-319-23525-7_4 . Springer.","DOI":"10.1007\/978-3-319-23525-7_4"},{"issue":"3","key":"5886_CR15","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1002\/env.830","volume":"18","author":"M Chiogna","year":"2007","unstructured":"Chiogna M, Gaetan C. Semiparametric zero-inflated poisson models with application to animal abundance studies. Environmetrics. 2007;18(3):303\u201314. https:\/\/doi.org\/10.1002\/env.830.","journal-title":"Environmetrics"},{"key":"5886_CR16","unstructured":"Simchowitz M. Zero-inflated poisson factorization for recommendation systems. Junior Independent Work (advised by D. Blei), Princeton University, Department of Mathematics. 2013."},{"issue":"1","key":"5886_CR17","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/00401706.1992.10485228","volume":"34","author":"D Lambert","year":"1992","unstructured":"Lambert D. Zero-inflated poisson regression, with an application to defects in manufacturing. Technometrics. 1992;34(1):1\u201314. https:\/\/doi.org\/10.1080\/00401706.1992.10485228.","journal-title":"Technometrics"},{"issue":"4","key":"5886_CR18","doi-asserted-by":"publisher","first-page":"1360","DOI":"10.1016\/j.jspi.2004.10.008","volume":"136","author":"SK Ghosh","year":"2006","unstructured":"Ghosh SK, Mukhopadhyay P, Lu J-CJ. Bayesian analysis of zero-inflated regression models. J Stat Plan Inference. 2006;136(4):1360\u201375. https:\/\/doi.org\/10.1016\/j.jspi.2004.10.008.","journal-title":"J Stat Plan Inference"},{"issue":"7","key":"5886_CR19","doi-asserted-by":"publisher","first-page":"1700","DOI":"10.1016\/j.csda.2004.11.013","volume":"50","author":"G Tomasi","year":"2006","unstructured":"Tomasi G, Bro R. A comparison of algorithms for fitting the PARAFAC model. Comput Stat Data Anal. 2006;50(7):1700\u201334. https:\/\/doi.org\/10.1016\/j.csda.2004.11.013.","journal-title":"Comput Stat Data Anal"},{"key":"5886_CR20","doi-asserted-by":"publisher","first-page":"43803","DOI":"10.7554\/eLife.43803","volume":"8","author":"D Kotliar","year":"2019","unstructured":"Kotliar D, Veres A, Nagy MA, Tabrizi S, Hodis E, Melton DA, Sabeti PC. Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq. Elife. 2019;8:43803. https:\/\/doi.org\/10.7554\/eLife.43803.","journal-title":"Elife"},{"key":"5886_CR21","doi-asserted-by":"publisher","unstructured":"Acar E, Kolda TG, Dunlavy DM. All-at-once optimization for coupled matrix and tensor factorizations. 2011. arXiv preprint arXiv:1105.3422. https:\/\/doi.org\/10.48550\/arXiv.1105.3422","DOI":"10.48550\/arXiv.1105.3422"},{"key":"5886_CR22","doi-asserted-by":"publisher","unstructured":"Y\u0131lmaz YK, Cemgil AT. Probabilistic latent tensor factorization. In: International Conference on Latent Variable Analysis and Signal Separation. 2010; pp. 346\u2013353. https:\/\/doi.org\/10.1007\/978-3-642-15995-4_43 . Springer.","DOI":"10.1007\/978-3-642-15995-4_43"},{"key":"5886_CR23","doi-asserted-by":"publisher","unstructured":"Cemgil AT. Bayesian inference for nonnegative matrix factorisation models. Computational intelligence and neuroscience. 2009; 2009. https:\/\/doi.org\/10.1155\/2009\/785152","DOI":"10.1155\/2009\/785152"},{"key":"5886_CR24","unstructured":"Zhou M, Hannah L, Dunson D, Carin L. Beta-negative binomial process and poisson factor analysis. In: Artificial Intelligence and Statistics. 2012; pp. 1462\u20131471. PMLR."},{"issue":"518","key":"5886_CR25","doi-asserted-by":"publisher","first-page":"859","DOI":"10.1080\/01621459.2017.1285773","volume":"112","author":"DM Blei","year":"2017","unstructured":"Blei DM, Kucukelbir A, McAuliffe JD. Variational inference: a review for statisticians. J Am Stat Assoc. 2017;112(518):859\u201377. https:\/\/doi.org\/10.1080\/01621459.2017.1285773.","journal-title":"J Am Stat Assoc"},{"key":"5886_CR26","volume-title":"Pattern Recognition and Machine Learning","author":"CM Bishop","year":"2006","unstructured":"Bishop CM, Nasrabadi NM. Pattern Recognition and Machine Learning, vol. 4. Berlin: Springer; 2006."},{"key":"5886_CR27","doi-asserted-by":"publisher","unstructured":"Prem G, Hofman\u00a0Jake M, Blei\u00a0David M. Scalable recommendation with poisson factorization. 2013. arXiv: 1311.1704. https:\/\/doi.org\/10.48550\/arXiv.1311.1704","DOI":"10.48550\/arXiv.1311.1704"},{"key":"5886_CR28","unstructured":"Paisley JW, Blei DM, Jordan MI. Bayesian nonnegative matrix factorization with stochastic variational inference. In: Handbook of Mixed Membership Models and Their Applications. Chapman and Hall\/CRC, 2014."},{"key":"5886_CR29","unstructured":"Hoffman MD, Blei DM, Wang C, Paisley J. Stochastic variational inference. J Mach Learn Res. 2013."},{"key":"5886_CR30","unstructured":"Ranganath R, Gerrish S, Blei D. Black box variational inference. In: Artificial Intelligence and Statistics. 2014; pp. 814\u2013822. PMLR."},{"key":"5886_CR31","doi-asserted-by":"crossref","unstructured":"Robbins H, Monro S. A stochastic approximation method. The Annals of Mathematical Statistics. 1951;400\u2013407. https:\/\/www.jstor.org\/stable\/2236626","DOI":"10.1214\/aoms\/1177729586"},{"issue":"12","key":"5886_CR32","doi-asserted-by":"publisher","first-page":"4164","DOI":"10.1073\/pnas.030853110","volume":"101","author":"J-P Brunet","year":"2004","unstructured":"Brunet J-P, Tamayo P, Golub TR, Mesirov JP. Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci. 2004;101(12):4164\u20139. https:\/\/doi.org\/10.1073\/pnas.030853110.","journal-title":"Proc Natl Acad Sci"},{"key":"5886_CR33","unstructured":"MacQueen J, et al. Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability. 1967; vol. 1, pp. 281\u2013297. Oakland, CA, USA."},{"key":"5886_CR34","doi-asserted-by":"publisher","unstructured":"Breunig MM, Kriegel H-P, Ng RT, Sander J. Lof: identifying density-based local outliers. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data. 2000; pp. 93\u2013104. https:\/\/doi.org\/10.1145\/342009.335388","DOI":"10.1145\/342009.335388"},{"key":"5886_CR35","doi-asserted-by":"publisher","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","volume":"20","author":"PJ Rousseeuw","year":"1987","unstructured":"Rousseeuw PJ. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math. 1987;20:53\u201365. https:\/\/doi.org\/10.1016\/0377-0427(87)90125-7.","journal-title":"J Comput Appl Math"},{"issue":"1","key":"5886_CR36","first-page":"973","volume":"20","author":"E Bingham","year":"2019","unstructured":"Bingham E, Chen JP, Jankowiak M, Obermeyer F, Pradhan N, Karaletsos T, Singh R, Szerlip P, Horsfall P, Goodman ND. Pyro: Deep universal probabilistic programming. J Mach Learn Res. 2019;20(1):973\u20138.","journal-title":"J Mach Learn Res"},{"issue":"6755","key":"5886_CR37","doi-asserted-by":"publisher","first-page":"788","DOI":"10.1038\/44565","volume":"401","author":"DD Lee","year":"1999","unstructured":"Lee DD, Seung HS. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401(6755):788\u201391.","journal-title":"Nature"},{"key":"5886_CR38","volume-title":"Principal Component Analysis for Special Types of Data","author":"IT Jolliffe","year":"2002","unstructured":"Jolliffe IT. Principal Component Analysis for Special Types of Data. Berlin: Springer; 2002."},{"issue":"7505","key":"5886_CR39","doi-asserted-by":"publisher","first-page":"363","DOI":"10.1038\/nature13437","volume":"510","author":"AK Shalek","year":"2014","unstructured":"Shalek AK, Satija R, Shuga J, Trombetta JJ, Gennert D, Lu D, Chen P, Gertner RS, Gaublomme JT, Yosef N, et al. Single-cell RNA-Seq reveals dynamic paracrine control of cellular variation. Nature. 2014;510(7505):363\u20139.","journal-title":"Nature"},{"issue":"7","key":"5886_CR40","doi-asserted-by":"publisher","first-page":"1611","DOI":"10.1016\/j.cell.2017.10.044","volume":"171","author":"SV Puram","year":"2017","unstructured":"Puram SV, Tirosh I, Parikh AS, Patel AP, Yizhak K, Gillespie S, Rodman C, Luo CL, Mroz EA, Emerick KS, et al. Single-cell transcriptomic analysis of primary and metastatic tumor ecosystems in head and neck cancer. Cell. 2017;171(7):1611\u201324.","journal-title":"Cell"},{"issue":"4","key":"5886_CR41","doi-asserted-by":"publisher","first-page":"1015","DOI":"10.1016\/j.cell.2018.07.028","volume":"174","author":"A Saunders","year":"2018","unstructured":"Saunders A, Macosko EZ, Wysoker A, Goldman M, Krienen FM, Rivera H, Bien E, Baum M, Bortolin L, Wang S, et al. Molecular diversity and specializations among the cells of the adult mouse brain. Cell. 2018;174(4):1015\u201330.","journal-title":"Cell"},{"issue":"4","key":"5886_CR42","doi-asserted-by":"publisher","first-page":"199","DOI":"10.1093\/bib\/bbad199","volume":"24","author":"Q Yang","year":"2023","unstructured":"Yang Q, Xu Z, Zhou W, Wang P, Jiang Q, Juan L. An interpretable single-cell RNA sequencing data clustering method based on latent Dirichlet allocation. Brief Bioinform. 2023;24(4):199.","journal-title":"Brief Bioinform"},{"issue":"Jan","key":"5886_CR43","first-page":"993","volume":"3","author":"DM Blei","year":"2003","unstructured":"Blei DM, Ng AY, Jordan MI. Latent Dirichlet allocation. J Mach Learn Res. 2003;3(Jan):993\u20131022.","journal-title":"J Mach Learn Res"},{"issue":"19","key":"5886_CR44","doi-asserted-by":"publisher","first-page":"5052","DOI":"10.1109\/TSP.2016.2576427","volume":"64","author":"K Huang","year":"2016","unstructured":"Huang K, Sidiropoulos ND, Liavas AP. A flexible and efficient algorithmic framework for constrained matrix and tensor factorization. IEEE Trans Signal Process. 2016;64(19):5052\u201365.","journal-title":"IEEE Trans Signal Process"},{"issue":"1","key":"5886_CR45","doi-asserted-by":"publisher","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","volume":"18","author":"L Zappia","year":"2017","unstructured":"Zappia L, Phipson B, Oshlack A. Splatter: simulation of single-cell RNA sequencing data. Genome Biol. 2017;18(1):174. https:\/\/doi.org\/10.1186\/s13059-017-1305-0.","journal-title":"Genome Biol"},{"key":"5886_CR46","doi-asserted-by":"publisher","unstructured":"Cohen I, Huang Y, Chen J, Benesty J, Benesty J, Chen J, Huang Y, Cohen I. Pearson correlation coefficient. Noise Reduction in Speech Processing. 2009;1\u20134. https:\/\/doi.org\/10.1007\/978-3-642-00296-0_5.","DOI":"10.1007\/978-3-642-00296-0_5"},{"issue":"2","key":"5886_CR47","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1038\/s41587-023-01772-1","volume":"42","author":"D Song","year":"2024","unstructured":"Song D, Wang Q, Yan G, Liu T, Sun T, Li JJ. scdesign3 generates realistic in silico data for multimodal single-cell and spatial omics. Nat Biotechnol. 2024;42(2):247\u201352.","journal-title":"Nat Biotechnol"},{"issue":"1","key":"5886_CR48","doi-asserted-by":"publisher","first-page":"89","DOI":"10.1038\/nbt.4042","volume":"36","author":"HM Kang","year":"2018","unstructured":"Kang HM, Subramaniam M, Targ S, Nguyen M, Maliskova L, McCarthy E, Wan E, Wong S, Byrnes L, Lanata CM, et al. Multiplexed droplet single-cell RNA-sequencing using natural genetic variation. Nat Biotechnol. 2018;36(1):89\u201394. https:\/\/doi.org\/10.1038\/nbt.4042.","journal-title":"Nat Biotechnol"},{"issue":"D1","key":"5886_CR49","doi-asserted-by":"publisher","first-page":"1003","DOI":"10.1093\/nar\/gkac888","volume":"51","author":"RL Seal","year":"2023","unstructured":"Seal RL, Braschi B, Gray K, Jones TE, Tweedie S, Haim-Vilmovsky L, Bruford EA. Genenames.org: the HGNC resources in 2023. Nucleic Acids Res. 2023;51(D1):1003\u20139. https:\/\/doi.org\/10.1093\/nar\/gkac888.","journal-title":"Nucleic Acids Res"},{"issue":"6594","key":"5886_CR50","doi-asserted-by":"publisher","first-page":"5197","DOI":"10.1126\/science.abl5197","volume":"376","author":"C Dom\u00ednguez Conde","year":"2022","unstructured":"Dom\u00ednguez Conde C, Xu C, Jarvis L, Rainbow D, Wells S, Gomes T, Howlett S, Suchanek O, Polanski K, King H, et al. Cross-tissue immune cell analysis reveals tissue-specific features in humans. Science. 2022;376(6594):5197. https:\/\/doi.org\/10.1126\/science.abl5197.","journal-title":"Science"},{"issue":"43","key":"5886_CR51","doi-asserted-by":"publisher","first-page":"15545","DOI":"10.1073\/pnas.0506580102","volume":"102","author":"A Subramanian","year":"2005","unstructured":"Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. 2005;102(43):15545\u201350. https:\/\/doi.org\/10.1073\/pnas.0506580102.","journal-title":"Proc Natl Acad Sci"},{"issue":"1","key":"5886_CR52","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1093\/bioinformatics\/btac757","volume":"39","author":"Z Fang","year":"2023","unstructured":"Fang Z, Liu X, Peltz G. Gseapy: a comprehensive package for performing gene set enrichment analysis in python. Bioinformatics. 2023;39(1):757. https:\/\/doi.org\/10.1093\/bioinformatics\/btac757.","journal-title":"Bioinformatics"},{"issue":"2","key":"5886_CR53","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1038\/s41556-022-01072-x","volume":"25","author":"M Lotfollahi","year":"2023","unstructured":"Lotfollahi M, Rybakov S, Hrovatin K, Hediyeh-Zadeh S, Talavera-L\u00f3pez C, Misharin AV, Theis FJ. Biologically informed deep learning to query gene programs in single-cell atlases. Nat Cell Biol. 2023;25(2):337\u201350. https:\/\/doi.org\/10.1038\/s41556-022-01072-x.","journal-title":"Nat Cell Biol"},{"issue":"6589","key":"5886_CR54","doi-asserted-by":"publisher","first-page":"1970","DOI":"10.1126\/science.abf1970","volume":"376","author":"RK Perez","year":"2022","unstructured":"Perez RK, Gordon MG, Subramaniam M, Kim MC, Hartoularos GC, Targ S, Sun Y, Ogorodnikov A, Bueno R, Lu A, et al. Single-cell RNA-Seq reveals cell type-specific molecular and genetic associations to lupus. Science. 2022;376(6589):1970.","journal-title":"Science"},{"issue":"6594","key":"5886_CR55","doi-asserted-by":"publisher","first-page":"5197","DOI":"10.1126\/science.abl5197","volume":"376","author":"C Dom\u00ednguez Conde","year":"2022","unstructured":"Dom\u00ednguez Conde C, Xu C, Jarvis L, Rainbow D, Wells S, Gomes T, Howlett S, Suchanek O, Polanski K, King H, et al. Cross-tissue immune cell analysis reveals tissue-specific features in humans. Science. 2022;376(6594):5197.","journal-title":"Science"},{"issue":"6","key":"5886_CR56","doi-asserted-by":"publisher","first-page":"711","DOI":"10.1084\/jem.20021553","volume":"197","author":"L Bennett","year":"2003","unstructured":"Bennett L, Palucka AK, Arce E, Cantrell V, Borvak J, Banchereau J, Pascual V. Interferon and granulopoiesis signatures in systemic lupus erythematosus blood. J Exp Med. 2003;197(6):711\u201323.","journal-title":"J Exp Med"},{"issue":"3","key":"5886_CR57","doi-asserted-by":"publisher","first-page":"541","DOI":"10.1038\/s41588-024-01659-0","volume":"56","author":"H Jin","year":"2024","unstructured":"Jin H, Gulhan DC, Geiger B, Ben-Isvy D, Geng D, Ljungstr\u00f6m V, Park PJ. Accurate and sensitive mutational signature analysis with musical. Nat Genet. 2024;56(3):541\u201352.","journal-title":"Nat Genet"},{"key":"5886_CR58","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13073-021-00988-7","volume":"13","author":"I Sason","year":"2021","unstructured":"Sason I, Chen Y, Leiserson MD, Sharan R. A mixture model for signature discovery from sparse mutation data. Genome Med. 2021;13:1\u201312.","journal-title":"Genome Med"},{"issue":"2","key":"5886_CR59","doi-asserted-by":"publisher","first-page":"458","DOI":"10.1111\/biom.12779","volume":"74","author":"IIM Gauran","year":"2018","unstructured":"Gauran IIM, Park J, Lim J, Park D, Zylstra J, Peterson T, Kann M, Spouge JL. Empirical null estimation using zero-inflated discrete mixture distributions and its application to protein domain data. Biometrics. 2018;74(2):458\u201371.","journal-title":"Biometrics"},{"issue":"4","key":"5886_CR60","doi-asserted-by":"publisher","first-page":"562","DOI":"10.1093\/biostatistics\/kxx053","volume":"19","author":"SC Hicks","year":"2018","unstructured":"Hicks SC, Townes FW, Teng M, Irizarry RA. Missing data and technical variability in single-cell RNA-sequencing experiments. Biostatistics. 2018;19(4):562\u201378. https:\/\/doi.org\/10.1093\/biostatistics\/kxx053.","journal-title":"Biostatistics"},{"key":"5886_CR61","doi-asserted-by":"publisher","unstructured":"Kingma DP, Ba J. Adam: A method for stochastic optimization. 2014. arXiv preprint arXiv:1412.6980. https:\/\/doi.org\/10.48550\/arXiv.1412.6980.","DOI":"10.48550\/arXiv.1412.6980"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05886-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-024-05886-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-024-05886-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,10,6]],"date-time":"2024-10-06T04:01:44Z","timestamp":1728187304000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-024-05886-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,10,5]]},"references-count":61,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["5886"],"URL":"https:\/\/doi.org\/10.1186\/s12859-024-05886-4","relation":{},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,10,5]]},"assertion":[{"value":"27 December 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"30 July 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 October 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare no Conflict of interest.","order":4,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"323"}}