{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,2]],"date-time":"2026-03-02T16:52:50Z","timestamp":1772470370092,"version":"3.50.1"},"reference-count":42,"publisher":"Oxford University Press (OUP)","issue":"7","license":[{"start":{"date-parts":[[2023,7,14]],"date-time":"2023-07-14T00:00:00Z","timestamp":1689292800000},"content-version":"vor","delay-in-days":13,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Australian Government Research Training Program"},{"DOI":"10.13039\/501100000923","name":"Australian Research Council","doi-asserted-by":"publisher","award":["DP220100985"],"award-info":[{"award-number":["DP220100985"]}],"id":[{"id":"10.13039\/501100000923","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2023,7,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:sec>\n                  <jats:title>Motivation<\/jats:title>\n                  <jats:p>Identification of cell types using single-cell RNA-seq is revolutionizing the study of multicellular organisms. However, typical single-cell RNA-seq analysis often involves post hoc manual curation to ensure clusters are transcriptionally distinct, which is time-consuming, error-prone, and irreproducible.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Results<\/jats:title>\n                  <jats:p>To overcome these obstacles, we developed Cytocipher, a bioinformatics method and scverse compatible software package that statistically determines significant clusters. Application of Cytocipher to normal tissue, development, disease, and large-scale atlas data reveals the broad applicability and power of Cytocipher to generate biological insights in numerous contexts. This included the identification of cell types not previously described in the datasets analysed, such as CD8+ T cell subtypes in human peripheral blood mononuclear cells; cell lineage intermediate states during mouse pancreas development; and subpopulations of luminal epithelial cells over-represented in prostate cancer. Cytocipher also scales to large datasets with high-test performance, as shown by application to the Tabula Sapiens Atlas representing &amp;gt;480\u2009000 cells. Cytocipher is a novel and generalizable method that statistically determines transcriptionally distinct and programmatically reproducible clusters from single-cell data.<\/jats:p>\n               <\/jats:sec>\n               <jats:sec>\n                  <jats:title>Availability and implementation<\/jats:title>\n                  <jats:p>The software version used for this manuscript has been deposited on Zenodo (https:\/\/doi.org\/10.5281\/zenodo.8089546), and is also available via github (https:\/\/github.com\/BradBalderson\/Cytocipher).<\/jats:p>\n               <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btad435","type":"journal-article","created":{"date-parts":[[2023,7,14]],"date-time":"2023-07-14T13:37:37Z","timestamp":1689341857000},"source":"Crossref","is-referenced-by-count":4,"title":["<i>Cytocipher<\/i> determines significantly different populations of cells in single-cell RNA-seq data"],"prefix":"10.1093","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5153-6601","authenticated-orcid":false,"given":"Brad","family":"Balderson","sequence":"first","affiliation":[{"name":"School of Chemistry and Molecular Biosciences, University of Queensland , Brisbane, QLD 4072, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6759-2560","authenticated-orcid":false,"given":"Michael","family":"Piper","sequence":"additional","affiliation":[{"name":"School of Biomedical Sciences, University of Queensland , Brisbane, QLD 4072, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5095-541X","authenticated-orcid":false,"given":"Stefan","family":"Thor","sequence":"additional","affiliation":[{"name":"School of Biomedical Sciences, University of Queensland , Brisbane, QLD 4072, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3548-268X","authenticated-orcid":false,"given":"Mikael","family":"Bod\u00e9n","sequence":"additional","affiliation":[{"name":"School of Chemistry and Molecular Biosciences, University of Queensland , Brisbane, QLD 4072, Australia"}]}],"member":"286","published-online":{"date-parts":[[2023,7,14]]},"reference":[{"key":"2023072608070252500_btad435-B1","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1038\/385257a0","article-title":"Independent requirement for ISL1 in formation of pancreatic mesenchyme and islet cells","volume":"385","author":"Ahlgren","year":"1997","journal-title":"Nature"},{"key":"2023072608070252500_btad435-B2","doi-asserted-by":"crossref","first-page":"1083","DOI":"10.1038\/nmeth.4463","article-title":"SCENIC: single-cell regulatory network inference and clustering","volume":"14","author":"Aibar","year":"2017","journal-title":"Nat Methods"},{"key":"2023072608070252500_btad435-B3","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: large-scale single-cell gene expression data analysis","volume":"19","author":"Alexander Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2023072608070252500_btad435-B4","doi-asserted-by":"crossref","first-page":"dev173849","DOI":"10.1242\/dev.173849","article-title":"Comprehensive single cell mRNA profiling reveals a detailed roadmap for pancreatic endocrinogenesis","volume":"146","author":"Bastidas-Ponce","year":"2019","journal-title":"Development"},{"key":"2023072608070252500_btad435-B5","doi-asserted-by":"crossref","first-page":"1408","DOI":"10.1038\/s41587-020-0591-3","article-title":"Generalizing RNA velocity to transient cell states through dynamical modeling","volume":"38","author":"Bergen","year":"2020","journal-title":"Nat Biotechnol"},{"key":"2023072608070252500_btad435-B6","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1186\/s12859-020-03774-1","article-title":"Hypercluster: a flexible tool for parallelized unsupervised clustering optimization","volume":"21","author":"Blumenberg","year":"2020","journal-title":"BMC Bioinformatics"},{"key":"2023072608070252500_btad435-B7","doi-asserted-by":"crossref","first-page":"1125","DOI":"10.1158\/1541-7786.MCR-17-0230","article-title":"miR-34a regulates expression of the stathmin-1 oncoprotein and prostate cancer progression","volume":"16","author":"Chakravarthi","year":"2018","journal-title":"Mol Cancer Res"},{"key":"2023072608070252500_btad435-B8","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1038\/s41556-020-00613-6","article-title":"Single-cell analysis reveals transcriptomic remodellings in distinct cell types that contribute to human prostate cancer progression","volume":"23","author":"Chen","year":"2021","journal-title":"Nat Cell Biol"},{"key":"2023072608070252500_btad435-B9","doi-asserted-by":"crossref","first-page":"245","DOI":"10.1038\/s41587-021-01033-z","article-title":"Differential abundance testing on single-cell data using k-nearest neighbor graphs","volume":"40","author":"Dann","year":"2022","journal-title":"Nat Biotechnol"},{"key":"2023072608070252500_btad435-B10","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1186\/s13059-021-02286-2","article-title":"Giotto: a toolbox for integrative analysis and visualization of spatial expression data","volume":"22","author":"Dries","year":"2021","journal-title":"Genome Biol"},{"key":"2023072608070252500_btad435-B11","doi-asserted-by":"crossref","first-page":"17562872221103988","DOI":"10.1177\/17562872221103988","article-title":"Biomarkers for prostate cancer detection and risk stratification","volume":"14","author":"Farha","year":"2022","journal-title":"Ther Adv Urol"},{"key":"2023072608070252500_btad435-B12","doi-asserted-by":"crossref","DOI":"10.1038\/s41592-023-01933-9","article-title":"Significance analysis for clustering with single-cell RNA-sequencing data","author":"Grabski","year":"2023"},{"key":"2023072608070252500_btad435-B13","doi-asserted-by":"crossref","first-page":"3573","DOI":"10.1016\/j.cell.2021.04.048","article-title":"Integrated analysis of multimodal single-cell data","volume":"184","author":"Hao","year":"2021","journal-title":"Cell"},{"key":"2023072608070252500_btad435-B14","doi-asserted-by":"crossref","first-page":"e43803","DOI":"10.7554\/eLife.43803","article-title":"Identifying gene expression programs of cell-type identity and cellular activity with single-cell RNA-Seq","volume":"8","author":"Kotliar","year":"2019","journal-title":"Elife"},{"key":"2023072608070252500_btad435-B15","doi-asserted-by":"crossref","first-page":"775","DOI":"10.1016\/j.cell.2018.11.043","article-title":"Dysfunctional CD8 T cells form a proliferative, dynamically regulated compartment within human melanoma","volume":"176","author":"Li","year":"2019","journal-title":"Cell"},{"key":"2023072608070252500_btad435-B16","doi-asserted-by":"crossref","first-page":"232","DOI":"10.1186\/s13059-021-02445-5","article-title":"MultiK: an automated tool to determine optimal cluster numbers in single-cell RNA sequencing data","volume":"22","author":"Liu","year":"2021","journal-title":"Genome Biol"},{"key":"2023072608070252500_btad435-B17","doi-asserted-by":"crossref","first-page":"451","DOI":"10.1093\/biostatistics\/kxw055","article-title":"Overcoming confounding plate effects in differential expression analyses of single-cell RNA-seq data","volume":"18","author":"Lun","year":"2017","journal-title":"Biostatistics"},{"key":"2023072608070252500_btad435-B18","doi-asserted-by":"crossref","first-page":"99","DOI":"10.1016\/j.immuni.2020.11.005","article-title":"Comprehensive profiling of an aging immune system reveals clonal GZMK+ CD8+ T cells as conserved hallmark of inflammaging","volume":"54","author":"Mogilenko","year":"2021","journal-title":"Immunity"},{"key":"2023072608070252500_btad435-B19","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1016\/j.semcdb.2015.08.013","article-title":"Pax4 acts as a key player in pancreas development and plasticity","volume":"44","author":"Napolitano","year":"2015","journal-title":"Semin Cell Dev Biol"},{"key":"2023072608070252500_btad435-B20","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1186\/s12859-021-03957-4","article-title":"Selecting single cell clustering parameter values using subsampling-based robustness metrics","volume":"22","author":"Patterson-Cross","year":"2021","journal-title":"BMC Bioinformatics"},{"key":"2023072608070252500_btad435-B21","author":"Pedregosa"},{"key":"2023072608070252500_btad435-B22","doi-asserted-by":"crossref","first-page":"1925","DOI":"10.1038\/s41593-019-0483-3","article-title":"A repeated molecular architecture across thalamic pathways","volume":"22","author":"Phillips","year":"2019","journal-title":"Nat Neurosci"},{"key":"2023072608070252500_btad435-B23","doi-asserted-by":"crossref","first-page":"e1006378","DOI":"10.1371\/journal.pcbi.1006378","article-title":"clusterExperiment and RSEC: a bioconductor package and framework for clustering of single-cell and other large gene expression datasets","volume":"14","author":"Risso","year":"2018","journal-title":"PLoS Comput Biol"},{"key":"2023072608070252500_btad435-B24","doi-asserted-by":"crossref","first-page":"495","DOI":"10.1038\/nbt.3192","article-title":"Spatial reconstruction of single-cell gene expression data","volume":"33","author":"Satija","year":"2015","journal-title":"Nat Biotechnol"},{"key":"2023072608070252500_btad435-B25","doi-asserted-by":"crossref","first-page":"3356","DOI":"10.1038\/s41467-018-05740-1","article-title":"Endocrine lineage biases arise in temporally distinct endocrine progenitors during pancreatic morphogenesis","volume":"9","author":"Scavuzzo","year":"2018","journal-title":"Nat Commun"},{"key":"2023072608070252500_btad435-B26","doi-asserted-by":"crossref","first-page":"giz087","DOI":"10.1093\/gigascience\/giz087","article-title":"ascend: R package for analysis of single-cell RNA-seq data","volume":"8","author":"Senabouth","year":"2019","journal-title":"Gigascience"},{"key":"2023072608070252500_btad435-B27","doi-asserted-by":"crossref","first-page":"31753","DOI":"10.18632\/oncotarget.25878","article-title":"A novel fatty acid-binding protein 5-estrogen-related receptor signaling pathway promotes cell growth and energy metabolism in prostate cancer cells","volume":"9","author":"Senga","year":"2018","journal-title":"Oncotarget"},{"key":"2023072608070252500_btad435-B28","doi-asserted-by":"crossref","DOI":"10.1101\/2022.01.31.478592","article-title":"ClustAssess: tools for assessing the robustness of single-cell clustering","author":"Shahsavari","year":"2022"},{"key":"2023072608070252500_btad435-B29","doi-asserted-by":"crossref","first-page":"5692","DOI":"10.1038\/s41467-021-25960-2","article-title":"Confronting false discoveries in single-cell differential expression","volume":"12","author":"Squair","year":"2021","journal-title":"Nat Commun"},{"key":"2023072608070252500_btad435-B30","doi-asserted-by":"crossref","first-page":"eabl4896","DOI":"10.1126\/science.abl4896","article-title":"The Tabula Sapiens: a multiple-organ, single-cell transcriptomic atlas of humans","volume":"376","author":"Tabula Sapiens Consortium","year":"2022","journal-title":"Science"},{"key":"2023072608070252500_btad435-B31","doi-asserted-by":"crossref","first-page":"5233","DOI":"10.1038\/s41598-019-41695-z","article-title":"From Louvain to Leiden: guaranteeing well-connected communities","volume":"9","author":"Traag","year":"2019","journal-title":"Sci Rep"},{"key":"2023072608070252500_btad435-B32","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1038\/nbt.2859","article-title":"Pseudo-temporal ordering of individual cells reveals dynamics and regulators of cell fate decisions","volume":"32","author":"Trapnell","year":"2014","journal-title":"Nat Biotechnol"},{"key":"2023072608070252500_btad435-B33","doi-asserted-by":"crossref","first-page":"110132","DOI":"10.1016\/j.celrep.2021.110132","article-title":"Resolving the immune landscape of human prostate at a single-cell level in health and cancer","volume":"37","author":"Tuong","year":"2021","journal-title":"Cell Rep"},{"key":"2023072608070252500_btad435-B34","doi-asserted-by":"crossref","DOI":"10.1101\/2020.12.08.417105","article-title":"constclust: consistent clusters for scRNA-seq","author":"Virshup","year":"2020"},{"key":"2023072608070252500_btad435-B35","doi-asserted-by":"crossref","first-page":"3490","DOI":"10.1038\/s41598-020-60384-w","article-title":"Spatial modeling of prostate cancer metabolic gene expression reveals extensive heterogeneity and selective vulnerabilities","volume":"10","author":"Wang","year":"2020","journal-title":"Sci Rep"},{"key":"2023072608070252500_btad435-B36","doi-asserted-by":"crossref","first-page":"431","DOI":"10.1002\/iid3.85","article-title":"Functional profile of S100A4-deficient T cells","volume":"3","author":"Weatherly","year":"2015","journal-title":"Immun Inflamm Dis"},{"key":"2023072608070252500_btad435-B37","doi-asserted-by":"crossref","first-page":"7090545","DOI":"10.1155\/2019\/7090545","article-title":"Prostatic acid phosphatase (PAP) predicts prostate cancer progress in a population-based study: the renewal of PAP?","volume":"2019","author":"Xu","year":"2019","journal-title":"Dis Markers"},{"key":"2023072608070252500_btad435-B38","doi-asserted-by":"crossref","first-page":"dev200076","DOI":"10.1242\/dev.200076","article-title":"Selective requirement for polycomb repressor complex 2 in the generation of specific hypothalamic neuronal subtypes","volume":"149","author":"Yaghmaeian Salmani","year":"2022","journal-title":"Development"},{"key":"2023072608070252500_btad435-B39","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1186\/s13059-020-1950-6","article-title":"Decontamination of ambient RNA in single-cell RNA-seq with DecontX","volume":"21","author":"Yang","year":"2020","journal-title":"Genome Biol"},{"key":"2023072608070252500_btad435-B40","doi-asserted-by":"crossref","first-page":"giy083","DOI":"10.1093\/gigascience\/giy083","article-title":"Clustering trees: a visualization for evaluating clusterings at multiple resolutions","volume":"7","author":"Zappia","year":"2018","journal-title":"Gigascience"},{"key":"2023072608070252500_btad435-B41","doi-asserted-by":"crossref","first-page":"174","DOI":"10.1186\/s13059-017-1305-0","article-title":"Splatter: simulation of single-cell RNA sequencing data","volume":"18","author":"Zappia","year":"2017","journal-title":"Genome Biol"},{"key":"2023072608070252500_btad435-B42","doi-asserted-by":"crossref","first-page":"829","DOI":"10.1101\/gad.235499.113","article-title":"The diabetes gene Hhex maintains \u03b4-cell differentiation and islet function","volume":"28","author":"Zhang","year":"2014","journal-title":"Genes Dev"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btad435\/50883193\/btad435.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/7\/btad435\/50962393\/btad435.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/39\/7\/btad435\/50962393\/btad435.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,26]],"date-time":"2023-07-26T09:13:52Z","timestamp":1690362832000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/doi\/10.1093\/bioinformatics\/btad435\/7224247"}},"subtitle":[],"editor":[{"given":"Anthony","family":"Mathelier","sequence":"additional","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2023,7,1]]},"references-count":42,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2023,7,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btad435","relation":{},"ISSN":["1367-4811"],"issn-type":[{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2023,7,1]]},"published":{"date-parts":[[2023,7,1]]},"article-number":"btad435"}}