{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,20]],"date-time":"2026-04-20T19:33:04Z","timestamp":1776713584446,"version":"3.51.2"},"reference-count":47,"publisher":"Oxford University Press (OUP)","issue":"12","license":[{"start":{"date-parts":[[2020,3,24]],"date-time":"2020-03-24T00:00:00Z","timestamp":1585008000000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Cincinnati Children\u2019s Hospital Research Foundation"},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R01CA226802"],"award-info":[{"award-number":["R01CA226802"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100000002","name":"National Institutes of Health","doi-asserted-by":"publisher","award":["R21AI35595"],"award-info":[{"award-number":["R21AI35595"]}],"id":[{"id":"10.13039\/100000002","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2020,6,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Motivation<\/jats:title>\n                    <jats:p>The rapid proliferation of single-cell RNA-sequencing (scRNA-Seq) technologies has spurred the development of diverse computational approaches to detect transcriptionally coherent populations. While the complexity of the algorithms for detecting heterogeneity has increased, most require significant user-tuning, are heavily reliant on dimension reduction techniques and are not scalable to ultra-large datasets. We previously described a multi-step algorithm, Iterative Clustering and Guide-gene Selection (ICGS), which applies intra-gene correlation and hybrid clustering to uniquely resolve novel transcriptionally coherent cell populations from an intuitive graphical user interface.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>We describe a new iteration of ICGS that outperforms state-of-the-art scRNA-Seq detection workflows when applied to well-established benchmarks. This approach combines multiple complementary subtype detection methods (HOPACH, sparse non-negative matrix factorization, cluster \u2018fitness\u2019, support vector machine) to resolve rare and common cell-states, while minimizing differences due to donor or batch effects. Using data from multiple cell atlases, we show that the PageRank algorithm effectively downsamples ultra-large scRNA-Seq datasets, without losing extremely rare or transcriptionally similar yet distinct cell types and while recovering novel transcriptionally distinct cell populations. We believe this new approach holds tremendous promise in reproducibly resolving hidden cell populations in complex datasets.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Availability and implementation<\/jats:title>\n                    <jats:p>ICGS2 is implemented in Python. The source code and documentation are available at http:\/\/altanalyze.org.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Supplementary information<\/jats:title>\n                    <jats:p>Supplementary data are available at Bioinformatics online.<\/jats:p>\n                  <\/jats:sec>","DOI":"10.1093\/bioinformatics\/btaa201","type":"journal-article","created":{"date-parts":[[2020,3,19]],"date-time":"2020-03-19T16:12:25Z","timestamp":1584634345000},"page":"3773-3780","source":"Crossref","is-referenced-by-count":58,"title":["Resolving single-cell heterogeneity from hundreds of thousands of cells through sequential hybrid clustering and NMF"],"prefix":"10.1093","volume":"36","author":[{"given":"Meenakshi","family":"Venkatasubramanian","sequence":"first","affiliation":[{"name":"Department of Electrical Engineering and Computer Science , University of Cincinnati"},{"name":"Division of Biomedical Informatics , Cincinnati Children\u2019s Hospital Medical Center"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kashish","family":"Chetal","sequence":"additional","affiliation":[{"name":"Division of Biomedical Informatics , Cincinnati Children\u2019s Hospital Medical Center"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Daniel J","family":"Schnell","sequence":"additional","affiliation":[{"name":"Division of Biomedical Informatics , Cincinnati Children\u2019s Hospital Medical Center"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gowtham","family":"Atluri","sequence":"additional","affiliation":[{"name":"Department of Electrical Engineering and Computer Science , University of Cincinnati"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9689-2469","authenticated-orcid":false,"given":"Nathan","family":"Salomonis","sequence":"additional","affiliation":[{"name":"Division of Biomedical Informatics , Cincinnati Children\u2019s Hospital Medical Center"},{"name":"Department of Biomedical Informatics , University of Cincinnati, Cincinnati, OH 45267, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"286","published-online":{"date-parts":[[2020,3,24]]},"reference":[{"key":"2023063011074042800_btaa201-B1","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1186\/s13059-019-1795-z","article-title":"A comparison of automatic cell identification methods for single-cell RNA sequencing data","volume":"20","author":"Abdelaal","year":"2019","journal-title":"Genome Biol"},{"key":"2023063011074042800_btaa201-B2","doi-asserted-by":"crossref","first-page":"114","DOI":"10.1016\/j.mam.2017.07.002","article-title":"Identifying cell populations with scRNASeq","volume":"59","author":"Andrews","year":"2018","journal-title":"Mol. Aspects Med"},{"key":"2023063011074042800_btaa201-B3","doi-asserted-by":"publisher","first-page":"101374","DOI":"10.1016\/j.is.2019.02.006","article-title":"ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms","volume":"87","author":"Aum\u00fcller","year":"2020","journal-title":"Information Systems"},{"key":"2023063011074042800_btaa201-B4","doi-asserted-by":"crossref","first-page":"1293","DOI":"10.1016\/j.cell.2018.05.060","article-title":"Single-cell map of diverse immune phenotypes in the breast tumor microenvironment","volume":"174","author":"Azizi","year":"2018","journal-title":"Cell"},{"key":"2023063011074042800_btaa201-B5","doi-asserted-by":"crossref","first-page":"1350","DOI":"10.1016\/j.patcog.2007.09.010","article-title":"SVD based initialization: a head start for nonnegative matrix factorization","volume":"41","author":"Boutsidis","year":"2008","journal-title":"Pattern Recognit"},{"key":"2023063011074042800_btaa201-B6","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1038\/nbt.4096","article-title":"Integrating single-cell transcriptomic data across different conditions, technologies, and species","volume":"36","author":"Butler","year":"2018","journal-title":"Nat. Biotechnol"},{"key":"2023063011074042800_btaa201-B7","doi-asserted-by":"crossref","first-page":"496","DOI":"10.1038\/s41586-019-0969-x","article-title":"The single-cell transcriptional landscape of mammalian organogenesis","volume":"566","author":"Cao","year":"2019","journal-title":"Nature"},{"key":"2023063011074042800_btaa201-B8","doi-asserted-by":"crossref","first-page":"1399","DOI":"10.1084\/jem.20052454","article-title":"Plasmacytoid dendritic cell-specific receptor ILT7-Fc epsilonRI gamma inhibits Toll-like receptor-induced interferon production","volume":"203","author":"Cao","year":"2006","journal-title":"J. Exp. Med"},{"key":"2023063011074042800_btaa201-B9","doi-asserted-by":"crossref","first-page":"4906","DOI":"10.1038\/s41467-018-07333-4","article-title":"Defining human cardiac transcription factor hierarchies using integrated single-cell heterogeneity analysis","volume":"9","author":"Churko","year":"2018","journal-title":"Nat. Commun"},{"key":"2023063011074042800_btaa201-B10","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn"},{"key":"2023063011074042800_btaa201-B11","doi-asserted-by":"crossref","first-page":"e138","DOI":"10.1093\/nar\/gkz789","article-title":"cellHarmony: cell-level matching and holistic comparison of single-cell transcriptomes","volume":"47","author":"DePasquale","year":"2019","journal-title":"Nucleic Acids Res"},{"key":"2023063011074042800_btaa201-B12","doi-asserted-by":"crossref","first-page":"28","DOI":"10.1186\/s40425-018-0339-5","article-title":"An innovative immunotherapeutic strategy for ovarian cancer: CLEC10A and glycomimetic peptides","volume":"6","author":"Eggink","year":"2018","journal-title":"J. Immunother. Cancer"},{"key":"2023063011074042800_btaa201-B13","doi-asserted-by":"crossref","first-page":"W755","DOI":"10.1093\/nar\/gkq405","article-title":"AltAnalyze and DomainGraph: analyzing and visualizing exon expression data","volume":"38","author":"Emig","year":"2010","journal-title":"Nucleic Acids Res"},{"key":"2023063011074042800_btaa201-B14","doi-asserted-by":"crossref","first-page":"e1004575","DOI":"10.1371\/journal.pcbi.1004575","article-title":"SINCERA: a pipeline for single-cell RNA-Seq profiling analysis","volume":"11","author":"Guo","year":"2015","journal-title":"PLoS Comput. Biol"},{"key":"2023063011074042800_btaa201-B15","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1016\/j.exphem.2018.09.004","article-title":"The Human Cell Atlas bone marrow single-cell interactive web portal","volume":"68","author":"Hay","year":"2018","journal-title":"Exp. Hematol"},{"key":"2023063011074042800_btaa201-B16","doi-asserted-by":"crossref","first-page":"744","DOI":"10.3389\/fimmu.2018.00744","article-title":"CLEC10A is a specific marker for human CD1c+ dendritic cells and enhances their toll-like receptor 7\/8-induced cytokine secretion","volume":"9","author":"Heger","year":"2018","journal-title":"Front. Immunol"},{"key":"2023063011074042800_btaa201-B17","doi-asserted-by":"crossref","first-page":"685","DOI":"10.1038\/s41587-019-0113-3","article-title":"Efficient integration of heterogeneous single-cell transcriptomes using Scanorama","volume":"37","author":"Hie","year":"2019","journal-title":"Nat. Biotechnol"},{"key":"2023063011074042800_btaa201-B18","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert","year":"1985","journal-title":"J. Class"},{"key":"2023063011074042800_btaa201-B19","doi-asserted-by":"crossref","first-page":"dev173047","DOI":"10.1242\/dev.173047","article-title":"Maturation of heart valve cell populations during postnatal remodeling","volume":"146","author":"Hulin","year":"2019","journal-title":"Development"},{"key":"2023063011074042800_btaa201-B20","doi-asserted-by":"crossref","first-page":"878","DOI":"10.1101\/gr.230771.117","article-title":"bigSCale: an analytical framework for big-scale single-cell data","volume":"28","author":"Iacono","year":"2018","journal-title":"Genome Res"},{"key":"2023063011074042800_btaa201-B21","doi-asserted-by":"crossref","first-page":"1495","DOI":"10.1093\/bioinformatics\/btm134","article-title":"Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis","volume":"23","author":"Kim","year":"2007","journal-title":"Bioinformatics"},{"key":"2023063011074042800_btaa201-B22","doi-asserted-by":"crossref","first-page":"483","DOI":"10.1038\/nmeth.4236","article-title":"SC3: consensus clustering of single-cell RNA-seq data","volume":"14","author":"Kiselev","year":"2017","journal-title":"Nat. Methods"},{"key":"2023063011074042800_btaa201-B23","doi-asserted-by":"crossref","first-page":"1289","DOI":"10.1038\/s41592-019-0619-0","article-title":"Fast, sensitive, and accurate integration of single cell data with Harmony","volume":"16","author":"Korsunsky","year":"2019","journal-title":"Nat. Methods"},{"key":"2023063011074042800_btaa201-B24","first-page":"631","volume-title":"Sampling from Large Graphs","author":"Leskovec","year":"2006"},{"key":"2023063011074042800_btaa201-B25","doi-asserted-by":"crossref","first-page":"9775","DOI":"10.1073\/pnas.1820006116","article-title":"scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets","volume":"116","author":"Lin","year":"2019","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023063011074042800_btaa201-B26","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/s41592-018-0229-2","article-title":"Deep generative modeling for single-cell transcriptomics","volume":"15","author":"Lopez","year":"2018","journal-title":"Nat. Methods"},{"key":"2023063011074042800_btaa201-B27","doi-asserted-by":"crossref","first-page":"2083","DOI":"10.1016\/j.celrep.2018.10.084","article-title":"The molecular signature of megakaryocyte-erythroid progenitors reveals a role for the cell cycle in fate specification","volume":"25","author":"Lu","year":"2018","journal-title":"Cell Rep"},{"key":"2023063011074042800_btaa201-B28","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Maaten","year":"2008","journal-title":"J. Mach. Learn. Res"},{"key":"2023063011074042800_btaa201-B29","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.ydbio.2017.11.006","article-title":"Cross-platform single cell analysis of kidney development shows stromal cells express Gdnf","volume":"434","author":"Magella","year":"2018","journal-title":"Dev. Biol"},{"key":"2023063011074042800_btaa201-B30","article-title":"UMAP: uniform manifold approximation and projection for dimension reduction","author":"McInnes","year":"2018"},{"key":"2023063011074042800_btaa201-B31","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1186\/s12859-015-0485-4","article-title":"NMF-mGPU: non-negative matrix factorization on multi-GPU systems","volume":"16","author":"Mej\u00eda-Roa","year":"2015","journal-title":"BMC Bioinformatics"},{"key":"2023063011074042800_btaa201-B32","doi-asserted-by":"crossref","first-page":"698","DOI":"10.1038\/nature19348","article-title":"Single-cell analysis of mixed-lineage states leading to a binary cell fate choice","volume":"537","author":"Olsson","year":"2016","journal-title":"Nature"},{"key":"2023063011074042800_btaa201-B33","doi-asserted-by":"crossref","first-page":"2846","DOI":"10.1182\/blood-2005-10-4077","article-title":"Toward the identification of a tolerogenic signature in IDO-competent dendritic cells","volume":"107","author":"Orabona","year":"2006","journal-title":"Blood"},{"key":"2023063011074042800_btaa201-B34","volume-title":"The PageRank Citation Ranking: Bringing Order to the Web","author":"Page","year":"1999"},{"key":"2023063011074042800_btaa201-B35","doi-asserted-by":"crossref","first-page":"1053","DOI":"10.1038\/nbt.2967","article-title":"Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex","volume":"32","author":"Pollen","year":"2014","journal-title":"Nat. Biotechnol"},{"key":"2023063011074042800_btaa201-B36","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1038\/s41586-019-1652-y","article-title":"Decoding human fetal liver haematopoiesis","volume":"574","author":"Popescu","year":"2019","journal-title":"Nature"},{"key":"2023063011074042800_btaa201-B37","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1038\/s41586-018-0590-4","article-title":"Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris","volume":"562","year":"2018","journal-title":"Nature"},{"key":"2023063011074042800_btaa201-B38","doi-asserted-by":"crossref","first-page":"371","DOI":"10.1038\/nature13173","article-title":"Reconstructing lineage hierarchies of the distal lung epithelium using single-cell RNA-seq","volume":"509","author":"Treutlein","year":"2014","journal-title":"Nature"},{"key":"2023063011074042800_btaa201-B39","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1038\/nn.3881","article-title":"Unbiased classification of sensory neuron types by large-scale single-cell RNA sequencing","volume":"18","author":"Usoskin","year":"2015","journal-title":"Nat. Neurosci"},{"key":"2023063011074042800_btaa201-B40","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1016\/S0378-3758(02)00388-9","article-title":"A new algorithm for hybrid hierarchical clustering with visualization and the bootstrap","volume":"117","author":"van der Laan","year":"2003","journal-title":"J. Stat. Plann. Inference"},{"key":"2023063011074042800_btaa201-B41","doi-asserted-by":"crossref","first-page":"1873","DOI":"10.1016\/j.cell.2019.05.006","article-title":"Single-cell multi-omic integration compares and contrasts features of brain cell identity","volume":"177","author":"Welch","year":"2019","journal-title":"Cell"},{"key":"2023063011074042800_btaa201-B42","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","article-title":"SCANPY: large-scale single-cell gene expression data analysis","volume":"19","author":"Wolf","year":"2018","journal-title":"Genome Biol"},{"key":"2023063011074042800_btaa201-B43","doi-asserted-by":"crossref","first-page":"1974","DOI":"10.1093\/bioinformatics\/btv088","article-title":"Identification of cell types from single-cell transcriptomes using a novel clustering method","volume":"31","author":"Xu","year":"2015","journal-title":"Bioinformatics"},{"key":"2023063011074042800_btaa201-B44","doi-asserted-by":"crossref","first-page":"40437","DOI":"10.18632\/oncotarget.9624","article-title":"A novel peptide targeting Clec9a on dendritic cell for cancer immunotherapy","volume":"7","author":"Yan","year":"2016","journal-title":"Oncotarget"},{"key":"2023063011074042800_btaa201-B45","doi-asserted-by":"crossref","first-page":"890","DOI":"10.1016\/j.immuni.2017.10.021","article-title":"Granulocyte-monocyte progenitors and monocyte-dendritic cell progenitors independently produce functionally distinct monocytes","volume":"47","author":"Y\u00e1\u00f1ez","year":"2017","journal-title":"Immunity"},{"key":"2023063011074042800_btaa201-B46","doi-asserted-by":"crossref","first-page":"2209","DOI":"10.1093\/bioinformatics\/bts366","article-title":"GO-Elite: a flexible solution for pathway and ontology over-representation","volume":"28","author":"Zambon","year":"2012","journal-title":"Bioinformatics"},{"key":"2023063011074042800_btaa201-B47","doi-asserted-by":"crossref","first-page":"1138","DOI":"10.1126\/science.aaa1934","article-title":"Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq","volume":"347","author":"Zeisel","year":"2015","journal-title":"Science"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/academic.oup.com\/bioinformatics\/advance-article-pdf\/doi\/10.1093\/bioinformatics\/btaa201\/33164579\/btaa201.pdf","content-type":"application\/pdf","content-version":"am","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/12\/3773\/50749919\/bioinformatics_36_12_3773.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/36\/12\/3773\/50749919\/bioinformatics_36_12_3773.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,6,30]],"date-time":"2023-06-30T07:09:09Z","timestamp":1688108949000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/36\/12\/3773\/5811229"}},"subtitle":[],"editor":[{"given":"Yann","family":"Ponty","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2020,3,24]]},"references-count":47,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2020,6,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/btaa201","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/608869","asserted-by":"object"}]},"ISSN":["1367-4803","1367-4811"],"issn-type":[{"value":"1367-4803","type":"print"},{"value":"1367-4811","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2020,6,15]]},"published":{"date-parts":[[2020,3,24]]}}}