{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T19:53:50Z","timestamp":1776110030540,"version":"3.50.1"},"reference-count":23,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2020,4,25]],"date-time":"2020-04-25T00:00:00Z","timestamp":1587772800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"},{"start":{"date-parts":[[2020,4,25]],"date-time":"2020-04-25T00:00:00Z","timestamp":1587772800000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100000052","name":"NIH Office of the Director","doi-asserted-by":"publisher","award":["UG3HL145609"],"award-info":[{"award-number":["UG3HL145609"]}],"id":[{"id":"10.13039\/100000052","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["BMC Bioinformatics"],"published-print":{"date-parts":[[2020,12]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:sec>\n                    <jats:title>Background<\/jats:title>\n                    <jats:p>With the rapid development of single-cell RNA sequencing technology, it is possible to dissect cell-type composition at high resolution. A number of methods have been developed with the purpose to identify rare cell types. However, existing methods are still not scalable to large datasets, limiting their utility. To overcome this limitation, we present a new software package, called GiniClust3, which is an extension of GiniClust2 and significantly faster and memory-efficient than previous versions.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Results<\/jats:title>\n                    <jats:p>Using GiniClust3, it only takes about 7\u2009h to identify both common and rare cell clusters from a dataset that contains more than one million cells. Cell type mapping and perturbation analyses show that GiniClust3 could robustly identify cell clusters.<\/jats:p>\n                  <\/jats:sec>\n                  <jats:sec>\n                    <jats:title>Conclusions<\/jats:title>\n                    <jats:p>\n                      Taken together, these results suggest that GiniClust3 is a powerful tool to identify both common and rare cell population and can handle large dataset. GiniCluster3 is implemented in the open-source python package and available at\n                      <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/rdong08\/GiniClust3\">https:\/\/github.com\/rdong08\/GiniClust3<\/jats:ext-link>\n                      .\n                    <\/jats:p>\n                  <\/jats:sec>","DOI":"10.1186\/s12859-020-3482-1","type":"journal-article","created":{"date-parts":[[2020,4,25]],"date-time":"2020-04-25T10:02:48Z","timestamp":1587808968000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":48,"title":["GiniClust3: a fast and memory-efficient tool for rare cell type identification"],"prefix":"10.1186","volume":"21","author":[{"given":"Rui","family":"Dong","sequence":"first","affiliation":[]},{"given":"Guo-Cheng","family":"Yuan","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2020,4,25]]},"reference":[{"issue":"3","key":"3482_CR1","doi-asserted-by":"publisher","first-page":"133","DOI":"10.1038\/nrg3833","volume":"16","author":"O Stegle","year":"2015","unstructured":"Stegle O, Teichmann SA, Marioni JC. Computational and analytical challenges in single-cell transcriptomics. Nat Rev Genet. 2015;16(3):133\u201345.","journal-title":"Nat Rev Genet"},{"issue":"4","key":"3482_CR2","doi-asserted-by":"publisher","first-page":"610","DOI":"10.1016\/j.molcel.2015.04.005","volume":"58","author":"AA Kolodziejczyk","year":"2015","unstructured":"Kolodziejczyk AA, Kim JK, Svensson V, Marioni JC, Teichmann SA. The technology and biology of single-cell RNA sequencing. Mol Cell. 2015;58(4):610\u201320.","journal-title":"Mol Cell"},{"issue":"1","key":"3482_CR3","doi-asserted-by":"publisher","first-page":"84","DOI":"10.1186\/s13059-017-1218-y","volume":"18","author":"GC Yuan","year":"2017","unstructured":"Yuan GC, Cai L, Elowitz M, Enver T, Fan G, Guo G, Irizarry R, Kharchenko P, Kim J, Orkin S, et al. Challenges and emerging directions in single-cell analysis. Genome Biol. 2017;18(1):84.","journal-title":"Genome Biol"},{"issue":"1","key":"3482_CR4","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1038\/nri.2017.76","volume":"18","author":"E Papalexi","year":"2018","unstructured":"Papalexi E, Satija R. Single-cell RNA sequencing to explore immune cell heterogeneity. Nat Rev Immunol. 2018;18(1):35\u201345.","journal-title":"Nat Rev Immunol"},{"key":"3482_CR5","doi-asserted-by":"publisher","first-page":"14049","DOI":"10.1038\/ncomms14049","volume":"8","author":"GX Zheng","year":"2017","unstructured":"Zheng GX, Terry JM, Belgrader P, Ryvkin P, Bent ZW, Wilson R, Ziraldo SB, Wheeler TD, McDermott GP, Zhu J, et al. Massively parallel digital transcriptional profiling of single cells. Nat Commun. 2017;8:14049.","journal-title":"Nat Commun"},{"issue":"5","key":"3482_CR6","doi-asserted-by":"publisher","first-page":"411","DOI":"10.1038\/nbt.4096","volume":"36","author":"A Butler","year":"2018","unstructured":"Butler A, Hoffman P, Smibert P, Papalexi E, Satija R. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol. 2018;36(5):411\u201320.","journal-title":"Nat Biotechnol"},{"issue":"1","key":"3482_CR7","doi-asserted-by":"publisher","first-page":"15","DOI":"10.1186\/s13059-017-1382-0","volume":"19","author":"FA Wolf","year":"2018","unstructured":"Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19(1):15.","journal-title":"Genome Biol"},{"issue":"1","key":"3482_CR8","doi-asserted-by":"publisher","first-page":"4719","DOI":"10.1038\/s41467-018-07234-6","volume":"9","author":"A Jindal","year":"2018","unstructured":"Jindal A, Gupta P, Jayadeva, Sengupta D. Discovery of rare cells from voluminous single cell expression data. Nat Commun. 2018;9(1):4719.","journal-title":"Nat Commun"},{"issue":"7568","key":"3482_CR9","doi-asserted-by":"publisher","first-page":"251","DOI":"10.1038\/nature14966","volume":"525","author":"D Grun","year":"2015","unstructured":"Grun D, Lyubimova A, Kester L, Wiebrands K, Basak O, Sasaki N, Clevers H, van Oudenaarden A. Single-cell messenger RNA sequencing reveals rare intestinal cell types. Nature. 2015;525(7568):251\u20135.","journal-title":"Nature"},{"issue":"2","key":"3482_CR10","doi-asserted-by":"publisher","first-page":"266","DOI":"10.1016\/j.stem.2016.05.010","volume":"19","author":"D Grun","year":"2016","unstructured":"Grun D, Muraro MJ, Boisset JC, Wiebrands K, Lyubimova A, Dharmadhikari G, van den Born M, van Es J, Jansen E, Clevers H, et al. De novo prediction of stem cell identity using single-cell Transcriptome data. Cell Stem Cell. 2016;19(2):266\u201377.","journal-title":"Cell Stem Cell"},{"issue":"1","key":"3482_CR11","doi-asserted-by":"publisher","first-page":"144","DOI":"10.1186\/s13059-016-1010-4","volume":"17","author":"L Jiang","year":"2016","unstructured":"Jiang L, Chen H, Pinello L, Yuan GC. GiniClust: detecting rare cell types from single-cell gene expression data with Gini index. Genome Biol. 2016;17(1):144.","journal-title":"Genome Biol"},{"issue":"1","key":"3482_CR12","doi-asserted-by":"publisher","first-page":"58","DOI":"10.1186\/s13059-018-1431-3","volume":"19","author":"D Tsoucas","year":"2018","unstructured":"Tsoucas D, Yuan GC. GiniClust2: a cluster-aware, weighted ensemble clustering method for cell-type detection. Genome Biol. 2018;19(1):58.","journal-title":"Genome Biol"},{"issue":"6","key":"3482_CR13","doi-asserted-by":"publisher","first-page":"483","DOI":"10.1016\/j.cels.2019.05.003","volume":"8","author":"B Hie","year":"2019","unstructured":"Hie B, Cho H, DeMeo B, Bryson B, Berger B. Geometric sketching compactly summarizes the single-cell Transcriptomic landscape. Cell Syst. 2019;8(6):483\u201393 e487.","journal-title":"Cell Syst"},{"issue":"5","key":"3482_CR14","doi-asserted-by":"publisher","first-page":"1307","DOI":"10.1016\/j.cell.2018.05.012","volume":"173","author":"X Han","year":"2018","unstructured":"Han X, Wang R, Zhou Y, Fei L, Sun H, Lai S, Saadatpour A, Zhou Z, Chen H, Ye F, et al. Mapping the mouse cell atlas by microwell-Seq. Cell. 2018;173(5):1307.","journal-title":"Cell"},{"issue":"4","key":"3482_CR15","doi-asserted-by":"publisher","first-page":"999","DOI":"10.1016\/j.cell.2018.06.021","volume":"174","author":"A Zeisel","year":"2018","unstructured":"Zeisel A, Hochgerner H, Lonnerberg P, Johnsson A, Memic F, van der Zwan J, Haring M, Braun E, Borm LE, La Manno G, et al. Molecular architecture of the mouse nervous system. Cell. 2018;174(4):999\u20131014 e1022.","journal-title":"Cell"},{"issue":"7677","key":"3482_CR16","doi-asserted-by":"publisher","first-page":"451","DOI":"10.1038\/550451a","volume":"550","author":"O Rozenblatt-Rosen","year":"2017","unstructured":"Rozenblatt-Rosen O, Stubbington MJT, Regev A, Teichmann SA. The human cell atlas: from vision to reality. Nature. 2017;550(7677):451\u20133.","journal-title":"Nature"},{"issue":"1","key":"3482_CR17","doi-asserted-by":"publisher","first-page":"5233","DOI":"10.1038\/s41598-019-41695-z","volume":"9","author":"VA Traag","year":"2019","unstructured":"Traag VA, Waltman L, van Eck NJ. From Louvain to Leiden: guaranteeing well-connected communities. Sci Rep. 2019;9(1):5233.","journal-title":"Sci Rep"},{"issue":"10","key":"3482_CR18","doi-asserted-by":"publisher","first-page":"P10008","DOI":"10.1088\/1742-5468\/2008\/10\/P10008","volume":"2008","author":"VD Blondel","year":"2008","unstructured":"Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E. Fast unfolding of communities in large networks. J Stat Mech. 2008;2008(10):P10008.","journal-title":"J Stat Mech"},{"issue":"4","key":"3482_CR19","doi-asserted-by":"publisher","first-page":"281","DOI":"10.1016\/j.cels.2018.11.005","volume":"8","author":"SL Wolock","year":"2019","unstructured":"Wolock SL, Lopez R, Klein AM. Scrublet: computational identification of cell doublets in single-cell Transcriptomic data. Cell Syst. 2019;8(4):281\u201391 e289.","journal-title":"Cell Syst"},{"key":"3482_CR20","first-page":"91","volume":"2019","author":"H Sun","year":"1935","unstructured":"Sun H, Zhou Y, Fei L, Chen H, Guo G. scMCA: a tool to define mouse cell types based on single-cell digital expression. Methods Mol Biol. 1935;2019:91\u20136.","journal-title":"Methods Mol Biol"},{"issue":"4S Suppl","key":"3482_CR21","doi-asserted-by":"publisher","first-page":"1007S","DOI":"10.1093\/jn\/130.4.1007S","volume":"130","author":"BS Meldrum","year":"2000","unstructured":"Meldrum BS. Glutamate as a neurotransmitter in the brain: review of physiology and pathology. J Nutr. 2000;130(4S Suppl):1007S\u201315S.","journal-title":"J Nutr"},{"issue":"8","key":"3482_CR22","doi-asserted-by":"publisher","first-page":"799","DOI":"10.1007\/s00702-014-1180-8","volume":"121","author":"Y Zhou","year":"2014","unstructured":"Zhou Y, Danbolt NC. Glutamate as a neurotransmitter in the healthy brain. J Neural Transm (Vienna). 2014;121(8):799\u2013817.","journal-title":"J Neural Transm (Vienna)"},{"issue":"4","key":"3482_CR23","doi-asserted-by":"publisher","first-page":"599","DOI":"10.1038\/nprot.2017.149","volume":"13","author":"V Svensson","year":"2018","unstructured":"Svensson V, Vento-Tormo R, Teichmann SA. Exponential scaling of single-cell RNA-seq in the past decade. Nat Protoc. 2018;13(4):599\u2013604.","journal-title":"Nat Protoc"}],"container-title":["BMC Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-3482-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s12859-020-3482-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s12859-020-3482-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,4,24]],"date-time":"2021-04-24T19:06:33Z","timestamp":1619291193000},"score":1,"resource":{"primary":{"URL":"https:\/\/bmcbioinformatics.biomedcentral.com\/articles\/10.1186\/s12859-020-3482-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,25]]},"references-count":23,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2020,12]]}},"alternative-id":["3482"],"URL":"https:\/\/doi.org\/10.1186\/s12859-020-3482-1","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/788554","asserted-by":"object"}]},"ISSN":["1471-2105"],"issn-type":[{"value":"1471-2105","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,4,25]]},"assertion":[{"value":"6 November 2019","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 April 2020","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"25 April 2020","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"Not applicable.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethics approval and consent to participate"}},{"value":"Not applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Consent for publication"}},{"value":"The authors declare that they have no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"158"}}