{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,31]],"date-time":"2026-03-31T23:13:42Z","timestamp":1774998822244,"version":"3.50.1"},"reference-count":24,"publisher":"Oxford University Press (OUP)","issue":"15","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2012,8,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n               <jats:p>Motivation: For flow cytometry data, there are two common approaches to the unsupervised clustering problem: one is based on the finite mixture model and the other on spatial exploration of the histograms. The former is computationally slow and has difficulty to identify clusters of irregular shapes. The latter approach cannot be applied directly to high-dimensional data as the computational time and memory become unmanageable and the estimated histogram is unreliable. An algorithm without these two problems would be very useful.<\/jats:p>\n               <jats:p>Results: In this article, we combine ideas from the finite mixture model and histogram spatial exploration. This new algorithm, which we call flowPeaks, can be applied directly to high-dimensional data and identify irregular shape clusters. The algorithm first uses K-means algorithm with a large K to partition the cell population into many small clusters. These partitioned data allow the generation of a smoothed density function using the finite mixture model. All local peaks are exhaustively searched by exploring the density function and the cells are clustered by the associated local peak. The algorithm flowPeaks is automatic, fast and reliable and robust to cluster shape and outliers. This algorithm has been applied to flow cytometry data and it has been compared with state of the art algorithms, including Misty Mountain, FLOCK, flowMeans, flowMerge and FLAME.<\/jats:p>\n               <jats:p>Availability: The R package flowPeaks is available at https:\/\/github.com\/yongchao\/flowPeaks.<\/jats:p>\n               <jats:p>Contact: \u00a0yongchao.ge@mssm.edu<\/jats:p>\n               <jats:p>Supplementary information: \u00a0Supplementary data are available at Bioinformatics online<\/jats:p>","DOI":"10.1093\/bioinformatics\/bts300","type":"journal-article","created":{"date-parts":[[2012,5,18]],"date-time":"2012-05-18T01:07:57Z","timestamp":1337303277000},"page":"2052-2058","source":"Crossref","is-referenced-by-count":130,"title":["flowPeaks: a fast unsupervised clustering for flow cytometry data via <i>K<\/i>-means and density peak finding"],"prefix":"10.1093","volume":"28","author":[{"given":"Yongchao","family":"Ge","sequence":"first","affiliation":[{"name":"Department of Neurology and Center of Translational System Biology, Mount Sinai School of Medicine, New York, NY 10029, USA&rcub;"}]},{"given":"Stuart C.","family":"Sealfon","sequence":"additional","affiliation":[{"name":"Department of Neurology and Center of Translational System Biology, Mount Sinai School of Medicine, New York, NY 10029, USA&rcub;"}]}],"member":"286","published-online":{"date-parts":[[2012,5,17]]},"reference":[{"key":"2023012512452510600_B1","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1002\/cyto.a.21007","article-title":"Rapid cell population identification in flow cytometry data","volume":"79","author":"Aghaeepour","year":"2011","journal-title":"Cytometry A"},{"key":"2023012512452510600_B2","first-page":"1027","article-title":"k-means++: the advantages of careful seeding","volume-title":"Proceedings of the Eighteenth Annual ACM\u2212SIAM Symposium on Discrete Algorithms","author":"Arthur","year":"2007"},{"key":"2023012512452510600_B3","doi-asserted-by":"crossref","first-page":"693","DOI":"10.1002\/cyto.a.20583","article-title":"Statistical mixture modeling for cell subtype identification in flow cytometry","volume":"73","author":"Chan","year":"2008","journal-title":"Cytometry A"},{"key":"2023012512452510600_B4","doi-asserted-by":"crossref","first-page":"247646","DOI":"10.1155\/2009\/247646","article-title":"Merging mixture components for cell population identification in flow cytometry","volume":"2009","author":"Finak","year":"2009","journal-title":"Adv. Bioinformatics"},{"key":"2023012512452510600_B5","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1007\/BF01025868","article-title":"On the histogram as a density estimator: L2 theory","volume":"57","author":"Freedman","year":"1981","journal-title":"Zeitschrift fur Wahrscheinlichkeitstheorie und verwandte Gebiete"},{"key":"2023012512452510600_B6","first-page":"59","article-title":"Hierarchical document clustering using frequent itemsets","volume-title":"Proceedings of the Third SIAM International Conference on Data Mining (SDM)","author":"Fung","year":"2003"},{"key":"2023012512452510600_B7","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.jim.2004.06.003","article-title":"Identification of compounds that enhance the anti-lymphoma activity of rituximab using flow cytometric high-content screening","volume":"292","author":"Gasparetto","year":"2004","journal-title":"J. Immunol. Methods"},{"key":"2023012512452510600_B8","doi-asserted-by":"crossref","first-page":"R80","DOI":"10.1186\/gb-2004-5-10-r80","article-title":"Bioconductor: open software development for computational biology and bioinformatics","volume":"5","author":"Gentleman","year":"2004","journal-title":"Genome Biol."},{"key":"2023012512452510600_B9","doi-asserted-by":"crossref","first-page":"100","DOI":"10.2307\/2346830","article-title":"A K-means clustering algorithm","volume":"28","author":"Hartigan","year":"1979","journal-title":"Appl. Stat."},{"key":"2023012512452510600_B10","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert","year":"1983","journal-title":"J. Classif."},{"key":"2023012512452510600_B11","first-page":"299","article-title":"R: a language for data analysis and graphics","volume":"5","author":"Ihaka","year":"1996","journal-title":"J. Comput. Graph. Stat."},{"key":"2023012512452510600_B12","doi-asserted-by":"crossref","first-page":"651","DOI":"10.1016\/j.patrec.2009.09.011","article-title":"Data clustering: 50 years beyond K-means","volume":"31","author":"Jain","year":"2010","journal-title":"Pattern Recogn. Lett."},{"key":"2023012512452510600_B13","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1109\/TPAMI.2002.1017616","article-title":"An efficient k-means clustering algorithm: analysis and implementation","volume":"24","author":"Kanungo","year":"2002","journal-title":"IEEE Trans. Pattern Anal."},{"key":"2023012512452510600_B14","doi-asserted-by":"crossref","first-page":"361","DOI":"10.1038\/nmeth872","article-title":"Fluorescent cell barcoding in flow cytometry allows high-throughput drug screening and signaling profiling","volume":"3","author":"Krutzik","year":"2006","journal-title":"Nat. Methods"},{"key":"2023012512452510600_B15","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1109\/TIT.1982.1056489","article-title":"Least squares quantization in PCM","volume":"IT-28","author":"Lloyd","year":"1982","journal-title":"IEEE Trans. Inform. Theory"},{"key":"2023012512452510600_B16","doi-asserted-by":"crossref","first-page":"321","DOI":"10.1002\/cyto.a.20531","article-title":"Automated gating of flow cytometry data via robust model-based clustering","volume":"73","author":"Lo","year":"2008","journal-title":"Cytometry A"},{"key":"2023012512452510600_B17","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1186\/1471-2105-10-145","article-title":"flowClust: a Bioconductor package for automated gating of flow cytometry data","volume":"14","author":"Lo","year":"2009","journal-title":"BMC Bioinformatics"},{"key":"2023012512452510600_B18","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1002\/cyto.990060405","article-title":"Automated identification of subpopulations in flow cytometric list mode data using cluster analysis","volume":"6","author":"Murphy","year":"1985","journal-title":"Cytometry"},{"key":"2023012512452510600_B19","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1186\/1471-2105-11-44","article-title":"The curvHDR method for gating flow cytometry samples","volume":"11","author":"Naumann","year":"2010","journal-title":"BMC Bioinformatics"},{"key":"2023012512452510600_B20","doi-asserted-by":"crossref","first-page":"8519","DOI":"10.1073\/pnas.0903028106","article-title":"Automated high-dimensional flow cytometric data analysis","volume":"106","author":"Pyne","year":"2009","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"2023012512452510600_B21","doi-asserted-by":"crossref","first-page":"S69","DOI":"10.1002\/cyto.b.20554","article-title":"Elucidation of seventeen human peripheral blood B-cell subsets and quantification of the tetanus response using a density-based method for the automated identification of cell populations in multidimensional flow cytometry data","volume":"78","author":"Qian","year":"2010","journal-title":"Cytometry B"},{"key":"2023012512452510600_B22","doi-asserted-by":"crossref","first-page":"846","DOI":"10.1080\/01621459.1971.10482356","article-title":"Objective criteria for the evaluation of clustering methods","volume":"66","author":"Rand","year":"1971","journal-title":"J. Am. Stat. Assoc."},{"key":"2023012512452510600_B23","first-page":"410","article-title":"V-Mmeasure: a conditional entropy-based external cluster evaluation measure","volume-title":"Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning","author":"Rosenberg","year":"2007"},{"key":"2023012512452510600_B24","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1186\/1471-2105-11-502","article-title":"Misty Mountain clustering: application to fast unsupervised flow cytometry gating","volume":"11","author":"Sugar","year":"2010","journal-title":"BMC Bioinformatics"}],"container-title":["Bioinformatics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/15\/2052\/48872746\/bioinformatics_28_15_2052.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article-pdf\/28\/15\/2052\/48872746\/bioinformatics_28_15_2052.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,1,25]],"date-time":"2023-01-25T17:19:39Z","timestamp":1674667179000},"score":1,"resource":{"primary":{"URL":"https:\/\/academic.oup.com\/bioinformatics\/article\/28\/15\/2052\/236255"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,5,17]]},"references-count":24,"journal-issue":{"issue":"15","published-print":{"date-parts":[[2012,8,1]]}},"URL":"https:\/\/doi.org\/10.1093\/bioinformatics\/bts300","relation":{},"ISSN":["1367-4811","1367-4803"],"issn-type":[{"value":"1367-4811","type":"electronic"},{"value":"1367-4803","type":"print"}],"subject":[],"published-other":{"date-parts":[[2012,8,1]]},"published":{"date-parts":[[2012,5,17]]}}}