{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,20]],"date-time":"2025-09-20T20:10:41Z","timestamp":1758399041700},"reference-count":45,"publisher":"World Scientific Pub Co Pte Ltd","issue":"07","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Patt. Recogn. Artif. Intell."],"published-print":{"date-parts":[[2012,11]]},"abstract":"<jats:p>K-means is undoubtedly the most widely used partitional clustering algorithm. Unfortunately, due to its gradient descent nature, this algorithm is highly sensitive to the initial placement of the cluster centers. Numerous initialization methods have been proposed to address this problem. Many of these methods, however, have superlinear complexity in the number of data points, making them impractical for large data sets. On the other hand, linear methods are often random and\/or order-sensitive, which renders their results unrepeatable. Recently, Su and Dy proposed two highly successful hierarchical initialization methods named Var-Part and PCA-Part that are not only linear, but also deterministic (nonrandom) and order-invariant. In this paper, we propose a discriminant analysis based approach that addresses a common deficiency of these two methods. Experiments on a large and diverse collection of data sets from the UCI machine learning repository demonstrate that Var-Part and PCA-Part are highly competitive with one of the best random initialization methods to date, i.e. k-means++, and that the proposed approach significantly improves the performance of both hierarchical methods.<\/jats:p>","DOI":"10.1142\/s0218001412500188","type":"journal-article","created":{"date-parts":[[2012,11,8]],"date-time":"2012-11-08T01:32:16Z","timestamp":1352338336000},"page":"1250018","source":"Crossref","is-referenced-by-count":46,"title":["DETERMINISTIC INITIALIZATION OF THE K-MEANS ALGORITHM USING HIERARCHICAL CLUSTERING"],"prefix":"10.1142","volume":"26","author":[{"given":"M. EMRE","family":"CELEBI","sequence":"first","affiliation":[{"name":"Department of Computer Science, Louisiana State University, Shreveport, LA, USA"}]},{"given":"HASSAN A.","family":"KINGRAVI","sequence":"additional","affiliation":[{"name":"School of Electrical and Computer Engineering, Georgia Institute of Technology, Atlanta, GA, USA"}]}],"member":"219","published-online":{"date-parts":[[2013,2,17]]},"reference":[{"key":"rf1","doi-asserted-by":"publisher","DOI":"10.1145\/331499.331504"},{"key":"rf2","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2009.09.011"},{"key":"rf3","doi-asserted-by":"publisher","DOI":"10.1002\/9780470316801"},{"key":"rf4","doi-asserted-by":"publisher","DOI":"10.1007\/s10994-009-5103-0"},{"key":"rf5","doi-asserted-by":"publisher","DOI":"10.1016\/j.tcs.2010.05.034"},{"key":"rf6","doi-asserted-by":"publisher","DOI":"10.1016\/S0031-3203(03)00190-0"},{"key":"rf7","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"rf10","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2010.88"},{"key":"rf11","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2002.1017616"},{"key":"rf13","doi-asserted-by":"publisher","DOI":"10.1109\/TVLSI.2009.2017543"},{"key":"rf14","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2004.25"},{"key":"rf15","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.1984.4767478"},{"key":"rf16","unstructured":"L.\u00a0Bottou and Y.\u00a0Bengio, Advances in Neural Information Processing Systems 7, eds. G.\u00a0Tesauro, D. S.\u00a0Touretzky and T. K.\u00a0Leen (MIT Press, 1995)\u00a0pp. 585\u2013592."},{"key":"rf17","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1002\/sam.10080","volume":"3","author":"Vendramin L.","year":"2010","journal-title":"Stat. Anal. Data Mining"},{"key":"rf18","doi-asserted-by":"publisher","DOI":"10.1109\/72.478389"},{"key":"rf19","doi-asserted-by":"publisher","DOI":"10.1109\/TSMCB.2003.816993"},{"key":"rf20","doi-asserted-by":"publisher","DOI":"10.1016\/j.eswa.2012.07.021"},{"key":"rf21","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2010.10.002"},{"key":"rf22","doi-asserted-by":"publisher","DOI":"10.1016\/S0167-8655(99)00069-0"},{"key":"rf24","doi-asserted-by":"publisher","DOI":"10.1093\/comjnl\/10.3.271"},{"key":"rf26","doi-asserted-by":"publisher","DOI":"10.2307\/2346830"},{"key":"rf27","doi-asserted-by":"publisher","DOI":"10.1016\/S0031-3203(02)00060-2"},{"key":"rf29","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2007.01.001"},{"key":"rf30","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2009.04.013"},{"key":"rf31","doi-asserted-by":"publisher","DOI":"10.1016\/j.camwa.2009.04.017"},{"key":"rf33","first-page":"768","volume":"21","author":"Forgy E.","year":"1965","journal-title":"Biometrics"},{"key":"rf34","doi-asserted-by":"publisher","DOI":"10.1071\/BT9660127"},{"key":"rf36","doi-asserted-by":"publisher","DOI":"10.1002\/bs.3830120210"},{"key":"rf37","volume-title":"Pattern Recognition Principles","author":"Tou J. T.","year":"1974"},{"key":"rf38","doi-asserted-by":"publisher","DOI":"10.1016\/S0377-2217(77)81005-9"},{"key":"rf40","doi-asserted-by":"crossref","first-page":"319","DOI":"10.3233\/IDA-2007-11402","volume":"11","author":"Su T.","year":"2007","journal-title":"Intell. Data Anal."},{"key":"rf41","volume-title":"Cluster Analysis for Applications","author":"Anderberg M. R.","year":"1973"},{"key":"rf42","volume-title":"IBM SPSS Statistics 19 Statistical Procedures Companion","author":"Noru\u0161is M. J.","year":"2011"},{"key":"rf43","doi-asserted-by":"publisher","DOI":"10.1016\/0304-3975(85)90224-5"},{"key":"rf44","doi-asserted-by":"publisher","DOI":"10.1007\/BF02287921"},{"key":"rf45","doi-asserted-by":"publisher","DOI":"10.1016\/S0020-0255(70)80056-1"},{"key":"rf46","doi-asserted-by":"publisher","DOI":"10.1117\/1.1631315"},{"key":"rf47","doi-asserted-by":"publisher","DOI":"10.1109\/TSMC.1979.4310076"},{"key":"rf48","doi-asserted-by":"publisher","DOI":"10.1016\/0734-189X(88)90022-9"},{"key":"rf49","doi-asserted-by":"publisher","DOI":"10.1016\/0734-189X(90)90053-X"},{"key":"rf50","doi-asserted-by":"publisher","DOI":"10.1109\/34.368197"},{"key":"rf51","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2004.11.024"},{"key":"rf54","first-page":"51","volume":"4","author":"Onoda T.","year":"2012","journal-title":"J. Emerging Technol. Web Intell."},{"key":"rf55","doi-asserted-by":"publisher","DOI":"10.1007\/BF01897163"},{"key":"rf56","doi-asserted-by":"publisher","DOI":"10.1145\/272991.272995"}],"container-title":["International Journal of Pattern Recognition and Artificial Intelligence"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218001412500188","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,1,31]],"date-time":"2022-01-31T00:49:34Z","timestamp":1643590174000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218001412500188"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,11]]},"references-count":45,"journal-issue":{"issue":"07","published-online":{"date-parts":[[2013,2,17]]},"published-print":{"date-parts":[[2012,11]]}},"alternative-id":["10.1142\/S0218001412500188"],"URL":"https:\/\/doi.org\/10.1142\/s0218001412500188","relation":{},"ISSN":["0218-0014","1793-6381"],"issn-type":[{"value":"0218-0014","type":"print"},{"value":"1793-6381","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,11]]}}}