{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,27]],"date-time":"2026-03-27T17:13:11Z","timestamp":1774631591104,"version":"3.50.1"},"reference-count":97,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2024,2,27]],"date-time":"2024-02-27T00:00:00Z","timestamp":1708992000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-sa\/4.0\/"}],"funder":[{"name":"European Union\u2019s funded Projects MobiSpaces","award":["101070279"],"award-info":[{"award-number":["101070279"]}]},{"name":"Green.DAT.AI","award":["101070416"],"award-info":[{"award-number":["101070416"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2024,6,30]]},"abstract":"<jats:p>Automated Machine Learning (AutoML) aims to identify the best-performing machine learning algorithm along with its input parameters for a given dataset and a specific machine learning task. This is a challenging problem, as the process of finding the best model and tuning it for a particular problem at hand is both time-consuming for a data scientist and computationally expensive. In this survey, we focus on unsupervised learning, and we turn our attention on AutoML methods for clustering. We present a systematic review that includes many recent research works for automated clustering. Furthermore, we provide a taxonomy for the classification of existing works, and we perform a qualitative comparison. As a result, this survey provides a comprehensive overview of the field of AutoML for clustering. Moreover, we identify open challenges for future research in this field.<\/jats:p>","DOI":"10.1145\/3643564","type":"journal-article","created":{"date-parts":[[2024,1,26]],"date-time":"2024-01-26T10:03:15Z","timestamp":1706263395000},"page":"1-30","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":11,"title":["A Survey on AutoML Methods and Systems for Clustering"],"prefix":"10.1145","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9336-5700","authenticated-orcid":false,"given":"Yannis","family":"Poulakis","sequence":"first","affiliation":[{"name":"Department of Digital Systems, University of Piraeus, Piraeus, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3219-0510","authenticated-orcid":false,"given":"Christos","family":"Doulkeridis","sequence":"additional","affiliation":[{"name":"Department of Digital Systems, University of Piraeus, Piraeus, Greece"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7019-7214","authenticated-orcid":false,"given":"Dimosthenis","family":"Kyriazis","sequence":"additional","affiliation":[{"name":"Department of Digital Systems, University of Piraeus, Piraeus, Greece"}]}],"member":"320","published-online":{"date-parts":[[2024,2,27]]},"reference":[{"issue":"1","key":"e_1_3_1_2_2","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1109\/4235.585893","article-title":"No free lunch theorems for optimization","volume":"1","author":"Wolpert David H.","year":"1997","unstructured":"David H. Wolpert and William G. Macready. 1997. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1, 1 (1997), 67\u201382.","journal-title":"IEEE Trans. Evol. Comput."},{"issue":"2","key":"e_1_3_1_3_2","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1023\/A:1012801612483","article-title":"On clustering validation techniques","volume":"17","author":"Halkidi Maria","year":"2001","unstructured":"Maria Halkidi, Yannis Batistakis, and Michalis Vazirgiannis. 2001. On clustering validation techniques. J. Intell. Inf. Syst. 17, 2-3 (2001), 107\u2013145.","journal-title":"J. Intell. Inf. Syst."},{"issue":"9","key":"e_1_3_1_4_2","doi-asserted-by":"crossref","first-page":"1464","DOI":"10.1109\/5.58325","article-title":"The self-organizing map","volume":"78","author":"Kohonen Teuvo","year":"1990","unstructured":"Teuvo Kohonen. 1990. The self-organizing map. Proc. IEEE 78, 9 (1990), 1464\u20131480.","journal-title":"Proc. IEEE"},{"issue":"3","key":"e_1_3_1_5_2","doi-asserted-by":"crossref","first-page":"586","DOI":"10.1109\/72.846731","article-title":"Clustering of the self-organizing map","volume":"11","author":"Vesanto Juha","year":"2000","unstructured":"Juha Vesanto and Esa Alhoniemi. 2000. Clustering of the self-organizing map. IEEE Trans. Neural Netw. Learn. Syst. 11, 3 (2000), 586\u2013600.","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"issue":"1","key":"e_1_3_1_6_2","first-page":"7","article-title":"Automatic recommendation of a distance measure for clustering algorithms","volume":"15","author":"Zhu Xiaoyan","year":"2020","unstructured":"Xiaoyan Zhu, Yingbin Li, Jiayin Wang, Tian Zheng, and Jingwen Fu. 2020. Automatic recommendation of a distance measure for clustering algorithms. ACM Trans. Knowl. Discov. Data 15, 1, Article 7 (Dec.2020), 22 pages.","journal-title":"ACM Trans. Knowl. Discov. Data"},{"issue":"6","key":"e_1_3_1_7_2","doi-asserted-by":"crossref","first-page":"778","DOI":"10.1109\/3477.809032","article-title":"A survey of fuzzy clustering algorithms for pattern recognition. I","volume":"29","author":"Baraldi Andrea","year":"1999","unstructured":"Andrea Baraldi and Palma Blonda. 1999. A survey of fuzzy clustering algorithms for pattern recognition. I. IEEE Trans. Syst. Man Cybern. Part B 29, 6 (1999), 778\u2013785.","journal-title":"IEEE Trans. Syst. Man Cybern. Part B"},{"key":"e_1_3_1_8_2","first-page":"331","volume-title":"6th International Conference of Soft Computing and Pattern Recognition (SoCPaR\u201914)","author":"Ayed Abdelkarim Ben","year":"2014","unstructured":"Abdelkarim Ben Ayed, Mohamed Ben Halima, and Adel M. Alimi. 2014. Survey on clustering methods: Towards fuzzy clustering for big data. In 6th International Conference of Soft Computing and Pattern Recognition (SoCPaR\u201914). IEEE, 331\u2013336."},{"issue":"1","key":"e_1_3_1_9_2","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1016\/j.patcog.2012.07.021","article-title":"An extensive comparative study of cluster validity indices","volume":"46","author":"Arbelaitz Olatz","year":"2013","unstructured":"Olatz Arbelaitz, Ibai Gurrutxaga, Javier Muguerza, Jes\u00fas M. P\u00e9rez, and I\u00f1igo Perona. 2013. An extensive comparative study of cluster validity indices. Pattern Recognit. 46, 1 (2013), 243\u2013256.","journal-title":"Pattern Recognit."},{"key":"e_1_3_1_10_2","doi-asserted-by":"crossref","first-page":"65","DOI":"10.1016\/S0065-2458(08)60520-3","article-title":"The algorithm selection problem","volume":"15","author":"Rice John R.","year":"1976","unstructured":"John R. Rice. 1976. The algorithm selection problem. Adv. Comput. 15 (1976), 65\u2013118.","journal-title":"Adv. Comput."},{"key":"e_1_3_1_11_2","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1007\/978-3-030-05318-5_2","volume-title":"Automated Machine Learning - Methods, Systems, Challenges","author":"Vanschoren Joaquin","year":"2019","unstructured":"Joaquin Vanschoren. 2019. Meta-learning. In Automated Machine Learning - Methods, Systems, Challenges, Frank Hutter, Lars Kotthoff, and Joaquin Vanschoren (Eds.). Springer, 35\u201361."},{"key":"e_1_3_1_12_2","first-page":"2825","article-title":"Scikit-learn: Machine learning in Python","volume":"12","author":"Pedregosa Fabian","year":"2011","unstructured":"Fabian Pedregosa, Ga\u00ebl Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, Jake VanderPlas, Alexandre Passos, David Cournapeau, Matthieu Brucher, Matthieu Perrot, and Edouard Duchesnay. 2011. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12 (2011), 2825\u20132830.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_1_13_2","doi-asserted-by":"crossref","first-page":"847","DOI":"10.1145\/2487575.2487629","volume-title":"19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201913)","author":"Thornton Chris","year":"2013","unstructured":"Chris Thornton, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2013. Auto-WEKA: Combined selection and hyperparameter optimization of classification algorithms. In 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201913), Inderjit S. Dhillon, Yehuda Koren, Rayid Ghani, Ted E. Senator, Paul Bradley, Rajesh Parekh, Jingrui He, Robert L. Grossman, and Ramasamy Uthurusamy (Eds.). ACM, 847\u2013855."},{"issue":"1","key":"e_1_3_1_14_2","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1109\/JPROC.2015.2494218","article-title":"Taking the human out of the loop: A review of Bayesian optimization","volume":"104","author":"Shahriari Bobak","year":"2016","unstructured":"Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P. Adams, and Nando de Freitas. 2016. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 104, 1 (2016), 148\u2013175.","journal-title":"Proc. IEEE"},{"key":"e_1_3_1_15_2","article-title":"A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning","volume":"1012","author":"Brochu Eric","year":"2010","unstructured":"Eric Brochu, Vlad M. Cora, and Nando de Freitas. 2010. A tutorial on Bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning. CoRR abs\/1012.2599 (2010).","journal-title":"CoRR"},{"key":"e_1_3_1_16_2","article-title":"Recent advances in Bayesian optimization","volume":"2206","author":"Wang Xilu","year":"2022","unstructured":"Xilu Wang, Yaochu Jin, Sebastian Schmitt, and Markus Olhofer. 2022. Recent advances in Bayesian optimization. CoRR abs\/2206.03301 (2022).","journal-title":"CoRR"},{"key":"e_1_3_1_17_2","volume-title":"32nd Conference on Uncertainty in Artificial Intelligence (UAI\u201916)","author":"McIntire Mitchell","year":"2016","unstructured":"Mitchell McIntire, Daniel Ratner, and Stefano Ermon. 2016. Sparse Gaussian processes for Bayesian optimization. In 32nd Conference on Uncertainty in Artificial Intelligence (UAI\u201916), Alexander Ihler and Dominik Janzing (Eds.). AUAI Press."},{"key":"e_1_3_1_18_2","first-page":"2546","volume-title":"25th Annual Conference on Neural Information Processing Systems","author":"Bergstra James","year":"2011","unstructured":"James Bergstra, R\u00e9mi Bardenet, Yoshua Bengio, and Bal\u00e1zs K\u00e9gl. 2011. Algorithms for hyper-parameter optimization. In 25th Annual Conference on Neural Information Processing Systems, John Shawe-Taylor, Richard S. Zemel, Peter L. Bartlett, Fernando C. N. Pereira, and Kilian Q. Weinberger (Eds.). ACM, 2546\u20132554."},{"key":"e_1_3_1_19_2","first-page":"507","volume-title":"5th International Conference on Learning and Intelligent Optimization (LION\u201911) (Lecture Notes in Computer Science)","volume":"6683","author":"Hutter Frank","year":"2011","unstructured":"Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. 2011. Sequential model-based optimization for general algorithm configuration. In 5th International Conference on Learning and Intelligent Optimization (LION\u201911) (Lecture Notes in Computer Science), Carlos A. Coello Coello (Ed.), Vol. 6683. Springer, 507\u2013523."},{"issue":"4","key":"e_1_3_1_20_2","doi-asserted-by":"crossref","first-page":"455","DOI":"10.1023\/A:1008306431147","article-title":"Efficient global optimization of expensive black-box functions","volume":"13","author":"Jones Donald R.","year":"1998","unstructured":"Donald R. Jones, Matthias Schonlau, and William J. Welch. 1998. Efficient global optimization of expensive black-box functions. J. Glob. Optim. 13, 4 (1998), 455\u2013492.","journal-title":"J. Glob. Optim."},{"issue":"3","key":"e_1_3_1_21_2","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1007\/s10898-020-00923-x","article-title":"Expected improvement for expensive optimization: A review","volume":"78","author":"Zhan Dawei","year":"2020","unstructured":"Dawei Zhan and Huanlai Xing. 2020. Expected improvement for expensive optimization: A review. J. Glob. Optim. 78, 3 (2020), 507\u2013544.","journal-title":"J. Glob. Optim."},{"key":"e_1_3_1_22_2","first-page":"1119","volume-title":"IEEE International Conference on Big Data (Big Data\u201921)","author":"Shawi Radwa El","year":"2021","unstructured":"Radwa El Shawi, Hudson Lekunze, and Sherif Sakr. 2021. cSmartML: A meta learning-based framework for automated selection and hyperparameter tuning for clustering. In IEEE International Conference on Big Data (Big Data\u201921), Yixin Chen, Heiko Ludwig, Yicheng Tu, Usama M. Fayyad, Xingquan Zhu, Xiaohua Hu, Suren Byna, Xiong Liu, Jianping Zhang, Shirui Pan, Vagelis Papalexakis, Jianwu Wang, Alfredo Cuzzocrea, and Carlos Ordonez (Eds.). IEEE, 1119\u20131126."},{"key":"e_1_3_1_23_2","doi-asserted-by":"crossref","DOI":"10.1007\/978-0-387-74759-0","volume-title":"Encyclopedia of Optimization, Second Edition","author":"Floudas Christodoulos A.","year":"2009","unstructured":"Christodoulos A. Floudas and Panos M. Pardalos (Eds.). 2009. Encyclopedia of Optimization, Second Edition. Springer."},{"issue":"5","key":"e_1_3_1_24_2","doi-asserted-by":"crossref","first-page":"8091","DOI":"10.1007\/s11042-020-10139-6","article-title":"A review on genetic algorithm: Past, present, and future","volume":"80","author":"Katoch Sourabh","year":"2021","unstructured":"Sourabh Katoch, Sumit Singh Chauhan, and Vijay Kumar. 2021. A review on genetic algorithm: Past, present, and future. Multim. Tools Appl. 80, 5 (2021), 8091\u20138126.","journal-title":"Multim. Tools Appl."},{"issue":"8","key":"e_1_3_1_25_2","article-title":"Evolutionary large-scale multi-objective optimization: A survey","volume":"54","author":"Tian Ye","year":"2022","unstructured":"Ye Tian, Langchun Si, Xingyi Zhang, Ran Cheng, Cheng He, Kay Chen Tan, and Yaochu Jin. 2022. Evolutionary large-scale multi-objective optimization: A survey. ACM Comput. Surv. 54, 8 (2022), 174:1\u2013174:34.","journal-title":"ACM Comput. Surv."},{"key":"e_1_3_1_26_2","first-page":"437","volume-title":"16th European Conference on Machine Learning (ECML\u201905) (Lecture Notes in Computer Science)","volume":"3720","author":"Vermorel Joann\u00e8s","year":"2005","unstructured":"Joann\u00e8s Vermorel and Mehryar Mohri. 2005. Multi-armed bandit algorithms and empirical evaluation. In 16th European Conference on Machine Learning (ECML\u201905) (Lecture Notes in Computer Science), Jo\u00e3o Gama, Rui Camacho, Pavel Brazdil, Al\u00edpio Jorge, and Lu\u00eds Torgo (Eds.), Vol. 3720. Springer, 437\u2013448."},{"key":"e_1_3_1_27_2","first-page":"240","volume-title":"19th International Conference on Artificial Intelligence and Statistics (AISTATS\u201916) (JMLR Workshop and Conference Proceedings)","volume":"51","author":"Jamieson Kevin G.","year":"2016","unstructured":"Kevin G. Jamieson and Ameet Talwalkar. 2016. Non-stochastic best arm identification and hyperparameter optimization. In 19th International Conference on Artificial Intelligence and Statistics (AISTATS\u201916) (JMLR Workshop and Conference Proceedings), Arthur Gretton and Christian C. Robert (Eds.), Vol. 51. JMLR.org, 240\u2013248."},{"issue":"1","key":"e_1_3_1_28_2","article-title":"Hyperband: A novel bandit-based approach to hyperparameter optimization","volume":"18","author":"Li Lisha","year":"2017","unstructured":"Lisha Li, Kevin G. Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, and Ameet Talwalkar. 2017. Hyperband: A novel bandit-based approach to hyperparameter optimization. J. Mach. Learn. Res. 18, 1 (2017), 185:1\u2013185:52.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_1_29_2","first-page":"743","volume-title":"17th International Conference on Machine Learning (ICML\u201900)","author":"Pfahringer Bernhard","year":"2000","unstructured":"Bernhard Pfahringer, Hilan Bensusan, and Christophe G. Giraud-Carrier. 2000. Meta-learning by landmarking various learning algorithms. In 17th International Conference on Machine Learning (ICML\u201900), Pat Langley (Ed.). Morgan Kaufmann, 743\u2013750."},{"key":"e_1_3_1_30_2","first-page":"3729","volume-title":"International Joint Conference on Neural Networks (IJCNN\u201908)","author":"Souto Marc\u00edlio Carlos Pereira de","year":"2008","unstructured":"Marc\u00edlio Carlos Pereira de Souto, Ricardo Bastos Cavalcante Prud\u00eancio, Rodrigo G. F. Soares, Daniel S. A. de Araujo, Ivan G. Costa, Teresa Bernarda Ludermir, and Alexander Schliep. 2008. Ranking and selecting clustering algorithms using a meta-learning approach. In International Joint Conference on Neural Networks (IJCNN\u201908). IEEE, 3729\u20133735."},{"issue":"10","key":"e_1_3_1_31_2","first-page":"281","article-title":"Random search for hyper-parameter optimization","volume":"13","author":"Bergstra James","year":"2012","unstructured":"James Bergstra and Yoshua Bengio. 2012. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 10 (2012), 281\u2013305.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_1_32_2","doi-asserted-by":"crossref","first-page":"901","DOI":"10.1145\/2851613.2851885","volume-title":"31st Annual ACM Symposium on Applied Computing","author":"Tomasini Caroline","year":"2016","unstructured":"Caroline Tomasini, Leonardo R. Emmendorfer, Eduardo Nunes Borges, and Karina S. Machado. 2016. A methodology for selecting the most suitable cluster validation internal indices. In 31st Annual ACM Symposium on Applied Computing, Sascha Ossowski (Ed.). ACM, 901\u2013903."},{"issue":"13","key":"e_1_3_1_33_2","doi-asserted-by":"crossref","first-page":"1607","DOI":"10.1093\/bioinformatics\/btm158","article-title":"Weighted rank aggregation of cluster validation measures: A Monte Carlo cross-entropy approach","volume":"23","author":"Pihur Vasyl","year":"2007","unstructured":"Vasyl Pihur, Susmita Datta, and Somnath Datta. 2007. Weighted rank aggregation of cluster validation measures: A Monte Carlo cross-entropy approach. Bioinformatics 23, 13 (2007), 1607\u20131615.","journal-title":"Bioinformatics"},{"issue":"3","key":"e_1_3_1_34_2","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1007\/s00357-022-09413-z","article-title":"Understanding the adjusted rand index and other partition comparison indices based on counting object pairs","volume":"39","author":"Warrens Matthijs J.","year":"2022","unstructured":"Matthijs J. Warrens and Hanneke van der Hoef. 2022. Understanding the adjusted rand index and other partition comparison indices based on counting object pairs. J. Classif. 39, 3 (2022), 487\u2013509.","journal-title":"J. Classif."},{"key":"e_1_3_1_35_2","first-page":"986","volume-title":"OTM Confederated International Conferences, (CoopIS, DOA, and ODBASE (\u201903) (Lecture Notes in Computer Science)","volume":"2888","author":"Guo Gongde","year":"2003","unstructured":"Gongde Guo, Hui Wang, David A. Bell, Yaxin Bi, and Kieran Greer. 2003. KNN model-based approach in classification. In OTM Confederated International Conferences, (CoopIS, DOA, and ODBASE (\u201903) (Lecture Notes in Computer Science), Robert Meersman, Zahir Tari, and Douglas C. Schmidt (Eds.), Vol. 2888. Springer, 986\u2013996."},{"key":"e_1_3_1_36_2","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1145\/2939672.2939785","volume-title":"22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201916)","author":"Chen Tianqi","year":"2016","unstructured":"Tianqi Chen and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. In 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201916), Balaji Krishnapuram, Mohak Shah, Alexander J. Smola, Charu C. Aggarwal, Dou Shen, and Rajeev Rastogi (Eds.). ACM, 785\u2013794."},{"key":"e_1_3_1_37_2","first-page":"3","volume-title":"International Workshop on Meta-learning and Algorithm Selection co-located with 21st European Conference on Artificial Intelligence (MetaSel@ECAI\u201914) (CEUR Workshop Proceedings)","volume":"1201","author":"Feurer Matthias","year":"2014","unstructured":"Matthias Feurer, Jost Tobias Springenberg, and Frank Hutter. 2014. Using meta-learning to initialize Bayesian optimization of hyperparameters. In International Workshop on Meta-learning and Algorithm Selection co-located with 21st European Conference on Artificial Intelligence (MetaSel@ECAI\u201914) (CEUR Workshop Proceedings), Joaquin Vanschoren, Pavel Brazdil, Carlos Soares, and Lars Kotthoff (Eds.), Vol. 1201. CEUR-WS.org, 3\u201310."},{"key":"e_1_3_1_38_2","first-page":"20","volume-title":"19th International Conference on Artificial Neural Networks (ICANN\u201909) (Lecture Notes in Computer Science)","volume":"5769","author":"Nascimento Andr\u00e9 C. A.","year":"2009","unstructured":"Andr\u00e9 C. A. Nascimento, Ricardo Bastos Cavalcante Prud\u00eancio, Marc\u00edlio Carlos Pereira de Souto, and Ivan G. Costa. 2009. Mining rules for the automatic selection process of clustering methods applied to cancer gene expression data. In 19th International Conference on Artificial Neural Networks (ICANN\u201909) (Lecture Notes in Computer Science), Cesare Alippi, Marios M. Polycarpou, Christos G. Panayiotou, and Georgios Ellinas (Eds.), Vol. 5769. Springer, 20\u201329."},{"key":"e_1_3_1_39_2","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.ins.2014.12.044","article-title":"Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods","volume":"301","author":"Ferrari Daniel Gomes","year":"2015","unstructured":"Daniel Gomes Ferrari and Leandro Nunes de Castro. 2015. Clustering algorithm selection by meta-learning systems: A new distance-based problem characterization and ranking combination methods. Inf. Sci. 301 (2015), 181\u2013194.","journal-title":"Inf. Sci."},{"issue":"2","key":"e_1_3_1_40_2","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1504\/IJDMB.2016.074682","article-title":"Extending meta-learning framework for clustering gene expression data with component-based algorithm design and internal evaluation measures","volume":"14","author":"Vukicevic Milan","year":"2016","unstructured":"Milan Vukicevic, Sandro Radovanovic, Boris Delibasic, and Milija Suknovic. 2016. Extending meta-learning framework for clustering gene expression data with component-based algorithm design and internal evaluation measures. Int. J. Data Min. Bioinform. 14, 2 (2016), 101\u2013119.","journal-title":"Int. J. Data Min. Bioinform."},{"key":"e_1_3_1_41_2","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/j.ins.2018.10.043","article-title":"A new data characterization for selecting clustering algorithms using meta-learning","volume":"477","author":"Pimentel Bruno Almeida","year":"2019","unstructured":"Bruno Almeida Pimentel and Andr\u00e9 C. P. L. F. de Carvalho. 2019. A new data characterization for selecting clustering algorithms using meta-learning. Inf. Sci. 477 (2019), 203\u2013219.","journal-title":"Inf. Sci."},{"key":"e_1_3_1_42_2","first-page":"1220","volume-title":"20th IEEE International Conference on Data Mining (ICDM\u201920)","author":"Poulakis Yannis","year":"2020","unstructured":"Yannis Poulakis, Christos Doulkeridis, and Dimosthenis Kyriazis. 2020. AutoClust: A framework for automated clustering based on cluster validity indices. In 20th IEEE International Conference on Data Mining (ICDM\u201920), Claudia Plant, Haixun Wang, Alfredo Cuzzocrea, Carlo Zaniolo, and Xindong Wu (Eds.). IEEE, 1220\u20131225."},{"key":"e_1_3_1_43_2","doi-asserted-by":"crossref","first-page":"824","DOI":"10.1016\/j.ins.2021.08.028","article-title":"Automatic selection of clustering algorithms using supervised graph embedding","volume":"577","author":"Cohen-Shapira Noy","year":"2021","unstructured":"Noy Cohen-Shapira and Lior Rokach. 2021. Automatic selection of clustering algorithms using supervised graph embedding. Inf. Sci. 577 (2021), 824\u2013851.","journal-title":"Inf. Sci."},{"key":"e_1_3_1_44_2","first-page":"246","volume-title":"25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD\u201921) (Lecture Notes in Computer Science)","volume":"12714","author":"Liu Yue","year":"2021","unstructured":"Yue Liu, Shuang Li, and Wenjie Tian. 2021. AutoCluster: Meta-learning based ensemble method for automated unsupervised clustering. In 25th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining (PAKDD\u201921) (Lecture Notes in Computer Science), Kamal Karlapalem, Hong Cheng, Naren Ramakrishnan, R. K. Agrawal, P. Krishna Reddy, Jaideep Srivastava, and Tanmoy Chakraborty (Eds.), Vol. 12714. Springer, 246\u2013258."},{"key":"e_1_3_1_45_2","first-page":"343","volume-title":"24th International Conference on Extending Database Technology","author":"Tschechlov Dennis","year":"2021","unstructured":"Dennis Tschechlov, Manuel Fritz, and Holger Schwarz. 2021. AutoML4Clust: Efficient autoML for clustering analyses. In 24th International Conference on Extending Database Technology, Yannis Velegrakis, Demetris Zeinalipour-Yazti, Panos K. Chrysanthis, and Francesco Guerra (Eds.). OpenProceedings.org, 343\u2013348."},{"key":"e_1_3_1_46_2","first-page":"1144","volume-title":"IEEE International Conference on Data Mining Workshops (ICDM\u201922)","author":"Shawi Radwa El","year":"2022","unstructured":"Radwa El Shawi and Sherif Sakr. 2022. TPE-AutoClust: A tree-based pipeline ensemble framework for automated clustering. In IEEE International Conference on Data Mining Workshops (ICDM\u201922), K. Sel\u00e7uk Candan, Thang N. Dinh, My T. Thai, and Takashi Washio (Eds.). IEEE, 1144\u20131153."},{"key":"e_1_3_1_47_2","first-page":"47","volume-title":"IEEE International Conference on Data Mining Workshops (ICDM\u201922)","author":"Shawi Radwa El","year":"2022","unstructured":"Radwa El Shawi and Sherif Sakr. 2022. cSmartML-Glassbox: Increasing transparency and controllability in automated clustering. In IEEE International Conference on Data Mining Workshops (ICDM\u201922), K. Sel\u00e7uk Candan, Thang N. Dinh, My T. Thai, and Takashi Washio (Eds.). IEEE, 47\u201354."},{"key":"e_1_3_1_48_2","volume-title":"Applied Multivariate Statistical Analysis","author":"Johnson R. A.","year":"2007","unstructured":"R. A. Johnson and D. W. Wichern. 2007. Applied Multivariate Statistical Analysis. Pearson Prentice Hall. 2006050673"},{"issue":"1","key":"e_1_3_1_49_2","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1145\/1656274.1656278","article-title":"The WEKA data mining software: An update","volume":"11","author":"Hall Mark A.","year":"2009","unstructured":"Mark A. Hall, Eibe Frank, Geoffrey Holmes, Bernhard Pfahringer, Peter Reutemann, and Ian H. Witten. 2009. The WEKA data mining software: An update. SIGKDD Explor. 11, 1 (2009), 10\u201318.","journal-title":"SIGKDD Explor."},{"key":"e_1_3_1_50_2","first-page":"224","volume-title":"25th International Conference on Machine Learning (ICML\u201908) (ACM International Conference Proceeding Series)","volume":"307","author":"Dembczynski Krzysztof","year":"2008","unstructured":"Krzysztof Dembczynski, Wojciech Kotlowski, and Roman Slowinski. 2008. Maximum likelihood rule ensembles. In 25th International Conference on Machine Learning (ICML\u201908) (ACM International Conference Proceeding Series), William W. Cohen, Andrew McCallum, and Sam T. Roweis (Eds.), Vol. 307. ACM, 224\u2013231."},{"key":"e_1_3_1_51_2","first-page":"574","volume-title":"IEEE International Conference on Bioinformatics and Biomedicine (BIBM\u201911)","author":"Vukicevic Milan","year":"2011","unstructured":"Milan Vukicevic, Boris Delibasic, Milos Jovanovic, Milija Suknovic, and Zoran Obradovic. 2011. Internal evaluation measures as proxies for external indices in clustering gene expression data. In IEEE International Conference on Bioinformatics and Biomedicine (BIBM\u201911), Fang-Xiang Wu, Mohammed Javeed Zaki, Shinichi Morishita, Yi Pan, Stephen Wong, Anastasia Christianson, and Xiaohua Hu (Eds.). IEEE Computer Society, 574\u2013577."},{"key":"e_1_3_1_52_2","first-page":"252","volume-title":"IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB\u201912)","author":"Vukicevic Milan","year":"2012","unstructured":"Milan Vukicevic, Boris Delibasic, Milos Jovanovic, Milija Suknovic, and Zoran Obradovic. 2012. A method for design of data-tailored partitioning algorithms for optimizing the number of clusters in microarray analysis. In IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB\u201912). IEEE, 252\u2013259."},{"key":"e_1_3_1_53_2","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.datak.2012.03.005","article-title":"An architecture for component-based design of representative-based clustering algorithms","volume":"75","author":"Delibasic Boris","year":"2012","unstructured":"Boris Delibasic, Milan Vukicevic, Milos Jovanovic, Kathrin Kirchner, Johannes Ruhland, and Milija Suknovic. 2012. An architecture for component-based design of representative-based clustering algorithms. Data Knowl. Eng. 75 (2012), 78\u201398.","journal-title":"Data Knowl. Eng."},{"key":"e_1_3_1_54_2","volume-title":"Machine Learning, Neural and Statistical Classification","author":"Michie Donald","year":"1994","unstructured":"Donald Michie, David J. Spiegelhalter, and Charles C. Taylor. 1994. Machine Learning, Neural and Statistical Classification. Ellis Horwood."},{"key":"e_1_3_1_55_2","volume-title":"Algorithm Selection via Meta-learning","author":"Kalousis Alexandros","year":"2002","unstructured":"Alexandros Kalousis. 2002. Algorithm Selection via Meta-learning. Ph.D. Dissertation. University of Geneva."},{"key":"e_1_3_1_56_2","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1145\/2623330.2623732","volume-title":"20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201914)","author":"Perozzi Bryan","year":"2014","unstructured":"Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. DeepWalk: Online learning of social representations. In 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD\u201914), Sofus A. Macskassy, Claudia Perlich, Jure Leskovec, Wei Wang, and Rayid Ghani (Eds.). ACM, 701\u2013710."},{"key":"e_1_3_1_57_2","first-page":"115","volume-title":"30th International Conference on Machine Learning (ICML\u201913) (JMLR Workshop and Conference Proceedings)","volume":"28","author":"Bergstra James","year":"2013","unstructured":"James Bergstra, Daniel Yamins, and David D. Cox. 2013. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. In 30th International Conference on Machine Learning (ICML\u201913) (JMLR Workshop and Conference Proceedings), Vol. 28. JMLR.org, 115\u2013123."},{"key":"e_1_3_1_58_2","first-page":"1436","volume-title":"35th International Conference on Machine Learning (ICML\u201918) (Proceedings of Machine Learning Research)","volume":"80","author":"Falkner Stefan","year":"2018","unstructured":"Stefan Falkner, Aaron Klein, and Frank Hutter. 2018. BOHB: Robust and efficient hyperparameter optimization at scale. In 35th International Conference on Machine Learning (ICML\u201918) (Proceedings of Machine Learning Research), Jennifer G. Dy and Andreas Krause (Eds.), Vol. 80. PMLR, 1436\u20131445."},{"issue":"5","key":"e_1_3_1_59_2","doi-asserted-by":"crossref","first-page":"603","DOI":"10.1109\/34.1000236","article-title":"Mean shift: A robust approach toward feature space analysis","volume":"24","author":"Comaniciu Dorin","year":"2002","unstructured":"Dorin Comaniciu and Peter Meer. 2002. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24, 5 (2002), 603\u2013619.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"2","key":"e_1_3_1_60_2","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1023\/A:1011508532477","article-title":"Book review: Genetic programming\u2014An introduction: On the automatic evolution of computer programs and its applications","volume":"2","author":"Punch William F.","year":"2001","unstructured":"William F. Punch. 2001. Book review: Genetic programming\u2014An introduction: On the automatic evolution of computer programs and its applications. Genet. Program. Evolv. Mach. 2, 2 (2001), 193\u2013195.","journal-title":"Genet. Program. Evolv. Mach."},{"key":"e_1_3_1_61_2","first-page":"2171","article-title":"DEAP: Evolutionary algorithms made easy","volume":"13","author":"Fortin F\u00e9lix-Antoine","year":"2012","unstructured":"F\u00e9lix-Antoine Fortin, Fran\u00e7ois-Michel De Rainville, Marc-Andr\u00e9 Gardner, Marc Parizeau, and Christian Gagn\u00e9. 2012. DEAP: Evolutionary algorithms made easy. J. Mach. Learn. Res. 13 (2012), 2171\u20132175.","journal-title":"J. Mach. Learn. Res."},{"issue":"2","key":"e_1_3_1_62_2","doi-asserted-by":"crossref","first-page":"182","DOI":"10.1109\/4235.996017","article-title":"A fast and elitist multiobjective genetic algorithm: NSGA-II","volume":"6","author":"Deb Kalyanmoy","year":"2002","unstructured":"Kalyanmoy Deb, Samir Agrawal, Amrit Pratap, and T. Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6, 2 (2002), 182\u2013197.","journal-title":"IEEE Trans. Evol. Comput."},{"key":"e_1_3_1_63_2","first-page":"149","volume-title":"IEEE International Conference on Fuzzy Systems (FUZZ\u201904)","author":"Banerjee Amit","year":"2004","unstructured":"Amit Banerjee and Rajesh N. Dav\u00e9. 2004. Validating clusters using the Hopkins statistic. In IEEE International Conference on Fuzzy Systems (FUZZ\u201904). IEEE, 149\u2013153."},{"key":"e_1_3_1_64_2","first-page":"4237","volume-title":"33rd AAAI Conference on Artificial Intelligence (AAAI\u201919), 31st Innovative Applications of Artificial Intelligence Conference (IAAI\u201919), 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201919)","author":"Li Yu-Feng","year":"2019","unstructured":"Yu-Feng Li, Hai Wang, Tong Wei, and Wei-Wei Tu. 2019. Towards automated semi-supervised learning. In 33rd AAAI Conference on Artificial Intelligence (AAAI\u201919), 31st Innovative Applications of Artificial Intelligence Conference (IAAI\u201919), 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI\u201919). AAAI Press, 4237\u20134244."},{"key":"e_1_3_1_65_2","article-title":"OpenEnsembles: A Python resource for ensemble clustering","volume":"19","author":"Ronan Tom","year":"2018","unstructured":"Tom Ronan, Shawn Anastasio, Zhijie Qi, Pedro Henrique S. Vieira Tavares, Roman Sloutsky, and Kristen M. Naegle. 2018. OpenEnsembles: A Python resource for ensemble clustering. J. Mach. Learn. Res. 19 (2018), 26:1\u201326:6.","journal-title":"J. Mach. Learn. Res."},{"key":"e_1_3_1_66_2","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1016\/j.eswa.2018.08.046","article-title":"A new similarity\/distance measure between intuitionistic fuzzy sets based on the transformed isosceles triangles and its applications to pattern recognition","volume":"116","author":"Jiang Qian","year":"2019","unstructured":"Qian Jiang, Xin Jin, Shin-Jye Lee, and Shaowen Yao. 2019. A new similarity\/distance measure between intuitionistic fuzzy sets based on the transformed isosceles triangles and its applications to pattern recognition. Expert Syst. Applic. 116 (2019), 439\u2013453.","journal-title":"Expert Syst. Applic."},{"issue":"5","key":"e_1_3_1_67_2","doi-asserted-by":"crossref","first-page":"1774","DOI":"10.1109\/TNNLS.2017.2673241","article-title":"Efficient kNN classification with different numbers of nearest neighbors","volume":"29","author":"Zhang Shichao","year":"2018","unstructured":"Shichao Zhang, Xuelong Li, Ming Zong, Xiaofeng Zhu, and Ruili Wang. 2018. Efficient kNN classification with different numbers of nearest neighbors. IEEE Trans. Neural Netw. Learn. Syst. 29, 5 (2018), 1774\u20131785.","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"issue":"07","key":"e_1_3_1_68_2","first-page":"7382","article-title":"Reachable distance function for KNN classification","volume":"35","author":"Zhang S.","year":"2023","unstructured":"S. Zhang, J. Li, and Y. Li. 2023. Reachable distance function for KNN classification. IEEE Trans. Knowl. Data Eng. 35, 07 (July2023), 7382\u20137396.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"issue":"3","key":"e_1_3_1_69_2","first-page":"2711","article-title":"KNN classification with one-step computation","volume":"35","author":"Zhang Shichao","year":"2023","unstructured":"Shichao Zhang and Jiaye Li. 2023. KNN classification with one-step computation. IEEE Trans. Knowl. Data Eng. 35, 3 (2023), 2711\u20132723.","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"e_1_3_1_70_2","doi-asserted-by":"crossref","first-page":"200","DOI":"10.1016\/j.neucom.2022.06.082","article-title":"Hyper-class representation of data","volume":"503","author":"Zhang Shichao","year":"2022","unstructured":"Shichao Zhang, Jiaye Li, Wenzhen Zhang, and Yongsong Qin. 2022. Hyper-class representation of data. Neurocomputing 503 (2022), 200\u2013218.","journal-title":"Neurocomputing"},{"key":"e_1_3_1_71_2","first-page":"911","volume-title":"10th IEEE International Conference on Data Mining (ICDM\u201910)","author":"Liu Yanchi","year":"2010","unstructured":"Yanchi Liu, Zhongmou Li, Hui Xiong, Xuedong Gao, and Junjie Wu. 2010. Understanding of internal clustering validation measures. In 10th IEEE International Conference on Data Mining (ICDM\u201910), Geoffrey I. Webb, Bing Liu, Chengqi Zhang, Dimitrios Gunopulos, and Xindong Wu (Eds.). IEEE Computer Society, 911\u2013916."},{"issue":"3","key":"e_1_3_1_72_2","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1109\/3477.678624","article-title":"Some new indexes of cluster validity","volume":"28","author":"Bezdek James C.","year":"1998","unstructured":"James C. Bezdek and Nikhil R. Pal. 1998. Some new indexes of cluster validity. IEEE Trans. Syst. Man Cybern. Part B 28, 3 (1998), 301\u2013315.","journal-title":"IEEE Trans. Syst. Man Cybern. Part B"},{"issue":"3","key":"e_1_3_1_73_2","doi-asserted-by":"crossref","first-page":"803","DOI":"10.2307\/2532201","article-title":"Model-based Gaussian and non-Gaussian clustering","volume":"49","author":"Banfield Jeffrey D.","year":"1993","unstructured":"Jeffrey D. Banfield and Adrian E. Raftery. 1993. Model-based Gaussian and non-Gaussian clustering. Biometrics 49, 3 (1993), 803\u2013821.","journal-title":"Biometrics"},{"issue":"6","key":"e_1_3_1_74_2","doi-asserted-by":"crossref","first-page":"1072","DOI":"10.1037\/0033-2909.83.6.1072","article-title":"A general statistical framework for assessing categorical clustering in free recall.","volume":"83","author":"Hubert Lawrence J.","year":"1976","unstructured":"Lawrence J. Hubert and Joel R. Levin. 1976. A general statistical framework for assessing categorical clustering in free recall. Psychol. Bull. 83, 6 (1976), 1072\u20131080.","journal-title":"Psychol. Bull."},{"issue":"1","key":"e_1_3_1_75_2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/03610927408827101","article-title":"A dendrite method for cluster analysis","volume":"3","author":"Cali\u0144ski Tadeusz","year":"1974","unstructured":"Tadeusz Cali\u0144ski and Jerzy Harabasz. 1974. A dendrite method for cluster analysis. Commun. Statist.-Theor. Meth. 3, 1 (1974), 1\u201327.","journal-title":"Commun. Statist.-Theor. Meth."},{"issue":"6","key":"e_1_3_1_76_2","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1016\/j.patrec.2007.12.011","article-title":"A density-based cluster validity approach using multi-representatives","volume":"29","author":"Halkidi Maria","year":"2008","unstructured":"Maria Halkidi and Michalis Vazirgiannis. 2008. A density-based cluster validity approach using multi-representatives. Pattern Recognit. Lett. 29, 6 (2008), 773\u2013786.","journal-title":"Pattern Recognit. Lett."},{"issue":"2","key":"e_1_3_1_77_2","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1109\/TPAMI.1979.4766909","article-title":"A cluster separation measure","volume":"1","author":"Davies David L.","year":"1979","unstructured":"David L. Davies and Donald W. Bouldin. 1979. A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1, 2 (1979), 224\u2013227.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"3","key":"e_1_3_1_78_2","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1080\/01969727308546046","article-title":"A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters","volume":"3","author":"Dunn J. C.","year":"1973","unstructured":"J. C. Dunn. 1973. A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3, 3 (1973), 32\u201357.","journal-title":"J. Cybern."},{"issue":"4","key":"e_1_3_1_79_2","first-page":"697","article-title":"Multivariate generalizations of the Wald-Wolfowitz and Smirnov two-sample tests","volume":"7","author":"Friedman Jerome H.","year":"1979","unstructured":"Jerome H. Friedman and Lawrence C. Rafsky. 1979. Multivariate generalizations of the Wald-Wolfowitz and Smirnov two-sample tests. Ann. Statist. 7, 4 (1979), 697\u2013717.","journal-title":"Ann. Statist."},{"issue":"15","key":"e_1_3_1_80_2","doi-asserted-by":"crossref","first-page":"3201","DOI":"10.1093\/bioinformatics\/bti517","article-title":"Computational cluster validation in post-genomic data analysis","volume":"21","author":"Handl Julia","year":"2005","unstructured":"Julia Handl, Joshua D. Knowles, and Douglas B. Kell. 2005. Computational cluster validation in post-genomic data analysis. Bioinformatics 21, 15 (2005), 3201\u20133212.","journal-title":"Bioinformatics"},{"issue":"1","key":"e_1_3_1_81_2","doi-asserted-by":"crossref","first-page":"193","DOI":"10.1007\/BF01908075","article-title":"Comparing partitions","volume":"2","author":"Hubert Lawrence","year":"1985","unstructured":"Lawrence Hubert and Phipps Arabie. 1985. Comparing partitions. J. Classif. 2, 1 (1985), 193\u2013218.","journal-title":"J. Classif."},{"issue":"12","key":"e_1_3_1_82_2","doi-asserted-by":"crossref","first-page":"1650","DOI":"10.1109\/TPAMI.2002.1114856","article-title":"Performance evaluation of some clustering algorithms and validity indices","volume":"24","author":"Maulik Ujjwal","year":"2002","unstructured":"Ujjwal Maulik and Sanghamitra Bandyopadhyay. 2002. Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell. 24, 12 (2002), 1650\u20131654.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"issue":"1","key":"e_1_3_1_83_2","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: A graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw Peter","year":"1987","unstructured":"Peter Rousseeuw. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 1 (1987), 53\u201365.","journal-title":"J. Comput. Appl. Math."},{"issue":"349","key":"e_1_3_1_84_2","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1080\/01621459.1975.10480256","article-title":"Measuring the power of hierarchical cluster analysis","volume":"70","author":"Baker Frank B.","year":"1975","unstructured":"Frank B. Baker and Lawrence J. Hubert. 1975. Measuring the power of hierarchical cluster analysis. J. Amer. Statist. Assoc. 70, 349 (1975), 31\u201338.","journal-title":"J. Amer. Statist. Assoc."},{"issue":"4","key":"e_1_3_1_85_2","first-page":"456","article-title":"CLUSTISZ: A program to test for the quality of clustering of a set of objects","volume":"12","author":"McClain John O.","year":"1975","unstructured":"John O. McClain and Vithala R. Rao. 1975. CLUSTISZ: A program to test for the quality of clustering of a set of objects. J. Market. Res. 12, 4 (1975), 456\u2013460.","journal-title":"J. Market. Res."},{"issue":"2","key":"e_1_3_1_86_2","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1007\/BF02294245","article-title":"An examination of procedures for determining the number of clusters in a data set","volume":"50","author":"Milligan Glen W.","year":"1985","unstructured":"Glen W. Milligan and Martha C. Cooper. 1985. An examination of procedures for determining the number of clusters in a data set. Psychometrika 50, 2 (1985), 159\u2013179.","journal-title":"Psychometrika"},{"issue":"3","key":"e_1_3_1_87_2","doi-asserted-by":"crossref","first-page":"487","DOI":"10.1016\/j.patcog.2003.06.005","article-title":"Validity index for crisp and fuzzy clusters","volume":"37","author":"Pakhira Malay Kumar","year":"2004","unstructured":"Malay Kumar Pakhira, Sanghamitra Bandyopadhyay, and Ujjwal Maulik. 2004. Validity index for crisp and fuzzy clusters. Pattern Recognit. 37, 3 (2004), 487\u2013501.","journal-title":"Pattern Recognit."},{"issue":"4","key":"e_1_3_1_88_2","first-page":"115","article-title":"A criterion for determining the number of groups in a classification","volume":"10","author":"Ratkowsky David A.","year":"1978","unstructured":"David A. Ratkowsky and Geoffrey N. Lance. 1978. A criterion for determining the number of groups in a classification. Austral. Comput. J. 10, 4 (1978), 115\u2013117.","journal-title":"Austral. Comput. J."},{"key":"e_1_3_1_89_2","first-page":"137","volume-title":"4th International Conference on Advances in Pattern Recognition and Digital Techniques","author":"Ray S.","year":"1999","unstructured":"S. Ray and R. H. Turi. 1999. Determination of number of clusters in k-means clustering and application in colour image segmentation. In 4th International Conference on Advances in Pattern Recognition and Digital Techniques. Narosa Publishing House, 137\u2013143."},{"issue":"2","key":"e_1_3_1_90_2","doi-asserted-by":"crossref","first-page":"387","DOI":"10.2307\/2529003","article-title":"Clustering methods based on likelihood ratio criteria","volume":"27","author":"Scott A. J.","year":"1971","unstructured":"A. J. Scott and M. J. Symons. 1971. Clustering methods based on likelihood ratio criteria. Biometrics 27, 2 (1971), 387\u2013397.","journal-title":"Biometrics"},{"issue":"268","key":"e_1_3_1_91_2","first-page":"732","article-title":"Measures of association for cross classifications","volume":"49","author":"Goodman Leo A.","year":"1954","unstructured":"Leo A. Goodman and William H. Kruskal. 1954. Measures of association for cross classifications. J. Amer. Statist. Assoc. 49, 268 (1954), 732\u2013764.","journal-title":"J. Amer. Statist. Assoc."},{"issue":"8","key":"e_1_3_1_92_2","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1109\/34.85677","article-title":"A validity measure for fuzzy clustering","volume":"13","author":"Xie Xuanli Lisa","year":"1991","unstructured":"Xuanli Lisa Xie and Gerardo Beni. 1991. A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13, 8 (1991), 841\u2013847.","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"e_1_3_1_93_2","first-page":"81","volume-title":"IEEE International Conference on Data Mining (ICDM\u201921)","author":"Cohen-Shapira Noy","year":"2021","unstructured":"Noy Cohen-Shapira and Lior Rokach. 2021. TRIO: Task-agnostic dataset representation optimized for automatic algorithm selection. In IEEE International Conference on Data Mining (ICDM\u201921), James Bailey, Pauli Miettinen, Yun Sing Koh, Dacheng Tao, and Xindong Wu (Eds.). IEEE, 81\u201390."},{"issue":"3","key":"e_1_3_1_94_2","doi-asserted-by":"crossref","first-page":"964","DOI":"10.1007\/s10618-021-00737-9","article-title":"Dataset2Vec: Learning dataset meta-features","volume":"35","author":"Jomaa Hadi S.","year":"2021","unstructured":"Hadi S. Jomaa, Lars Schmidt-Thieme, and Josif Grabocka. 2021. Dataset2Vec: Learning dataset meta-features. Data Min. Knowl. Discov. 35, 3 (2021), 964\u2013985.","journal-title":"Data Min. Knowl. Discov."},{"issue":"4","key":"e_1_3_1_95_2","doi-asserted-by":"crossref","first-page":"772","DOI":"10.14778\/3574245.3574261","article-title":"SubStrat: A subset-based optimization strategy for faster AutoML","volume":"16","author":"Lazebnik Teddy","year":"2022","unstructured":"Teddy Lazebnik, Amit Somech, and Abraham Itzhak Weinberg. 2022. SubStrat: A subset-based optimization strategy for faster AutoML. Proc. VLDB Endow. 16, 4 (2022), 772\u2013780.","journal-title":"Proc. VLDB Endow."},{"issue":"1","key":"e_1_3_1_96_2","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1145\/1007730.1007731","article-title":"Subspace clustering for high dimensional data: A review","volume":"6","author":"Parsons Lance","year":"2004","unstructured":"Lance Parsons, Ehtesham Haque, and Huan Liu. 2004. Subspace clustering for high dimensional data: A review. SIGKDD Explor. 6, 1 (2004), 90\u2013105.","journal-title":"SIGKDD Explor."},{"key":"e_1_3_1_97_2","doi-asserted-by":"crossref","first-page":"148","DOI":"10.1016\/j.neucom.2020.02.104","article-title":"An overview of recent multi-view clustering","volume":"402","author":"Fu Lele","year":"2020","unstructured":"Lele Fu, Pengfei Lin, Athanasios V. Vasilakos, and Shiping Wang. 2020. An overview of recent multi-view clustering. Neurocomputing 402 (2020), 148\u2013161.","journal-title":"Neurocomputing"},{"issue":"2","key":"e_1_3_1_98_2","doi-asserted-by":"crossref","first-page":"83","DOI":"10.26599\/BDMA.2018.9020003","article-title":"Multi-view clustering: A survey","volume":"1","author":"Yang Yan","year":"2018","unstructured":"Yan Yang and Hao Wang. 2018. Multi-view clustering: A survey. Big Data Min. Anal. 1, 2 (2018), 83\u2013107.","journal-title":"Big Data Min. Anal."}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3643564","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3643564","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T22:50:28Z","timestamp":1750287028000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3643564"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,2,27]]},"references-count":97,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2024,6,30]]}},"alternative-id":["10.1145\/3643564"],"URL":"https:\/\/doi.org\/10.1145\/3643564","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,2,27]]},"assertion":[{"value":"2023-04-17","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-01-19","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2024-02-27","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}