{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T04:46:55Z","timestamp":1760244415568,"version":"build-2065373602"},"reference-count":52,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2022,12,21]],"date-time":"2022-12-21T00:00:00Z","timestamp":1671580800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012190","name":"Ministry of Science and Higher Education of the Russian Federation","doi-asserted-by":"publisher","award":["FEFE-2020-0013"],"award-info":[{"award-number":["FEFE-2020-0013"]}],"id":[{"id":"10.13039\/501100012190","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Finding the cluster structure is essential for analyzing self-organized networking structures, such as social networks. In such problems, a wide variety of distance measures can be used. Common clustering methods often require the number of clusters to be explicitly indicated before starting the process of clustering. A preliminary step to clustering is deciding, firstly, whether the data contain any clusters and, secondly, how many clusters the dataset contains. To highlight the internal structure of data, several methods for visual assessment of clustering tendency (VAT family of methods) have been developed. The vast majority of these methods use the Euclidean distance or cosine similarity measure. In our study, we modified the VAT and iVAT algorithms for visual assessment of the clustering tendency with a wide variety of distance measures. We compared the results of our algorithms obtained from both samples from repositories and data from applied problems.<\/jats:p>","DOI":"10.3390\/a16010005","type":"journal-article","created":{"date-parts":[[2022,12,22]],"date-time":"2022-12-22T02:31:11Z","timestamp":1671676271000},"page":"5","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Visual Assessment of Cluster Tendency with Variations of Distance Measures"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8257-7329","authenticated-orcid":false,"given":"Guzel","family":"Shkaberina","sequence":"first","affiliation":[{"name":"Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, Russia"},{"name":"Laboratory \u201cHybrid Methods of Modeling and Optimization in Complex Systems\u201d, Siberian Federal University, Svobodny Av., 660041 Krasnoyarsk, Russia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1149-3299","authenticated-orcid":false,"given":"Natalia","family":"Rezova","sequence":"additional","affiliation":[{"name":"Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, Russia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8500-2050","authenticated-orcid":false,"given":"Elena","family":"Tovbis","sequence":"additional","affiliation":[{"name":"Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, Russia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0667-4001","authenticated-orcid":false,"given":"Lev","family":"Kazakovtsev","sequence":"additional","affiliation":[{"name":"Institute of Informatics and Telecommunications, Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarsky Rabochy Av., 660037 Krasnoyarsk, Russia"},{"name":"Laboratory \u201cHybrid Methods of Modeling and Optimization in Complex Systems\u201d, Siberian Federal University, Svobodny Av., 660041 Krasnoyarsk, Russia"}]}],"member":"1968","published-online":{"date-parts":[[2022,12,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1007\/s10791-009-9106-z","article-title":"A comparison of extrinsic clustering evaluation metrics based on formal constraints","volume":"12","author":"Gonzalo","year":"2009","journal-title":"Inf. Retr."},{"key":"ref_2","first-page":"1","article-title":"A dendrite method for cluster analysis","volume":"3","author":"Calinski","year":"1974","journal-title":"Commun. Stat."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1109\/TPAMI.1979.4766909","article-title":"A cluster separation measure","volume":"1","author":"Davies","year":"1979","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Kaufman, L., and Rousseeuw, P.J. (1990). Finding Groups in Data: An Introduction to Cluster Analysis, Wiley.","DOI":"10.1002\/9780470316801"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1016\/0377-0427(87)90125-7","article-title":"Silhouettes: A graphical aid to the interpretation and validation of cluster analysis","volume":"20","author":"Rousseeuw","year":"1987","journal-title":"J. Comput. Appl. Math."},{"key":"ref_6","unstructured":"Bezdek, C., and Hathaway, R.J. (2002, January 12\u201317). Vat: A tool for visual assessment of (cluster) tendency. Proceedings of the \u0132CNN, Honolulu, HI, USA."},{"key":"ref_7","unstructured":"Jain, A.K., and Dubes, R.C. (1988). Algorithms for Clustering Data, Prentice Hall College Div."},{"key":"ref_8","unstructured":"Everitt, B. (1978). Graphical Techniques for Multivariate Data, North-Holland Press."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/S0003-2670(01)01033-9","article-title":"New index for clustering tendency","volume":"446","author":"Forina","year":"2001","journal-title":"Anal. Chim. Acta"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1093\/oxfordjournals.aob.a083391","article-title":"A New Method for determining the Type of Distribution of Plant Individuals","volume":"18","author":"Hopkins","year":"1954","journal-title":"Ann. Bot."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1021\/ci00066a007","article-title":"Cluster analysis of acrylates to guide sampling for toxicity testing","volume":"30","author":"Lawson","year":"1990","journal-title":"J. Chem. Inf. Comput. Sci."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/S0003-2670(99)00879-X","article-title":"Improved algorithm for clustering tendency","volume":"408","author":"Massart","year":"2000","journal-title":"Anal. Chim. Acta"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1389","DOI":"10.1002\/j.1538-7305.1957.tb01515.x","article-title":"Shortest Connection Networks and some Generalizations","volume":"36","author":"Prim","year":"1957","journal-title":"Bell Syst. Tech. J."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1090\/S0002-9939-1956-0078686-7","article-title":"On the Shortest Spanning Subtree of a Graph and the Traveling Salesman Problem","volume":"7","author":"Kruskal","year":"1956","journal-title":"Proc. Am. Math. Soc."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"841","DOI":"10.1109\/34.85677","article-title":"A Validity Measure for Fuzzy Clustering","volume":"13","author":"Xie","year":"1991","journal-title":"IEEE Trans. Pattern Anal. Mach. Intel."},{"key":"ref_16","first-page":"10","article-title":"Visual approaches for exploratory data analysis: A survey of the visual assessment of clustering tendency (VAT) family of algorithms","volume":"6","author":"Kumar","year":"2020","journal-title":"IEEE Trans. Syst. Man Cybern."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, L., Nguyen, U.T., Bezdek, J.C., Leckie, C.A., and Ramamohanarao, K. (2010, January 21\u201324). iVAT and aVAT: Enhanced visual analysis for cluster tendency assessment. Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Hyderabad, India.","DOI":"10.1007\/978-3-642-13657-3_5"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Havens, T.C., Bezdek, J.C., Leckie, C., and Palaniswami, M. (2013, January 7\u201310). Extension of iVAT to asymmetric matrices. Proceedings of the Fuzzy Systems (FUZZ), 2013 IEEE International Conference, Hyderabad, India.","DOI":"10.1109\/FUZZ-IEEE.2013.6622300"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.patrec.2015.03.007","article-title":"Visual hierarchical cluster structure: A refined coassociation matrix based visual assessment of cluster tendency","volume":"59","author":"Zhong","year":"2015","journal-title":"Pattern Recognit. Lett."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Huband, J.M., Bezdek, J.C., and Hathaway, R.J. (2004, January 27\u201330). Revised visual assessment of (cluster) tendency (reVAT). Proceedings of the North American Fuzzy Information Processing Society (NAFIPS), Banff, AB, Canada.","DOI":"10.1109\/NAFIPS.2004.1336257"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1875","DOI":"10.1016\/j.patcog.2005.03.018","article-title":"BigVAT: Visual assessment of cluster tendency for large data sets","volume":"38","author":"Huband","year":"2005","journal-title":"Pattern Recognit."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1315","DOI":"10.1016\/j.patcog.2006.02.011","article-title":"Scalable visual assessment of cluster tendency for large data sets","volume":"39","author":"Hathaway","year":"2006","journal-title":"Pattern Recognit."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"890","DOI":"10.1109\/TFUZZ.2006.889956","article-title":"Visual assessment of clustering tendency for rectangular dissimilarity matrices","volume":"15","author":"Bezdek","year":"2007","journal-title":"IEEE Trans. Fuzzy Syst."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Sledge, I., Huband, J., and Bezdek, J.C. (2008, January 1820). (Automatic) cluster count extraction from unlabeled datasets. Proceedings of the Joint International Conference on Natural Computation and International Conference on Fuzzy Systems and Knowledge Discovery, Jinan, China.","DOI":"10.1109\/FSKD.2008.552"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"335","DOI":"10.1109\/TKDE.2008.158","article-title":"Automatically determining the number of clusters in unlabeled data sets","volume":"21","author":"Wang","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1002\/int.20344","article-title":"Clustering in ordered dissimilarity data","volume":"24","author":"Havens","year":"2009","journal-title":"Int. J. Intell. Syst."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1109\/4235.985692","article-title":"The particle swarm\u2014Explosion, stability, and convergence in a multi-dimensional complex space","volume":"6","author":"Clerc","year":"2002","journal-title":"IEEE Trans. Evolut. Comput."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"213","DOI":"10.1016\/j.neucom.2018.04.022","article-title":"A new cluster tendency assessment method for fuzzy co-clustering in hyperspectral image analysis","volume":"307","author":"Pham","year":"2018","journal-title":"Neurocomputing"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Kumar, D., and Bezdek, J.C. (2020, January 19\u201324). Clustering tendency assessment for datasets having inter-cluster density variations. Proceedings of the 2020 International Conference on Signal Processing and Communications (SPCOM), Bangalore, India.","DOI":"10.1109\/SPCOM50965.2020.9179608"},{"key":"ref_30","first-page":"1601","article-title":"Self-tuning spectral clustering","volume":"Volume 17","author":"Perona","year":"2004","journal-title":"Advances in Neural Information Processing Systems"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"655","DOI":"10.1007\/BFb0055696","article-title":"A factorization approach to grouping","volume":"Volume 1406","author":"Perona","year":"1998","journal-title":"Proceedings of the Computer Vision\u2014ECCV\u201998"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Campello, R.J.G.B., Moulavi, D., and Sander, J. (2013). Density-based clustering based on hierarchical density estimates. Advances in Knowledge Discovery and Data Mining, Springer.","DOI":"10.1007\/978-3-642-37456-2_14"},{"key":"ref_33","unstructured":"Rathore, P., Bezdek, J.C., Santi, P., and Ratti, C. (2020). ConiVAT: Cluster Tendency Assessment and Clustering with Partial Background Knowledge. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Lesot, M.J., and Marsala, C. (2021). Fast Cluster Tendency Assessment for Big, High-Dimensional Data. Fuzzy Approaches for Soft Computing and Approximate Reasoning: Theories and Applications, Springer. Studies in Fuzziness and Soft Computing.","DOI":"10.1007\/978-3-030-54341-9"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"8013","DOI":"10.1007\/s11227-021-03618-6","article-title":"Sampling-based visual assessment computing techniques for an efficient social data clustering","volume":"8","author":"Basha","year":"2021","journal-title":"J. Supercomput."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"104625","DOI":"10.1016\/j.micpro.2022.104625","article-title":"A novel data visualization method for the effective assessment of cluster tendency through the dark blocks image pattern analysis","volume":"93","author":"Prasad","year":"2022","journal-title":"Microprocess. Microsyst."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Datta, S., Karmakar, C., Rathore, P., and Palaniswami, M. (2022, January 11\u201315). Scalable Cluster Tendency Assessment for Streaming Activity Data using Recurring Shapelets. Proceedings of the 44th Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Glasgow, UK.","DOI":"10.1109\/EMBC48229.2022.9871534"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1401","DOI":"10.1109\/TKDE.2009.192","article-title":"Enhanced visual analysis for cluster tendency assessment and data partitioning","volume":"22","author":"Wang","year":"2010","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Shirkhorshidi, S., Aghabozorgi, S., and Wah, T. (2015). A Comparison Study on Similarity and Dissimilarity Measures in Clustering Continuous Data. PLoS ONE, 10.","DOI":"10.1371\/journal.pone.0144059"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1089\/big.2018.0175","article-title":"Effects of Distance Measure Choice on K-Nearest Neighbor Classifier Performance: A Review","volume":"7","author":"Alfeilat","year":"2019","journal-title":"Big Data"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1109\/COMST.2014.2336610","article-title":"A Survey of Distance and Similarity Measures Used Within Network Intrusion Anomaly Detection","volume":"17","author":"Borghetti","year":"2015","journal-title":"IEEE Commun. Surv. Tutor."},{"key":"ref_42","unstructured":"(2022, October 14). Canberra Distance. Available online: https:\/\/academic.oup.com\/comjnl\/article\/9\/1\/60\/348137?login=false."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1007\/BF02834632","article-title":"Mahalanobis Distance","volume":"4","author":"McLachlan","year":"1999","journal-title":"Resonance"},{"key":"ref_44","unstructured":"(2022, October 14). Distance Correlation. Available online: https:\/\/arxiv.org\/abs\/0803.4101."},{"key":"ref_45","unstructured":"Han, J., Kamber, M., and Pei, J. (2012). Data mining: Concepts and Techniques, Morgan Kaufmann."},{"key":"ref_46","unstructured":"(2022, October 14). Bray\u2013Curtis Dissimilarity. Available online: https:\/\/esajournals.onlinelibrary.wiley.com\/doi\/10.2307\/1942268."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"813","DOI":"10.1109\/TKDE.2011.33","article-title":"An efficient formulation of the improved visual assessment of cluster tendency (iVAT) algorithm","volume":"24","author":"Havens","year":"2012","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"237","DOI":"10.1007\/s10472-009-9157-2","article-title":"Is VAT really single linkage in disguise?","volume":"55","author":"Havens","year":"2009","journal-title":"Ann. Math. Artif. Intell."},{"key":"ref_49","unstructured":"(2022, October 14). Artificial Clustering Datasets. Available online: https:\/\/github.com\/milaan9\/Clustering-Datasets."},{"key":"ref_50","first-page":"421","article-title":"On the Optimization Models for Automatic Grouping of Industrial Products by Homogeneous Production Batches","volume":"1275","author":"Shkaberina","year":"2020","journal-title":"Commun. Comput. Inf. Sci."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"012015","DOI":"10.1088\/1757-899X\/94\/1\/012015","article-title":"Fast deterministic algorithm for EEE components classification","volume":"94","author":"Kazakovtsev","year":"2015","journal-title":"IOP Conf. Ser. Mater. Sci. Eng."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Rezova, N., Kazakovtsev, L., Shkaberina, G., Demidko, D., and Goroshko, A. (2022, January 15\u201316). Data pre-processing for ecosystem behaviour analysis. Proceedings of the 2022 IEEE International Conference on Information Technologies, Varna, Bulgaria. in press.","DOI":"10.1109\/InfoTech55606.2022.9897105"}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/1\/5\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:47:07Z","timestamp":1760147227000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/1\/5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,12,21]]},"references-count":52,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,1]]}},"alternative-id":["a16010005"],"URL":"https:\/\/doi.org\/10.3390\/a16010005","relation":{},"ISSN":["1999-4893"],"issn-type":[{"type":"electronic","value":"1999-4893"}],"subject":[],"published":{"date-parts":[[2022,12,21]]}}}