{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,7]],"date-time":"2026-05-07T07:04:09Z","timestamp":1778137449470,"version":"3.51.4"},"reference-count":77,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T00:00:00Z","timestamp":1658448000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Classif"],"published-print":{"date-parts":[[2022,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In unsupervised machine learning, agreement between partitions is commonly assessed with so-called external validity indices. Researchers tend to use and report indices that quantify agreement between two partitions for all clusters simultaneously. Commonly used examples are the Rand index and the adjusted Rand index. Since these overall measures give a general notion of what is going on, their values are usually hard to interpret. The goal of this study is to provide a thorough understanding of the adjusted Rand index as well as many other partition comparison indices based on counting object pairs. It is shown that many overall indices based on the pair-counting approach can be decomposed into indices that reflect the degree of agreement on the level of individual clusters. The decompositions (1) show that the overall indices can be interpreted as summary statistics of the agreement on the cluster level, (2) specify how these overall indices are related to the indices for individual clusters, and (3) show that the overall indices are affected by cluster size imbalance: if cluster sizes are unbalanced these overall measures will primarily reflect the degree of agreement between the partitions on the large clusters, and will provide much less information on the agreement on smaller clusters. Furthermore, the value of Rand-like indices is determined to a large extent by the number of pairs of objects that are not joined in either of the partitions.<\/jats:p>","DOI":"10.1007\/s00357-022-09413-z","type":"journal-article","created":{"date-parts":[[2022,7,22]],"date-time":"2022-07-22T13:03:43Z","timestamp":1658495023000},"page":"487-509","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":103,"title":["Understanding the Adjusted Rand Index and Other Partition Comparison Indices Based on Counting Object Pairs"],"prefix":"10.1007","volume":"39","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7302-640X","authenticated-orcid":false,"given":"Matthijs J.","family":"Warrens","sequence":"first","affiliation":[]},{"given":"Hanneke","family":"van der Hoef","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,7,22]]},"reference":[{"issue":"3","key":"9413_CR1","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1007\/s11634-011-0090-y","volume":"5","author":"AN Albatineh","year":"2011","unstructured":"Albatineh, A.N., & Niewiadomska-Bugaj, M. (2011a). Correcting Jaccard and other similarity indices for chance agreement in cluster analysis. Advances in Data Analysis and Classification, 5(3), 179\u2013200.","journal-title":"Advances in Data Analysis and Classification"},{"key":"9413_CR2","doi-asserted-by":"publisher","first-page":"184","DOI":"10.1007\/s00357-010-9069-1","volume":"28","author":"AN Albatineh","year":"2011","unstructured":"Albatineh, A.N., & Niewiadomska-Bugaj, M. (2011b). MCS: A method for finding the number of clusters. Journal of Classification, 28, 184\u2013209.","journal-title":"Journal of Classification"},{"issue":"2","key":"9413_CR3","doi-asserted-by":"publisher","first-page":"301","DOI":"10.1007\/s00357-006-0017-z","volume":"23","author":"AN Albatineh","year":"2006","unstructured":"Albatineh, A.N., Niewiadomska-Bugaj, M., & Mihalko, D. (2006). On similarity indices and correction for chance agreement. Journal of Classification, 23 (2), 301\u2013313.","journal-title":"Journal of Classification"},{"key":"9413_CR4","first-page":"494","volume":"6","author":"AK Alok","year":"2014","unstructured":"Alok, A.K., Saha, S., & Ekbal, A. (2014). Development of an external cluster validity index using probabilistic approach and min-max distance. International Journal of Computer Information Systems and Industrial Management Applications, 6, 494\u2013504.","journal-title":"International Journal of Computer Information Systems and Industrial Management Applications"},{"key":"9413_CR5","doi-asserted-by":"publisher","first-page":"906","DOI":"10.1109\/TFUZZ.2010.2052258","volume":"18","author":"DT Anderson","year":"2010","unstructured":"Anderson, D.T., Bezdek, J.C., Popescu, M., & Keller, J.M. (2010). Comparing fuzzy, probabilistic, and possibilistic partitions. IEEE Transactions on Fuzzy Systems, 18, 906\u2013917.","journal-title":"IEEE Transactions on Fuzzy Systems"},{"issue":"1","key":"9413_CR6","doi-asserted-by":"publisher","first-page":"233","DOI":"10.1007\/BF01908601","volume":"6","author":"FB Baulieu","year":"1989","unstructured":"Baulieu, F.B. (1989). A classification of presence\/absence based dissimilarity coefficients. Journal of Classification, 6(1), 233\u2013246.","journal-title":"Journal of Classification"},{"key":"9413_CR7","volume-title":"Plant sociology: The study of plant communities","author":"J Braun-Blanquet","year":"1932","unstructured":"Braun-Blanquet, J. (1932). Plant sociology: The study of plant communities. New York: Authorized English translation of Panzensoziologie. McGraw-Hill."},{"key":"9413_CR8","doi-asserted-by":"publisher","first-page":"807","DOI":"10.1016\/j.patcog.2006.06.026","volume":"40","author":"M Brun","year":"2007","unstructured":"Brun, M., Sima, C., Hua, J., Lowey, J., Carroll, B., Suh, E., & Dougherty, E.R. (2007). Model-based evaluation of clustering validation measures. Pattern Recognition, 40, 807\u2013824.","journal-title":"Pattern Recognition"},{"key":"9413_CR9","doi-asserted-by":"crossref","unstructured":"Chac\u00f3n, J.E. (2019). A close-up comparison of the misclassification error distance and the adjusted Rand index for external clustering evaluation. arXiv:1907.11505.","DOI":"10.1111\/bmsp.12212"},{"key":"9413_CR10","unstructured":"Chac\u00f3n, J. E., & Rastrojo, A.I. (2020). Minimum adjusted Rand index for two clusterings of a given size. arXiv:2002.03677."},{"key":"9413_CR11","first-page":"1130","volume":"43","author":"AH Cheetham","year":"1969","unstructured":"Cheetham, A.H., & Hazel, J.E. (1969). Binary (presence-absence) similarity coefficients. Journal of Paleontology, 43, 1130\u20131136.","journal-title":"Journal of Paleontology"},{"key":"9413_CR12","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1177\/001316446002000104","volume":"20","author":"J Cohen","year":"1960","unstructured":"Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20, 37\u201346.","journal-title":"Educational and Psychological Measurement"},{"key":"9413_CR13","doi-asserted-by":"crossref","unstructured":"De Souto, M.C.P., Coelho, A.L.V., Faceli, K., Sakata, T.C., Bonadia, V., & Costa, I.G. (2012). A comparison of external clustering evaluation indices in the context of imbalanced data sets. Brazilian Symposium on Neural Networks, pp. 49\u201354.","DOI":"10.1109\/SBRN.2012.25"},{"key":"9413_CR14","doi-asserted-by":"publisher","first-page":"297","DOI":"10.2307\/1932409","volume":"26","author":"LR Dice","year":"1945","unstructured":"Dice, L.R. (1945). Measures of the amount of ecologic association between species. Ecology, 26, 297\u2013302.","journal-title":"Ecology"},{"key":"9413_CR15","first-page":"122","volume":"7","author":"MH Doolittle","year":"1885","unstructured":"Doolittle, M.H. (1885). The verification of predictions. Bulletin of the Philosophical Society of Washington, 7, 122\u2013127.","journal-title":"Bulletin of the Philosophical Society of Washington"},{"key":"9413_CR16","first-page":"211","volume":"31","author":"HE Driver","year":"1932","unstructured":"Driver, H.E., & Kroeber, A.L. (1932). Quantitative expression of cultural relationship. The University of California Publications in American Archaeology and Ethnology, 31, 211\u2013256.","journal-title":"The University of California Publications in American Archaeology and Ethnology"},{"key":"9413_CR17","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1080\/01969727408546059","volume":"4","author":"J Dunn","year":"1974","unstructured":"Dunn, J. (1974). Well separated clusters and optimal fuzzy partitions. Cybernetics, 4, 95\u2013104.","journal-title":"Cybernetics"},{"key":"9413_CR18","doi-asserted-by":"publisher","first-page":"651","DOI":"10.2307\/2529549","volume":"31","author":"JL Fleiss","year":"1975","unstructured":"Fleiss, J.L. (1975). Measuring agreement between two judges on the presence or absence of a trait. Biometrics, 31, 651\u2013659.","journal-title":"Biometrics"},{"key":"9413_CR19","doi-asserted-by":"publisher","first-page":"553","DOI":"10.1080\/01621459.1983.10478008","volume":"78","author":"EB Fowlkes","year":"1983","unstructured":"Fowlkes, E.B., & Mallows, C.L. (1983). A method for comparing two hierarchical clusterings. Journal of the American Statistical Association, 78, 553\u2013569.","journal-title":"Journal of the American Statistical Association"},{"key":"9413_CR20","doi-asserted-by":"publisher","first-page":"3034","DOI":"10.1016\/j.patcog.2014.03.017","volume":"47","author":"P Fr\u00e4nti","year":"2014","unstructured":"Fr\u00e4nti, P., Rezaei, M., & Zhao, Q. (2014). Centroid index: Cluster level similarity measure. Pattern Recognition, 47, 3034\u20133045.","journal-title":"Pattern Recognition"},{"key":"9413_CR21","doi-asserted-by":"publisher","first-page":"21","DOI":"10.2307\/2480223","volume":"47","author":"HA Gleason","year":"1920","unstructured":"Gleason, H.A. (1920). Some applications of the quadrat method. Bulletin of the Torrey Botanical Club, 47, 21\u201333.","journal-title":"Bulletin of the Torrey Botanical Club"},{"key":"9413_CR22","doi-asserted-by":"publisher","first-page":"40","DOI":"10.1145\/565117.565124","volume":"31","author":"M Halkidi","year":"2002","unstructured":"Halkidi, M., & Batiskis, Y. (2002). Cluster validity methods: Part I. SIGMOD Record, 31, 40\u201345.","journal-title":"SIGMOD Record"},{"key":"9413_CR23","first-page":"639","volume":"2","author":"U Hamann","year":"1961","unstructured":"Hamann, U. (1961). Merkmalsbestand und Verwandtschaftsbeziehungen der Farinose. Ein Betrag zum System der Monokotyledonen. Willdenowia, 2, 639\u2013768.","journal-title":"Willdenowia"},{"key":"9413_CR24","doi-asserted-by":"crossref","unstructured":"Heiser, W.J., & Warrens, M.J. (2010). Families of relational statistics for 2\u00d72 tables. In H. Kaul H. Mulder (Eds.) Advances in interdisciplinary applied discrete mathematics (pp. 25\u201352). Singapore: World Scientific.","DOI":"10.1142\/9789814299152_0003"},{"key":"9413_CR25","doi-asserted-by":"publisher","DOI":"10.1201\/b19706","volume-title":"Handbook of cluster analysis","author":"C Hennig","year":"2015","unstructured":"Hennig, C., Meil\u0103, M., Murtagh, F., & Rocci, R. (2015). Handbook of cluster analysis. New York: Chapman and Hall\/CRC."},{"key":"9413_CR26","unstructured":"Horton, P., & Nakai, K. (1996). A probablistic classification system for predicting the cellular localization sites of proteins. Intelligent Systems in Molecular Biology, pp. 109\u2013115."},{"key":"9413_CR27","doi-asserted-by":"publisher","first-page":"669","DOI":"10.1111\/j.1469-185X.1982.tb00376.x","volume":"57","author":"Z Hub\u00e1lek","year":"1982","unstructured":"Hub\u00e1lek, Z. (1982). Coefficients of association and similarity based on binary (presence absence) data: An evaluation. Biological Reviews, 57, 669\u2013689.","journal-title":"Biological Reviews"},{"key":"9413_CR28","doi-asserted-by":"publisher","first-page":"98","DOI":"10.1111\/j.2044-8317.1977.tb00728.x","volume":"30","author":"LJ Hubert","year":"1977","unstructured":"Hubert, L.J. (1977). Nominal scale response agreement as a generalized correlation. British Journal of Mathematical and Statistical Psychology, 30, 98\u2013103.","journal-title":"British Journal of Mathematical and Statistical Psychology"},{"issue":"1","key":"9413_CR29","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","volume":"2","author":"LJ Hubert","year":"1985","unstructured":"Hubert, L.J., & Arabie, P. (1985). Comparing partitions. Journal of Classifications, 2(1), 193\u2013218.","journal-title":"Journal of Classifications"},{"key":"9413_CR30","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1080\/01621459.2015.1086354","volume":"111","author":"Z Huo","year":"2016","unstructured":"Huo, Z., Ding, Y., Liu, S., Oesterreich, S., & Tseng, G. (2016). Meta-analytic framework for sparse K-means to identify disease subtypes in multiple transcriptomic studies. Journal of the American Statistical Association, 111, 27\u201352.","journal-title":"Journal of the American Statistical Association"},{"key":"9413_CR31","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1111\/j.1469-8137.1912.tb05611.x","volume":"11","author":"P Jaccard","year":"1912","unstructured":"Jaccard, P. (1912). The distribution of the ora in the Alpine zone. The New Phytologist, 11, 37\u201350.","journal-title":"The New Phytologist"},{"issue":"8","key":"9413_CR32","doi-asserted-by":"publisher","first-page":"651","DOI":"10.1016\/j.patrec.2009.09.011","volume":"31","author":"AK Jain","year":"2010","unstructured":"Jain, A.K. (2010). Data clustering: 50 years beyond K-means. Pattern Recognition Letters, 31(8), 651\u2013666.","journal-title":"Pattern Recognition Letters"},{"key":"9413_CR33","doi-asserted-by":"publisher","first-page":"241","DOI":"10.1007\/BF02289588","volume":"32","author":"SC Johnson","year":"1967","unstructured":"Johnson, S.C. (1967). Hierarchical clustering schemes. Psychometrika, 32, 241\u2013254.","journal-title":"Psychometrika"},{"key":"9413_CR34","doi-asserted-by":"publisher","first-page":"260","DOI":"10.1186\/1471-2105-10-260","volume":"10","author":"E-Y Kim","year":"2009","unstructured":"Kim, E.-Y., Kim, S.-Y., Ashlock, D., & Nam, D. (2009). MULTI-K: Accurate classification of microarray subtypes using ensemble k-means clustering. BMC Bioinformatics, 10, 260.","journal-title":"BMC Bioinformatics"},{"key":"9413_CR35","first-page":"57","volume":"2","author":"S Kulczy\u0144ski","year":"1927","unstructured":"Kulczy\u0144ski, S. (1927). Die P anzenassociationen der Pienenen. Bulletin Interna- tional de l\u2019acad\u00e9mie Polonaise des Sciences et des Letters, Classe des Sciences Mathematiques et Naturelles, Serie B, Suppl\u00e9ment II, 2, 57\u2013203.","journal-title":"Bulletin Interna- tional de l\u2019acad\u00e9mie Polonaise des Sciences et des Letters, Classe des Sciences Mathematiques et Naturelles, Serie B, Suppl\u00e9ment II"},{"issue":"3","key":"9413_CR36","doi-asserted-by":"publisher","first-page":"519","DOI":"10.1109\/TSMC.1987.4309069","volume":"17","author":"TO Kvalseth","year":"1987","unstructured":"Kvalseth, T.O. (1987). Entropy and correlation: Some comments. IEEE Transactions on Systems, Man and Cybernetics, 17(3), 519\u2013519.","journal-title":"IEEE Transactions on Systems, Man and Cybernetics"},{"key":"9413_CR37","volume-title":"Numerical ecology","author":"P Legendre","year":"1998","unstructured":"Legendre, P., & Legendre, L. (1998). Numerical ecology. Amsterdam: Elsevier."},{"issue":"4","key":"9413_CR38","doi-asserted-by":"publisher","first-page":"1013","DOI":"10.1109\/TFUZZ.2016.2584644","volume":"25","author":"Y Lei","year":"2016","unstructured":"Lei, Y., Bezdek, J.C., Chan, J., Vinh, N., Romano, S., & Bailey, J. (2016). Extending information-theoretic validity indices for fuzzy clustering. IEEE Transactions on Fuzzy Systems, 25(4), 1013\u20131018.","journal-title":"IEEE Transactions on Fuzzy Systems"},{"key":"9413_CR39","unstructured":"Lichman, M. (2013). UCI Machine Learning Repository. Retrieved from http:\/\/archive.ics.uci.edu\/ml."},{"key":"9413_CR40","doi-asserted-by":"crossref","unstructured":"Loevinger, J.A. (1947). A systematic approach to the construction and evaluation of tests of ability. Psychometrika, Monograph No. 4.","DOI":"10.1037\/h0093565"},{"key":"9413_CR41","unstructured":"McConnaughey, B.H. (1964). The determination and analysis of plankton communities. Marine Research, Special No, Indonesia, pp. 1\u201340."},{"issue":"5","key":"9413_CR42","doi-asserted-by":"publisher","first-page":"873","DOI":"10.1016\/j.jmva.2006.11.013","volume":"98","author":"M Meil\u0103","year":"2007","unstructured":"Meil\u0103, M. (2007). Comparing clusterings. an information based distance. Journal of Multivariate Analysis, 98(5), 873\u2013895.","journal-title":"Journal of Multivariate Analysis"},{"key":"9413_CR43","unstructured":"Meil\u0103, M. (2016). Criteria for comparing clusterings. In C. Hennig, M. Meil\u0103, F. Murtagh, & R. Rocci (Eds.) Handbook of cluster analysis (pp. 619\u2013636). New York: Chapman and Hall\/CRC."},{"key":"9413_CR44","doi-asserted-by":"crossref","unstructured":"Milligan, G.W. (1996). Clustering validation: Results and implications for applied analyses. In P. Arabie, L. Hubert, & G. De Soete (Eds.) (pp. 341\u2013375). River Edge: World Scientific.","DOI":"10.1142\/9789812832153_0010"},{"key":"9413_CR45","doi-asserted-by":"publisher","first-page":"441","DOI":"10.1207\/s15327906mbr2104_5","volume":"21","author":"GW Milligan","year":"1986","unstructured":"Milligan, G.W., & Cooper, M.C. (1986). A study of the comparability of external criteria for hierarchical cluster analysis. Multivariate Behavioral Research, 21, 441\u2013458.","journal-title":"Multivariate Behavioral Research"},{"key":"9413_CR46","doi-asserted-by":"publisher","first-page":"526","DOI":"10.2331\/suisan.22.526","volume":"22","author":"A Ochiai","year":"1957","unstructured":"Ochiai, A. (1957). Zoogeographic studies on the soleoid fishes found in Japan and its neighboring regions. Bulletin of the Japanese Society for Fish Science, 22, 526\u2013530.","journal-title":"Bulletin of the Japanese Society for Fish Science"},{"key":"9413_CR47","doi-asserted-by":"publisher","first-page":"361","DOI":"10.1007\/s10115-008-0150-6","volume":"19","author":"D Pfitzner","year":"2009","unstructured":"Pfitzner, D., Leibbrandt, R., & Powers, D. (2009). Characterization and evaluation of similarity measures for pairs of clusterings. Knowledge and Information Systems, 19, 361\u2013394.","journal-title":"Knowledge and Information Systems"},{"issue":"3","key":"9413_CR48","doi-asserted-by":"publisher","first-page":"846","DOI":"10.1080\/01621459.1971.10482356","volume":"66","author":"WM Rand","year":"1971","unstructured":"Rand, W.M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66(3), 846\u2013850.","journal-title":"Journal of the American Statistical Association"},{"key":"9413_CR49","doi-asserted-by":"publisher","first-page":"1115","DOI":"10.1126\/science.132.3434.1115","volume":"132","author":"DJ Rogers","year":"1960","unstructured":"Rogers, D.J., & Tanimoto, T.T. (1960). A computer program for classifying plants. Science, 132, 1115\u20131118.","journal-title":"Science"},{"key":"9413_CR50","doi-asserted-by":"publisher","first-page":"991","DOI":"10.1016\/0021-9681(66)90032-4","volume":"19","author":"E Rogot","year":"1966","unstructured":"Rogot, E., & Goldberg, I.D. (1966). A proposed index for measuring agreement in test-retest studies. Journal of Chronic Disease, 19, 991\u201310.","journal-title":"Journal of Chronic Disease"},{"key":"9413_CR51","doi-asserted-by":"publisher","first-page":"3997","DOI":"10.1128\/JCM.00624-11","volume":"49","author":"A Severiano","year":"2011","unstructured":"Severiano, A., Pinto, F.R., Ramirez, M., & Carri\u00e7o, J.A. (2011). Adjusted Wallace coefficient as a measure of congruence between typing methods. Journal of Clinical Microbiology, 49, 3997\u20134000.","journal-title":"Journal of Clinical Microbiology"},{"issue":"3","key":"9413_CR52","doi-asserted-by":"publisher","first-page":"623","DOI":"10.1002\/j.1538-7305.1948.tb00917.x","volume":"27","author":"CE Shannon","year":"1948","unstructured":"Shannon, C.E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27(3), 623\u2013656.","journal-title":"The Bell System Technical Journal"},{"key":"9413_CR53","doi-asserted-by":"publisher","first-page":"1","DOI":"10.2475\/ajs.241.1.1","volume":"241","author":"GG Simpson","year":"1943","unstructured":"Simpson, G.G. (1943). Mammals and the nature of continents. American Journal of Science, 241, 1\u201331.","journal-title":"American Journal of Science"},{"key":"9413_CR54","first-page":"1409","volume":"38","author":"RR Sokal","year":"1958","unstructured":"Sokal, R.R., & Michener, C.D. (1958). A statistical method for evaluating systematic relationships. University of Kansas Science Bulletin, 38, 1409\u20131438.","journal-title":"University of Kansas Science Bulletin"},{"key":"9413_CR55","volume-title":"Principles of numerical taxonomy","author":"RR Sokal","year":"1963","unstructured":"Sokal, R.R., & Sneath, P.H. (1963). Principles of numerical taxonomy. San Francisco: W. H. Freeman and Company."},{"key":"9413_CR56","first-page":"1","volume":"5","author":"T S\u00f8renson","year":"1948","unstructured":"S\u00f8renson, T. (1948). A method of stabilizing groups of equivalent amplitude in plant sociology based on the similarity of species content and its application to analyses of the vegetation on Danish commons. Kongelige Danske Videnskabernes Selskab Biologiske Skrifter, 5, 1\u201334.","journal-title":"Kongelige Danske Videnskabernes Selskab Biologiske Skrifter"},{"key":"9413_CR57","doi-asserted-by":"crossref","unstructured":"Sorgenfrei, T. (1958). Molluscan Assemblages From the Marine Middle Miocene of South Jutland and Their Environments. Copenhagen: Reitzel.","DOI":"10.34194\/raekke2.v79.6869"},{"issue":"3","key":"9413_CR58","doi-asserted-by":"publisher","first-page":"386","DOI":"10.1037\/1082-989X.9.3.386","volume":"9","author":"D Steinley","year":"2004","unstructured":"Steinley, D. (2004). Properties of the Hubert-Arabie adjusted Rand index. Psychological Methods, 9(3), 386\u2013396.","journal-title":"Psychological Methods"},{"key":"9413_CR59","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1348\/000711005X48266","volume":"59","author":"D Steinley","year":"2006","unstructured":"Steinley, D. (2006). K-means clustering: A half-century synthesis. British Journal of Mathematical and Statistical Psychology, 59, 1\u201334.","journal-title":"British Journal of Mathematical and Statistical Psychology"},{"issue":"2","key":"9413_CR60","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1037\/met0000049","volume":"21","author":"D Steinley","year":"2016","unstructured":"Steinley, D., Brusco, M.J., & Hubert, L.J. (2016). The variance of the adjusted Rand index. Psychological Methods, 21(2), 261\u2013272.","journal-title":"Psychological Methods"},{"key":"9413_CR61","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1007\/s00357-015-9169-z","volume":"32","author":"D Steinley","year":"2015","unstructured":"Steinley, D., Hendrickson, G., & Brusco, M.J. (2015). A note on maximizing the agreement between partitions: A stepwise optimal algorithm and some properties. Journal of Classification, 32, 114\u2013126.","journal-title":"Journal of Classification"},{"key":"9413_CR62","unstructured":"Ting, K.M. (2011). Sensitivity and specificity. In C. Sammut G. Webb (Eds.) Encyclopedia of machine learning. Boston: Springer."},{"key":"9413_CR63","doi-asserted-by":"publisher","first-page":"353","DOI":"10.1007\/s41237-018-0075-7","volume":"46","author":"H Van der Hoef","year":"2019","unstructured":"Van der Hoef, H., & Warrens, M.J. (2019). Understanding information theoretic measures for comparing clusterings. Behaviormetrika, 46, 353\u2013370.","journal-title":"Behaviormetrika"},{"key":"9413_CR64","doi-asserted-by":"crossref","unstructured":"Van der Hoef, H., & Warrens, M.J. (2020). Understanding Malvestuto\u2019s normalized mutual information. In T. Imaizumi, A. Okada, S. Miyamoto, F. Sakaori, Y. Yamamoto, & M. Vichi (Eds.) Advanced Studies in Classification and Data Science (pp. 289\u2013299). Springer.","DOI":"10.1007\/978-981-15-3311-2_23"},{"key":"9413_CR65","first-page":"21","volume":"19","author":"E Van der Maarel","year":"1969","unstructured":"Van der Maarel, E. (1969). On the use of ordination models in phytosociology. Vegetatio, 19, 21\u201346.","journal-title":"Vegetatio"},{"key":"9413_CR66","doi-asserted-by":"crossref","unstructured":"Vinh, N.X., Epps, J., & Bailey, J. (2009). Information theoretic measures for clusterings comparison: Is a correction for chance necessary?. In Icml \u201909 proceedings of the 26th international conference on machine learning (pp. 1073\u20131080). New York: ACM.","DOI":"10.1145\/1553374.1553511"},{"key":"9413_CR67","first-page":"2837","volume":"11","author":"NX Vinh","year":"2010","unstructured":"Vinh, N.X., Epps, J., & Bailey, J. (2010). Information theoretic measures for clustering comparison: Variants, properties, normalization and correction for chance. Journal of Machine Learning Research, 11, 2837\u20132854.","journal-title":"Journal of Machine Learning Research"},{"key":"9413_CR68","first-page":"569","volume":"78","author":"D Wallace","year":"1983","unstructured":"Wallace, D. (1983). Comment on a method for comparing two hierarchical clusterings. Journal of the American Statistical Association, 78, 569\u2013576.","journal-title":"Journal of the American Statistical Association"},{"key":"9413_CR69","doi-asserted-by":"publisher","first-page":"195","DOI":"10.1007\/s00357-008-9024-6","volume":"25","author":"MJ Warrens","year":"2008","unstructured":"Warrens, M.J. (2008a). Bounds of resemblance measures for binary (presence\/absence) variables. Journal of Classification, 25, 195\u2013208.","journal-title":"Journal of Classification"},{"issue":"3","key":"9413_CR70","doi-asserted-by":"publisher","first-page":"487","DOI":"10.1007\/s11336-008-9059-y","volume":"73","author":"MJ Warrens","year":"2008","unstructured":"Warrens, M.J. (2008b). On similarity coefficients for 2\u00d72 tables and correction for chance. Psychometrika, 73(3), 487\u2013502.","journal-title":"Psychometrika"},{"issue":"2","key":"9413_CR71","doi-asserted-by":"publisher","first-page":"177","DOI":"10.1007\/s00357-008-9023-7","volume":"25","author":"MJ Warrens","year":"2008","unstructured":"Warrens, M.J. (2008c). On the equivalence of Cohen\u2019s kappa and the Hubert-Arabie adjusted Rand index. Journal of Classification, 25(2), 177\u2013183.","journal-title":"Journal of Classification"},{"key":"9413_CR72","doi-asserted-by":"publisher","first-page":"125","DOI":"10.1007\/s00357-008-9006-8","volume":"25","author":"MJ Warrens","year":"2008","unstructured":"Warrens, M.J. (2008d). On the indeterminacy of resemblance measures for binary (presence\/absence) data. Journal of Classification, 25, 125\u2013136.","journal-title":"Journal of Classification"},{"key":"9413_CR73","volume-title":"Similarity coefficients for binary data: Properties of coefficients, coefficient matrices multi-way metrics and multivariate coefficients (Unpublished doctoral dissertation)","author":"MJ Warrens","year":"2008","unstructured":"Warrens, M.J. (2008e). Similarity coefficients for binary data: Properties of coefficients, coefficient matrices multi-way metrics and multivariate coefficients (Unpublished doctoral dissertation). Leiden: Leiden University."},{"key":"9413_CR74","doi-asserted-by":"publisher","first-page":"3005","DOI":"10.3233\/JIFS-172291","volume":"36","author":"MJ Warrens","year":"2019","unstructured":"Warrens, M.J. (2019). Similarity measures for 2\u00d72 tables. Journal of Intelligent and Fuzzy Systems, 36, 3005\u20133018.","journal-title":"Journal of Intelligent and Fuzzy Systems"},{"key":"9413_CR75","doi-asserted-by":"crossref","unstructured":"Warrens, M.J., & Van der Hoef, H. (2020). Understanding the Rand index. In T. Imaizumi, A. Okada, S. Miyamoto, F. Sakaori, Y. Yamamoto, & M. Vichi (Eds.) Advanced Studies in Classification and Data Science (pp. 301\u2013313). Springer.","DOI":"10.1007\/978-981-15-3311-2_24"},{"key":"9413_CR76","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1016\/j.ins.2012.02.019","volume":"198","author":"Z Yu","year":"2012","unstructured":"Yu, Z., You, J., Wong, H.-S., & Han, G. (2012). From cluster ensemble to structure ensemble. Information Sciences, 198, 81\u201399.","journal-title":"Information Sciences"},{"key":"9413_CR77","doi-asserted-by":"publisher","first-page":"579","DOI":"10.2307\/2340126","volume":"75","author":"GU Yule","year":"1912","unstructured":"Yule, G.U. (1912). On the methods of measuring the association between two attributes. Journal of the Royal Statistical Society, 75, 579\u2013652.","journal-title":"Journal of the Royal Statistical Society"}],"container-title":["Journal of Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00357-022-09413-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00357-022-09413-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00357-022-09413-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,15]],"date-time":"2022-12-15T14:14:29Z","timestamp":1671113669000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00357-022-09413-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,22]]},"references-count":77,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2022,11]]}},"alternative-id":["9413"],"URL":"https:\/\/doi.org\/10.1007\/s00357-022-09413-z","relation":{},"ISSN":["0176-4268","1432-1343"],"issn-type":[{"value":"0176-4268","type":"print"},{"value":"1432-1343","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,22]]},"assertion":[{"value":"23 March 2022","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 July 2022","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"The authors declare no competing interests. Furthermore, the research study did not involve human participants and\/or animals.","order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}}]}}