{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T12:24:04Z","timestamp":1763641444003},"reference-count":96,"publisher":"Springer Science and Business Media LLC","issue":"1-2","license":[{"start":{"date-parts":[[2011,3,18]],"date-time":"2011-03-18T00:00:00Z","timestamp":1300406400000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2011,10]]},"DOI":"10.1007\/s10994-011-5236-9","type":"journal-article","created":{"date-parts":[[2011,3,17]],"date-time":"2011-03-17T18:20:58Z","timestamp":1300386058000},"page":"209-248","source":"Crossref","is-referenced-by-count":9,"title":["Resampling approach for cluster model selection"],"prefix":"10.1007","volume":"85","author":[{"given":"Z.","family":"Volkovich","sequence":"first","affiliation":[]},{"given":"Z.","family":"Barzily","sequence":"additional","affiliation":[]},{"given":"G.-W.","family":"Weber","sequence":"additional","affiliation":[]},{"given":"D.","family":"Toledano-Kitai","sequence":"additional","affiliation":[]},{"given":"R.","family":"Avros","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2011,3,18]]},"reference":[{"key":"5236_CR1","first-page":"821","volume":"25","author":"M. A. Aizerman","year":"1964","unstructured":"Aizerman, M. A., Braverman, E. M., & Rozono, L. I. (1964). Theoretical foundations of the potential function method in pattern recognition learning. Automation and Remote Control, 25, 821\u2013837.","journal-title":"Automation and Remote Control"},{"issue":"1","key":"5236_CR2","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1006\/jmva.1994.1033","volume":"50","author":"N. H. Anderson","year":"1994","unstructured":"Anderson, N. H., Hall, P., & Titterington, M. (1994). Two-sample test statistics for measuring discrepancies between two multivariate probability density functions using kernel-based density estimates. Journal of Multivariate Analysis, 50(1), 41\u201354.","journal-title":"Journal of Multivariate Analysis"},{"key":"5236_CR3","doi-asserted-by":"crossref","unstructured":"Aronszajn, N. (1950). Theory of reproducing kernels. Transactions of the American Mathematical Society, 68.","DOI":"10.1090\/S0002-9947-1950-0051437-7"},{"issue":"1","key":"5236_CR4","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1016\/S0047-259X(03)00079-4","volume":"88","author":"L. Baringhaus","year":"2004","unstructured":"Baringhaus, L., & Franz, C. (2004). On a new multivariate two-sample test. Journal of Multivariate Analysis, 88(1), 190\u2013206.","journal-title":"Journal of Multivariate Analysis"},{"issue":"2","key":"5236_CR5","doi-asserted-by":"crossref","first-page":"187","DOI":"10.15388\/Informatica.2009.245","volume":"20","author":"Z. Barzily","year":"2009","unstructured":"Barzily, Z., Volkovich, Z., Akteke-Ozturk, B., & Weber, G.-W. (2009). On a minimal spanning tree approach in the cluster validation problem. Informatica, 20(2), 187\u2013202.","journal-title":"Informatica"},{"issue":"1","key":"5236_CR6","doi-asserted-by":"crossref","first-page":"1682","DOI":"10.1007\/s10958-005-0128-9","volume":"127","author":"Ya. Belopolskaya","year":"2005","unstructured":"Belopolskaya, Ya., Klebanov, L., & Volkovich, V. (2005). Characterization of elliptic distributions. Journal of Mathematical Sciences, 127(1), 1682\u20131686.","journal-title":"Journal of Mathematical Sciences"},{"key":"5236_CR7","first-page":"159","volume-title":"Methods in molecular biology","author":"A. Ben-Hur","year":"2003","unstructured":"Ben-Hur, A., & Guyon, I. (2003). Detecting stable clusters using principal component analysis. In M.\u00a0J.\u00a0Brownstein & A. Khodursky (Eds.), Methods in molecular biology (pp. 159\u2013182). Clifton: Humana Press."},{"key":"5236_CR8","first-page":"125","volume":"2","author":"A. Ben-Hur","year":"2001","unstructured":"Ben-Hur, A., Horn, D., Siegelmann, H. T., & Vapnik, V. (2001). Support vector clustering. Journal of Machine Learning Research, 2, 125\u2013137.","journal-title":"Journal of Machine Learning Research"},{"key":"5236_CR9","first-page":"6","volume-title":"Pacific symposium on biocomputing","author":"A. Ben-Hur","year":"2002","unstructured":"Ben-Hur, A., Elisseeff, A., & Guyon, I. (2002). A stability based method for discovering structure in clustered data. In Pacific symposium on biocomputing (pp. 6\u201317)."},{"key":"5236_CR10","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4612-1128-0","volume-title":"Harmonic analysis on semigroups","author":"C. Berg","year":"1984","unstructured":"Berg, C., Christensen, J. P. R., & Ressel, P. (1984). Harmonic analysis on semigroups. Berlin: Springer."},{"key":"5236_CR11","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1207\/s15327906mbr2402_1","volume":"24","author":"J. Breckenridge","year":"1989","unstructured":"Breckenridge, J. (1989). Replicating cluster analysis: method, consistency and validity. Multivariate Behavioral Research, 24, 147\u2013161.","journal-title":"Multivariate Behavioral Research"},{"key":"5236_CR12","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/03610927408827101","volume":"3","author":"R. Calinski","year":"1974","unstructured":"Calinski, R., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics, 3, 1\u201327.","journal-title":"Communications in Statistics"},{"issue":"3","key":"5236_CR13","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1016\/0167-9473(92)90042-E","volume":"14","author":"G. Celeux","year":"1992","unstructured":"Celeux, G., & Govaert, G. (1992). A classification EM algorithm for clustering and two stochastic versions. Computational Statistics & Data Analysis, 14(3), 15, 315\u2013332.","journal-title":"Computational Statistics & Data Analysis"},{"issue":"5","key":"5236_CR14","doi-asserted-by":"crossref","first-page":"1250","DOI":"10.1109\/72.536318","volume":"7","author":"S. V. Chakravarthy","year":"1996","unstructured":"Chakravarthy, S. V., & Ghosh, J. (1996). Scale-based clustering using the radial basis function network. IEEE Transactions on Neural Networks, 7(5), 1250\u20131261.","journal-title":"IEEE Transactions on Neural Networks"},{"key":"5236_CR15","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1007\/BF01246105","volume":"13","author":"R. Cheng","year":"1996","unstructured":"Cheng, R., & Milligan, G. W. (1996). Measuring the influence of individual data points in a cluster analysis. Journal of Classification, 13, 315\u2013335.","journal-title":"Journal of Classification"},{"key":"5236_CR16","doi-asserted-by":"crossref","first-page":"351","DOI":"10.2307\/1268225","volume":"23","author":"W. J. Conover","year":"1981","unstructured":"Conover, W. J., Johnson, M. E., & Johnson, M. M. (1981). Comparative study of tests of homogeneity of variances, with applications to the outer continental shelf bidding data. Technometrics, 23, 351\u2013361.","journal-title":"Technometrics"},{"key":"5236_CR17","doi-asserted-by":"crossref","unstructured":"Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 273\u2013297.","DOI":"10.1007\/BF00994018"},{"issue":"2","key":"5236_CR18","doi-asserted-by":"crossref","first-page":"367","DOI":"10.2307\/3315985","volume":"28","author":"A. Cuevas","year":"2000","unstructured":"Cuevas, A., Febrero, M., & Fraiman, R. (2000). Estimating the number of clusters. Canadian Journal of Statistics, 28(2), 367\u2013382.","journal-title":"Canadian Journal of Statistics"},{"key":"5236_CR19","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1016\/S0167-9473(00)00052-9","volume":"28","author":"A. Cuevas","year":"2001","unstructured":"Cuevas, A., Febrero, M., & Fraiman, R. (2001). Cluster analysis: a further approach based on density estimation. Computational Statistics & Data Analysis, 28, 441\u2013459.","journal-title":"Computational Statistics & Data Analysis"},{"issue":"1","key":"5236_CR20","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1023\/A:1007612920971","volume":"42","author":"I. S. Dhillon","year":"2001","unstructured":"Dhillon, I. S., & Modha, D. S. (2001). Concept decompositions for large sparse text data using clustering. Machine Learning, 42(1), 143\u2013175. Also appears as IBM Research Report RJ 10147, July 1999.","journal-title":"Machine Learning"},{"key":"5236_CR21","first-page":"73","volume-title":"A comprehensive survey of text mining","author":"I. Dhillon","year":"2003","unstructured":"Dhillon, I., Kogan, J., & Nicholas, C. (2003). Feature selection and document clustering. In A comprehensive survey of text mining (pp. 73\u2013100). Berlin: Springer."},{"issue":"7","key":"5236_CR22","doi-asserted-by":"crossref","DOI":"10.1186\/gb-2002-3-7-research0036","volume":"3","author":"S. Dudoit","year":"2002","unstructured":"Dudoit, S., & Fridlyand, J. (2002). A prediction-based resampling method for estimating the number of clusters in a dataset. Genome Biology, 3(7), 0036.","journal-title":"Genome Biology"},{"key":"5236_CR23","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1080\/01969727408546059","volume":"4","author":"J. C. Dunn","year":"1974","unstructured":"Dunn, J. C. (1974). Well separated clusters and optimal fuzzy partitions. Journal of Cybernetics, 4, 95\u2013104.","journal-title":"Journal of Cybernetics"},{"key":"5236_CR24","doi-asserted-by":"crossref","first-page":"1287","DOI":"10.1080\/03610927608827443","volume":"5","author":"B. S. Duran","year":"1976","unstructured":"Duran, B. S. (1976). A survey of nonparametric tests for scale. Communications in Statistics. Theory and Methods, 5, 1287\u20131312.","journal-title":"Communications in Statistics. Theory and Methods"},{"key":"5236_CR25","volume-title":"Proceedings of the twentieth annual conference on neural information processing systems (NIPS)","author":"Y. Feng","year":"2006","unstructured":"Feng, Y., & Hamerly, G. (2006). PG-means: learning the number of clusters in data. In Proceedings of the twentieth annual conference on neural information processing systems (NIPS)."},{"issue":"1","key":"5236_CR26","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1016\/j.patcog.2007.05.018","volume":"41","author":"M. Filippone","year":"2008","unstructured":"Filippone, M., Camastra, F., Masulli, F., & Rovetta, S. (2008). A survey of kernel and spectral methods for clustering. Pattern Recognition, 41(1), 176\u2013190.","journal-title":"Pattern Recognition"},{"key":"5236_CR27","doi-asserted-by":"crossref","first-page":"697","DOI":"10.1214\/aos\/1176344722","volume":"7","author":"J. H. Friedman","year":"1979","unstructured":"Friedman, J. H., & Rafsky, L. C. (1979). Multivariate generalizations of the Wolfowitz and Smirnov two-sample tests. Annals of Statistics, 7, 697\u2013717.","journal-title":"Annals of Statistics"},{"issue":"3","key":"5236_CR28","doi-asserted-by":"crossref","first-page":"780","DOI":"10.1109\/TNN.2002.1000150","volume":"13","author":"M. Girolami","year":"2002","unstructured":"Girolami, M. (2002). Mercer kernel-based clustering in feature space. IEEE Transactions on Neural Networks, 13(3), 780\u2013784.","journal-title":"IEEE Transactions on Neural Networks"},{"issue":"2","key":"5236_CR29","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1109\/34.982897","volume":"24","author":"E. Gokcay","year":"2002","unstructured":"Gokcay, E., & Principe, J. C. (2002). Information theoretic clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(2), 158\u2013171.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"5236_CR30","doi-asserted-by":"crossref","first-page":"561","DOI":"10.1016\/0167-9473(94)90085-X","volume":"18","author":"A. D. Gordon","year":"1994","unstructured":"Gordon, A. D. (1994). Identifying genuine clusters in a classification. Computational Statistics & Data Analysis, 18, 561\u2013581.","journal-title":"Computational Statistics & Data Analysis"},{"key":"5236_CR31","doi-asserted-by":"crossref","DOI":"10.1201\/9780367805302","volume-title":"Classification","author":"A. D. Gordon","year":"1999","unstructured":"Gordon, A. D. (1999). Classification. Boca Raton: Chapman and Hall\/CRC."},{"key":"5236_CR32","first-page":"513","volume-title":"Advances in neural information processing systems","author":"A. Gretton","year":"2007","unstructured":"Gretton, A., Borgwardt, K., Rasch, M., Sch\u00f6lkopf, B., & Smola, A. (2007a). A kernel method for the two-sample-problem. In Advances in neural information processing systems (Vol.\u00a019, pp.\u00a0513\u2013520). Cambridge: MIT Press."},{"key":"5236_CR33","first-page":"1637","volume-title":"Proceedings of the 22nd conference on artificial intelligence (AAAI-07)","author":"A. Gretton","year":"2007","unstructured":"Gretton, A., Borgwardt, K., Rasch, M., Sch\u00f6lkopf, B., & Smola, A. (2007b). A kernel approach to comparing distributions. In Proceedings of the 22nd conference on artificial intelligence (AAAI-07) (pp. 1637\u20131641)."},{"key":"5236_CR34","doi-asserted-by":"crossref","unstructured":"Gretton, A., Borgwardt, K. M., Rasch, M. J., Sch\u00f6lkopf, B., & Smola, A. J. (2008a). A kernel method for the two-sample problem. CoRR. arXiv:0805.2368 . DBLP, http:\/\/dblp.uni-trier.de .","DOI":"10.7551\/mitpress\/7503.003.0069"},{"key":"5236_CR35","first-page":"1","volume":"4","author":"A. Gretton","year":"2008","unstructured":"Gretton, A., Borgwardt, K. M., Rasch, M. J., Sch\u00f6lkopf, B., & Smola, A. (2008b). A kernel method for the two-sample problem. Journal of Machine Learning Research, 4, 1\u201310.","journal-title":"Journal of Machine Learning Research"},{"issue":"2","key":"5236_CR36","doi-asserted-by":"crossref","first-page":"359","DOI":"10.1093\/biomet\/89.2.359","volume":"89","author":"P. Hall","year":"2002","unstructured":"Hall, P., & Tajvidi, N. (2002). Permutation tests for equality of distributions in high-dimensional settings. Biometrika, 89(2), 359\u2013374.","journal-title":"Biometrika"},{"key":"5236_CR37","first-page":"281","volume-title":"Proceedings of the seventeenth annual conference on neural information processing systems (NIPS)","author":"G. Hamerly","year":"2003","unstructured":"Hamerly, G., & Elkan, Ch. (2003). Learning the k in k-means. In Proceedings of the seventeenth annual conference on neural information processing systems (NIPS) (pp. 281\u2013288)."},{"key":"5236_CR38","volume-title":"Clustering algorithms","author":"J. A. Hartigan","year":"1975","unstructured":"Hartigan, J. A. (1975). Clustering algorithms. New York: Wiley."},{"key":"5236_CR39","doi-asserted-by":"crossref","first-page":"388","DOI":"10.2307\/2287840","volume":"76","author":"J. A. Hartigan","year":"1981","unstructured":"Hartigan, J. A. (1981). Consistency of single linkage for high-density clusters. Journal of the American Statistical Association, 76, 388\u2013394.","journal-title":"Journal of the American Statistical Association"},{"key":"5236_CR40","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/BF01908064","volume":"2","author":"J. A. Hartigan","year":"1985","unstructured":"Hartigan, J. A. (1985). Statistical theory in clustering. Journal of Classification, 2, 63\u201376.","journal-title":"Journal of Classification"},{"key":"5236_CR41","unstructured":"Haussler, D. (1999). Convolution kernels on discrete structures (UCSC-CRL-9910). Department of Computer Science University of California at Santa Cruz."},{"key":"5236_CR42","doi-asserted-by":"crossref","first-page":"772","DOI":"10.1214\/aos\/1176350835","volume":"16","author":"N. Henze","year":"1988","unstructured":"Henze, N. (1988). A multivariate two-sample test based on the number of nearest neighbor type coincidences. Annals of Statistics, 16, 772\u2013783.","journal-title":"Annals of Statistics"},{"key":"5236_CR43","doi-asserted-by":"crossref","first-page":"190","DOI":"10.1111\/j.2044-8317.1976.tb00714.x","volume":"76","author":"L. Hubert","year":"1976","unstructured":"Hubert, L., & Schultz, J. (1976). Quadratic assignment as a general data-analysis strategy. British Journal of Mathematical & Statistical Psychology, 76, 190\u2013241.","journal-title":"British Journal of Mathematical & Statistical Psychology"},{"key":"5236_CR44","volume-title":"Algorithms for clustering data","author":"A. Jain","year":"1988","unstructured":"Jain, A., & Dubes, R. (1988). Algorithms for clustering data. New Jersey: Englewood Cliffs\/Prentice-Hall."},{"issue":"5","key":"5236_CR45","doi-asserted-by":"crossref","first-page":"547","DOI":"10.1016\/0031-3203(87)90081-1","volume":"20","author":"A. K. Jain","year":"1987","unstructured":"Jain, A. K., & Moreau, J. V. (1987). Bootstrap technique in cluster analysis. Pattern Recognition, 20(5), 547\u2013568.","journal-title":"Pattern Recognition"},{"key":"5236_CR46","doi-asserted-by":"crossref","first-page":"928","DOI":"10.2307\/2291327","volume":"90","author":"R. E. Kass","year":"1995","unstructured":"Kass, R. E. (1995). A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. Journal of the American Statistical Association, 90, 928\u2013934.","journal-title":"Journal of the American Statistical Association"},{"key":"5236_CR47","doi-asserted-by":"crossref","DOI":"10.1002\/9780470316801","volume-title":"Finding groups in data","author":"L. Kaufman","year":"1990","unstructured":"Kaufman, L., & Rousseeuw, P. J. (1990). Finding groups in data. New York: Wiley."},{"key":"5236_CR48","unstructured":"Klebanov, L. (2003). One class of distribution free multivariate tests. SPb. Math. Society, Preprint, 03."},{"key":"5236_CR49","volume-title":"N-distances and their applications","author":"L. B. Klebanov","year":"2005","unstructured":"Klebanov, L. B. (2005). N-distances and their applications. Charsel University in Prague, The Karolinum Press."},{"key":"5236_CR50","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/S0167-7152(01)00011-6","volume":"53","author":"L. Klebanov","year":"2001","unstructured":"Klebanov, L., Kozubowskii, T., Rachev, S., & Volkovich, V. (2001). Characterization of distributions symmetric with respect to a group of transformations and testing of corresponding statistical hypothesis. Statistics & Probability Letters, 53, 241\u2013247.","journal-title":"Statistics & Probability Letters"},{"key":"5236_CR51","doi-asserted-by":"crossref","unstructured":"Kogan, J., Nicholas, C., & Volkovich, V. (2003a). Text mining with information\u2013theoretical clustering. Computing in Science and Engineering, 52\u201359.","DOI":"10.1109\/MCISE.2003.1238704"},{"key":"5236_CR52","first-page":"5","volume-title":"Proceedings of the workshop on text mining","author":"J. Kogan","year":"2003","unstructured":"Kogan, J., Nicholas, C., & Volkovich, V. (2003b). Text mining with hybrid clustering schemes. In M. W. Berry & W. M. Pottenger (Eds.), Proceedings of the workshop on text mining (pp.\u00a05\u201316). Held in conjunction with the third SIAM international conference on data mining."},{"key":"5236_CR53","volume-title":"Proceedings of the workshop on clustering high dimensional data and its applications","author":"J. Kogan","year":"2003","unstructured":"Kogan, J., Teboulle, M., & Nicholas, C. (2003c). Optimization approach to generating families of k-means like algorithms. In Proceedings of the workshop on clustering high dimensional data and its applications. Held in conjunction with the third SIAM international conference on data mining."},{"key":"5236_CR54","doi-asserted-by":"crossref","first-page":"23","DOI":"10.2307\/2531893","volume":"44","author":"W. Krzanowski","year":"1985","unstructured":"Krzanowski, W., & Lai, Y. (1985). A criterion for determining the number of groups in a dataset using sum of squares clustering. Biometrics, 44, 23\u201334.","journal-title":"Biometrics"},{"key":"5236_CR55","doi-asserted-by":"crossref","first-page":"83","DOI":"10.1002\/nav.3800020109","volume":"2","author":"H. Kuhn","year":"1955","unstructured":"Kuhn, H. (1955). The Hungarian method for the assignment problem. Naval Research Logistics Quarterly, 2, 83\u201397.","journal-title":"Naval Research Logistics Quarterly"},{"key":"5236_CR56","unstructured":"Lange, T., Braun, M., Roth, V., & Buhmann, J. M. (2003). Stability-based model selection. Advances in Neural Information Processing Systems, 15. http:\/\/citeseer.ist.psu.edu\/700728.html ."},{"issue":"6","key":"5236_CR57","doi-asserted-by":"crossref","first-page":"1299","DOI":"10.1162\/089976604773717621","volume":"15","author":"T. Lange","year":"2004","unstructured":"Lange, T., Roth, V., Braun, M., & Buhmann, J. M. (2004). Stability-based validation of clustering solutions. Neural Computation, 15(6), 1299\u20131323.","journal-title":"Neural Computation"},{"key":"5236_CR58","doi-asserted-by":"crossref","first-page":"2573","DOI":"10.1162\/089976601753196030","volume":"13","author":"E. Levine","year":"2001","unstructured":"Levine, E., & Domany, E. (2001). Resampling method for unsupervised estimation of cluster validity. Neural Computation, 13, 2573\u20132593.","journal-title":"Neural Computation"},{"key":"5236_CR59","volume-title":"Characteristic functions","author":"E. Lukacs","year":"1970","unstructured":"Lukacs, E. (1970). Characteristic functions. Duxbury: Griffin."},{"key":"5236_CR60","doi-asserted-by":"crossref","first-page":"159","DOI":"10.1007\/BF02294245","volume":"50","author":"G. Milligan","year":"1985","unstructured":"Milligan, G., & Cooper, M. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50, 159\u2013179.","journal-title":"Psychometrika"},{"key":"5236_CR61","first-page":"404","volume-title":"Proceedings of ASMDA 2005","author":"G. B. Mufti","year":"2005","unstructured":"Mufti, G. B., Bertrand, P., & El Moubarki, L. (2005). Determining the number of groups from measures of cluster validity. In Proceedings of ASMDA 2005 (pp. 404\u2013414)."},{"key":"5236_CR62","doi-asserted-by":"crossref","first-page":"1065","DOI":"10.1214\/aoms\/1177704472","volume":"32","author":"E. Parzen","year":"1962","unstructured":"Parzen, E. (1962). On the estimation of a probability density function and the mode. Annals of Mathematical Statistics, 32, 1065\u20131076.","journal-title":"Annals of Mathematical Statistics"},{"key":"5236_CR63","doi-asserted-by":"crossref","first-page":"454","DOI":"10.1016\/j.patrec.2009.07.009","volume":"31","author":"D. Pascual","year":"2010","unstructured":"Pascual, D., Pla, F., & Sanche, J. S. (2010). Cluster validation using information stability measures. Pattern Recognition Letters, 31, 454\u2013461.","journal-title":"Pattern Recognition Letters"},{"key":"5236_CR64","first-page":"727","volume-title":"Proceedings of the 17th international conf. on machine learning","author":"D. Pelleg","year":"2000","unstructured":"Pelleg, D., & Moore, A. (2000). X-means: Extending K-means with efficient estimation of the number of clusters. In Proceedings of the 17th international conf. on machine learning (pp. 727\u2013734). San Francisco: Morgan Kaufmann."},{"key":"5236_CR65","first-page":"265","volume-title":"Unsupervised adaptive filtering, I","author":"J. Principe","year":"2002","unstructured":"Principe, J., Xu, D., & Fisher, J. (2002). Information theoretic learning. In Unsupervised adaptive filtering, I (pp. 265\u2013319). New York: Wiley."},{"key":"5236_CR66","series-title":"Wiley series in probability and mathematical statistics","volume-title":"Probability metrics and the stability of stochastic models","author":"S. T. Rachev","year":"1991","unstructured":"Rachev, S. T. (1991). Wiley series in probability and mathematical statistics. Probability metrics and the stability of stochastic models. Chichester: Wiley."},{"issue":"8","key":"5236_CR67","doi-asserted-by":"crossref","first-page":"945","DOI":"10.1103\/PhysRevLett.65.945","volume":"65","author":"K. Rose","year":"1990","unstructured":"Rose, K., Gurewitz, E., & Fox, G. C. (1990). Statistical mechanics and phase transitions in clustering. Physical Review Letters, 65(8), 945\u2013948.","journal-title":"Physical Review Letters"},{"issue":"8","key":"5236_CR68","doi-asserted-by":"crossref","first-page":"785","DOI":"10.1109\/34.236251","volume":"15","author":"K. Rose","year":"1993","unstructured":"Rose, K., Gurewitz, E., & Fox, G. C. (1993). Constrained clustering as an optimization method. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(8), 785\u2013794.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"issue":"1\u20133","key":"5236_CR69","first-page":"23","volume":"72","author":"J. Robert","year":"2008","unstructured":"Robert, J., & Torbjorn, E. (2008). A new information theoretic analysis of sum-of-squared-error kernel clustering. Neurocomputing, 72(1\u20133), 23\u201331.","journal-title":"Neurocomputing"},{"issue":"4","key":"5236_CR70","doi-asserted-by":"crossref","first-page":"515","DOI":"10.1111\/j.1467-9868.2005.00513.x","volume":"67","author":"P. Rosenbaum","year":"2005","unstructured":"Rosenbaum, P. (2005). An exact distribution-free test comparing two multivariate distributions based on adjacency. Journal of the Royal Statistical Society. Series B, Statistical Methodology, 67(4), 515\u2013530.","journal-title":"Journal of the Royal Statistical Society. Series B, Statistical Methodology"},{"key":"5236_CR71","unstructured":"Roth, V., Lange, T., Braun, M., & Buhmann, J. (2002). A resampling approach to cluster validation. COMPSTAT, available at http:\/\/www.cs.uni-bonn.De\/braunm ."},{"key":"5236_CR72","volume-title":"Levy processes and infinitely divisible distributions","author":"K. Sato","year":"1999","unstructured":"Sato, K. (1999). Levy processes and infinitely divisible distributions. Cambridge: Cambridge University Press."},{"issue":"3","key":"5236_CR73","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1090\/S0002-9947-1938-1501980-0","volume":"44","author":"I. J. Schoenberg","year":"1938","unstructured":"Schoenberg, I. J. (1938). Metric spaces and positive definite functions. Transactions of the American Mathematical Society, 44(3), 522\u2013536.","journal-title":"Transactions of the American Mathematical Society"},{"key":"5236_CR74","first-page":"301","volume-title":"NIPS","author":"B. Sch\u00f6lkopf","year":"2000","unstructured":"Sch\u00f6lkopf, B. (2000). The kernel trick for distances. In NIPS (pp. 301\u2013307)."},{"key":"5236_CR75","volume-title":"Learning with kernels","author":"B. Sch\u00f6lkopf","year":"2002","unstructured":"Sch\u00f6lkopf, B., & Smola, A. J. (2002). Learning with kernels. New York: MIT Press."},{"issue":"5","key":"5236_CR76","doi-asserted-by":"crossref","first-page":"1299","DOI":"10.1162\/089976698300017467","volume":"10","author":"B. Sch\u00f6lkopf","year":"1998","unstructured":"Sch\u00f6lkopf, B., Smola, A. J., & Muller, K.-R. (1998). Nonlinear component analysis as a kernel eigenvalue problem. Neural Computation, 10(5), 1299\u20131319.","journal-title":"Neural Computation"},{"key":"5236_CR77","first-page":"67","volume":"2","author":"I. Steinwart","year":"2001","unstructured":"Steinwart, I. (2001). On the influence of the kernel on the consistency of support vector machines. Journal of Machine Learning Research, 2, 67\u201393.","journal-title":"Journal of Machine Learning Research"},{"issue":"12","key":"5236_CR78","doi-asserted-by":"crossref","first-page":"2483","DOI":"10.1162\/0899766042321751","volume":"16","author":"S. Still","year":"2004","unstructured":"Still, S., & Bialek, W. (2004). How many clusters? An information-theoretic perspective. Neural Computation, 16(12), 2483\u20132506.","journal-title":"Neural Computation"},{"key":"5236_CR79","first-page":"583","volume":"3","author":"A. Strehl","year":"2002","unstructured":"Strehl, A., & Ghosh, J. (2002). Cluster ensembles\u2014a knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research, 3, 583\u2013617.","journal-title":"Journal of Machine Learning Research"},{"issue":"5","key":"5236_CR80","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1007\/s00357-003-0004-6","volume":"20","author":"W. Stuetzle","year":"2003","unstructured":"Stuetzle, W. (2003). Estimating the cluster tree of a density by analyzing the minimal spanning tree of a sample. Journal of Classification, 20(5), 25\u201347.","journal-title":"Journal of Classification"},{"key":"5236_CR81","doi-asserted-by":"crossref","first-page":"750","DOI":"10.1198\/016214503000000666","volume":"98","author":"C. Sugar","year":"2003","unstructured":"Sugar, C., & James, G. (2003). Finding the number of clusters in a data set: an information theoretic approach. Journal of the American Statistical Association, 98, 750\u2013763.","journal-title":"Journal of the American Statistical Association"},{"issue":"3","key":"5236_CR82","doi-asserted-by":"crossref","first-page":"511","DOI":"10.1198\/106186005X59243","volume":"14","author":"R. Tibshirani","year":"2005","unstructured":"Tibshirani, R., & Walther, G. (2005). Cluster validation by prediction strength. Journal of Computational and Graphical Statistics, 14(3), 511\u2013528.","journal-title":"Journal of Computational and Graphical Statistics"},{"issue":"2","key":"5236_CR83","doi-asserted-by":"crossref","first-page":"411","DOI":"10.1111\/1467-9868.00293","volume":"63","author":"R. Tibshirani","year":"2001","unstructured":"Tibshirani, R., Walther, G., & Hastie, T. (2001). Estimating the number of clusters via the gap statistic. Journal of the Royal Statistical Society. Series B, Statistical Methodology, 63(2), 411\u2013423.","journal-title":"Journal of the Royal Statistical Society. Series B, Statistical Methodology"},{"key":"5236_CR84","first-page":"368","volume-title":"Proceedings of the 37th annual Allerton conference on communication, control and computing","author":"N. Tishby","year":"2000","unstructured":"Tishby, N., Pereira, F. C., & Bialek, W. (2000). The information bottleneck method. In B. Hajek & R. S. Sreenivas (Eds.), Proceedings of the 37th annual Allerton conference on communication, control and computing (pp. 368\u2013377)."},{"key":"5236_CR85","first-page":"5","volume-title":"1st European conference on data mining (ECDM\u201907)","author":"Z. Volkovich","year":"2007","unstructured":"Volkovich, Z., & Barzily, Z. (2007). On application of probability metrics in the cluster stability problem. In 1st European conference on data mining (ECDM\u201907) (pp. 5\u20137). Lisbon, Portugal."},{"key":"5236_CR86","first-page":"17","volume-title":"Proceedings of the workshop on clustering high dimensional data and its applications","author":"V. Volkovich","year":"2004","unstructured":"Volkovich, V., Kogan, J., & Nicholas, C. (2004). k-means initialization by sampling large datasets. In I.\u00a0Dhillon & J. Kogan (Eds.), Proceedings of the workshop on clustering high dimensional data and its applications (pp. 17\u201322). Held in conjunction with SDM 2004."},{"issue":"1","key":"5236_CR87","first-page":"103","volume":"25","author":"Z. Volkovich","year":"2005","unstructured":"Volkovich, Z., Barzily, Z., & Sureanu, P. (2005). The Levy-Khinchine representations and functional algebras of test functions. Journal of Pure and Applied Mathematics, 25(1), 103\u2013121.","journal-title":"Journal of Pure and Applied Mathematics"},{"issue":"7","key":"5236_CR88","doi-asserted-by":"crossref","first-page":"2174","DOI":"10.1016\/j.patcog.2008.01.008","volume":"41","author":"Z. Volkovich","year":"2008","unstructured":"Volkovich, Z., Barzily, Z., & Morozensky, L. (2008). A statistical model of cluster stability. Pattern Recognition, 41(7), 2174\u20132188.","journal-title":"Pattern Recognition"},{"key":"5236_CR89","volume-title":"Proceeding of the XIII international conference applied stochastic models and data analysis (ASMDA 2009)","author":"Z. Volkovich","year":"2009","unstructured":"Volkovich, Z., Barzily, Z., Avros, R., & Toledano-Kitai, D. (2009a). On application of the K-nearest neighbors approach for cluster validation. In Proceeding of the XIII international conference applied stochastic models and data analysis (ASMDA 2009). Vilnius, Lithuania"},{"key":"5236_CR90","volume-title":"The second global conference on power and optimization (PCO 2009)","author":"Z. Volkovich","year":"2009","unstructured":"Volkovich, Z., Barzily, Z., Weber, G.-W., & Toledano-Kitai, D. (2009b). Cluster stability estimation based on a minimal spanning trees approach. In The second global conference on power and optimization (PCO 2009). Bali, Indonesia."},{"key":"5236_CR91","volume-title":"The third global conference on power control and optimization (PCO 2010)","author":"Z. Volkovich","year":"2010","unstructured":"Volkovich, Z., Weber, G.-W., & Avros, R. (2010). On an adjacency cluster merit. In The third global conference on power control and optimization (PCO 2010). Gold Coast, Australia."},{"key":"5236_CR92","doi-asserted-by":"crossref","DOI":"10.1007\/978-1-4899-4493-1","volume-title":"Kernel smoothing","author":"M. P. Wand","year":"1995","unstructured":"Wand, M. P., & Jones, M. C. (1995). Kernel smoothing. London: Chapman and Hall."},{"key":"5236_CR93","first-page":"282","volume-title":"Numerical taxonomy","author":"D. Wishart","year":"1969","unstructured":"Wishart, D. (1969). Mode analysis: a generalization of nearest neighbor which reduces chaining effects. In A. J. Cole (Ed.), Numerical taxonomy (Vol.\u00a076, pp. 282\u2013311). London: Academic Press."},{"issue":"2","key":"5236_CR94","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1080\/00949650410001661440","volume":"75","author":"G. Zech","year":"2005","unstructured":"Zech, G., & Aslan, B. (2005). New test for the multivariate two-sample problem based on the concept of minimum energy. Journal of Statistical Computation and Simulation, 75(2), 109\u2013119.","journal-title":"Journal of Statistical Computation and Simulation"},{"key":"5236_CR95","first-page":"47","volume":"VNIISI","author":"A. A. Zinger","year":"1989","unstructured":"Zinger, A. A., Kakosyan, A. V., & Klebanov, L. B. (1989). Characterization of distributions by means of the mean values of statistics in connection with some probability metrics. Stability Problems for Stochastic Models, VNIISI, 47\u201355.","journal-title":"Stability Problems for Stochastic Models"},{"key":"5236_CR96","doi-asserted-by":"crossref","DOI":"10.1515\/9783110936537","volume-title":"Modern theory of summation of random variable","author":"V. M. Zolotarev","year":"1997","unstructured":"Zolotarev, V. M. (1997). Modern theory of summation of random variable. Leiden: Brill Academic."}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-011-5236-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10994-011-5236-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-011-5236-9","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,5]],"date-time":"2024-04-05T08:12:00Z","timestamp":1712304720000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10994-011-5236-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,3,18]]},"references-count":96,"journal-issue":{"issue":"1-2","published-print":{"date-parts":[[2011,10]]}},"alternative-id":["5236"],"URL":"https:\/\/doi.org\/10.1007\/s10994-011-5236-9","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"value":"0885-6125","type":"print"},{"value":"1573-0565","type":"electronic"}],"subject":[],"published":{"date-parts":[[2011,3,18]]}}}