{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,12]],"date-time":"2026-05-12T15:41:11Z","timestamp":1778600471767,"version":"3.51.4"},"reference-count":81,"publisher":"Springer Science and Business Media LLC","issue":"2-3","license":[{"start":{"date-parts":[[2015,3,28]],"date-time":"2015-03-28T00:00:00Z","timestamp":1427500800000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mach Learn"],"published-print":{"date-parts":[[2015,9]]},"DOI":"10.1007\/s10994-015-5493-0","type":"journal-article","created":{"date-parts":[[2015,3,29]],"date-time":"2015-03-29T02:55:43Z","timestamp":1427597743000},"page":"333-378","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":30,"title":["Minimum message length estimation of mixtures of multivariate Gaussian and von Mises-Fisher distributions"],"prefix":"10.1007","volume":"100","author":[{"given":"Parthan","family":"Kasarapu","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lloyd","family":"Allison","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2015,3,28]]},"reference":[{"key":"5493_CR1","doi-asserted-by":"crossref","unstructured":"Agusta, Y., & Dowe, D. L. (2003). Unsupervised learning of correlated multivariate Gaussian mixture models using MML. In AI 2003: advances in artificial intelligence (pp. 477\u2013489). Berlin: Springer.","DOI":"10.1007\/978-3-540-24581-0_40"},{"issue":"6","key":"5493_CR2","doi-asserted-by":"crossref","first-page":"716","DOI":"10.1109\/TAC.1974.1100705","volume":"19","author":"H Akaike","year":"1974","unstructured":"Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716\u2013723.","journal-title":"IEEE Transactions on Automatic Control"},{"key":"5493_CR3","first-page":"2","volume":"59","author":"E Anderson","year":"1935","unstructured":"Anderson, E. (1935). The Irises of the Gasp\u00e9 Peninsula. Bulletin of the American Iris Society, 59, 2\u20135.","journal-title":"Bulletin of the American Iris Society"},{"key":"5493_CR4","doi-asserted-by":"crossref","unstructured":"Banerjee, A., Dhillon, I., Ghosh, J., & Sra, S. (2003). Generative model-based clustering of directional data. Proceedings of the 9th international conference on knowledge discovery and data mining (pp. 19\u201328). New York: ACM.","DOI":"10.1145\/956750.956757"},{"key":"5493_CR5","first-page":"1345","volume":"6","author":"A Banerjee","year":"2005","unstructured":"Banerjee, A., Dhillon, I. S., Ghosh, J., & Sra, S. (2005). Clustering on the unit hypersphere using von Mises-Fisher distributions. Journal of Machine Learning Research, 6, 1345\u20131382.","journal-title":"Journal of Machine Learning Research"},{"issue":"1\u20132","key":"5493_CR6","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1093\/biomet\/48.1-2.227","volume":"48","author":"DE Barton","year":"1961","unstructured":"Barton, D. E. (1961). Unbiased estimation of a set of probabilities. Biometrika, 48(1\u20132), 227\u2013229.","journal-title":"Biometrika"},{"issue":"2","key":"5493_CR7","doi-asserted-by":"crossref","first-page":"215","DOI":"10.1080\/00401706.1964.10490165","volume":"6","author":"AP Basu","year":"1964","unstructured":"Basu, A. P. (1964). Estimates of reliability for some distributions useful in life testing. Technometrics, 6(2), 215\u2013219.","journal-title":"Technometrics"},{"issue":"5","key":"5493_CR8","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1080\/03610918108812225","volume":"10","author":"D Best","year":"1981","unstructured":"Best, D., & Fisher, N. (1981). The bias of the maximum likelihood estimators of the von Mises-Fisher concentration parameters. Communications in Statistics-Simulation and Computation, 10(5), 493\u2013502.","journal-title":"Communications in Statistics-Simulation and Computation"},{"issue":"7","key":"5493_CR9","doi-asserted-by":"crossref","first-page":"719","DOI":"10.1109\/34.865189","volume":"22","author":"C Biernacki","year":"2000","unstructured":"Biernacki, C., Celeux, G., & Govaert, G. (2000). Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(7), 719\u2013725.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"5493_CR10","volume-title":"Pattern recognition and machine learning","author":"CM Bishop","year":"2006","unstructured":"Bishop, C. M. (2006). Pattern recognition and machine learning (Vol. 1). New York: Springer."},{"key":"5493_CR11","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1016\/0022-5193(69)90041-1","volume":"23","author":"D Boulton","year":"1969","unstructured":"Boulton, D., & Wallace, C. (1969). The information content of a multistate distribution. Journal of Theoretical Biology, 23, 269\u2013278.","journal-title":"Journal of Theoretical Biology"},{"issue":"1","key":"5493_CR12","doi-asserted-by":"crossref","first-page":"221","DOI":"10.1080\/03610929008830199","volume":"19","author":"H Bozdogan","year":"1990","unstructured":"Bozdogan, H. (1990). On the information-based measure of covariance complexity and its application to the evaluation of multivariate linear models. Communications in Statistics-Theory and Methods, 19(1), 221\u2013278.","journal-title":"Communications in Statistics-Theory and Methods"},{"key":"5493_CR13","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-642-50974-2_5","volume-title":"Choosing the number of component clusters in the mixture-model using a new informational complexity criterion of the inverse-Fisher information matrix","author":"H Bozdogan","year":"1993","unstructured":"Bozdogan, H. (1993). Choosing the number of component clusters in the mixture-model using a new informational complexity criterion of the inverse-Fisher information matrix. Berlin: Springer."},{"issue":"17","key":"5493_CR14","doi-asserted-by":"crossref","first-page":"i512","DOI":"10.1093\/bioinformatics\/btu460","volume":"30","author":"JH Collier","year":"2014","unstructured":"Collier, J. H., Allison, L., Lesk, A. M., de la Banda, M. G., & Konagurthu, A. S. (2014). A new statistical framework to assess structural alignment quality using information compression. Bioinformatics, 30(17), i512\u2013i518.","journal-title":"Bioinformatics"},{"key":"5493_CR15","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1137\/0605031","volume":"5","author":"JH Conway","year":"1984","unstructured":"Conway, J. H., & Sloane, N. J. A. (1984). On the Voronoi regions of certain lattices. SIAM Journal on Algebraic and Discrete Methods, 5, 294\u2013305.","journal-title":"SIAM Journal on Algebraic and Discrete Methods"},{"issue":"2","key":"5493_CR16","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1111\/1467-842X.00073","volume":"41","author":"GM Cordeiro","year":"1999","unstructured":"Cordeiro, G. M., & Vasconcellos, K. L. (1999). Theory & Methods: Second-order biases of the maximum likelihood estimates in von Mises regression models. Australian & New Zealand Journal of Statistics, 41(2), 189\u2013198.","journal-title":"Australian & New Zealand Journal of Statistics"},{"issue":"1","key":"5493_CR17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"AP Dempster","year":"1977","unstructured":"Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1\u201338.","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"key":"5493_CR18","unstructured":"Dowe, D. L., Allison, L., Dix, T. I., Hunter, L., Wallace, C. S., & Edgoose, T. (1996a). Circular clustering of protein dihedral angles by minimum message length. In Pacific symposium on biocomputing, Vol. 96, pp. 242\u2013255."},{"key":"5493_CR19","doi-asserted-by":"crossref","unstructured":"Dowe, D. L., Oliver, J. J., Baxter, R. A., & Wallace, C. S. (1996b). Bayesian estimation of the von Mises concentration parameter. In Maximum entropy and Bayesian methods pp. 51\u201360. The Netherlands: Springer.","DOI":"10.1007\/978-94-011-5430-7_6"},{"key":"5493_CR20","doi-asserted-by":"crossref","unstructured":"Dowe, D. L., Oliver, J. J., & Wallace, C. S. (1996c). MML estimation of the parameters of the spherical Fisher distribution. In Algorithmic learning theory (pp. 213\u2013227). Berlin: Springer.","DOI":"10.1007\/3-540-61863-5_48"},{"issue":"318","key":"5493_CR21","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1080\/01621459.1967.10482934","volume":"62","author":"PS Dwyer","year":"1967","unstructured":"Dwyer, P. S. (1967). Some applications of matrix derivatives in multivariate analysis. Journal of the American Statistical Association, 62(318), 607\u2013625.","journal-title":"Journal of the American Statistical Association"},{"issue":"5","key":"5493_CR22","doi-asserted-by":"crossref","first-page":"1708","DOI":"10.1214\/aoms\/1177696815","volume":"41","author":"ML Eaton","year":"1970","unstructured":"Eaton, M. L., & Morris, C. N. (1970). The application of invariance to unbiased estimation. The Annals of Mathematical Statistics, 41(5), 1708\u20131716.","journal-title":"The Annals of Mathematical Statistics"},{"issue":"3","key":"5493_CR23","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1109\/34.990138","volume":"24","author":"MA Figueiredo","year":"2002","unstructured":"Figueiredo, M. A., & Jain, A. K. (2002). Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence, 24(3), 381\u2013396.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"5493_CR24","volume-title":"Statistical analysis of spherical data","author":"NI Fisher","year":"1993","unstructured":"Fisher, N. I. (1993). Statistical analysis of spherical data. Cambridge: Cambridge University Press."},{"issue":"1130","key":"5493_CR25","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1098\/rspa.1953.0064","volume":"217","author":"R Fisher","year":"1953","unstructured":"Fisher, R. (1953). Dispersion on a sphere. Proceedings of the Royal Society of London: Series A (Mathematical and Physical Sciences), 217(1130), 295\u2013305.","journal-title":"Proceedings of the Royal Society of London: Series A (Mathematical and Physical Sciences)"},{"issue":"2","key":"5493_CR26","doi-asserted-by":"crossref","first-page":"179","DOI":"10.1111\/j.1469-1809.1936.tb02137.x","volume":"7","author":"RA Fisher","year":"1936","unstructured":"Fisher, R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2), 179\u2013188.","journal-title":"Annals of Eugenics"},{"issue":"2","key":"5493_CR27","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1109\/89.279278","volume":"2","author":"J Gauvain","year":"1994","unstructured":"Gauvain, J., & Lee, C. H. (1994). Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains. IEEE Transactions on Speech and Audio Processing, 2(2), 291\u2013298.","journal-title":"IEEE Transactions on Speech and Audio Processing"},{"issue":"2","key":"5493_CR28","doi-asserted-by":"crossref","first-page":"457","DOI":"10.2307\/2533388","volume":"50","author":"G Gray","year":"1994","unstructured":"Gray, G. (1994). Bias in misspecified mixtures. Biometrics, 50(2), 457\u2013470.","journal-title":"Biometrics"},{"issue":"1007","key":"5493_CR29","doi-asserted-by":"crossref","first-page":"453","DOI":"10.1098\/rspa.1946.0056","volume":"186","author":"H Jeffreys","year":"1946","unstructured":"Jeffreys, H. (1946). An invariant form for the prior probability in estimation problems. Proceedings of the Royal Society of London: Series A (Mathematical and Physical Sciences), 186(1007), 453\u2013461.","journal-title":"Proceedings of the Royal Society of London: Series A (Mathematical and Physical Sciences)"},{"issue":"2","key":"5493_CR30","doi-asserted-by":"crossref","first-page":"271","DOI":"10.1080\/757582839","volume":"17","author":"P Jones","year":"1990","unstructured":"Jones, P., & McLachlan, G. (1990). Laplace-normal mixtures fitted to wind shear data. Journal of Applied Statistics, 17(2), 271\u2013276.","journal-title":"Journal of Applied Statistics"},{"issue":"5","key":"5493_CR31","doi-asserted-by":"crossref","first-page":"571","DOI":"10.1093\/comjnl\/bxm121","volume":"51","author":"MA Jorgensen","year":"2008","unstructured":"Jorgensen, M. A., & McLachlan, G. J. (2008). Wallace\u2019s approach to unsupervised learning: The Snob program. The Computer Journal, 51(5), 571\u2013578.","journal-title":"The Computer Journal"},{"key":"5493_CR32","unstructured":"Kasarapu, P., & Allison, L. (2015). Minimum message length estimation of mixtures of multivariate gaussian and von Mises-Fisher distributions. arXiv:1502.07813 [cs.LG]."},{"issue":"1","key":"5493_CR33","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1111\/j.2517-6161.1982.tb01189.x","volume":"44","author":"JT Kent","year":"1982","unstructured":"Kent, J. T. (1982). The Fisher\u2013Bingham distribution on the sphere. Journal of the Royal Statistical Society: Series B (Methodological), 44(1), 71\u201380.","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"issue":"12","key":"5493_CR34","doi-asserted-by":"crossref","first-page":"i97","DOI":"10.1093\/bioinformatics\/bts223","volume":"28","author":"AS Konagurthu","year":"2012","unstructured":"Konagurthu, A. S., Lesk, A. M., & Allison, L. (2012). Minimum message length inference of secondary structure from protein coordinate data. Bioinformatics, 28(12), i97\u2013i105.","journal-title":"Bioinformatics"},{"key":"5493_CR35","doi-asserted-by":"crossref","unstructured":"Konagurthu, A. S., Allison, L., Abramson, D., Stuckey, P. J., & Lesk, A. M. (2013). Statistical inference of protein \u201cLEGO bricks\u201d. In 2013 IEEE 13th international conference on data mining (ICDM), IEEE, (pp 1091\u20131096).","DOI":"10.1109\/ICDM.2013.73"},{"key":"5493_CR36","volume-title":"The EM algorithm and extensions","author":"T Krishnan","year":"1997","unstructured":"Krishnan, T., & McLachlan, G. (1997). The EM algorithm and extensions. New York: Wiley."},{"key":"5493_CR37","doi-asserted-by":"crossref","unstructured":"Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. The Annals of Mathematical Statistics, 22(1), 79\u201386.","DOI":"10.1214\/aoms\/1177729694"},{"key":"5493_CR38","volume-title":"Bayesian statistics: An introduction","author":"P Lee","year":"1997","unstructured":"Lee, P. (1997). Bayesian statistics: An introduction. London: Arnold."},{"issue":"9","key":"5493_CR39","doi-asserted-by":"crossref","first-page":"2739","DOI":"10.1016\/j.csda.2011.04.007","volume":"55","author":"Y Lo","year":"2011","unstructured":"Lo, Y. (2011). Bias from misspecification of the component variances in a normal mixture. Computational Statistics and Data Anaysis, 55(9), 2739\u20132747.","journal-title":"Computational Statistics and Data Anaysis"},{"key":"5493_CR40","volume-title":"Matrix differential calculus with applications in statistics and econometrics","author":"JR Magnus","year":"1988","unstructured":"Magnus, J. R., & Neudecker, H. (1988). Matrix differential calculus with applications in statistics and econometrics. New York: Wiley."},{"key":"5493_CR41","volume-title":"Directional statistics","author":"K Mardia","year":"2000","unstructured":"Mardia, K., & Jupp, P. (2000). Directional statistics. Hoboken, NJ: Wiley."},{"issue":"1","key":"5493_CR42","doi-asserted-by":"crossref","first-page":"72","DOI":"10.1111\/j.2517-6161.1984.tb01278.x","volume":"46","author":"K Mardia","year":"1984","unstructured":"Mardia, K., Holmes, D., & Kent, J. (1984). A goodness-of-fit test for the von Mises-Fisher distribution. Journal of the Royal Statistical Society: Series B (Methodological), 46(1), 72\u201378.","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"key":"5493_CR43","volume-title":"Multivariate analysis","author":"KV Mardia","year":"1979","unstructured":"Mardia, K. V., Kent, J. T., & Bibby, J. M. (1979). Multivariate analysis. London: Academic Press."},{"issue":"2","key":"5493_CR44","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1111\/j.1541-0420.2006.00682.x","volume":"63","author":"KV Mardia","year":"2007","unstructured":"Mardia, K. V., Taylor, C. C., & Subramaniam, G. K. (2007). Protein bioinformatics and mixtures of bivariate von Mises distributions for angular data. Biometrics, 63(2), 505\u2013512.","journal-title":"Biometrics"},{"key":"5493_CR45","first-page":"779","volume":"59","author":"G McLachlan","year":"1997","unstructured":"McLachlan, G., & Peel, D. (1997). Contribution to the discussion of paper by S. Richardson and P.J. Green. Journal of the Royal Statistical Society B, 59, 779\u2013780.","journal-title":"Journal of the Royal Statistical Society B"},{"key":"5493_CR46","doi-asserted-by":"crossref","DOI":"10.1002\/0471721182","volume-title":"Finite mixture models","author":"G McLachlan","year":"2000","unstructured":"McLachlan, G., & Peel, D. (2000). Finite mixture models. New York: Wiley."},{"key":"5493_CR47","volume-title":"Mixture models: Inference and applications to clustering (Statistics: Textbooks and Monographs)","author":"GJ McLachlan","year":"1988","unstructured":"McLachlan, G. J., & Basford, K. E. (1988). Mixture models: Inference and applications to clustering (Statistics: Textbooks and Monographs). New York: Dekker."},{"issue":"4","key":"5493_CR48","first-page":"536","volume":"247","author":"A Murzin","year":"1995","unstructured":"Murzin, A., Brenner, S., Hubbard, T., Chothia, C., et al. (1995). SCOP: A structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology, 247(4), 536\u2013540.","journal-title":"Journal of Molecular Biology"},{"key":"5493_CR49","unstructured":"Oliver, J., & Baxter, R. (1994). MML and Bayesianism: Similarities and differences. Dept Comput Sci Monash Univ, Clayton, Victoria, Australia, Tech Rep 206."},{"key":"5493_CR50","unstructured":"Oliver, J. J., Baxter, R. A., & Wallace, C. S. (1996). Unsupervised learning using MML. In Machine learning: Proceedings of the 13th international conference, (pp. 364\u2013372)."},{"issue":"3","key":"5493_CR51","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1016\/S0167-7152(99)00062-0","volume":"45","author":"K Patra","year":"1999","unstructured":"Patra, K., & Dey, D. K. (1999). A multivariate mixture of Weibull distributions in reliability modeling. Statistics & Probability letters, 45(3), 225\u2013235.","journal-title":"Statistics & Probability letters"},{"issue":"4","key":"5493_CR52","doi-asserted-by":"crossref","first-page":"339","DOI":"10.1023\/A:1008981510081","volume":"10","author":"D Peel","year":"2000","unstructured":"Peel, D., & McLachlan, G. J. (2000). Robust mixture modelling using the t-distribution. Statistics and Computing, 10(4), 339\u2013348.","journal-title":"Statistics and Computing"},{"issue":"453","key":"5493_CR53","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1198\/016214501750332974","volume":"96","author":"D Peel","year":"2001","unstructured":"Peel, D., Whiten, W. J., & McLachlan, G. J. (2001). Fitting mixtures of Kent distributions to aid in joint set identification. Journal of the American Statistical Association, 96(453), 56\u201363.","journal-title":"Journal of the American Statistical Association"},{"issue":"4","key":"5493_CR54","doi-asserted-by":"crossref","first-page":"731","DOI":"10.1111\/1467-9868.00095","volume":"59","author":"S Richardson","year":"1997","unstructured":"Richardson, S., & Green, P. J. (1997). On Bayesian analysis of mixtures with an unknown number of components. Journal of the Royal Statistical Society: Series B (Methodological), 59(4), 731\u2013792.","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"issue":"5","key":"5493_CR55","doi-asserted-by":"crossref","first-page":"465","DOI":"10.1016\/0005-1098(78)90005-5","volume":"14","author":"J Rissanen","year":"1978","unstructured":"Rissanen, J. (1978). Modeling by shortest data description. Automatica, 14(5), 465\u2013471.","journal-title":"Automatica"},{"key":"5493_CR56","volume-title":"Stochastic complexity in statistical inquiry theory","author":"J Rissanen","year":"1989","unstructured":"Rissanen, J. (1989). Stochastic complexity in statistical inquiry theory. River Edge, NJ: World Scientific Publishing Co. Inc."},{"issue":"11","key":"5493_CR57","doi-asserted-by":"crossref","first-page":"1133","DOI":"10.1109\/34.730550","volume":"20","author":"S Roberts","year":"1998","unstructured":"Roberts, S., Husmeier, D., Rezek, I., & Penny, W. (1998). Bayesian approaches to Gaussian mixture modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(11), 1133\u20131142.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"5493_CR58","volume-title":"The probabilistic relevance framework: BM25 and beyond","author":"S Robertson","year":"2009","unstructured":"Robertson, S., & Zaragoza, H. (2009). The probabilistic relevance framework: BM25 and beyond. Hanover, MA: Now Publishers Inc."},{"issue":"5","key":"5493_CR59","doi-asserted-by":"crossref","first-page":"513","DOI":"10.1016\/0306-4573(88)90021-0","volume":"24","author":"G Salton","year":"1988","unstructured":"Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information Processing & Management, 24(5), 513\u2013523.","journal-title":"Information Processing & Management"},{"key":"5493_CR60","volume-title":"Introduction to modern information retrieval","author":"G Salton","year":"1986","unstructured":"Salton, G., & McGill, M. J. (1986). Introduction to modern information retrieval. New York, NY: McGraw-Hill Inc."},{"issue":"2","key":"5493_CR61","doi-asserted-by":"crossref","first-page":"369","DOI":"10.1093\/biomet\/65.2.369","volume":"65","author":"G Schou","year":"1978","unstructured":"Schou, G. (1978). Estimation of the concentration parameter in von Mises-Fisher distributions. Biometrika, 65(2), 369\u2013377.","journal-title":"Biometrika"},{"issue":"2","key":"5493_CR62","doi-asserted-by":"crossref","first-page":"461","DOI":"10.1214\/aos\/1176344136","volume":"6","author":"G Schwarz","year":"1978","unstructured":"Schwarz, G., et al. (1978). Estimating the dimension of a model. The Annals of Statistics, 6(2), 461\u2013464.","journal-title":"The Annals of Statistics"},{"issue":"3","key":"5493_CR63","doi-asserted-by":"crossref","first-page":"481","DOI":"10.1023\/A:1004117419204","volume":"52","author":"W Seidel","year":"2000","unstructured":"Seidel, W., Mosler, K., & Alker, M. (2000). A cautionary note on likelihood ratio tests in mixture models. Annals of the Institute of Statistical Mathematics, 52(3), 481\u2013487.","journal-title":"Annals of the Institute of Statistical Mathematics"},{"key":"5493_CR64","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1002\/j.1538-7305.1948.tb01338.x","volume":"27","author":"CE Shannon","year":"1948","unstructured":"Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27, 379\u2013423.","journal-title":"The Bell System Technical Journal"},{"issue":"24","key":"5493_CR65","doi-asserted-by":"crossref","first-page":"11,880","DOI":"10.1016\/j.amc.2012.05.050","volume":"218","author":"H Song","year":"2012","unstructured":"Song, H., Liu, J., & Wang, G. (2012). High-order parameter approximation for von Mises-Fisher distributions. Applied Mathematics and Computation, 218(24), 11,880\u201311,890.","journal-title":"Applied Mathematics and Computation"},{"issue":"1","key":"5493_CR66","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/s00180-011-0232-x","volume":"27","author":"S Sra","year":"2012","unstructured":"Sra, S. (2012). A short note on parameter approximation for von Mises-Fisher distributions: And a fast implementation of $$I_s(x)$$ I s ( x ) . Computational Statistics, 27(1), 177\u2013190.","journal-title":"Computational Statistics"},{"key":"5493_CR67","unstructured":"Strehl, A., Ghosh, J., & Mooney, R. (2000). Impact of similarity measures on web-page clustering. In Workshop on artificial intelligence for web search (AAAI 2000), (pp. 58\u201364)."},{"issue":"1","key":"5493_CR68","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1007\/s00180-007-0030-7","volume":"22","author":"A Tanabe","year":"2007","unstructured":"Tanabe, A., Fukumizu, K., Oba, S., Takenouchi, T., & Ishii, S. (2007). Parameter estimation for von Mises-Fisher distributions. Computational Statistics, 22(1), 145\u2013157.","journal-title":"Computational Statistics"},{"key":"5493_CR69","volume-title":"Statistical analysis of finite mixture distributions","author":"DM Titterington","year":"1985","unstructured":"Titterington, D. M., Smith, A. F., Makov, U. E., et al. (1985). Statistical analysis of finite mixture distributions. New York: Wiley."},{"key":"5493_CR70","unstructured":"Wallace, C. (1986). An improved program for classification. In Proceedings of the 9th Australian computer science conference, (pp. 357\u2013366)."},{"key":"5493_CR71","unstructured":"Wallace, C., & Dowe, D. (1994). Estimation of the von Mises concentration parameter using minimum message length. In Proceedings of the 12th Australian statistical society conference, Monash University, Australia."},{"key":"5493_CR72","volume-title":"Statistical and inductive inference using minimum message length. Information Science and Statistics","author":"CS Wallace","year":"2005","unstructured":"Wallace, C. S. (2005). Statistical and inductive inference using minimum message length. Information Science and Statistics. Secaucus, NJ: Springer."},{"issue":"2","key":"5493_CR73","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1093\/comjnl\/11.2.185","volume":"11","author":"CS Wallace","year":"1968","unstructured":"Wallace, C. S., & Boulton, D. M. (1968). An information measure for classification. Computer Journal, 11(2), 185\u2013194.","journal-title":"Computer Journal"},{"key":"5493_CR74","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1093\/comjnl\/42.4.270","volume":"42","author":"CS Wallace","year":"1999","unstructured":"Wallace, C. S., & Dowe, D. L. (1999). Minimum message length and Kolmogorov complexity. Computer Journal, 42, 270\u2013283.","journal-title":"Computer Journal"},{"issue":"3","key":"5493_CR75","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1111\/j.2517-6161.1987.tb01695.x","volume":"49","author":"CS Wallace","year":"1987","unstructured":"Wallace, C. S., & Freeman, P. R. (1987). Estimation and inference by compact coding. Journal of the Royal Statistical Society: Series B (Methodological), 49(3), 240\u2013265.","journal-title":"Journal of the Royal Statistical Society: Series B (Methodological)"},{"issue":"2","key":"5493_CR76","doi-asserted-by":"crossref","first-page":"381","DOI":"10.2307\/2532881","volume":"52","author":"P Wang","year":"1996","unstructured":"Wang, P., Puterman, M. L., Cockburn, I., & Le, N. (1996). Mixed Poisson regression models with covariate dependent rates. Biometrics, 52(2), 381\u2013400.","journal-title":"Biometrics"},{"issue":"3\u20134","key":"5493_CR77","doi-asserted-by":"crossref","first-page":"344","DOI":"10.1093\/biomet\/43.3-4.344","volume":"43","author":"G Watson","year":"1956","unstructured":"Watson, G., & Williams, E. (1956). On the construction of significance tests on the circle and the sphere. Biometrika, 43(3\u20134), 344\u2013352.","journal-title":"Biometrika"},{"issue":"1","key":"5493_CR78","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2307\/1912526","volume":"50","author":"H White","year":"1982","unstructured":"White, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica, 50(1), 1\u201325.","journal-title":"Econometrica"},{"issue":"1","key":"5493_CR79","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1080\/03610919408813161","volume":"23","author":"AT Wood","year":"1994","unstructured":"Wood, A. T. (1994). Simulation of the von Mises Fisher distribution. Communications in Statistics-Simulation and Computation, 23(1), 157\u2013164.","journal-title":"Communications in Statistics-Simulation and Computation"},{"issue":"1","key":"5493_CR80","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1162\/neco.1996.8.1.129","volume":"8","author":"L Xu","year":"1996","unstructured":"Xu, L., & Jordan, M. I. (1996). On convergence properties of the EM algorithm for Gaussian mixtures. Neural Computation, 8(1), 129\u2013151.","journal-title":"Neural Computation"},{"key":"5493_CR81","unstructured":"Zhong, S., & Ghosh, J. (2003). A comparative study of generative models for document clustering. In Proceedings of the workshop on clustering high dimensional data and its applications in SIAM data mining conference."}],"container-title":["Machine Learning"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-015-5493-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s10994-015-5493-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s10994-015-5493-0","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,5,21]],"date-time":"2025-05-21T18:25:44Z","timestamp":1747851944000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s10994-015-5493-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,3,28]]},"references-count":81,"journal-issue":{"issue":"2-3","published-print":{"date-parts":[[2015,9]]}},"alternative-id":["5493"],"URL":"https:\/\/doi.org\/10.1007\/s10994-015-5493-0","relation":{},"ISSN":["0885-6125","1573-0565"],"issn-type":[{"value":"0885-6125","type":"print"},{"value":"1573-0565","type":"electronic"}],"subject":[],"published":{"date-parts":[[2015,3,28]]}}}