{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,11]],"date-time":"2025-12-11T07:40:58Z","timestamp":1765438858768,"version":"3.37.3"},"reference-count":35,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2023,4,4]],"date-time":"2023-04-04T00:00:00Z","timestamp":1680566400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,4,4]],"date-time":"2023-04-04T00:00:00Z","timestamp":1680566400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100004505","name":"Universit\u00e0 degli Studi di Catania","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004505","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Classif"],"published-print":{"date-parts":[[2023,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In generalized linear models (GLMs), measures of lack of fit are typically defined as the deviance between two nested models, and a deviance-based <jats:italic>R<\/jats:italic><jats:sup>2<\/jats:sup> is commonly used to evaluate the fit. In this paper, we extend deviance measures to mixtures of GLMs, whose parameters are estimated by maximum likelihood (ML) via the EM algorithm. Such measures are defined both locally, i.e., at cluster-level, and globally, i.e., with reference to the whole sample. At the cluster-level, we propose a normalized two-term decomposition of the local deviance into explained, and unexplained local deviances. At the sample-level, we introduce an additive normalized decomposition of the total deviance into three terms, where each evaluates a different aspect of the fitted model: (1) the cluster separation on the dependent variable, (2) the proportion of the total deviance explained by the fitted model, and (3) the proportion of the total deviance which remains unexplained. We use both local and global decompositions to define, respectively, local and overall deviance <jats:italic>R<\/jats:italic><jats:sup>2<\/jats:sup> measures for mixtures of GLMs, which we illustrate\u2014for Gaussian, Poisson and binomial responses\u2014by means of a simulation study. The proposed fit measures are then used to assess, and interpret clusters of COVID-19 spread in Italy in two time points.<\/jats:p>","DOI":"10.1007\/s00357-023-09432-4","type":"journal-article","created":{"date-parts":[[2023,4,4]],"date-time":"2023-04-04T12:19:23Z","timestamp":1680610763000},"page":"233-266","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["Local and Overall Deviance R-Squared Measures for Mixtures of Generalized Linear Models"],"prefix":"10.1007","volume":"40","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5498-009X","authenticated-orcid":false,"given":"Roberto","family":"Di Mari","sequence":"first","affiliation":[]},{"given":"Salvatore","family":"Ingrassia","sequence":"additional","affiliation":[]},{"given":"Antonio","family":"Punzo","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,4,4]]},"reference":[{"issue":"3-4","key":"9432_CR1","doi-asserted-by":"publisher","first-page":"561","DOI":"10.1016\/S0167-9473(02)00163-9","volume":"41","author":"C Biernacki","year":"2003","unstructured":"Biernacki, C., Celeux, G., & Govaert, G. (2003). Choosing starting values for the EM algorithm for getting the highest likelihood in multivariate Gaussian mixture models. Computational Statistics & Data Analysis, 41(3-4), 561\u2013575.","journal-title":"Computational Statistics & Data Analysis"},{"issue":"2","key":"9432_CR2","first-page":"209","volume":"14","author":"AC Cameron","year":"1996","unstructured":"Cameron, A. C., & Windmeijer, F. A. G. (1996). R-squared measures for count data regression models with applications to health-care utilization. Journal of Business & Economic Statistics, 14(2), 209\u2013220.","journal-title":"Journal of Business & Economic Statistics"},{"issue":"2","key":"9432_CR3","doi-asserted-by":"publisher","first-page":"329","DOI":"10.1016\/S0304-4076(96)01818-0","volume":"77","author":"AC Cameron","year":"1997","unstructured":"Cameron, A. C., & Windmeijer, F. A. G. (1997). An R-squared measure of goodness of fit for some common nonlinear regression models. Journal of Econometrics, 77(2), 329\u2013342.","journal-title":"Journal of Econometrics"},{"issue":"3","key":"9432_CR4","doi-asserted-by":"publisher","first-page":"315","DOI":"10.1016\/0167-9473(92)90042-E","volume":"14","author":"G Celeux","year":"1992","unstructured":"Celeux, G., & Govaert, G. (1992). A classification EM algorithm for clustering and two stochastic versions. Computational Statistics & Data Analysis, 14(3), 315\u2013332.","journal-title":"Computational Statistics & Data Analysis"},{"issue":"1","key":"9432_CR5","doi-asserted-by":"publisher","first-page":"7","DOI":"10.1007\/s00357-012-9098-z","volume":"29","author":"JO Cerdeira","year":"2012","unstructured":"Cerdeira, J. O., Martins, M. J., & Silva, P. C. (2012). A combinatorial approach to assess the separability of clusters. Journal of Classification, 29(1), 7\u201322.","journal-title":"Journal of Classification"},{"key":"9432_CR6","doi-asserted-by":"crossref","unstructured":"Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2013). Applied multiple regression\/correlation analysis for the behavioral sciences. Taylor & Francis.","DOI":"10.4324\/9780203774441"},{"key":"9432_CR7","doi-asserted-by":"crossref","unstructured":"Crawley, M. J. (2012). The R Book. Wiley.","DOI":"10.1002\/9781118448908"},{"issue":"1","key":"9432_CR8","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","volume":"39","author":"AP Dempster","year":"1977","unstructured":"Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 39(1), 1\u201338.","journal-title":"Journal of the Royal Statistical Society. Series B (Methodological)"},{"key":"9432_CR9","unstructured":"Diebolt, J., & Ip, E. H. S. (1996). Stochastic EM: Method and application. In Markov Chain Monte Carlo in practice, pp. 259\u2013273. Springer."},{"issue":"3","key":"9432_CR10","first-page":"768","volume":"21","author":"EW Forgy","year":"1965","unstructured":"Forgy, E. W. (1965). Cluster analysis of multivariate data: Efficiency versus interpretability of classifications. Biometrics, 21(3), 768\u2013780.","journal-title":"Biometrics"},{"key":"9432_CR11","volume-title":"Finite Mixture and Markov switching models","author":"S Fr\u00fchwirth-Schnatter","year":"2006","unstructured":"Fr\u00fchwirth-Schnatter, S. (2006). Finite Mixture and Markov switching models. New York: Springer."},{"key":"9432_CR12","doi-asserted-by":"crossref","unstructured":"Gr\u00fcn, B., & Leisch, F. (2008a). Finite mixtures of generalized linear regression models. In C. Heumann (Ed.) Recent Advances in Linear Models and Related Areas - Essays in Honour of Helge Toutenburg Shalabh, pp. 205\u2013230. Springer Physica Verlag, Heidelberg.","DOI":"10.1007\/978-3-7908-2064-5_11"},{"key":"9432_CR13","doi-asserted-by":"crossref","unstructured":"Gr\u00fcn, B., & Leisch, F. (2008b). Flexmix version 2: Finite mixtures with concomitant variables and varying and constant parameters. Journal of Statistical Software, 28(4), 1\u201335.","DOI":"10.18637\/jss.v028.i04"},{"issue":"2","key":"9432_CR14","doi-asserted-by":"publisher","first-page":"147","DOI":"10.1016\/S0304-3800(00)00354-9","volume":"135","author":"A Guisan","year":"2000","unstructured":"Guisan, A., & Zimmermann, N. E. (2000). Predictive habitat distribution models in ecology. Ecological Modelling, 135(2), 147\u2013186.","journal-title":"Ecological Modelling"},{"key":"9432_CR15","unstructured":"Gujarati, D. N., & Porter, D. C. (2009). Basic econometrics. Economics series. McGraw-Hill Irwin."},{"issue":"1","key":"9432_CR16","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","volume":"2","author":"L Hubert","year":"1985","unstructured":"Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2(1), 193\u2013218.","journal-title":"Journal of Classification"},{"issue":"2","key":"9432_CR17","doi-asserted-by":"publisher","first-page":"526","DOI":"10.1007\/s00357-019-09326-4","volume":"37","author":"S Ingrassia","year":"2020","unstructured":"Ingrassia, S., & Punzo, A. (2020). Cluster validation for mixtures of regressions via the total sum of squares decomposition. Journal of Classification, 37 (2), 526\u2013547.","journal-title":"Journal of Classification"},{"issue":"1","key":"9432_CR18","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1007\/s00357-015-9175-1","volume":"32","author":"S Ingrassia","year":"2015","unstructured":"Ingrassia, S., Punzo, A., Vittadini, G., & Minotti, S. C. (2015). The generalized linear mixed cluster-weighted model. Journal of Classification, 32(1), 85\u2013113.","journal-title":"Journal of Classification"},{"key":"9432_CR19","unstructured":"Kassambara, A. (2017). Practical guide to cluster analysis in R: Unsupervised machine learning, vol. 1 of multivariate analysis. STHDA."},{"key":"9432_CR20","doi-asserted-by":"crossref","unstructured":"Kaufman, L., & Rousseeuw, P. J. R. (1990). Finding groups in data: An introduction to cluster analysis. A Wiley-Interscience publication. Wiley.","DOI":"10.1002\/9780470316801"},{"issue":"8","key":"9432_CR21","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v011.i08","volume":"11","author":"F Leisch","year":"2004","unstructured":"Leisch, F. (2004). Flexmix: A general framework for finite mixture models and latent class regression in R. Journal of Statistical Software, 11(8), 1\u201318.","journal-title":"Journal of Statistical Software"},{"key":"9432_CR22","unstructured":"MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol. 1, pp. 281-297, Oakland, CA, USA."},{"key":"9432_CR23","unstructured":"Maechler, M., Rousseeuw, P., Struyf, A., & Hubert, M. (2019). Cluster: Finding groups in data: Cluster analysis extended Rousseeuw et al. Version 2.1.0 (2019-06-19)."},{"issue":"2","key":"9432_CR24","doi-asserted-by":"publisher","first-page":"1","DOI":"10.18637\/jss.v086.i02","volume":"86","author":"A Mazza","year":"2018","unstructured":"Mazza, A., Punzo, A., & Ingrassia, S. (2018). flexCWM: A flexible framework for cluster-weighted models. Journal of Statistical Software, 86(2), 1\u201330.","journal-title":"Journal of Statistical Software"},{"key":"9432_CR25","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4899-3242-6","volume-title":"Generalized linear models, 2nd edn.","author":"P McCullagh","year":"1989","unstructured":"McCullagh, P., & Nelder, J. A. (1989). Generalized linear models, 2nd edn. Boca Raton: Chapman & Hall."},{"key":"9432_CR26","doi-asserted-by":"publisher","DOI":"10.1002\/0471721182","volume-title":"Finite mixture models","author":"GJ McLachlan","year":"2000","unstructured":"McLachlan, G. J., & Peel, D. (2000). Finite mixture models. New York: John Wiley & Sons."},{"key":"9432_CR27","doi-asserted-by":"crossref","unstructured":"Menard, S. (2002). Applied logistic regression analysis, vol. 106 of applied logistic regression analysis. SAGE Publications.","DOI":"10.4135\/9781412983433"},{"key":"9432_CR28","unstructured":"Omerovic, S. (2019). Fitting mixtures of generalized nonlinear models. Ph.D. thesis, Institute of Statistics, Graz University of Technology, Austria. Available at, https:\/\/diglib.tugraz.at\/fitting-mixtures-of-generalized-nonlinear-mode, ls-2019."},{"issue":"2","key":"9432_CR29","doi-asserted-by":"publisher","first-page":"212","DOI":"10.1007\/s00357-015-9182-2","volume":"32","author":"C Panagiotakis","year":"2015","unstructured":"Panagiotakis, C. (2015). Point clustering via voting maximization. Journal of Classification, 32(2), 212\u2013240.","journal-title":"Journal of Classification"},{"key":"9432_CR30","doi-asserted-by":"crossref","unstructured":"Punzo, A., & Ingrassia, S. (2015). Parsimonious generalized linear Gaussian cluster-weighted models. In I. Morlini, T. Minerva, & M. Vichi (Eds.) Advances in Statistical Models for Data Analysis, Studies in Classification, Data Analysis and Knowledge Organization, pp. 201\u2013209, Cham. Springer.","DOI":"10.1007\/978-3-319-17377-1_21"},{"key":"9432_CR31","unstructured":"R Core Team. (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria."},{"issue":"10231","key":"9432_CR32","doi-asserted-by":"publisher","first-page":"1225","DOI":"10.1016\/S0140-6736(20)30627-9","volume":"395","author":"A Remuzzi","year":"2020","unstructured":"Remuzzi, A., & Remuzzi, G. (2020). Covid-19 and Italy: what next? The Lancet, 395(10231), 1225\u20131228.","journal-title":"The Lancet"},{"issue":"1","key":"9432_CR33","doi-asserted-by":"publisher","first-page":"205","DOI":"10.32614\/RJ-2016-021","volume":"8","author":"L Scrucca","year":"2016","unstructured":"Scrucca, L., Fop, M., Murphy, T. B., & Raftery, A. E. (2016). mclust 5: Clustering, classification and density estimation using Gaussian finite mixture models. The R Journal, 8(1), 205\u2013233.","journal-title":"The R Journal"},{"issue":"1","key":"9432_CR34","doi-asserted-by":"publisher","first-page":"21","DOI":"10.1007\/BF01202266","volume":"12","author":"M Wedel","year":"1995","unstructured":"Wedel, M., & De Sarbo, W. S. (1995). A mixture likelihood approach for generalized linear models. Journal of Classification, 12(1), 21\u201355.","journal-title":"Journal of Classification"},{"key":"9432_CR35","doi-asserted-by":"publisher","DOI":"10.1007\/978-1-4615-4651-1","volume-title":"Market segmentation: Conceptual and methodological foundations","author":"M Wedel","year":"2000","unstructured":"Wedel, M., & Kamakura, W. A. (2000). Market segmentation: Conceptual and methodological foundations, 2nd Edition. USA: Kluwer Academic Publishers, Boston, MA.","edition":"2nd Edition"}],"container-title":["Journal of Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00357-023-09432-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00357-023-09432-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00357-023-09432-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,26]],"date-time":"2023-07-26T12:03:54Z","timestamp":1690373034000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00357-023-09432-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,4]]},"references-count":35,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2023,7]]}},"alternative-id":["9432"],"URL":"https:\/\/doi.org\/10.1007\/s00357-023-09432-4","relation":{},"ISSN":["0176-4268","1432-1343"],"issn-type":[{"type":"print","value":"0176-4268"},{"type":"electronic","value":"1432-1343"}],"subject":[],"published":{"date-parts":[[2023,4,4]]},"assertion":[{"value":"8 February 2023","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"4 April 2023","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The manuscript does not contain any studies involving human or animal participants performed by any of the authors.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"<!--Emphasis Type='Bold' removed-->Consent to Participate"}},{"value":"The authors declare no competing interests.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"<!--Emphasis Type='Bold' removed-->Conflict of Interest"}}]}}