{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,5]],"date-time":"2025-06-05T00:40:31Z","timestamp":1749084031508,"version":"3.37.3"},"reference-count":43,"publisher":"Springer Science and Business Media LLC","issue":"2","license":[{"start":{"date-parts":[[2023,4,9]],"date-time":"2023-04-09T00:00:00Z","timestamp":1680998400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,4,9]],"date-time":"2023-04-09T00:00:00Z","timestamp":1680998400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"name":"Universit\u00e0 degli Studi di Roma La Sapienza"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Adv Data Anal Classif"],"published-print":{"date-parts":[[2024,6]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In this paper, we propose twelve parsimonious models for clustering mixed-type (ordinal and continuous) data. The dependence among the different types of variables is modeled by assuming that ordinal and continuous data follow a multivariate finite mixture of Gaussians, where the ordinal variables are a discretization of some continuous variates of the mixture. The general class of parsimonious models is based on a factor decomposition of the component-specific covariance matrices. Parameter estimation is carried out using a EM-type algorithm based on composite likelihood. The proposal is evaluated through a simulation study and an application to real data.<\/jats:p>","DOI":"10.1007\/s11634-023-00539-5","type":"journal-article","created":{"date-parts":[[2023,4,9]],"date-time":"2023-04-09T10:02:29Z","timestamp":1681034549000},"page":"381-407","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Composite likelihood methods for parsimonious model-based clustering of mixed-type data"],"prefix":"10.1007","volume":"18","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7193-8803","authenticated-orcid":false,"given":"Monia","family":"Ranalli","sequence":"first","affiliation":[]},{"given":"Roberto","family":"Rocci","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,4,9]]},"reference":[{"issue":"3","key":"539_CR1","doi-asserted-by":"publisher","first-page":"283","DOI":"10.1007\/BF02310791","volume":"35","author":"JD Carroll","year":"1970","unstructured":"Carroll JD, Chang JJ (1970) Analysis of individual differences in multidimensional scaling via an n-way generalization of \u201ceckart-young\u2019\u2019 decomposition. Psychometrika 35(3):283\u2013319","journal-title":"Psychometrika"},{"issue":"4","key":"539_CR2","doi-asserted-by":"publisher","first-page":"267","DOI":"10.1007\/BF02288739","volume":"9","author":"RB Cattell","year":"1944","unstructured":"Cattell RB (1944) \u201cParallel proportional profiles\u2019\u2019 and other principles for determining the choice of factors by rotation. Psychometrika 9(4):267\u2013283","journal-title":"Psychometrika"},{"issue":"484","key":"539_CR3","doi-asserted-by":"publisher","first-page":"1674","DOI":"10.1198\/016214508000001075","volume":"103","author":"J Chen","year":"2008","unstructured":"Chen J, Khalili A (2008) Order selection in finite mixture models with a nonsmooth penalty. J Am Stat Assoc 103(484):1674\u20131683","journal-title":"J Am Stat Assoc"},{"issue":"3","key":"539_CR4","doi-asserted-by":"publisher","first-page":"327","DOI":"10.1207\/S15328007SEM0903_2","volume":"9","author":"C DiStefano","year":"2002","unstructured":"DiStefano C (2002) The impact of categorization with confirmatory factor analysis. Struct Equ Model 9(3):327\u2013346","journal-title":"Struct Equ Model"},{"issue":"2","key":"539_CR5","doi-asserted-by":"publisher","first-page":"309","DOI":"10.1111\/j.2044-8317.1994.tb01039.x","volume":"47","author":"CV Dolan","year":"1994","unstructured":"Dolan CV (1994) Factor analysis of variables with 2, 3, 5 and 7 response categories: a comparison of categorical variable estimators using simulated data. Br J Math Stat Psychol 47(2):309\u2013326","journal-title":"Br J Math Stat Psychol"},{"issue":"5","key":"539_CR6","doi-asserted-by":"publisher","first-page":"305","DOI":"10.1016\/0167-7152(88)90004-1","volume":"6","author":"B Everitt","year":"1988","unstructured":"Everitt B (1988) A finite mixture model for the clustering of mixed-mode data. Stat Probab Lett 6(5):305\u2013309","journal-title":"Stat Probab Lett"},{"key":"539_CR7","doi-asserted-by":"publisher","first-page":"79","DOI":"10.1504\/IJDATS.2022.121506","volume":"14","author":"R Farooq","year":"2022","unstructured":"Farooq R (2022) Heywood cases: possible causes and solutions. Int J Data Anal Tech Strateg 14:79\u201388","journal-title":"Int J Data Anal Tech Strateg"},{"issue":"492","key":"539_CR8","doi-asserted-by":"publisher","first-page":"1531","DOI":"10.1198\/jasa.2010.tm09414","volume":"105","author":"X Gao","year":"2010","unstructured":"Gao X, Song PXK (2010) Composite likelihood Bayesian information criteria for model selection in high-dimensional data. J Am Stat Assoc 105(492):1531\u20131540","journal-title":"J Am Stat Assoc"},{"key":"539_CR9","unstructured":"Ghahramani Z, Hinton GE (1996) The em algorithm for mixtures of factor analyzers"},{"issue":"3","key":"539_CR10","doi-asserted-by":"publisher","first-page":"555","DOI":"10.1007\/s11336-020-09715-4","volume":"85","author":"P Giordani","year":"2020","unstructured":"Giordani P, Rocci R, Bove G (2020) Factor uniqueness of the structural parafac model. Psychometrika 85(3):555\u2013574","journal-title":"Psychometrika"},{"key":"539_CR11","unstructured":"Harshman RA, et\u00a0al (1970) Foundations of the parafac procedure: Models and conditions for an\u201d explanatory\u201d multimodal factor analysis"},{"key":"539_CR12","doi-asserted-by":"crossref","unstructured":"Hennig C, Meila M, Murtagh F, Rocci R (2015) Handbook of cluster analysis. CRC Press","DOI":"10.1201\/b19706"},{"issue":"1","key":"539_CR13","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF01908075","volume":"2","author":"L Hubert","year":"1985","unstructured":"Hubert L, Arabie P (1985) Comparing partitions. J Classif 2(1):193\u2013218","journal-title":"J Classif"},{"key":"539_CR14","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1007\/s00357-015-9175-1","volume":"32","author":"S Ingrassia","year":"2015","unstructured":"Ingrassia S, Punzo A, Vittadini G, Minotti S (2015) The generalized linear mixed cluster-weighted model. J Classif 32:85\u2013113","journal-title":"J Classif"},{"issue":"4","key":"539_CR15","doi-asserted-by":"publisher","first-page":"387","DOI":"10.1007\/BF00152012","volume":"24","author":"KG J\u00f6reskog","year":"1990","unstructured":"J\u00f6reskog KG (1990) New developments in lisrel: analysis of ordinal variables using polychoric correlations and weighted least squares. Qual Quant 24(4):387\u2013404","journal-title":"Qual Quant"},{"key":"539_CR16","unstructured":"J\u00f6reskog K\u00a0G, S\u00f6rbom D (1996) LISREL 8: User\u2019s Reference Guide. Scientific Software"},{"issue":"479","key":"539_CR17","doi-asserted-by":"publisher","first-page":"1025","DOI":"10.1198\/016214507000000590","volume":"102","author":"A Khalili","year":"2007","unstructured":"Khalili A, Chen J (2007) Variable selection in finite mixture of regression models. J Am Stat Assoc 102(479):1025\u20131038","journal-title":"J Am Stat Assoc"},{"issue":"2","key":"539_CR18","doi-asserted-by":"publisher","first-page":"95","DOI":"10.1016\/0024-3795(77)90069-6","volume":"18","author":"JB Kruskal","year":"1977","unstructured":"Kruskal JB (1977) Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics. Linear Algebra Appl 18(2):95\u2013138","journal-title":"Linear Algebra Appl"},{"issue":"3","key":"539_CR19","first-page":"209","volume":"12","author":"DN Lawley","year":"1962","unstructured":"Lawley DN, Maxwell AE (1962) Factor analysis as a statistical method. J R Stat Soc Ser D (Stat) 12(3):209\u2013229","journal-title":"J R Stat Soc Ser D (Stat)"},{"key":"539_CR20","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1007\/BF02288062","volume":"2","author":"W Ledermann","year":"1937","unstructured":"Ledermann W (1937) On the rank of the reduced correlation matrix in multiple-factor analysis. Psychometrika 2:85\u201393","journal-title":"Psychometrika"},{"issue":"1","key":"539_CR21","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1016\/0167-7152(90)90100-L","volume":"9","author":"SY Lee","year":"1990","unstructured":"Lee SY, Poon WY, Bentler P (1990) Full maximum likelihood analysis of structural equation models with polytomous variables. Stat Probab Lett 9(1):91\u201397","journal-title":"Stat Probab Lett"},{"key":"539_CR22","doi-asserted-by":"publisher","first-page":"221","DOI":"10.1090\/conm\/080\/999014","volume":"80","author":"B Lindsay","year":"1988","unstructured":"Lindsay B (1988) Composite likelihood methods. Contemp Math 80:221\u2013239","journal-title":"Contemp Math"},{"issue":"4","key":"539_CR23","doi-asserted-by":"publisher","first-page":"975","DOI":"10.1093\/biomet\/asp056","volume":"96","author":"KV Mardia","year":"2009","unstructured":"Mardia KV, Kent JT, Hughes G, Taylor CC (2009) Maximum likelihood estimation using composite likelihoods for closed exponential families. Biometrika 96(4):975\u2013982","journal-title":"Biometrika"},{"key":"539_CR24","unstructured":"MATLAB. User\u2019s Guide, R2013b. MathWorks (2013)"},{"issue":"3","key":"539_CR25","doi-asserted-by":"publisher","first-page":"379","DOI":"10.1016\/S0167-9473(02)00183-4","volume":"41","author":"G McLachlan","year":"2003","unstructured":"McLachlan G, Peel D, Bean R (2003) Modelling high-dimensional data by mixtures of factor analyzers. Comput Stat Data Anal 41(3):379\u2013388","journal-title":"Comput Stat Data Anal"},{"issue":"3","key":"539_CR26","doi-asserted-by":"publisher","first-page":"285","DOI":"10.1007\/s11222-008-9056-0","volume":"18","author":"P McNicholas","year":"2008","unstructured":"McNicholas P, Murphy T (2008) Parsimonious gaussian mixture models. Stat Comput 18(3):285\u2013296","journal-title":"Stat Comput"},{"issue":"21","key":"539_CR27","doi-asserted-by":"publisher","first-page":"2705","DOI":"10.1093\/bioinformatics\/btq498","volume":"26","author":"PD McNicholas","year":"2010","unstructured":"McNicholas PD, Murphy TB (2010) Model-based clustering of microarray expression data via latent Gaussian mixture models. Bioinformatics 26(21):2705\u20132712","journal-title":"Bioinformatics"},{"key":"539_CR28","first-page":"1","volume":"10","author":"D Mcparland","year":"2015","unstructured":"Mcparland D, Gormley I (2015) Model based clustering for mixed data: clustmd. Adv Data Anal Classif 10:1","journal-title":"Adv Data Anal Classif"},{"key":"539_CR29","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1214\/14-AOAS726","volume":"8","author":"D Mcparland","year":"2014","unstructured":"Mcparland D, Gormley I, McCormick T, Clark S, Kabudula C, Collinson M (2014) Clustering south African households based on their asset status using latent variable models. Ann Appl Stat 8:1","journal-title":"Ann Appl Stat"},{"key":"539_CR30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/sim.7371","volume":"36","author":"D Mcparland","year":"2017","unstructured":"Mcparland D, Phillips C, Brennan L, Roche H, Gormley I (2017) Clustering high-dimensional mixed data to uncover sub-phenotypes: joint analysis of phenotypic and genotypic data. Stat Med 36:1","journal-title":"Stat Med"},{"issue":"3","key":"539_CR31","doi-asserted-by":"publisher","first-page":"479","DOI":"10.1207\/S15327906MBR3903_4","volume":"39","author":"RE Millsap","year":"2004","unstructured":"Millsap RE, Yun-Tein J (2004) Assessing factorial invariance in ordered-categorical measures. Multivar Behav Res 39(3):479\u2013515","journal-title":"Multivar Behav Res"},{"key":"539_CR32","unstructured":"Molenberghs G, Verbeke G (2005) Models for discrete longitudinal data. Springer Series in Statistics Series. Springer, New York"},{"key":"539_CR33","doi-asserted-by":"publisher","first-page":"293","DOI":"10.1007\/s11634-019-00373-8","volume":"14","author":"K Murphy","year":"2020","unstructured":"Murphy K, Murphy TB (2020) Gaussian parsimonious clustering models with covariates and a noise component. Adv Data Anal Classif 14:293\u2013325","journal-title":"Adv Data Anal Classif"},{"issue":"1","key":"539_CR34","doi-asserted-by":"publisher","first-page":"115","DOI":"10.1007\/BF02294210","volume":"49","author":"B Muth\u00e9n","year":"1984","unstructured":"Muth\u00e9n B (1984) A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika 49(1):115\u2013132","journal-title":"Psychometrika"},{"key":"539_CR35","first-page":"1","volume":"1","author":"M Ranalli","year":"2016","unstructured":"Ranalli M, Rocci R (2016) Mixture models for ordinal data: a pairwise likelihood approach. Stat Comput 1:1\u201319","journal-title":"Stat Comput"},{"issue":"C","key":"539_CR36","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1016\/j.csda.2016.12.016","volume":"110","author":"M Ranalli","year":"2017","unstructured":"Ranalli M, Rocci R (2017) Mixture models for mixed-type data through a composite likelihood approach. Comput Stat Data Anal 110(C):87\u2013102","journal-title":"Comput Stat Data Anal"},{"issue":"5","key":"539_CR37","doi-asserted-by":"publisher","first-page":"781","DOI":"10.1080\/10705511.2019.1668276","volume":"27","author":"J Revuelta","year":"2019","unstructured":"Revuelta J, Maydeu Olivares A, Xim\u00e9nez C (2019) Factor analysis for nominal (first choice) data. Struct Equ Modeling 27(5):781\u2013797","journal-title":"Struct Equ Modeling"},{"issue":"3","key":"539_CR38","doi-asserted-by":"publisher","first-page":"354","DOI":"10.1037\/a0029315","volume":"17","author":"M Rhemtulla","year":"2012","unstructured":"Rhemtulla M, Brosseau-Liard P\u00c9, Savalei V (2012) When can categorical variables be treated as continuous? a comparison of robust continuous and categorical sem estimation methods under suboptimal conditions. Psychol Methods 17(3):354","journal-title":"Psychol Methods"},{"key":"539_CR39","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1016\/0024-3795(85)90038-2","volume":"70","author":"A Shapiro","year":"1985","unstructured":"Shapiro A (1985) Identifiability of factor analysis: some results and open problems. Linear Algebra Appl 70:1\u20137","journal-title":"Linear Algebra Appl"},{"key":"539_CR40","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1111\/bmsp.12061","volume":"69","author":"A Stegeman","year":"2016","unstructured":"Stegeman A, Lam TTT (2016) Multi-set factor analysis by means of parafac2. Br J Math Stat Psychol 69:1\u201319","journal-title":"Br J Math Stat Psychol"},{"key":"539_CR41","unstructured":"Tipping M, House SG, Street G, Nh CC (1999) Probabilistic visualisation of high-dimensional binary data"},{"issue":"1","key":"539_CR42","first-page":"1","volume":"21","author":"C Varin","year":"2011","unstructured":"Varin C, Reid N, Firth D (2011) An overview of composite likelihood methods. Stat Sin 21(1):1\u201341","journal-title":"Stat Sin"},{"issue":"9","key":"539_CR43","doi-asserted-by":"publisher","first-page":"3047","DOI":"10.1016\/j.jspi.2011.03.026","volume":"141","author":"X Xu","year":"2011","unstructured":"Xu X, Reid N (2011) On the robustness of maximum composite likelihood estimate. J Stat Plan Inference 141(9):3047\u20133054","journal-title":"J Stat Plan Inference"}],"container-title":["Advances in Data Analysis and Classification"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-023-00539-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11634-023-00539-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11634-023-00539-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,6,19]],"date-time":"2024-06-19T08:19:15Z","timestamp":1718785155000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11634-023-00539-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,4,9]]},"references-count":43,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,6]]}},"alternative-id":["539"],"URL":"https:\/\/doi.org\/10.1007\/s11634-023-00539-5","relation":{},"ISSN":["1862-5347","1862-5355"],"issn-type":[{"type":"print","value":"1862-5347"},{"type":"electronic","value":"1862-5355"}],"subject":[],"published":{"date-parts":[[2023,4,9]]},"assertion":[{"value":"17 February 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 March 2023","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"13 March 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 April 2023","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}