{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,12]],"date-time":"2026-06-12T19:13:29Z","timestamp":1781291609213,"version":"3.54.1"},"reference-count":28,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2020,9,14]],"date-time":"2020-09-14T00:00:00Z","timestamp":1600041600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002573","name":"Yonsei University","doi-asserted-by":"publisher","award":["Research Grant of 2020"],"award-info":[{"award-number":["Research Grant of 2020"]}],"id":[{"id":"10.13039\/501100002573","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>Fuzzy clustering has been broadly applied to classify data into K clusters by assigning membership probabilities of each data point close to K centroids. Such a function has been applied into characterizing the clusters associated with a statistical model such as structural equation modeling. The characteristics identified by the statistical model further define the clusters as heterogeneous groups selected from a population. Recently, such statistical model has been formulated as fuzzy clusterwise generalized structured component analysis (fuzzy clusterwise GSCA). The same as in fuzzy clustering, the clusters are enumerated to infer the population and its parameters within the fuzzy clusterwise GSCA. However, the identification of clusters in fuzzy clustering is a difficult task because of the data-dependence of classification indexes, which is known as a cluster validity problem. We examined the cluster validity problem within the fuzzy clusterwise GSCA framework and proposed a new criterion for selecting the most optimal number of clusters using both fit indexes of the GSCA and the fuzzy validity indexes in fuzzy clustering. The criterion, named the FIT-FHV method combining a fit index, FIT, from GSCA and a cluster validation measure, FHV, from fuzzy clustering, performed better than any other indices used in fuzzy clusterwise GSCA.<\/jats:p>","DOI":"10.3390\/sym12091514","type":"journal-article","created":{"date-parts":[[2020,9,14]],"date-time":"2020-09-14T09:04:53Z","timestamp":1600074293000},"page":"1514","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Efficiency of Cluster Validity Indexes in Fuzzy Clusterwise Generalized Structured Component Analysis"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7546-6911","authenticated-orcid":false,"given":"Ji Hoon","family":"Ryoo","sequence":"first","affiliation":[{"name":"Department of Education, College of Educational Sciences, Yonsei University, Seoul 03722, Korea"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Seohee","family":"Park","sequence":"additional","affiliation":[{"name":"Department of Educational Measurement and Statistics, College of Education, University of Iowa, Iowa, IA 52242, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Seongeun","family":"Kim","sequence":"additional","affiliation":[{"name":"Department of Educational Research Methodology, School of Education, University of North Carolina at Greensboro, Greensboro, NC 27412, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hyun Suk","family":"Ryoo","sequence":"additional","affiliation":[{"name":"Department of Computer Science, College of Arts and Science, University of Virginia, Charlottesville, VA 22904, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2020,9,14]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Marcoulides, G.A., and Schumaker, R.E. (2001). Latent variable mixture modeling. New Developments and Techniques in Structural Equation Modeling, Erlbaum.","DOI":"10.4324\/9781410601858"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Hwang, H., and Takane, Y. (2014). Generalized Structured Component Analysis: A Component-Based Approach to Structural Equation Modeling, CRC Press.","DOI":"10.1201\/b17872"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1007\/s11336-005-1314-x","article-title":"Fuzzy clusterwise generalized structured component analysis","volume":"72","author":"Hwang","year":"2007","journal-title":"Psychometrika"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/s41237-019-00084-6","article-title":"Categorical latent variable modeling utilizing fuzzy clustering generalized structured component analysis as an alternative to latent class analysis","volume":"47","author":"Ryoo","year":"2020","journal-title":"Behaviormetrika"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1016\/0377-2217(82)90228-4","article-title":"Fuzzy clustering algorithms and their cluster validity","volume":"10","author":"Roubens","year":"1982","journal-title":"Eur. J. Oper. Res."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"2095","DOI":"10.1016\/j.fss.2007.03.004","article-title":"On fuzzy cluster validity indices","volume":"158","author":"Wang","year":"2007","journal-title":"Fuzzy Sets Syst."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1007\/BF02339490","article-title":"Numerical taxonomy with fuzzy sets","volume":"1","author":"Bezdek","year":"1974","journal-title":"J. Math. Biol."},{"key":"ref_8","unstructured":"Goldberger, A.S., and Duncan, O.D. (1973). A general method for estimating a linear structural equation system. Structural Equation Models in the Social Sciences, Seminar Press."},{"key":"ref_9","first-page":"1","article-title":"Nonlinear generalized structured component analysis","volume":"37","author":"Hwang","year":"2010","journal-title":"Psychometrika"},{"key":"ref_10","unstructured":"McDonald, R.P. (1999). Test Theory: A Unified Treatment, Lawrence Erlbaum Associates."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1007\/BF02296972","article-title":"Additive structure in qualitative data: An alternating least squares method with optimal scaling features","volume":"41","author":"Young","year":"1976","journal-title":"Psychometrika"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"347","DOI":"10.1007\/BF02293796","article-title":"Quantitative analysis of qualitative data","volume":"46","author":"Young","year":"1981","journal-title":"Psychometrika"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Bezdek, J.C. (1981). Pattern Recognition with Fuzzy Objective Function Algorithms, Plenum Press.","DOI":"10.1007\/978-1-4757-0450-1"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2137","DOI":"10.3389\/fpsyg.2017.02137","article-title":"Generalized structured component analysis with uniqueness terms for accommodating measurement error","volume":"8","author":"Hwang","year":"2017","journal-title":"Front. Psychol."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"613","DOI":"10.1016\/0167-8655(96)00026-8","article-title":"Validating fuzzy partitions obtained through c-shells clustering","volume":"17","author":"Dave","year":"1996","journal-title":"Pattern Recognit. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1080\/01621459.1988.10478584","article-title":"Concomitant-variable latent-class models","volume":"83","author":"Dayton","year":"1988","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"707","DOI":"10.1007\/BF02296405","article-title":"A simulated annealing methodology for clusterwise linear regression","volume":"54","author":"DeSarbo","year":"1989","journal-title":"Psychometrika"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"215","DOI":"10.2307\/1165269","article-title":"Estimating the concomitant-variable latent-class model with the EM algorithm","volume":"21","author":"Dessens","year":"1996","journal-title":"J. Educ. Behav. Stat."},{"key":"ref_19","unstructured":"Bezdek, J.C. (1975). Mathematical models for systematics and taxonomy. Proceedings of the 8th International Conference on Numerical Taxonomy, Freeman."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"243","DOI":"10.1016\/S0165-0114(03)00160-X","article-title":"Rule-base self-generation and simplication for data-driven fuzzy models","volume":"142","author":"Chen","year":"2004","journal-title":"Fuzzy Sets Syst."},{"key":"ref_21","unstructured":"Fukiyama, Y., and Sugeno, M. (, January June). A new method of choosing the number of clusters for the fuzzy c-means method. Proceedings of the Fifth Fuzzy Systems Symposium, Kobe, Japan. Available online: https:\/\/jglobal.jst.go.jp\/en\/detail?JGLOBAL_ID=200902072543924485."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"773","DOI":"10.1109\/34.192473","article-title":"Unsupervised optimal fuzzy clustering","volume":"11","author":"Gath","year":"1989","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1037\/met0000095","article-title":"A comparison of latent class, K-means, and K-median methods for clustering dichotomous data","volume":"22","author":"Brusco","year":"2017","journal-title":"Psychol. Methods"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1007\/BF02294713","article-title":"An examination of indices for determining the number of clusters in binary data sets","volume":"67","author":"Dimitriadou","year":"2002","journal-title":"Psychometrika"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Ryoo, J., Park, S., Kim, S., and Hwang, H. (2020, June 08). gscaLCA: Generalized Structure Component Analysis\u2014Latent Class Analysis & Latent Class Regression. R Package Version 0.0.5. Available online: https:\/\/CRAN.R-project.org\/package=gscaLCA.","DOI":"10.32614\/CRAN.package.gscaLCA"},{"key":"ref_26","unstructured":"Harris, K.M. The National Longitudinal Study of Adolescent to Adult Health (Add Health), Waves I & II, 1994\u20131996; Wave III, 2001\u20132002; Wave IV, 2007\u20132009 (Machine-Readable Data File and Documentation), Carolina Population Center, University of North Carolina at Chapel Hill. Available online: https:\/\/www.icpsr.umich.edu\/web\/DSDR\/studies\/21600\/versions\/V21."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Martinez-Garcia, M., and Latimer, A. (2017, January 5\u20138). Estimating gas turbine compressor discharge temperature using Bayesian neuro-fuzzy modelling. Proceedings of the 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, AB, Canada.","DOI":"10.1109\/SMC.2017.8123194"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Mart\u00ednez-Garc\u00eda, M., and Latimer, A. (2018, January 11\u201315). Selecting Optimal Features for Cross-Fleet Analysis and Fault Diagnosis of Industrial Gas Turbines. Proceedings of the ASME Turbo Expo 2018: Turbomachinery Technical Conference and Exposition, Oslo, Norway.","DOI":"10.1115\/GT2018-75286"}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/12\/9\/1514\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:09:54Z","timestamp":1760177394000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/12\/9\/1514"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,14]]},"references-count":28,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2020,9]]}},"alternative-id":["sym12091514"],"URL":"https:\/\/doi.org\/10.3390\/sym12091514","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,9,14]]}}}