{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,20]],"date-time":"2026-03-20T00:50:39Z","timestamp":1773967839484,"version":"3.50.1"},"reference-count":35,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2025,5,26]],"date-time":"2025-05-26T00:00:00Z","timestamp":1748217600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>In today\u2019s data-driven business landscape, effective customer segmentation is crucial for enhancing engagement, loyalty, and profitability. Traditional clustering methods often struggle with datasets containing both numerical and categorical variables, leading to suboptimal segmentation. This study addresses this limitation by introducing a novel application of Factor Analysis of Mixed Data (FAMD) for dimensionality reduction, integrated with K-means and Agglomerative Clustering for robust customer segmentation. While FAMD is not new in data analytics, its potential in customer segmentation has been underexplored. This research bridges that gap by demonstrating how FAMD can harmonize mixed data types, preserving structural relationships that conventional methods overlook. The proposed methodology was tested on a Kaggle-sourced retail dataset comprising 3900 customers, with preprocessing steps including correlation ratio filtering (\u03b7 \u2265 0.03), standardization, and encoding. FAMD reduced the feature space to three principal components, capturing 81.46% of the variance, which facilitated clearer segmentation. Comparative clustering analysis showed that Agglomerative Clustering (Silhouette Score: 0.52) outperformed K-means (0.51) at k = 4, revealing distinct customer segments such as seasonal shoppers and high spenders. Practical implications include the development of targeted marketing strategies, validated through heatmap visualizations and cluster profiling. This study not only underscores the suitability of FAMD for customer segmentation but also sets the stage for more nuanced marketing analytics driven by mixed-data methodologies.<\/jats:p>","DOI":"10.3390\/info16060441","type":"journal-article","created":{"date-parts":[[2025,5,27]],"date-time":"2025-05-27T11:12:57Z","timestamp":1748344377000},"page":"441","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Enhancing Customer Segmentation Through Factor Analysis of Mixed Data (FAMD)-Based Approach Using K-Means and Hierarchical Clustering Algorithms"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-1704-2330","authenticated-orcid":false,"given":"Chukwutem Pinic","family":"Ufeli","sequence":"first","affiliation":[{"name":"College of Science and Engineering, University of Derby, Kedleston Road, Derby DE22 1GB, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1457-3021","authenticated-orcid":false,"given":"Mian Usman","family":"Sattar","sequence":"additional","affiliation":[{"name":"College of Science and Engineering, University of Derby, Kedleston Road, Derby DE22 1GB, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8089-837X","authenticated-orcid":false,"given":"Raza","family":"Hasan","sequence":"additional","affiliation":[{"name":"Department of Science and Engineering, Solent University, Southampton SO14 0YN, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2860-4095","authenticated-orcid":false,"given":"Salman","family":"Mahmood","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Nazeer Hussain University, ST-2, Near Karimabad, Karachi 75950, Pakistan"}]}],"member":"1968","published-online":{"date-parts":[[2025,5,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1007\/s10257-023-00640-4","article-title":"A review on customer segmentation methods for personalized customer targeting in e-commerce use cases","volume":"21","author":"Meisen","year":"2023","journal-title":"Inf. Syst. E-Bus. Manag."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Tabianan, K., Velu, S., and Ravi, V. (2022). K-Means Clustering Approach for Intelligent Customer Segmentation Using Customer Purchase Behavior Data. Sustainability, 14.","DOI":"10.3390\/su14127243"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"69227","DOI":"10.1109\/ACCESS.2023.3292516","article-title":"Knowledge Extraction from PV Power Generation with Deep Learning Autoencoder and Clustering-Based Algorithms","volume":"11","author":"Miraftabzadeh","year":"2023","journal-title":"IEEE Access"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Miraftabzadeh, S.M., Longo, M., Foiadelli, F., Pasetti, M., and Igual, R. (2021). Advances in the Application of Machine Learning Techniques for Power System Analytics: A Survey. Energies, 14.","DOI":"10.3390\/en14164776"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"117704","DOI":"10.1016\/j.apenergy.2021.117704","article-title":"A temporal distributed hybrid deep learning model for day-ahead distributed PV power forecasting","volume":"304","author":"Qu","year":"2021","journal-title":"Appl. Energy"},{"key":"ref_6","first-page":"37","article-title":"K-means clustering algorithm: A brief review","volume":"4","author":"Chong","year":"2021","journal-title":"Acad. J. Comput. Inf. Sci."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Ahmed, M., Seraj, R., and Islam, S.M.S. (2020). The k-means Algorithm: A Comprehensive Survey and Performance Evaluation. Electronics, 9.","DOI":"10.3390\/electronics9081295"},{"key":"ref_8","first-page":"122","article-title":"Analysis of Unsupervised Machine Learning Techniques for an Efficient Customer Segmentation using Clustering Ensemble and Spectral Clustering","volume":"13","author":"Hicham","year":"2022","journal-title":"Int. J. Adv. Comput. Sci. Appl."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1865598","DOI":"10.1080\/23311916.2020.1865598","article-title":"Customer behaviour analysis based on buying-data sparsity for multi-category products in pork industry: A hybrid approach","volume":"8","author":"Apichottanakul","year":"2021","journal-title":"Cogent Eng."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Ashabi, A., Sahibuddin, S.B., and Salkhordeh Haghighi, M. (2020, January 18\u201320). The Systematic Review of K-Means Clustering Algorithm. Proceedings of the 2020 The 9th International Conference on Networks, Communication and Computing, Tokyo, Japan.","DOI":"10.1145\/3447654.3447657"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1007\/s40092-018-0285-3","article-title":"Customer behavior mining framework (cbmf) using clustering and classification techniques","volume":"15","author":"Abdi","year":"2019","journal-title":"J. Ind. Eng. Int."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"809","DOI":"10.3390\/analytics2040042","article-title":"An Exploration of Clustering Algorithms for Customer Segmentation in the UK Retail Market","volume":"2","author":"John","year":"2023","journal-title":"Analytics"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"121449","DOI":"10.1016\/j.eswa.2023.121449","article-title":"RFM model customer segmentation based on hierarchical approach using FCA","volume":"237","author":"Rungruang","year":"2024","journal-title":"Expert Syst. Appl."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1447","DOI":"10.1007\/s43615-023-00336-4","article-title":"Examination of the Criticality of Customer Segmentation Using Unsupervised Learning Methods","volume":"4","author":"Saxena","year":"2024","journal-title":"Circ. Econ. Sustain."},{"key":"ref_15","first-page":"119","article-title":"Market Segmentation in Pakistan: A Mona Lisa Smile or a Big Fat Smile?","volume":"5","author":"Rehman","year":"2024","journal-title":"Qlantic J. Soc. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"51","DOI":"10.3390\/businesses1010005","article-title":"Geo-Marketing Segmentation with Deep Learning","volume":"1","author":"Ansari","year":"2021","journal-title":"Businesses"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1251","DOI":"10.1016\/j.jksuci.2018.09.004","article-title":"RFM ranking\u2014An effective approach to customer segmentation","volume":"33","author":"Christy","year":"2021","journal-title":"J. King Saud. Univ. Comput. Inf. Sci."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"122310","DOI":"10.1016\/j.eswa.2023.122310","article-title":"Multiple criteria decision support system for customer segmentation using a sorting outranking method","volume":"238","author":"Barrera","year":"2024","journal-title":"Expert Syst. Appl."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TKDE.2023.3290371","article-title":"A Comprehensive Survey on Multi-view Clustering","volume":"35","author":"Fang","year":"2023","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1007\/978-3-030-29407-6_5","article-title":"Machine Learning: A Review of the Algorithms and Its Applications","volume":"597","author":"Dhall","year":"2019","journal-title":"Proc. ICRIC 2019"},{"key":"ref_21","unstructured":"Cormen, T.H., Leiserson, C.E., Rivest, R.L., and Stein, C. (2022). Introduction to Algorithms, The MIT Press. [4th ed.]."},{"key":"ref_22","first-page":"1","article-title":"A K-means clustering model for analyzing the Bitcoin extreme value returns","volume":"6","author":"Das","year":"2023","journal-title":"Decis. Anal. J."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Sreekala, K., Sridivya, R., Rao, N.K.K., Mandal, R.K., Moses, G.J., and Lakshmanarao, A. (2024, January 1\u20133). A hybrid Kmeans and ML Classification Approach for Credit Card Fraud Detection. Proceedings of the 2024 3rd International Conference for Innovation in Technology (INOCON), Bangalore, India.","DOI":"10.1109\/INOCON60754.2024.10511603"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Rajput, L., and Singh, S.N. (2023, January 19\u201320). Customer Segmentation of E-commerce data using K-means Clustering Algorithm. Proceedings of the 2023 13th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India.","DOI":"10.1109\/Confluence56041.2023.10048834"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"768","DOI":"10.59141\/jist.v5i3.935","article-title":"Customer Segmentation With K-Means Clustering Suzuki Mobil Bandung Customer Case Study","volume":"5","author":"Kadarsah","year":"2024","journal-title":"J. Indones. Sos. Teknol."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"26","DOI":"10.2478\/bsrj-2023-0002","article-title":"An Extended RFM Model for Customer Behaviour and Demographic Analysis in Retail Industry","volume":"14","author":"Ho","year":"2023","journal-title":"Bus. Syst. Res."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"111262","DOI":"10.1016\/j.knosys.2023.111262","article-title":"Speeding up k-means clustering in high dimensions by pruning unnecessary distance computations","volume":"284","author":"Zhang","year":"2024","journal-title":"Knowl. Based Syst."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Afzal, A., Khan, L., Hussain, M.Z., Zulkifl Hasan, M., Mustafa, M., Khalid, A., Awan, R., Ashraf, F., Khan, Z.A., and Javaid, A. (2024, January 5\u20137). Customer Segmentation Using Hierarchical Clustering. Proceedings of the 2024 IEEE 9th International Conference for Convergence in Technology (I2CT), Pune, India.","DOI":"10.1109\/I2CT61223.2024.10543349"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1177\/1938965519857539","article-title":"I Earn It, But They Just Get It: Loyalty Program Customer Reactions to Unearned Preferential Treatment in the Social Servicescape","volume":"61","author":"Kim","year":"2020","journal-title":"Cornell Hosp. Q."},{"key":"ref_30","first-page":"1","article-title":"Self-Organizing Maps, theory and applications","volume":"39","author":"Cottrell","year":"2018","journal-title":"Investig. Oper."},{"key":"ref_31","first-page":"539","article-title":"Optimizing Customer Segmentation in Online Retail Transactions through the Implementation of the K-Means Clustering Algorithm","volume":"11","author":"Awaliyah","year":"2024","journal-title":"Sci. J. Inform."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Narayana, V.L., Sirisha, S., Divya, G., Pooja, N.L.S., and Nouf, S.A. (2022, January 16\u201318). Mall Customer Segmentation Using Machine Learning. Proceedings of the 2022 International Conference on Electronics and Renewable Systems (ICEARS), Tuticorin, India.","DOI":"10.1109\/ICEARS53579.2022.9752447"},{"key":"ref_33","first-page":"139","article-title":"Customer Segmentation: Transformation from Data to Marketing Strategy","volume":"4","author":"Abednego","year":"2023","journal-title":"Conf. Ser."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ullah, A., Mohmand, M.I., Hussain, H., Johar, S., Khan, I., Ahmad, S., Mahmoud, H.A., and Huda, S. (2023). Customer Analysis Using Machine Learning-Based Classification Algorithms for Effective Segmentation Using Recency, Frequency, Monetary, and Time. Sensors, 23.","DOI":"10.3390\/s23063180"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"89","DOI":"10.30871\/jaic.v7i1.4947","article-title":"Analysis of Elbow, Silhouette, Davies-Bouldin, Calinski-Harabasz, and Rand-Index Evaluation on K-Means Algorithm for Classifying Flood-Affected Areas in Jakarta","volume":"7","author":"Ashari","year":"2023","journal-title":"J. Appl. Inform. Comput."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/6\/441\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T17:40:51Z","timestamp":1760031651000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/6\/441"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,5,26]]},"references-count":35,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2025,6]]}},"alternative-id":["info16060441"],"URL":"https:\/\/doi.org\/10.3390\/info16060441","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,5,26]]}}}