{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,24]],"date-time":"2026-04-24T15:20:17Z","timestamp":1777044017481,"version":"3.51.4"},"reference-count":56,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2020,10,27]],"date-time":"2020-10-27T00:00:00Z","timestamp":1603756800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Symmetry"],"abstract":"<jats:p>The recommender system is the most profound research area for e-commerce product recommendations. Currently, many e-commerce platforms use a text-based product search, which has limitations to fetch the most similar products. An image-based similarity search for recommendations had considerable gains in popularity for many areas, especially for the e-commerce platforms giving a better visual search experience by the users. In our research work, we proposed a machine-learning-based approach for a similar image-based recommender system. We applied a dimensionality reduction technique using Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) for transforming the extracted features into lower-dimensional space. Further, we applied the K-Means++ clustering approach for the possible cluster identification for a similar group of products. Later, we computed the Manhattan distance measure for the input image to the target clusters set for fetching the top-N similar products with low distance measure. We compared our approach with five different unsupervised clustering algorithms, namely Minibatch, K-Mediod, Agglomerative, Brich, and the Gaussian Mixture Model (GMM), and used the 40,000 fashion product image dataset from the Kaggle web platform for the product recommendation process. We computed various cluster performance metrics on K-means++ and achieved a Silhouette Coefficient (SC) of 0.1414, a Calinski-Harabasz (CH) index score of 669.4, and a Davies\u2013Bouldin (DB) index score of 1.8538. Finally, our proposed PCA-SVD transformed K-mean++ approach showed superior performance compared to the other five clustering approaches for similar image product recommendations.<\/jats:p>","DOI":"10.3390\/sym12111783","type":"journal-article","created":{"date-parts":[[2020,10,29]],"date-time":"2020-10-29T23:06:12Z","timestamp":1604012772000},"page":"1783","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":43,"title":["Probabilistic Unsupervised Machine Learning Approach for a Similar Image Recommender System for E-Commerce"],"prefix":"10.3390","volume":"12","author":[{"given":"Ssvr Kumar","family":"Addagarla","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Anthoniraj","family":"Amalanathan","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, Tamil Nadu, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,10,27]]},"reference":[{"key":"ref_1","first-page":"12","article-title":"Asian E-Commerce Engages Global Trade Openness: The Role of Information and Communications Technology, Social, and Security Indicators","volume":"11","author":"Wulansaria","year":"2020","journal-title":"Int. J. Innov. Creat. Chang."},{"key":"ref_2","unstructured":"(2020, February 02). eCommerce\u2014Asia | Statista Market Forecast. Available online: https:\/\/www.statista.com\/outlook\/243\/101\/ecommerce\/asia."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Salau, A.O., and Jain, S. (2019, January 7\u20139). Feature Extraction: A Survey of the Types, Techniques, Applications. Proceedings of the 2019 International Conference on Signal Processing and Communication (ICSC), Noida, India.","DOI":"10.1109\/ICSC45622.2019.8938371"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1504\/IJCVR.2019.098004","article-title":"Content-Based Image Retrieval: A Deep Look at Features Prospectus","volume":"9","author":"Haji","year":"2019","journal-title":"Int. J. Comput. Vis. Robot."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Kumari, M. (2019, May 14). Content Based Image Retrieval. Available online: https:\/\/papers.ssrn.com\/sol3\/papers.cfm?abstract_id=3371777.","DOI":"10.2139\/ssrn.3371777"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"6163","DOI":"10.1007\/s11042-018-6192-1","article-title":"Image Retrieval Based on Effective Feature Extraction and Diffusion Process","volume":"78","author":"Zhou","year":"2019","journal-title":"Multimed. Tools Appl."},{"key":"ref_7","first-page":"1323","article-title":"Image Retrieval: A Literature Review","volume":"2","author":"Pal","year":"2013","journal-title":"Int. J. Adv. Res. Comput. Eng. Technol."},{"key":"ref_8","unstructured":"(2020, October 15). Limitations of Text Based Image Retrieval Psychology Essay. Available online: https:\/\/www.ukessays.com\/essays\/psychology\/limitations-of-text-based-image-retrieval-psychology-essay.php."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"37","DOI":"10.1016\/0169-7439(87)80084-9","article-title":"Principal Component Analysis","volume":"2","author":"Wold","year":"1987","journal-title":"Chemom. Intell. Lab. Syst."},{"key":"ref_10","unstructured":"Roweis, S.T. (2020, October 22). EM Algorithms for PCA and SPCA. Advances in Neural Information Processing Systems, Available online: http:\/\/papers.neurips.cc\/paper\/1398-em-algorithms-for-pca-and-spca."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"169","DOI":"10.3233\/AIC-170729","article-title":"Linear Discriminant Analysis: A Detailed Tutorial","volume":"30","author":"Tharwat","year":"2017","journal-title":"AI Commun."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"611","DOI":"10.1111\/1467-9868.00196","article-title":"Probabilistic Principal Component Analysis","volume":"61","author":"Tipping","year":"1999","journal-title":"J. R. Stat. Soc. Ser. B Stat. Methodol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1016\/j.cmpb.2016.11.011","article-title":"PCA Based Clustering for Brain Tumor Segmentation of T1w MRI Images","volume":"140","author":"Kaya","year":"2017","journal-title":"Comput. Methods Programs Biomed."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Geng, X., Zhang, H., Bian, J., and Chua, T.-S. (2015, January 7). Learning Image and User Features for Recommendation in Social Networks. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.486"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Tyagi, V. (2017). Similarity Measures and Performance Evaluation. Content-Based Image Retrieval, Springer.","DOI":"10.1007\/978-981-10-6759-4"},{"key":"ref_16","unstructured":"(2020, February 02). Clustering Scikit-Learn 0.23.2 documentation. Available online: https:\/\/scikit-learn.org\/stable\/modules\/clustering.html#clustering-performance-evaluation."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"261","DOI":"10.1037\/met0000049","article-title":"Properties of the Hubert-Arable Adjusted Rand Index","volume":"21","author":"Steinley","year":"2004","journal-title":"Psychol. Methods"},{"key":"ref_18","unstructured":"Rosenberg, A., and Hirschberg, J. (2007, January 28\u201330). V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure. Proceedings of the 2007 joint conference on empirical methods in natural language processing and computational natural language learning (EMNLP-CoNLL), Prague, Czech Republic."},{"key":"ref_19","unstructured":"Hirschberg, J.B., and Rosenberg, A. (2020, August 14). V-Measure: A Conditional Entropy-Based External Cluster Evaluation. Available online: http:\/\/www.aclweb.org\/anthologyy\/D07-103."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"97","DOI":"10.22436\/jmcs.06.02.02","article-title":"A New Method for Clustering in Credit Scoring Problems","volume":"6","author":"Gholamian","year":"2013","journal-title":"J. Math. Comput. Sci."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"e109","DOI":"10.2196\/jmir.7087","article-title":"Understanding Health Care Social Media Use from Different Stakeholder Perspectives: A Content Analysis of an Online Health Community","volume":"19","author":"Lu","year":"2017","journal-title":"J. Med. Internet Res."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Hancer, E., Ozturk, C., and Karaboga, D. (2012, January 10\u201315). Artificial Bee Colony Based Image Clustering Method. Proceedings of the 2012 IEEE Congress on Evolutionary Computation, Brisbane, Australia.","DOI":"10.1109\/CEC.2012.6252919"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1109\/TIP.2012.2219547","article-title":"Fuzzy C-Means Clustering with Local Information and Kernel Metric for Image Segmentation","volume":"22","author":"Gong","year":"2012","journal-title":"IEEE Trans. Image Process."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"959","DOI":"10.1016\/j.asoc.2012.09.013","article-title":"Probability Based Document Clustering and Image Clustering Using Content-Based Image Retrieval","volume":"13","author":"Karthikeyan","year":"2013","journal-title":"Appl. Soft Comput."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"209","DOI":"10.1016\/j.neucom.2014.01.043","article-title":"Image Clustering by Hyper-Graph Regularized Non-Negative Matrix Factorization","volume":"138","author":"Zeng","year":"2014","journal-title":"Neurocomputing"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"587","DOI":"10.1007\/s10044-014-0365-y","article-title":"Improved Clustering Criterion for Image Clustering with Artificial Bee Colony Algorithm","volume":"18","author":"Ozturk","year":"2015","journal-title":"Pattern Anal. Appl."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"6211","DOI":"10.1007\/s12517-014-1584-7","article-title":"Content-Based Image Retrieval Using PSO and k-Means Clustering Algorithm","volume":"8","author":"Younus","year":"2015","journal-title":"Arab. J. Geosci."},{"key":"ref_28","first-page":"793","article-title":"Large-Scale Image Clustering Based on Camera Fingerprints","volume":"12","author":"Lin","year":"2016","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"1927","DOI":"10.1016\/j.eswa.2014.10.018","article-title":"Smartphone Image Clustering","volume":"42","author":"Villalba","year":"2015","journal-title":"Expert Syst. Appl."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Yang, J., Parikh, D., and Batra, D. (2016, January 27\u201330). Joint Unsupervised Learning of Deep Representations and Image Clusters. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.556"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"506","DOI":"10.1016\/j.compeleceng.2016.04.003","article-title":"Content-Based Image Retrieval Embedded with Agglomerative Clustering Built on Information Loss","volume":"54","author":"Pandey","year":"2016","journal-title":"Comput. Electr. Eng."},{"key":"ref_32","unstructured":"L i, S., Yu, H., and Yuan, L. (2016, January 20\u201322). A Novel Approach to Remote Sensing Image Retrieval with Multi-Feature vp-Tree Indexing and Online Feature Selection. Proceedings of the 2016 IEEE Second International Conference on Multimedia Big Data (BigMM), Taipei, Taiwan."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Biradar, M., and Ahmed, M. (2017, January 2\u20135). Visual Based Information Retrieval Using Voronoi Tree. Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications, Beijing, China.","DOI":"10.1007\/978-981-10-3156-4_64"},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2033","DOI":"10.1109\/TMM.2017.2703636","article-title":"Graph PCA Hashing for Similarity Search","volume":"19","author":"Zhu","year":"2017","journal-title":"IEEE Trans. Multimed."},{"key":"ref_35","unstructured":"Fachrurrozi, M., Fiqih, A., Saputra, B.R., Algani, R., and Primanita, A. (2017, January 1\u20132). Content Based Image Retrieval for Multi-Objects Fruits Recognition Using k-Means and k-Nearest Neighbor. Proceedings of the 2017 International Conference on Data and Software Engineering (ICoDSE), Palembang, Indonesia."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1007\/s11760-016-0939-9","article-title":"Dimensionality Reduction-Based Feature Extraction and Classification on Fleece Fabric Images","volume":"11","author":"Yildiz","year":"2017","journal-title":"Signal Image Video Process."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Cui, Y., and Wong, W.K. (2018). Textile Image Retrieval Using Joint Local PCA-Based Feature Descriptor. Applications of Computer Vision in Fashion and Textiles, Elsevier.","DOI":"10.1016\/B978-0-08-101217-8.00010-5"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"375","DOI":"10.1016\/j.patcog.2018.05.030","article-title":"A Fast Clustering Algorithm Based on Pruning Unnecessary Distance Computations in DBSCAN for High-Dimensional Data","volume":"83","author":"Chen","year":"2018","journal-title":"Pattern Recognit."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"14435","DOI":"10.1007\/s11042-017-5036-8","article-title":"Improved Image Retrieval Using Fast Colour-Texture Features with Varying Weighted Similarity Measure and Random Forests","volume":"77","author":"Singh","year":"2018","journal-title":"Multimed. Tools Appl."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"44","DOI":"10.1007\/s10916-017-0880-7","article-title":"Content Based Image Retrieval by Using Color Descriptor and Discrete Wavelet Transform","volume":"42","author":"Ashraf","year":"2018","journal-title":"J. Med. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"29099","DOI":"10.1007\/s11042-018-6122-2","article-title":"Content-Based Image Retrieval via a Hierarchical-Local-Feature Extraction Scheme","volume":"77","author":"Jian","year":"2018","journal-title":"Multimed. Tools Appl."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.eswa.2019.06.064","article-title":"Combining Hierarchical Clustering Approaches Using the PCA Method","volume":"137","author":"Jafarzadegan","year":"2019","journal-title":"Expert Syst. Appl."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Mateen, M., Wen, J., Song, S., and Huang, Z. (2019). Fundus Image Classification Using VGG-19 Architecture with PCA and SVD. Symmetry, 11.","DOI":"10.3390\/sym11010001"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"185","DOI":"10.1007\/s13735-016-0104-9","article-title":"Image Recommendation Based on Keyword Relevance Using Absorbing Markov Chain and Image Features","volume":"5","author":"Sejal","year":"2016","journal-title":"Int. J. Multimed. Inf. Retr."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Sha, D., Wang, D., Zhou, X., Feng, S., Zhang, Y., and Yu, G. (2016, January 3\u20135). An Approach for Clothing Recommendation Based on Multiple Image Attributes. Proceedings of the International Conference on Web-Age Information Management, Nanchang, China.","DOI":"10.1007\/978-3-319-39937-9_21"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Shrivastava, R., and Sisodia, D.S. (2019, January 23\u201325). Product Recommendations Using Textual Similarity Based Learning Models. Proceedings of the 2019 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India.","DOI":"10.1109\/ICCCI.2019.8821893"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1080\/01621459.2017.1401542","article-title":"Principal Component Analysis of High-Frequency Data","volume":"114","author":"Xiu","year":"2019","journal-title":"J. Am. Stat. Assoc."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1145\/3130348.3130358","article-title":"Information Retrieval Using a Singular Value Decomposition Model of Latent Semantic Structure","volume":"51","author":"Furnas","year":"2017","journal-title":"ACM SIGIR Forum"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"1253","DOI":"10.1137\/S0895479896305696","article-title":"A Multilinear Singular Value Decomposition","volume":"21","author":"Vandewalle","year":"2000","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"38","DOI":"10.1109\/5992.814656","article-title":"The QR Algorithm","volume":"2","author":"Parlett","year":"2000","journal-title":"Comput. Sci. Eng."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Wall, M.E., Rechtsteiner, A., and Rocha, L.M. (2003). Singular Value Decomposition and Principal Component Analysis. A Practical Approach to Microarray Data Analysis, Springer.","DOI":"10.1007\/0-306-47815-3_5"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Ding, C., and He, X. (2004, January 4\u20138). K-Means Clustering via Principal Component Analysis. Proceedings of the Twenty-First International Conference on Machine Learning, Banff, AB, Canada.","DOI":"10.1145\/1015330.1015408"},{"key":"ref_53","unstructured":"Arthur, D., and Vassilvitskii, S. (2006). K-Means++: The Advantages of Careful Seeding, Stanford University."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Aranganayagi, S., and Thangavel, K. (2007, January 13\u201315). Clustering Categorical Data Using Silhouette Coefficient as a Relocating Measure. Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA 2007), Sivakasi, India.","DOI":"10.1109\/ICCIMA.2007.328"},{"key":"ref_55","unstructured":"(2020, February 02). Fashion Product Images Dataset | Kaggle. Available online: https:\/\/www.kaggle.com\/paramaggarwal\/fashion-product-images-dataset."},{"key":"ref_56","first-page":"2579","article-title":"Visualizing Data Using T-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."}],"container-title":["Symmetry"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2073-8994\/12\/11\/1783\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:29:27Z","timestamp":1760178567000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2073-8994\/12\/11\/1783"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,27]]},"references-count":56,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2020,11]]}},"alternative-id":["sym12111783"],"URL":"https:\/\/doi.org\/10.3390\/sym12111783","relation":{},"ISSN":["2073-8994"],"issn-type":[{"value":"2073-8994","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,10,27]]}}}