{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T02:19:45Z","timestamp":1760149185880,"version":"build-2065373602"},"reference-count":29,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2023,7,14]],"date-time":"2023-07-14T00:00:00Z","timestamp":1689292800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"NSFC","award":["61771377","61502376","2023-YBGY-206","2023A1515010671"],"award-info":[{"award-number":["61771377","61502376","2023-YBGY-206","2023A1515010671"]}]},{"name":"Key Research and Development Project of Shannxi Province","award":["61771377","61502376","2023-YBGY-206","2023A1515010671"],"award-info":[{"award-number":["61771377","61502376","2023-YBGY-206","2023A1515010671"]}]},{"name":"GuangDong Basic and Applied Basic Research Foundation","award":["61771377","61502376","2023-YBGY-206","2023A1515010671"],"award-info":[{"award-number":["61771377","61502376","2023-YBGY-206","2023A1515010671"]}]},{"name":"Fundamental Research Funds for the Central Universities of China","award":["61771377","61502376","2023-YBGY-206","2023A1515010671"],"award-info":[{"award-number":["61771377","61502376","2023-YBGY-206","2023A1515010671"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Entropy"],"abstract":"<jats:p>In machine learning and data analysis, dimensionality reduction and high-dimensional data visualization can be accomplished by manifold learning using a t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm. We significantly improve this manifold learning scheme by introducing a preprocessing strategy for the t-SNE algorithm. In our preprocessing, we exploit Laplacian eigenmaps to reduce the high-dimensional data first, which can aggregate each data cluster and reduce the Kullback\u2013Leibler divergence (KLD) remarkably. Moreover, the k-nearest-neighbor (KNN) algorithm is also involved in our preprocessing to enhance the visualization performance and reduce the computation and space complexity. We compare the performance of our strategy with that of the standard t-SNE on the MNIST dataset. The experiment results show that our strategy exhibits a stronger ability to separate different clusters as well as keep data of the same kind much closer to each other. Moreover, the KLD can be reduced by about 30% at the cost of increasing the complexity in terms of runtime by only 1\u20132%.<\/jats:p>","DOI":"10.3390\/e25071065","type":"journal-article","created":{"date-parts":[[2023,7,17]],"date-time":"2023-07-17T00:35:04Z","timestamp":1689554104000},"page":"1065","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":6,"title":["A Preprocessing Manifold Learning Strategy Based on t-Distributed Stochastic Neighbor Embedding"],"prefix":"10.3390","volume":"25","author":[{"given":"Sha","family":"Shi","sequence":"first","affiliation":[{"name":"State Key Laboratory of Integrated Services Network, Xidian University, 2 South TaiBai Road, Xi\u2019an 710071, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-2742-6760","authenticated-orcid":false,"given":"Yefei","family":"Xu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Integrated Services Network, Xidian University, 2 South TaiBai Road, Xi\u2019an 710071, China"}]},{"given":"Xiaoyang","family":"Xu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Integrated Services Network, Xidian University, 2 South TaiBai Road, Xi\u2019an 710071, China"}]},{"given":"Xiaofan","family":"Mo","sequence":"additional","affiliation":[{"name":"National Astronomical Observatories, Chinese Academy of Sciences, 20A Datun Road, Chaoyang District, Beijing 100101, China"}]},{"given":"Jun","family":"Ding","sequence":"additional","affiliation":[{"name":"Institute of Information Sensing, Xidian University, 2 South TaiBai Road, Xi\u2019an 710071, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,7,14]]},"reference":[{"key":"ref_1","unstructured":"Keogh, E., and Mueen, A. (2011). Encyclopedia of Machine Learning, Springer."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"100378","DOI":"10.1016\/j.cosrev.2021.100378","article-title":"Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE)","volume":"40","author":"Anowar","year":"2021","journal-title":"Comput. Sci. Rev."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3079","DOI":"10.1109\/TNNLS.2021.3049719","article-title":"Efficient Approximation of High-Dimensional Functions with Neural Networks","volume":"33","author":"Cheridito","year":"2022","journal-title":"IEEE Trans. Neural Networks Learn. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"102844","DOI":"10.1016\/j.ipm.2021.102844","article-title":"Ensemble unsupervised autoencoders and Gaussian mixture model for cyberattack detection","volume":"59","author":"An","year":"2022","journal-title":"Inf. Process. Manag."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Gorban, A.N., K\u00e9gl, B., Wunsch, D.C., and Zinovyev, A.Y. (2008). Principal Manifolds for Data Visualization and Dimension Reduction, Springer.","DOI":"10.1007\/978-3-540-73750-6"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"6418","DOI":"10.1109\/TIE.2014.2301773","article-title":"A review on basic data-driven approaches for industrial process monitoring","volume":"61","author":"Yin","year":"2014","journal-title":"IEEE Trans. Ind. Electron."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"54776","DOI":"10.1109\/ACCESS.2020.2980942","article-title":"Analysis of Dimensionality Reduction Techniques on Big Data","volume":"8","author":"Reddy","year":"2020","journal-title":"IEEE Access"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/s41524-020-0276-y","article-title":"Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures","volume":"6","author":"Kiarashinejad","year":"2020","journal-title":"NPJ Comput. Mater."},{"key":"ref_9","first-page":"9129","article-title":"Understanding how dimension reduction tools work: An empirical approach to deciphering t-SNE, UMAP, TriMAP, and PaCMAP for data visualization","volume":"22","author":"Wang","year":"2021","journal-title":"J. Mach. Learn. Res."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Li, X., Li, P., Zhang, H., Zhu, K., and Zhang, R. (2023). Pivotal-Aware Principal Component Analysis. IEEE Trans. Neural Netw. Learn. Syst., 1\u201310.","DOI":"10.1109\/TNNLS.2023.3252602"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ejaz, M.S., Islam, M.R., Sifatullah, M., and Sarker, A. (2019, January 3\u20135). Implementation of Principal Component Analysis on Masked and Non-masked Face Recognition. Proceedings of the 2019 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT), Dhaka, Bangladesh.","DOI":"10.1109\/ICASERT.2019.8934543"},{"key":"ref_12","first-page":"157","article-title":"On Learning 3D Face Morphable Model from In-the-Wild Images","volume":"43","author":"Tran","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1016\/j.jare.2020.01.004","article-title":"Multidimensional scaling locus of memristor and fractional order elements","volume":"25","author":"Lopes","year":"2020","journal-title":"J. Adv. Res."},{"key":"ref_14","first-page":"23","article-title":"Uncertainty-Aware Multidimensional Scaling","volume":"29","author":"Krake","year":"2022","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"401","DOI":"10.1109\/T-C.1969.222678","article-title":"A nonlinear mapping for data structure analysis","volume":"100","author":"Sammon","year":"1969","journal-title":"IEEE Trans. Comput."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1085","DOI":"10.1109\/TMM.2021.3139217","article-title":"Robust image hashing with isomap and saliency map for copy detection","volume":"25","author":"Liang","year":"2021","journal-title":"IEEE Trans. Multimed."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"2323","DOI":"10.1126\/science.290.5500.2323","article-title":"Nonlinear dimensionality reduction by locally linear embedding","volume":"290","author":"Roweis","year":"2000","journal-title":"Science"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhang, H., Ding, Y., Meng, H., Ma, S., and Long, Z. (2022, January 26\u201328). Component preserving and adaptive Laplacian Eigenmaps for data reconstruction and dimensionality reduction. Proceedings of the 15th International FLINS Conference (FLINS 2022), Tianjin, China.","DOI":"10.1142\/9789811269264_0076"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"108399","DOI":"10.1016\/j.patcog.2021.108399","article-title":"Kernelized Supervised Laplacian Eigenmap for Visualization and Classification of Multi-Label Data","volume":"123","author":"Tai","year":"2022","journal-title":"Pattern Recognit."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Li, B., Liu, Y., Wang, H., and Miao, C. (2021, January 14\u201318). Initialization matters: Regularizing manifold-informed initialization for neural recommendation systems. Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, Virtual Event.","DOI":"10.1145\/3447548.3467338"},{"key":"ref_21","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_22","first-page":"13581","article-title":"Theoretical foundations of t-sne for visualizing high-dimensional clustered data","volume":"23","author":"Cai","year":"2022","journal-title":"J. Mach. Learn. Res."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"721","DOI":"10.1007\/s10994-021-06043-1","article-title":"Embedding to reference t-SNE space addresses batch effects in single-cell classification","volume":"112","author":"Zupan","year":"2023","journal-title":"Mach. Learn."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Wang, Q., Xia, W., Tao, Z., Gao, Q., and Cao, X. (2021, January 20\u201324). Deep self-supervised t-SNE for multi-modal subspace clustering. Proceedings of the 29th ACM International Conference on Multimedia, Virtual Event.","DOI":"10.1145\/3474085.3475319"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"444","DOI":"10.1109\/TMI.2013.2287121","article-title":"Hierarchical manifold learning for regional image analysis","volume":"33","author":"Bhatia","year":"2014","journal-title":"IEEE Trans. Med. Imaging"},{"key":"ref_26","unstructured":"Hinton, G., and Roweis, S. (2002, January 9\u201314). Stochastic neighbor embedding. Proceedings of the NIPS, Vancouver, BC, Canada."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Guo, Y., Guo, H., and Yu, S.X. (2022, January 18\u201324). Co-sne: Dimensionality reduction and visualization for hyperbolic data. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00011"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"1373","DOI":"10.1162\/089976603321780317","article-title":"Laplacian eigenmaps for dimensionality reduction and data representation","volume":"15","author":"Belkin","year":"2003","journal-title":"Neural Comput."},{"key":"ref_29","first-page":"3221","article-title":"Accelerating t-SNE using tree-based algorithms","volume":"15","year":"2014","journal-title":"J. Mach. Learn. Res."}],"container-title":["Entropy"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1099-4300\/25\/7\/1065\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:12:23Z","timestamp":1760127143000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1099-4300\/25\/7\/1065"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,7,14]]},"references-count":29,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["e25071065"],"URL":"https:\/\/doi.org\/10.3390\/e25071065","relation":{},"ISSN":["1099-4300"],"issn-type":[{"type":"electronic","value":"1099-4300"}],"subject":[],"published":{"date-parts":[[2023,7,14]]}}}