{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T06:03:47Z","timestamp":1770271427849,"version":"3.49.0"},"reference-count":41,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T00:00:00Z","timestamp":1696377600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100000266","name":"EPSRC","doi-asserted-by":"publisher","award":["EP\/S021892\/1"],"award-info":[{"award-number":["EP\/S021892\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>We present a novel approach to providing greater insight into the characteristics of an unlabelled dataset, increasing the efficiency with which labelled datasets can be created. We leverage dimension-reduction techniques in combination with autoencoders to create an efficient feature representation for image tiles derived from remote sensing satellite imagery. The proposed methodology consists of two main stages. Firstly, an autoencoder network is utilised to reduce the high-dimensional image tile data into a compact and expressive latentfeature representation. Subsequently, features are further reduced to a two-dimensional embedding space using the manifold learning algorithm Uniform Manifold Approximation and Projection (UMAP) and t-distributed Stochastic Neighbour Embedding (t-SNE). This step enables the visualization of the image tile clusters in a 2D plot, providing an intuitive and interactive representation that can be used to aid rapid and geographically distributed image labelling. To facilitate the labelling process, our approach allows users to interact with the 2D visualization and label clusters based on their domain knowledge. In cases where certain classes are not effectively separated, users can re-apply dimension reduction to interactively refine subsets of clusters and achieve better class separation, enabling a comprehensively labelled dataset. We evaluate the proposed approach on real-world remote sensing satellite image datasets and demonstrate its effectiveness in achieving accurate and efficient image tile clustering and labelling. Users actively participate in the labelling process through our interactive approach, leading to enhanced relevance of the labelled data, by allowing domain experts to contribute their expertise and enrich the dataset for improved downstream analysis and applications.<\/jats:p>","DOI":"10.3390\/a16100469","type":"journal-article","created":{"date-parts":[[2023,10,4]],"date-time":"2023-10-04T11:58:57Z","timestamp":1696420737000},"page":"469","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Manifold Explorer: Satellite Image Labelling and Clustering Tool with Using Deep Convolutional Autoencoders"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0009-0004-6699-9707","authenticated-orcid":false,"given":"Tulsi","family":"Patel","sequence":"first","affiliation":[{"name":"Department of Computer Science, Swansea University, Swansea SA2 8PP, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8991-1190","authenticated-orcid":false,"given":"Mark W.","family":"Jones","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Swansea University, Swansea SA2 8PP, UK"}]},{"given":"Thomas","family":"Redfern","sequence":"additional","affiliation":[{"name":"UK Hydrographic Office, Taunton TA1 2DN, UK"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/j.future.2014.10.029","article-title":"Remote sensing big data computing: Challenges and opportunities","volume":"51","author":"Ma","year":"2015","journal-title":"Future Gener. Comput. Syst."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2207","DOI":"10.1109\/JPROC.2016.2598228","article-title":"Big data for remote sensing: Challenges and opportunities","volume":"104","author":"Chi","year":"2016","journal-title":"Proc. IEEE"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3025","DOI":"10.1080\/01431160600589179","article-title":"Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery","volume":"27","author":"Xu","year":"2006","journal-title":"Int. J. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Pettorelli, N. (2013). The Normalized Difference Vegetation Index, Oxford University Press.","DOI":"10.1093\/acprof:osobl\/9780199693160.001.0001"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2720","DOI":"10.1109\/TGRS.2008.923631","article-title":"An automatic bridge detection technique for multispectral images","volume":"46","author":"Chaudhuri","year":"2008","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"247","DOI":"10.1016\/j.isprsjprs.2010.11.001","article-title":"Support vector machines in remote sensing: A review","volume":"66","author":"Mountrakis","year":"2011","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1109\/LGRS.2009.2020825","article-title":"Multiresolution Remote Sensing Image Clustering","volume":"6","author":"Wemmert","year":"2009","journal-title":"IEEE Geosci. Remote. Sens. Lett."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Boschetti, M., Nutini, F., Manfron, G., Brivio, P.A., and Nelson, A. (2014). Comparative analysis of normalised difference spectral indices derived from MODIS for detecting surface water in flooded rice cropping systems. PLoS ONE, 9.","DOI":"10.1371\/journal.pone.0088741"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.neucom.2015.11.044","article-title":"Novel segmented stacked autoencoder for effective dimensionality reduction and feature extraction in hyperspectral imaging","volume":"185","author":"Zabalza","year":"2016","journal-title":"Neurocomputing"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Protopapadakis, E., Doulamis, A., Doulamis, N., and Maltezos, E. (2021). Stacked autoencoders driven by semi-supervised learning for building extraction from near infrared remote sensing imagery. Remote Sens., 13.","DOI":"10.3390\/rs13030371"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2019.04.015","article-title":"Deep learning in remote sensing applications: A meta-analysis and review","volume":"152","author":"Ma","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Windrim, L., Ramakrishnan, R., Melkumyan, A., Murphy, R.J., and Chlingaryan, A. (2019). Unsupervised feature-learning for hyperspectral data with autoencoders. Remote Sens., 11.","DOI":"10.3390\/rs11070864"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Luppino, L.T., Hansen, M.A., Kampffmeyer, M., Bianchi, F.M., Moser, G., Jenssen, R., and Anfinsen, S.N. (2022). Code-aligned autoencoders for unsupervised change detection in multimodal remote sensing images. IEEE Trans. Neural Netw. Learn. Syst., Early Access.","DOI":"10.1109\/TNNLS.2022.3172183"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"105920","DOI":"10.1016\/j.knosys.2020.105920","article-title":"Remote sensing image captioning via Variational Autoencoder and Reinforcement Learning","volume":"203","author":"Shen","year":"2020","journal-title":"Knowl.-Based Syst."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"202","DOI":"10.1109\/JSTARS.2020.3042760","article-title":"Desertification detection using an improved variational autoencoder-based approach through ETM-landsat satellite data","volume":"14","author":"Zerrouki","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Naik, P., Dalponte, M., and Bruzzone, L. (2021, January 11\u201316). A disentangled variational autoencoder for prediction of above ground biomass from hyperspectral data. Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium.","DOI":"10.1109\/IGARSS47720.2021.9554415"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.inffus.2020.10.008","article-title":"Image retrieval from remote sensing big data: A survey","volume":"67","author":"Li","year":"2021","journal-title":"Inf. Fusion"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Tang, X., Zhang, X., Liu, F., and Jiao, L. (2018). Unsupervised deep feature learning for remote sensing image retrieval. Remote Sens., 10.","DOI":"10.3390\/rs10081243"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1013","DOI":"10.1007\/s00371-019-01673-y","article-title":"TimeCluster: Dimension reduction applied to temporal data for visual analytics","volume":"35","author":"Ali","year":"2019","journal-title":"Vis. Comput."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"107507","DOI":"10.1016\/j.knosys.2021.107507","article-title":"Concurrent Time-Series Selections Using Deep Learning and Dimension Reduction","volume":"233","author":"Ali","year":"2021","journal-title":"Knowl.-Based Syst."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Alqahtani, A., Ali, M., Xie, X., and Jones, M.W. (2021). Deep Time-Series Clustering: A Review. Electronics, 10.","DOI":"10.3390\/electronics10233001"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Kruskal, J.B., and Wish, M. (1978). Multidimensional Scaling, Sage.","DOI":"10.4135\/9781412985130"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"439","DOI":"10.1002\/wics.1222","article-title":"Introduction to manifold learning","volume":"4","author":"Izenman","year":"2012","journal-title":"Wiley Interdiscip. Rev. Comput. Stat."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"2319","DOI":"10.1126\/science.290.5500.2319","article-title":"A Global Geometric Framework for Nonlinear Dimensionality Reduction","volume":"290","author":"Tenenbaum","year":"2000","journal-title":"Science"},{"key":"ref_25","first-page":"2859","article-title":"Linear dimensionality reduction: Survey, insights, and generalizations","volume":"16","author":"Cunningham","year":"2015","journal-title":"J. Mach. Learn. Res."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"100378","DOI":"10.1016\/j.cosrev.2021.100378","article-title":"Conceptual and empirical comparison of dimensionality reduction algorithms (PCA, KPCA, LDA, MDS, SVD, LLE, ISOMAP, LE, ICA, t-SNE)","volume":"40","author":"Anowar","year":"2021","journal-title":"Comput. Sci. Rev."},{"key":"ref_27","first-page":"119","article-title":"Think globally, fit locally: Unsupervised learning of low dimensional manifolds","volume":"4","author":"Saul","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_28","first-page":"9129","article-title":"Understanding how dimension reduction tools work: An empirical approach to deciphering t-SNE, UMAP, TriMAP, and PaCMAP for data visualization","volume":"22","author":"Wang","year":"2021","journal-title":"J. Mach. Learn. Res."},{"key":"ref_29","first-page":"11","article-title":"Visualizing data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."},{"key":"ref_30","unstructured":"Kobak, D., Linderman, G., Steinerberger, S., Kluger, Y., and Berens, P. (2019). Joint European Conference on Machine Learning and Knowledge Discovery in Databases, Springer."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Kobak, D., and Linderman, G.C. (2019). UMAP does not preserve global structure any better than t-SNE when using the same initialization. BioRxiv.","DOI":"10.1101\/2019.12.19.877522"},{"key":"ref_32","unstructured":"McInnes, L., Healy, J., and Melville, J. (2020). UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1038\/s41587-020-00809-z","article-title":"Initialization is critical for preserving global data structure in both t-SNE and UMAP","volume":"39","author":"Kobak","year":"2021","journal-title":"Nat. Biotechnol."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"113044","DOI":"10.1016\/j.rse.2022.113044","article-title":"Coastline detection in satellite imagery: A deep learning approach on new benchmark data","volume":"278","author":"Seale","year":"2022","journal-title":"Remote Sens. Environ."},{"key":"ref_35","first-page":"37","article-title":"Sen2Cor for sentinel-2","volume":"Volume 10427","author":"Pflug","year":"2017","journal-title":"Image and Signal Processing for Remote Sensing XXIII"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1425","DOI":"10.1080\/01431169608948714","article-title":"The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features","volume":"17","author":"McFeeters","year":"1996","journal-title":"Int. J. Remote. Sens."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"2634","DOI":"10.1109\/TVCG.2013.153","article-title":"Empirical Guidance on Scatterplot and Dimension Reduction Technique Choices","volume":"19","author":"Sedlmair","year":"2013","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_38","unstructured":"van der Maaten, L. (2013). Barnes-Hut-SNE. arXiv."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"1739","DOI":"10.1109\/TVCG.2016.2570755","article-title":"Approximated and User Steerable tSNE for Progressive Visual Analytics","volume":"23","author":"Pezzotti","year":"2017","journal-title":"IEEE Trans. Vis. Comput. Graph."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"507","DOI":"10.1109\/TBDATA.2019.2948924","article-title":"Exploiting Deep Features for Remote Sensing Image Retrieval: A Systematic Investigation","volume":"6","author":"Tong","year":"2020","journal-title":"IEEE Trans. Big Data"},{"key":"ref_41","unstructured":"Hoffer, E., and Ailon, N. (2018). Deep metric learning using Triplet network. arXiv."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/10\/469\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:05:04Z","timestamp":1760130304000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/16\/10\/469"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,4]]},"references-count":41,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["a16100469"],"URL":"https:\/\/doi.org\/10.3390\/a16100469","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,4]]}}}