{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T10:02:10Z","timestamp":1766138530525,"version":"build-2065373602"},"reference-count":45,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2023,12,25]],"date-time":"2023-12-25T00:00:00Z","timestamp":1703462400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the National Key Research and Development Program of China","award":["2021YFB3901300"],"award-info":[{"award-number":["2021YFB3901300"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Change detection from heterogeneous satellite and aerial images plays a progressively important role in many fields, including disaster assessment, urban construction, and land use monitoring. Currently, researchers have mainly devoted their attention to change detection using homologous image pairs and achieved many remarkable results. It is sometimes necessary to use heterogeneous images for change detection in practical scenarios due to missing images, emergency situations, and cloud and fog occlusion. However, heterogeneous change detection still faces great challenges, especially using satellite and aerial images. The main challenges in satellite and aerial image change detection are related to the resolution gap and blurred edge. Previous studies used interpolation or shallow feature alignment before traditional homologous change detection methods, which ignored the high-level feature interaction and edge information. Therefore, we propose a new heterogeneous change detection model based on multimodal transformers combined with edge guidance. In order to alleviate the resolution gap between satellite and aerial images, we design an improved spatially aligned transformer (SP-T) with a sub-pixel module to align the satellite features to the same size of the aerial ones supervised by a token loss. Moreover, we introduce an edge detection branch to guide change features using the object edge with an auxiliary edge-change loss. Finally, we conduct considerable experiments to verify the effectiveness and superiority of our proposed model (EGMT-CD) on a new satellite\u2013aerial heterogeneous change dataset, named SACD. The experiments show that our method (EGMT-CD) outperforms many previously superior change detection methods and fully demonstrates its potential in heterogeneous change detection from satellite\u2013aerial images.<\/jats:p>","DOI":"10.3390\/rs16010086","type":"journal-article","created":{"date-parts":[[2023,12,25]],"date-time":"2023-12-25T23:00:12Z","timestamp":1703545212000},"page":"86","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["EGMT-CD: Edge-Guided Multimodal Transformers Change Detection from Satellite and Aerial Images"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0400-2196","authenticated-orcid":false,"given":"Yunfan","family":"Xiang","sequence":"first","affiliation":[{"name":"Airborne Remote Sensing Center, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China"},{"name":"University of Chinese Academy of Sciences, Beijing 100049, China"}]},{"given":"Xiangyu","family":"Tian","sequence":"additional","affiliation":[{"name":"Airborne Remote Sensing Center, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China"},{"name":"University of Chinese Academy of Sciences, Beijing 100049, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6683-2107","authenticated-orcid":false,"given":"Yue","family":"Xu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China"}]},{"given":"Xiaokun","family":"Guan","sequence":"additional","affiliation":[{"name":"Airborne Remote Sensing Center, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China"},{"name":"University of Chinese Academy of Sciences, Beijing 100049, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4293-6459","authenticated-orcid":false,"given":"Zhengchao","family":"Chen","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,12,25]]},"reference":[{"key":"ref_1","unstructured":"J\u00e9r\u00f4me, T. (2022). Springer Handbook of Geographic Information, Springer."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1913","DOI":"10.1109\/JSTARS.2012.2228469","article-title":"Seasonal Change of Land-Use\/Land-Cover (Lulc) Detection Using Modis Data in Rapid Urbanization Regions: A Case Study of the Pearl River Delta Region (China)","volume":"6","author":"Hu","year":"2013","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Jensen, J.R., and Im, J. (2007). Geo-Spatial Technologies in Urban Environments, Springer.","DOI":"10.1007\/978-3-540-69417-5"},{"key":"ref_4","unstructured":"Zhang, J.-F., Xie, L.-L., and Tao, X.-X. (2003., January 21\u201325). Change Detection of Earthquake-Damaged Buildings on Remote Sensing Image and Its Application in Seismic Disaster Assessment. Proceedings of the IGARSS 2003, 2003 IEEE International Geoscience and Remote Sensing Symposium, Proceedings (IEEE Cat. No. 03CH37477), Toulouse, France."},{"key":"ref_5","unstructured":"Bitelli, G., Camassi, R., Gusella, L., and Mognol, A. (2004, January 12\u201323). Image Change Detection on Urban Area: The Earthquake Case. Proceedings of the Xth ISPRS Congress, Istanbul, Turkey."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1352","DOI":"10.1109\/LGRS.2018.2843385","article-title":"Log-based transformation feature learning for change detection in heterogeneous images","volume":"15","author":"Zhan","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Shao, R., Du, C., Chen, H., and Li, J. (2021). SUNet: Change detection for heterogeneous remote sensing images from satellite and UAV using a dual-channel fully convolution network. Remote Sens., 13.","DOI":"10.3390\/rs13183750"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"2365","DOI":"10.1080\/0143116031000139863","article-title":"Change detection techniques","volume":"25","author":"Lu","year":"2004","journal-title":"Int. J. Remote Sens."},{"key":"ref_9","first-page":"1183","article-title":"UAV for mapping\u2014Low altitude photogrammetric survey","volume":"37","author":"Zongjian","year":"2008","journal-title":"Int. Arch. Photogram. Remote Sens. Beijing China"},{"key":"ref_10","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017). Attention is all you need. arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Bazi, Y., Bashmal, L., Rahhal, M.M.A., Dayil, R.A., and Ajlan, N.A. (2021). Vision transformers for remote sensing image classification. Remote Sens., 13.","DOI":"10.3390\/rs13030516"},{"key":"ref_12","first-page":"1","article-title":"Super-resolution-based change detection network with stacked attention module for images with different resolutions","volume":"60","author":"Liu","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Papadomanolaki, M., Verma, S., Vakalopoulou, M., Gupta, S., and Karantzalos, K. (August, January 28). Detecting urban changes with recurrent neural networks from multitemporal Sentinel-2 data. Proceedings of the IGARSS 2019-2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan.","DOI":"10.1109\/IGARSS.2019.8900330"},{"key":"ref_14","unstructured":"Daudt, R.C., Le Saux, B., and Boulch, A. (2018, January 7\u201310). Fully convolutional Siamese networks for change detection. Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP 2018), Athens, Greece."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Daudt, R.C., Le Saux, B., Boulch, A., and Gousseau, Y. (2018, January 22\u201327). Urban change detection for multispectral earth observation using convolutional neural networks. Proceedings of the IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518015"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Chen, H., and Shi, Z. (2020). A spatial-temporal attention-based method and a new dataset for remote sensing image change detection. Remote Sens., 12.","DOI":"10.3390\/rs12101662"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1339","DOI":"10.1109\/JSTARS.2014.2355832","article-title":"Land cover change detection at subpixel resolution with a hopfield neural network","volume":"8","author":"Wang","year":"2015","journal-title":"IEEE J. Sel.Topics Appl. Earth Observ. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"3822","DOI":"10.1109\/TGRS.2016.2528583","article-title":"A super resolution land-cover change detection method using remotely sensed images with different spatial resolutions","volume":"54","author":"Li","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Wu, K., Du, Q., Wang, Y., and Yang, Y. (2017). Supervised sub-pixel mapping for change detection from remotely sensed images with different resolutions. Remote Sens., 9.","DOI":"10.3390\/rs9030284"},{"key":"ref_20","first-page":"1","article-title":"Learning token-aligned representations with multimodel transformers for different-resolution change detection","volume":"60","author":"Liu","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_21","first-page":"2500518","article-title":"Dynamic Low-Rank and Sparse Priors Constrained Deep Autoencoders for Hyperspectral Anomaly Detection","volume":"73","author":"Lin","year":"2023","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"946","DOI":"10.1109\/JSTARS.2022.3229834","article-title":"Hyperspectral anomaly detection via sparse representation and collaborative representation","volume":"16","author":"Lin","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_23","first-page":"5002016","article-title":"Deep Self-Representation Learning Framework for Hyperspectral Anomaly Detection","volume":"73","author":"Cheng","year":"2023","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"262","DOI":"10.1080\/10095020.2022.2085633","article-title":"Deep learning for change detection in remote sensing: A review","volume":"26","author":"Bai","year":"2023","journal-title":"Geo-Spat. Inf. Sci."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"669","DOI":"10.1142\/S0218001493000339","article-title":"Signature verification using a \u201csiamese\u201d time delay neural network","volume":"7","author":"Bromley","year":"1993","journal-title":"Int. J. Pattern Recognit. Artif. Intell."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"MArabi, E.A., Karoui, M.S., and Djerriri, K. (2018, January 22\u201327). Optical remote sensing change detection through deep siamese network. Proceedings of the IGARSS 2018\u20132018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518178"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"266","DOI":"10.1109\/LGRS.2018.2869608","article-title":"Triplet-based semantic relation learning for aerial remote sensing image change detection","volume":"16","author":"Zhang","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"108717","DOI":"10.1016\/j.patcog.2022.108717","article-title":"HFA-Net: High frequency attention siamese network for building change detection in VHR remote sensing images","volume":"129","author":"Zheng","year":"2022","journal-title":"Pattern Recogn."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"4816","DOI":"10.1109\/JSTARS.2021.3077545","article-title":"AGCDetNet: An Attention-Guided Network for Building Change Detection in High-Resolution Remote Sensing Images","volume":"14","author":"Song","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_30","unstructured":"Wei, Y., Zhao, Z., and Song, J. (2004, January 20\u201324). Urban Building Extraction from High-Resolution Satellite Panchromatic Image Using Clustering and Edge Detection. Proceedings of the IGARSS 2004. 2004 IEEE International Geoscience and Remote Sensing Symposium, Anchorage, AK, USA."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"203","DOI":"10.1016\/j.isprsjprs.2022.07.016","article-title":"EGDE-Net: A building change detection method for high-resolution remote sensing imagery based on edge guidance and differential enhancement","volume":"191","author":"Chen","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_32","first-page":"247","article-title":"CLNet: Cross-layer convolutional neural network for change detection in optical remote sensing imagery. ISPRS J. Photogram","volume":"175","author":"Zheng","year":"2021","journal-title":"Remote Sens."},{"key":"ref_33","first-page":"1","article-title":"BOMSC-Net: Boundary Optimization and Multi-Scale Context Awareness Based Building Extraction from High-Resolution Remote Sensing Imagery","volume":"60","author":"Zhou","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_34","first-page":"1","article-title":"Boundary Enhancement Semantic Segmentation for Building Extraction from Remote Sensed Image","volume":"60","author":"Jung","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_35","first-page":"1","article-title":"AERNet: An attention-guided edge refinement network and a dataset for remote sensing building change detection","volume":"61","author":"Zhang","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Shi, W., Caballero, J., Husz\u00e1r, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., and Wang, Z. (2016, January 27\u201330). Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.207"},{"key":"ref_38","unstructured":"Hendrycks, D., and Gimpel, K. (2016). Bridging nonlinearities and stochastic regularizers with Gaussian error linear units. arXiv."},{"key":"ref_39","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16 \u00d7 16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll\u2019ar, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1109\/TGRS.2018.2858817","article-title":"Fully Convolutional Networks for Multisource Building Extraction from an Open Aerial and Satellite Imagery Data Set","volume":"57","author":"Ji","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_43","first-page":"1","article-title":"SNUNet-CD: A Densely Connected Siamese Network for Change Detection of VHR Images","volume":"19","author":"Fang","year":"2021","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_44","unstructured":"Zhou, Z., Rahman Siddiquee, M.M., Tajbakhsh, N., and Liang, J. (2018). Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Springer."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2020.3034752","article-title":"Remote sensing image change detection with transformers","volume":"60","author":"Chen","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/1\/86\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:41:37Z","timestamp":1760132497000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/1\/86"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,25]]},"references-count":45,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,1]]}},"alternative-id":["rs16010086"],"URL":"https:\/\/doi.org\/10.3390\/rs16010086","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2023,12,25]]}}}