{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,20]],"date-time":"2025-11-20T18:53:19Z","timestamp":1763664799744,"version":"build-2065373602"},"reference-count":46,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2022,3,22]],"date-time":"2022-03-22T00:00:00Z","timestamp":1647907200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Semantic segmentation is a critical problem for many remote sensing (RS) image applications. Benefiting from large-scale pixel-level labeled data and the continuous evolution of deep neural network architectures, the performance of semantic segmentation approaches has been constantly improved. However, deploying a well-trained model on unseen and diverse testing environments remains a major challenge: a large gap between data distributions in train and test domains results in severe performance loss, while manual dense labeling is costly and not scalable. To this end, we proposed an unsupervised domain adaptation framework for RS image semantic segmentation that is both practical and effective. The framework is supported by the consistency principle, including the cycle consistency in the input space and self-supervised consistency in the training stage. Specifically, we introduce cycle-consistent generative adversarial networks to reduce the discrepancy between source and target distributions by translating one into the other. The translated source data then drive a pipeline of supervised semantic segmentation model training. We enforce consistency of model predictions across target image transformations in order to provide self-supervision for the unlabeled target data. Experiments and extensive ablation studies demonstrate the effectiveness of the proposed approach on two challenging benchmarks, on which we achieve up to 9.95% and 7.53% improvements in accuracy over the state-of-the-art methods, respectively.<\/jats:p>","DOI":"10.3390\/rs14071527","type":"journal-article","created":{"date-parts":[[2022,3,22]],"date-time":"2022-03-22T23:30:23Z","timestamp":1647991823000},"page":"1527","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Cycle and Self-Supervised Consistency Training for Adapting Semantic Segmentation of Aerial Images"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-1633-6652","authenticated-orcid":false,"given":"Han","family":"Gao","sequence":"first","affiliation":[{"name":"Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1728-3281","authenticated-orcid":false,"given":"Yang","family":"Zhao","sequence":"additional","affiliation":[{"name":"Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3692-0453","authenticated-orcid":false,"given":"Peng","family":"Guo","sequence":"additional","affiliation":[{"name":"Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zihao","family":"Sun","sequence":"additional","affiliation":[{"name":"Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Xiuwan","family":"Chen","sequence":"additional","affiliation":[{"name":"Institute of Remote Sensing and Geographic Information System, Peking University, Beijing 100871, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2257-8749","authenticated-orcid":false,"given":"Yunwei","family":"Tang","sequence":"additional","affiliation":[{"name":"International Research Center of Big Data for Sustainable Development Goals, Beijing 100094, China"},{"name":"Key Laboratory of Digital Earth Science, Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,3,22]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","first-page":"155","article-title":"Building segmentation from airborne VHR images using Mask R-CNN","volume":"42","author":"Zhou","year":"2019","journal-title":"ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1109\/LGRS.2017.2778181","article-title":"Semantic Segmentation of Aerial Images With Shuffling Convolutional Neural Networks","volume":"15","author":"Chen","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/j.isprsjprs.2019.02.019","article-title":"Automatic building extraction from high-resolution aerial images and LiDAR data using gated residual refinement network","volume":"151","author":"Huang","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1016\/j.isprsjprs.2018.04.014","article-title":"Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning","volume":"145","author":"Kemker","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MGRS.2016.2540798","article-title":"Advances in Machine Learning for Remote Sensing and Geosciences","volume":"19","author":"Zhang","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. (2017). Segment-before-Detect: Vehicle Detection and Classification through Semantic Segmentation of Aerial Images. Remote Sens., 9.","DOI":"10.3390\/rs9040368"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.isprsjprs.2017.08.011","article-title":"Contextually guided very-high-resolution imagery classification with semantic segments","volume":"132","author":"Zhao","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Garcia-Garcia, A., Orts-Escolano, S., Oprea, S., Villena-Martinez, V., and Garcia-Rodriguez, J. (2017). A Review on Deep Learning Techniques Applied to Semantic Segmentation. arXiv.","DOI":"10.1016\/j.asoc.2018.05.018"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1109\/MGRS.2016.2548504","article-title":"Domain Adaptation for the Classification of Remote Sensing Data: An Overview of Recent Advances","volume":"4","author":"Tuia","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Wang, M., and Deng, W. (2018). Deep Visual Domain Adaptation: A Survey. arXiv.","DOI":"10.1016\/j.neucom.2018.05.083"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Toldo, M., Maracani, A., Michieli, U., and Zanuttigh, P. (2020). Unsupervised Domain Adaptation in Semantic Segmentation: A Review. Technologies, 8.","DOI":"10.3390\/technologies8020035"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Benjdira, B., Bazi, Y., Koubaa, A., and Ouni, K. (2019). Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images. Remote Sens., 11.","DOI":"10.3390\/rs11111369"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2021.02.009","article-title":"Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation","volume":"175","author":"Li","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"7178","DOI":"10.1109\/TGRS.2020.2980417","article-title":"ColorMapGAN: Unsupervised Domain Adaptation for Semantic Segmentation Using Color Mapping Generative Adversarial Networks","volume":"58","author":"Tasar","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1067","DOI":"10.1109\/TGRS.2020.3006161","article-title":"DAugNet: Unsupervised, Multisource, Multitarget, and Life-Long Domain Adaptation for Semantic Segmentation of Satellite Images","volume":"59","author":"Tasar","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"3816","DOI":"10.1109\/TGRS.2020.3020804","article-title":"Generative Adversarial Network-Based Full-Space Domain Adaptation for Land Cover Classification From Multiple-Source Remote Sensing Images","volume":"59","author":"Ji","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1016\/j.isprsjprs.2021.08.004","article-title":"Appearance based deep domain adaptation for the classification of aerial images","volume":"180","author":"Wittich","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1109\/JSTARS.2020.3031741","article-title":"Cross-Sensor Adversarial Domain Adaptation of Landsat-8 and Proba-V Images for Cloud Detection","volume":"14","author":"Laparra","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1635","DOI":"10.5194\/isprs-archives-XLIII-B3-2020-1635-2020","article-title":"Domain adaptation with cyclegan for change detection in the Amazon forest","volume":"43","author":"Soto","year":"2020","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Kou, R., Fang, B., Chen, G., and Wang, L. (2020). Progressive Domain Adaptation for Change Detection Using Season-Varying Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12223815"},{"key":"ref_23","unstructured":"Zhu, J.Y., Park, T., Isola, P., and Efros, A.A. (2020). Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Yi, Z., Zhang, H., Tan, P., and Gong, M. (2018). DualGAN: Unsupervised Dual Learning for Image-to-Image Translation. arXiv.","DOI":"10.1109\/ICCV.2017.310"},{"key":"ref_25","unstructured":"Mirza, M., and Osindero, S. (2014). Conditional Generative Adversarial Nets. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"746","DOI":"10.1109\/LGRS.2020.2982783","article-title":"Unsupervised Domain Adaptation of High-Resolution Aerial Images via Correlation Alignment and Self Training","volume":"18","author":"Zhang","year":"2021","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, B., Chen, T., and Wang, B. (2021). Curriculum-Style Local-to-Global Adaptation for Cross-Domain Remote Sensing Image Segmentation. IEEE Trans. Geosci. Remote Sens.","DOI":"10.1109\/TGRS.2021.3117851"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Shen, W., Wang, Q., Jiang, H., Li, S., and Yin, J. (2021, January 11\u201316). Unsupervised domain adaptation for semantic segmentation via self-supervision. Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium.","DOI":"10.1109\/IGARSS47720.2021.9553451"},{"key":"ref_29","unstructured":"Chen, Y., Ouyang, X., Zhu, K., and Agam, G. (2020). Domain Adaptation on Semantic Segmentation for Aerial Images. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2020.3035561","article-title":"Bispace Domain Adaptation Network for Remotely Sensed Semantic Segmentation","volume":"60","author":"Liu","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","unstructured":"Kim, T., Cha, M., Kim, H., Lee, J.K., and Kim, J. (2017). Learning to Discover Cross-Domain Relations with Generative Adversarial Networks. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Zhao, Y., Gao, H., Guo, P., and Sun, Z. (2022). ResiDualGAN: Resize-Residual DualGAN for Cross-Domain Remote Sensing Images Semantic Segmentation. arXiv.","DOI":"10.3390\/rs15051428"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015). Deep Residual Learning for Image Recognition. arXiv.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Melas-Kyriazi, L., and Manrai, A.K. (2021). PixMatch: Unsupervised Domain Adaptation via Pixelwise Consistency Training. arXiv.","DOI":"10.1109\/CVPR46437.2021.01225"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Araslanov, N., and Roth, S. (2021). Self-supervised Augmentation Consistency for Adapting Semantic Segmentation. arXiv.","DOI":"10.1109\/CVPR46437.2021.01513"},{"key":"ref_36","unstructured":"Chen, T., Kornblith, S., Norouzi, M., and Hinton, G. (2020, January 12\u201318). A Simple Framework for Contrastive Learning of Visual Representations. Proceedings of the 37th International Conference on Machine Learning, PMLR, Virtual Event."},{"key":"ref_37","unstructured":"Berthelot, D., Carlini, N., Goodfellow, I., Papernot, N., Oliver, A., and Raffel, C. (2019). MixMatch: A Holistic Approach to Semi-Supervised Learning. arXiv."},{"key":"ref_38","unstructured":"Berthelot, D., Carlini, N., Cubuk, E.D., Kurakin, A., Sohn, K., Zhang, H., and Raffel, C. (2020). ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring. arXiv."},{"key":"ref_39","first-page":"596","article-title":"FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence","volume":"33","author":"Sohn","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_40","first-page":"6256","article-title":"Unsupervised data augmentation for consistency training","volume":"33","author":"Xie","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., and Yoo, Y. (2019, January 27\u201328). Cutmix: Regularization strategy to train strong classifiers with localizable features. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00612"},{"key":"ref_42","unstructured":"Gerke, M. (2015). Use of the Stair Vision Library within the ISPRS 2D Semantic Labeling Benchmark (Vaihingen), University of Twente."},{"key":"ref_43","unstructured":"Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., and Antiga, L. (2019). PyTorch: An Imperative Style, High-Performance Deep Learning Library. Adv. Neural Inf. Process. Syst., 32, Available online: https:\/\/proceedings.neurips.cc\/paper\/2019\/hash\/bdbca288fee7f92f2bfa9f7012727740-Abstract.html."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018). Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation. arXiv.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Tsai, Y.H., Hung, W.C., Schulter, S., Sohn, K., Yang, M.H., and Chandraker, M. (2018, January 18\u201322). Learning to adapt structured output space for semantic segmentation. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00780"},{"key":"ref_46","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Hinton","year":"2008","journal-title":"J. Mach. Learn. Res."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/7\/1527\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T22:40:48Z","timestamp":1760136048000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/7\/1527"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,22]]},"references-count":46,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2022,4]]}},"alternative-id":["rs14071527"],"URL":"https:\/\/doi.org\/10.3390\/rs14071527","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2022,3,22]]}}}