{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T10:01:10Z","timestamp":1766138470571,"version":"build-2065373602"},"reference-count":52,"publisher":"MDPI AG","issue":"20","license":[{"start":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T00:00:00Z","timestamp":1697068800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Work Enhancement Based on Visual Scene Perception","award":["GJSD22007"],"award-info":[{"award-number":["GJSD22007"]}]},{"name":"National Key Laboratory Foundation of Human Factors Engineering","award":["GJSD22007"],"award-info":[{"award-number":["GJSD22007"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Few-shot semantic segmentation (FSS) is committed to segmenting new classes with only a few labels. Generally, FSS assumes that base classes and novel classes belong to the same domain, which limits FSS\u2019s application in a wide range of areas. In particular, since annotation is time-consuming, it is not cost-effective to process remote sensing images using FSS. To address this issue, we designed a feature transformation network (FTNet) for learning to few-shot segment remote sensing images from irrelevant data (FSS-RSI). The main idea is to train networks on irrelevant, already labeled data but inference on remote sensing images. In other words, the training and testing data neither belong to the same domain nor category. The FTNet contains two main modules: a feature transformation module (FTM) and a hierarchical transformer module (HTM). Among them, the FTM transforms features into a domain-agnostic high-level anchor, and the HTM hierarchically enhances matching between support and query features. Moreover, to promote the development of FSS-RSI, we established a new benchmark, which other researchers may use. Our experiments demonstrate that our model outperforms the cutting-edge few-shot semantic segmentation method by 25.39% and 21.31% in the one-shot and five-shot settings, respectively.<\/jats:p>","DOI":"10.3390\/rs15204937","type":"journal-article","created":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T12:46:13Z","timestamp":1697114773000},"page":"4937","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Learn to Few-Shot Segment Remote Sensing Images from Irrelevant Data"],"prefix":"10.3390","volume":"15","author":[{"given":"Qingwei","family":"Sun","sequence":"first","affiliation":[{"name":"Department of Aerospace Science and Technology, Space Engineering University, Beijing 101416, China"},{"name":"China Astronaut Research and Training Center, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0005-2276-7856","authenticated-orcid":false,"given":"Jiangang","family":"Chao","sequence":"additional","affiliation":[{"name":"China Astronaut Research and Training Center, Beijing 100094, China"},{"name":"National Key Laboratory of Human Factors Engineering, China Astronaut Research and Training Center, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wanhong","family":"Lin","sequence":"additional","affiliation":[{"name":"China Astronaut Research and Training Center, Beijing 100094, China"},{"name":"National Key Laboratory of Human Factors Engineering, China Astronaut Research and Training Center, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenying","family":"Xu","sequence":"additional","affiliation":[{"name":"China Astronaut Research and Training Center, Beijing 100094, China"},{"name":"National Key Laboratory of Human Factors Engineering, China Astronaut Research and Training Center, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wei","family":"Chen","sequence":"additional","affiliation":[{"name":"China Astronaut Research and Training Center, Beijing 100094, China"},{"name":"National Key Laboratory of Human Factors Engineering, China Astronaut Research and Training Center, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ning","family":"He","sequence":"additional","affiliation":[{"name":"China Astronaut Research and Training Center, Beijing 100094, China"},{"name":"National Key Laboratory of Human Factors Engineering, China Astronaut Research and Training Center, Beijing 100094, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Wang, Z., Wang, B., Zhang, C., Liu, Y., and Guo, J. (2023). Defending against Poisoning Attacks in Aerial Image Semantic Segmentation with Robust Invariant Feature Enhancement. Remote Sens., 15.","DOI":"10.3390\/rs15123157"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"He, Y., Jia, K., and Wei, Z. (2023). Improvements in Forest Segmentation Accuracy Using a New Deep Learning Architecture and Data Augmentation Technique. Remote Sens., 15.","DOI":"10.3390\/rs15092412"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1109\/TPAMI.2016.2572683","article-title":"Fully Convolutional Networks for Semantic Segmentation","volume":"39","author":"Shelhamer","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Noh, H., Hong, S., and Han, B. (2015, January 7\u201313). Learning Deconvolution Network for Semantic Segmentation. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Piscataway, NJ, USA.","DOI":"10.1109\/ICCV.2015.178"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Lin, G., Milan, A., Shen, C., and Reid, I. (2017, January 21\u201326). RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.549"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-Net: Convolutional Networks for Biomedical Image Segmentation. arXiv.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_8","unstructured":"Yu, F., and Koltun, V. (2016). Multi-Scale Context Aggregation by Dilated Convolutions. arXiv."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs","volume":"40","author":"Chen","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_11","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_12","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Zheng, S., Lu, J., Zhao, H., Zhu, X., Luo, Z., Wang, Y., Fu, Y., Feng, J., Xiang, T., and Torr, P.H.S. (2021, January 20\u201325). Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00681"},{"key":"ref_14","unstructured":"Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., and Luo, P. (2016). SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. arXiv."},{"key":"ref_15","unstructured":"Devlin, J., Chang, M.-W., Lee, K., and Toutanova, K. (2016). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv."},{"key":"ref_16","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., and Polosukhin, I. (2017). Attention Is All You Need. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Shaban, A., Bansal, S., Liu, Z., Essa, I., and Boots, B. (2017). One-Shot Learning for Semantic Segmentation. arXiv.","DOI":"10.5244\/C.31.167"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"1050","DOI":"10.1109\/TPAMI.2020.3013717","article-title":"Prior Guided Feature Enrichment Network for Few-Shot Segmentation","volume":"44","author":"Tian","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Lang, C., Cheng, G., Tu, B., and Han, J. (2022, January 18\u201324). Learning What Not to Segment: A New Perspective on Few-Shot Segmentation. Proceedings of the 2022 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00789"},{"key":"ref_20","unstructured":"Vinyals, O., Blundell, C., Lillicrap, T., Kavukcuoglu, K., and Wierstra, D. (2016). Matching Networks for One Shot Learning. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, K., Liew, J.H., Zou, Y., Zhou, D., and Feng, J. (November, January 27). PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00929"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhang, C., Lin, G., Liu, F., Yao, R., and Shen, C. (2019, January 15\u201320). CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00536"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Yang, B., Liu, C., Li, B., Jiao, J., and Ye, Q. (2020). Prototype Mixture Models for Few-shot Semantic Segmentation. arXiv.","DOI":"10.1007\/978-3-030-58598-3_45"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Min, J., Kang, D., and Cho, M. (2021, January 10\u201317). Hypercorrelation Squeeze for Few-Shot Segmenation. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00686"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Siam, M., and Oreshkin, B. (2019). Adaptive Masked Weight Imprinting for Few-Shot Segmentation. arXiv.","DOI":"10.1109\/ICCV.2019.00535"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Peng, B., Tian, Z., Wu, X., Wang, C., Liu, S., Su, J., and Jia, J. (2023). Hierarchical Dense Correlation Distillation for Few-Shot Segmentation. arXiv.","DOI":"10.1109\/CVPR52729.2023.02264"},{"key":"ref_27","unstructured":"Zhang, G., Kang, G., Yang, Y., and Wei, Y. (2021). Few-Shot Segmentation via Cycle-Consistent Transformer. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Zhang, J., Liu, Y., Wu, P., Shi, Z., and Pan, B. (2022). Mining Cross-Domain Structure Affinity for Refined Building Segmentation in Weakly Supervised Constraints. Remote Sens., 14.","DOI":"10.3390\/rs14051227"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Gao, H., Zhao, Y., Guo, P., Sun, Z., Chen, X., and Tang, Y. (2022). Cycle and Self-Supervised Consistency Training for Adapting Semantic Segmentation of Aerial Images. Remote Sens., 14.","DOI":"10.3390\/rs14071527"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"4045","DOI":"10.1109\/JSTARS.2022.3175191","article-title":"SPANet: Successive Pooling Attention Network for Semantic Segmentation of Remote Sensing Images","volume":"15","author":"Sun","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Chen, Y., Wei, C., Wang, D., Ji, C., and Li, B. (2022). Semi-Supervised Contrastive Learning for Few-Shot Segmentation of Remote Sensing Images. Remote Sens., 14.","DOI":"10.3390\/rs14174254"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Deng, R., Shen, C., Liu, S., Wang, H., and Liu, X. (2018, January 8\u201314). Learning to Predict Crisp Boundaries. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01231-1_35"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Demir, I., Koperski, K., Lindenbaum, D., Pang, G., Huang, J., Basu, S., Hughes, F., Tuia, D., and Raskar, R. (2018, January 18\u201322). DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00031"},{"key":"ref_34","unstructured":"ISPRS (2023, June 20). Potsdam. Available online: https:\/\/www.isprs.org\/education\/benchmarks\/UrbanSemLab\/2d-sem-label-potsdam.aspx."},{"key":"ref_35","unstructured":"(2023, June 20). ISPRS Vaihingen. Available online: https:\/\/www.isprs.org\/education\/benchmarks\/UrbanSemLab\/2d-sem-labelvaihingen.aspx."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"6054","DOI":"10.1109\/TGRS.2017.2719738","article-title":"Learning Aerial Image Segmentation from Online Maps","volume":"55","author":"Kaiser","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Kai, L., and Li, F.-F. (2009, January 20\u201325). ImageNet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid Scene Parsing Network. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Piscataway, NJ, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_39","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on International Conference on Machine Learning (ICML), Lille, France."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N. (2018, January 8\u201314). BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_20"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"3051","DOI":"10.1007\/s11263-021-01515-2","article-title":"BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation","volume":"129","author":"Yu","year":"2021","journal-title":"Int. J. Comput. Vis."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Fan, M., Lai, S., Huang, J., Wei, X., Chai, Z., Luo, J., and Wei, X. (2021, January 20\u201325). Rethinking BiSeNet For Real-time Semantic Segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00959"},{"key":"ref_43","unstructured":"Seo, J., Park, Y.-H., Yoon, S.W., and Moon, J. (2022). Task-Adaptive Feature Transformer with Semantic Enrichment for Few-Shot Segmentation. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"12","DOI":"10.1109\/MPRV.2008.80","article-title":"OpenStreetMap: User-Generated Street Maps","volume":"7","author":"Haklay","year":"2008","journal-title":"IEEE Pervasive Comput."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"682","DOI":"10.1068\/b35097","article-title":"How good is volunteered geographical information? A comparative study of OpenStreetMap and Ordnance Survey datasets","volume":"37","author":"Haklay","year":"2010","journal-title":"Environ. Plan. B-Plan. Des."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"435","DOI":"10.1111\/j.1467-9671.2010.01203.x","article-title":"Quality Assessment of the French OpenStreetMap Dataset","volume":"14","author":"Girres","year":"2010","journal-title":"Trans. GIS"},{"key":"ref_47","unstructured":"(2023, September 20). Google Maps. Available online: https:\/\/support.google.com\/mapcontentpartners\/answer\/144284?hl=en."},{"key":"ref_48","unstructured":"Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., and Hochreiter, S. (2017). GANs trained by a two time-scale update rule converge to a local nash equilibrium. arXiv."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Hariharan, B., Arbel\u00e1ez, P., Bourdev, L., Maji, S., and Malik, J. (2011, January 6\u201313). Semantic contours from inverse detectors. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126343"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"5613610","DOI":"10.1109\/TGRS.2023.3286183","article-title":"Progressive Parsing and Commonality Distillation for Few-Shot Remote Sensing Segmentation","volume":"61","author":"Lang","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Li, R., Li, J., Gou, S., Lu, H., Mao, S., and Guo, Z. (2023). Multi-Scale Similarity Guidance Few-Shot Network for Ship Segmentation in SAR Images. Remote Sens., 15.","DOI":"10.20944\/preprints202305.2088.v1"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/20\/4937\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:05:50Z","timestamp":1760130350000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/20\/4937"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,12]]},"references-count":52,"journal-issue":{"issue":"20","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["rs15204937"],"URL":"https:\/\/doi.org\/10.3390\/rs15204937","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2023,10,12]]}}}