{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,12]],"date-time":"2026-05-12T13:00:09Z","timestamp":1778590809893,"version":"3.51.4"},"reference-count":71,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2024,7,10]],"date-time":"2024-07-10T00:00:00Z","timestamp":1720569600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Research on Intelligent Monitoring and Early Warning Technology for rice pests and diseases of the Sichuan Provincial Department of Science and Technology","award":["2022NSFSC0172"],"award-info":[{"award-number":["2022NSFSC0172"]}]},{"name":"Research on Intelligent Monitoring and Early Warning Technology for rice pests and diseases of the Sichuan Provincial Department of Science and Technology","award":["202210626054"],"award-info":[{"award-number":["202210626054"]}]},{"name":"Sichuan Agricultural University Innovation Training Programme Project Funding","award":["2022NSFSC0172"],"award-info":[{"award-number":["2022NSFSC0172"]}]},{"name":"Sichuan Agricultural University Innovation Training Programme Project Funding","award":["202210626054"],"award-info":[{"award-number":["202210626054"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Utilizing deep learning for semantic segmentation of cropland from remote sensing imagery has become a crucial technique in land surveys. Cropland is highly heterogeneous and fragmented, and existing methods often suffer from inaccurate boundary segmentation. This paper introduces a UNet-like boundary-aware compensation model (BAFormer). Cropland boundaries typically exhibit rapid transformations in pixel values and texture features, often appearing as high-frequency features in remote sensing images. To enhance the recognition of these high-frequency features as represented by cropland boundaries, the proposed BAFormer integrates a Feature Adaptive Mixer (FAM) and develops a Depthwise Large Kernel Multi-Layer Perceptron model (DWLK-MLP) to enrich the global and local cropland boundaries features separately. Specifically, FAM enhances the boundary-aware method by adaptively acquiring high-frequency features through convolution and self-attention advantages, while DWLK-MLP further supplements boundary position information using a large receptive field. The efficacy of BAFormer has been evaluated on datasets including Vaihingen, Potsdam, LoveDA, and Mapcup. It demonstrates high performance, achieving mIoU scores of 84.5%, 87.3%, 53.5%, and 83.1% on these datasets, respectively. Notably, BAFormer-T (lightweight model) surpasses other lightweight models on the Vaihingen dataset with scores of 91.3% F1 and 84.1% mIoU.<\/jats:p>","DOI":"10.3390\/rs16142526","type":"journal-article","created":{"date-parts":[[2024,7,10]],"date-time":"2024-07-10T11:14:41Z","timestamp":1720610081000},"page":"2526","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["BAFormer: A Novel Boundary-Aware Compensation UNet-like Transformer for High-Resolution Cropland Extraction"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2402-7657","authenticated-orcid":false,"given":"Zhiyong","family":"Li","sequence":"first","affiliation":[{"name":"College of Information Engineering, Sichuan Agricultural University, Ya\u2019an 625014, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-3967-6365","authenticated-orcid":false,"given":"Youming","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information Engineering, Sichuan Agricultural University, Ya\u2019an 625014, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fa","family":"Tian","sequence":"additional","affiliation":[{"name":"College of Information Engineering, Sichuan Agricultural University, Ya\u2019an 625014, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Junbo","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Information Engineering, Sichuan Agricultural University, Ya\u2019an 625014, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yijie","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Information Engineering, Sichuan Agricultural University, Ya\u2019an 625014, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kunhong","family":"Li","sequence":"additional","affiliation":[{"name":"College of Information Engineering, Sichuan Agricultural University, Ya\u2019an 625014, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,7,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.isprsjprs.2015.10.004","article-title":"Remote Sensing platforms and sensors: A survey","volume":"115","author":"Toth","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"528","DOI":"10.1016\/j.eng.2019.10.015","article-title":"Remote sensing and precision agriculture technologies for crop disease detection and management with a practical application example","volume":"6","author":"Yang","year":"2020","journal-title":"Engineering"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"111912","DOI":"10.1016\/j.rse.2020.111912","article-title":"A generalized approach based on convolutional neural networks for large area cropland mapping at very high resolution","volume":"247","author":"Zhang","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"107683","DOI":"10.1016\/j.compag.2023.107683","article-title":"BSNet: Boundary-semantic-fusion Network for Farmland Parcel Mapping in High-Resolution Satellite Images","volume":"206","author":"Shunying","year":"2023","journal-title":"Comput. Electron. Agric."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/j.isprsjprs.2023.04.019","article-title":"Using a Semantic Edge-Aware Multi-Task Neural Network to Delineate Agricultural Parcels from Remote Sensing Images","volume":"200","author":"Li","year":"2023","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2022.3230043","article-title":"A Deformable Attention Network for High-Resolution Remote Sensing Images Semantic Segmentation","volume":"60","author":"Zuo","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_7","first-page":"1","article-title":"ASNet: Adaptive Semantic Network Based on Transformer\u2013CNN for Salient Object Detection in Optical Remote Sensing Images","volume":"62","author":"Yan","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2022.3230846","article-title":"Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation","volume":"60","author":"He","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","first-page":"1","article-title":"Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery","volume":"60","author":"Zhang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Xia, L., Luo, J., Sun, Y., and Yang, H. (2018, January 6\u20139). Deep Extraction of Cropland Parcels from Very High-Resolution Remotely Sensed Imagery. Proceedings of the 2018 7th International Conference on Agro-Geoinformatics (Agro-Geoinformatics), Hangzhou, China.","DOI":"10.1109\/Agro-Geoinformatics.2018.8476002"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"3760","DOI":"10.1109\/JSTARS.2023.3253779","article-title":"Edge Detection With Direction Guided Postprocessing for Farmland Parcel Extraction","volume":"16","author":"Xie","year":"2023","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Awad, B., and Erer, I. (2023). FAUNet: Frequency Attention U-Net for Parcel Boundary Delineation in Satellite Images. Remote Sens., 15.","DOI":"10.3390\/rs15215123"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2349","DOI":"10.1109\/TGRS.2017.2778343","article-title":"Two-Stream Deep Architecture for Hyperspectral Image Classification","volume":"56","author":"Hao","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Doersch, C., Gupta, A., and Efros, A.A. (2015, January 7\u201313). Unsupervised Visual Representation Learning by Context Prediction. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.167"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Dong, X., Xie, J., Tu, K., Qi, K., Yang, C., and Zhai, H. (2023, January 25\u201328). DSFNet: Dual-Stream-Fusion Network for Farmland Parcel Mapping in High-Resolution Satellite Images. Proceedings of the 2023 11th International Conference on Agro-Geoinformatics (Agro-Geoinformatics), Wuhan, China.","DOI":"10.1109\/Agro-Geoinformatics59224.2023.10233401"},{"key":"ref_16","first-page":"1","article-title":"A Novel Knowledge-Driven Automated Solution for High-Resolution Cropland Extraction by Cross-Scale Sample Transfer","volume":"61","author":"Zhang","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2023.3344670","article-title":"Frequency-based Optimal Style Mix for Domain Generalization in Semantic Segmentation of Remote Sensing Images","volume":"62","author":"Iizuka","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","first-page":"1","article-title":"Learn More and Learn Usefully: Truncation Compensation Network for Semantic Segmentation of High-Resolution Remote Sensing Images","volume":"62","author":"Zhang","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Xu, L., Ming, D., Zhou, W., Bao, H., Chen, Y., and Ling, X. (2019). Farmland Extraction from High Spatial Resolution Remote Sensing Images Based on Stratified Scale Pre-Estimation. Remote Sens., 11.","DOI":"10.3390\/rs11020108"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Li, Z., Chen, S., Meng, X., Zhu, R., Lu, J., Cao, L., and Lu, P. (2022). Full Convolution Neural Network Combined with Contextual Feature Representation for Cropland Extraction from High-Resolution Remote Sensing Images. Remote Sens., 14.","DOI":"10.3390\/rs14092157"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Sheng, J., Sun, Y., Huang, H., Xu, W., Pei, H., Zhang, W., and Wu, X. (2022). HBRNet: Boundary Enhancement Segmentation Network for Cropland Extraction in High-Resolution Remote Sensing Images. Agriculture, 12.","DOI":"10.3390\/agriculture12081284"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Luo, W., Zhang, C., Li, Y., and Yan, Y. (2023). MLGNet: Multi-Task Learning Network with Attention-Guided Mechanism for Segmenting Agricultural Fields. Remote Sens., 15.","DOI":"10.3390\/rs15163934"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"3060","DOI":"10.1109\/JSTARS.2023.3255541","article-title":"Statistical Texture Learning Method for Monitoring Abandoned Suburban Cropland Based on High-Resolution Remote Sensing and Deep Learning","volume":"16","author":"Shen","year":"2023","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"108902","DOI":"10.1016\/j.compag.2024.108902","article-title":"TSANet: A Deep Learning Framework for the Delineation of Agricultural Fields Utilizing Satellite Image Time Series","volume":"220","author":"Yan","year":"2024","journal-title":"Comput. Electron. Agric."},{"key":"ref_25","first-page":"1","article-title":"RBP-MTL: Agricultural Parcel Vectorization via Region-Boundary-Parcel Decoupled Multitask Learning","volume":"62","author":"Pan","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","first-page":"2397","article-title":"Active Boundary Loss for Semantic Segmentation","volume":"36","author":"Wang","year":"2022","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"101851","DOI":"10.1016\/j.media.2020.101851","article-title":"Boundary Loss for Highly Unbalanced Segmentation","volume":"67","author":"Kervadec","year":"2021","journal-title":"Med. Image Anal."},{"key":"ref_28","unstructured":"Yu, F., and Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. arXiv."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"149","DOI":"10.1016\/j.knosys.2019.04.025","article-title":"DUNet: A deformable network for retinal vessel segmentation","volume":"178","author":"Jin","year":"2019","journal-title":"Knowl. Based Syst."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2254","DOI":"10.1109\/TMI.2024.3363190","article-title":"ScribFormer: Transformer Makes CNN Work Better for Scribble-based Medical Image Segmentation","volume":"43","author":"Li","year":"2024","journal-title":"IEEE Trans. Med. Imaging"},{"key":"ref_31","unstructured":"Pham, T.H., Li, X., and Nguyen, K.D. (2023). Seunet-trans: A simple yet effective unet-transformer model for medical image segmentation. arXiv."},{"key":"ref_32","first-page":"1","article-title":"MDE-UNet: A Multitask Deformable UNet Combined Enhancement Network for Farmland Boundary Segmentation","volume":"20","author":"Wang","year":"2023","journal-title":"IEEE Geosci. Remote Sensing Lett."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2024.3419794","article-title":"Multiscale Edge-Guided Network for Accurate Cultivated Land Parcel Boundary Extraction From Remote Sensing Images","volume":"62","author":"Xu","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"3717","DOI":"10.1109\/TIP.2023.3290519","article-title":"Conditional Boundary Loss for Semantic Segmentation","volume":"32","author":"Wu","year":"2023","journal-title":"IEEE Trans. Image Process."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1016\/j.isprsjprs.2022.06.008","article-title":"UNetFormer: A UNet-like transformer for efficient semantic segmentation of Remote Sensing urban scene imagery","volume":"190","author":"Wang","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_37","first-page":"5998","article-title":"Attention is all you need","volume":"30","author":"Vaswani","year":"2017","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_38","unstructured":"Li, J., Xia, X., Li, W., Li, H., Wang, X., Xiao, X., Wang, R., Zheng, M., and Pan, X. (2022). Next-vit: Next generation vision transformer for efficient deployment in realistic industrial scenarios. arXiv."},{"key":"ref_39","unstructured":"Tan, W., Geng, Y., and Xie, X. (2023). FMViT: A multiple-frequency mixing Vision Transformer. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognit, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Li, X., Wang, W., Hu, X., and Yang, J. (2019, January 15\u201320). Selective kernel networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00060"},{"key":"ref_43","unstructured":"Zhang, X., Gong, Y., Li, Z., Gao, X., Jin, D., Li, J., and Liu, H. (2023). SkipcrossNets: Adaptive Skip-cross Fusion for Road Detection. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Guo, S., Liu, L., Gan, Z., Wang, Y., Zhang, W., Wang, C., Jiang, G., Zhang, W., Yi, R., and Ma, L. (2022, January 18\u201324). Isdnet: Integrating shallow and deep networks for efficient ultra-high resolution segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00432"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"733","DOI":"10.1007\/s41095-023-0364-2","article-title":"Visual attention network","volume":"9","author":"Guo","year":"2023","journal-title":"Comput. Vis. Media"},{"key":"ref_46","first-page":"3965","article-title":"Coatnet: Marrying convolution and attention for all data sizes","volume":"34","author":"Dai","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Guo, J., Han, K., Wu, H., Tang, Y., Chen, X., Wang, Y., and Xu, C. (2022, January 18\u201324). Cmt: Convolutional neural networks meet vision transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01186"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Shi, D. (2024, January 17\u201321). TransNeXt: Robust Foveal Visual Perception for Vision Transformers. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, Seattle, DC, USA.","DOI":"10.1109\/CVPR52733.2024.01683"},{"key":"ref_49","unstructured":"He, W., Li, J., Cao, W., Zhang, L., and Zhang, H. (2023). Building extraction from Remote Sensing images via an uncertainty-aware network. arXiv."},{"key":"ref_50","unstructured":"Wang, J., Zheng, Z., Ma, A., Lu, X., and Zhong, Y. (2021). Loveda: A remote sensing land-cover dataset for domain adaptation semantic segmentation. arXiv."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Sun, Y., Wang, S., Chen, C., and Xiang, T.Z. (2022). Boundary-guided camouflaged object detection. arXiv.","DOI":"10.24963\/ijcai.2022\/186"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognit, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_54","first-page":"1","article-title":"Multistage attention ResU-Net for semantic segmentation of fine-resolution Remote Sensing images","volume":"19","author":"Li","year":"2021","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1016\/j.isprsjprs.2021.09.005","article-title":"ABCNet: Attentive bilateral contextual network for efficient semantic segmentation of Fine-Resolution remotely sensed imagery","volume":"181","author":"Li","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Wang, L., Li, R., Wang, D., Duan, C., Wang, T., and Meng, X. (2021). Transformer meets convolution: A bilateral awareness network for semantic segmentation of very fine resolution urban scene images. Remote Sens., 13.","DOI":"10.3390\/rs13163065"},{"key":"ref_57","first-page":"1","article-title":"Multiattention network for semantic segmentation of fine-resolution Remote Sensing images","volume":"60","author":"Li","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_58","first-page":"1","article-title":"A novel transformer based semantic segmentation scheme for fine-resolution Remote Sensing images","volume":"19","author":"Wang","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Cheng, B., Misra, I., Schwing, A.G., Kirillov, A., and Girdhar, R. (2022, January 18\u201324). Masked-attention mask transformer for universal image segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00135"},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Panboonyuen, T., Jitkajornwanich, K., Lawawirojwong, S., Srestasathiern, P., and Vateekul, P. (2021). Transformer-based decoder designs for semantic segmentation on remotely sensed images. Remote Sens., 13.","DOI":"10.3390\/rs13245100"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Kirillov, A., Girshick, R., He, K., and Doll\u00e1r, P. (2019, January 15\u201320). Panoptic feature pyramid networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00656"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Zhong, Y., Wang, J., and Ma, A. (2020, January 13\u201319). Foreground-aware relation network for geospatial object segmentation in high spatial resolution Remote Sensing imagery. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00415"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Chen, K., Zou, Z., and Shi, Z. (2021). Building extraction from Remote Sensing images with sparse token transformers. Remote Sens., 13.","DOI":"10.3390\/rs13214441"},{"key":"ref_64","first-page":"17864","article-title":"Per-pixel classification is not all you need for semantic segmentation","volume":"34","author":"Cheng","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_65","doi-asserted-by":"crossref","first-page":"1131","DOI":"10.1080\/01431161.2022.2030071","article-title":"A2-FPN for semantic segmentation of fine-resolution remotely sensed images","volume":"43","author":"Li","year":"2022","journal-title":"Int. J. Remote Sens."},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N. (2018, January 8\u201314). Bisenet: Bilateral segmentation network for real-time semantic segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01261-8_20"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Strudel, R., Garcia, R., Laptev, I., and Schmid, C. (2021, January 11\u201317). Segmenter: Transformer for semantic segmentation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00717"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Srinivas, A., Lin, T.Y., Parmar, N., Shlens, J., Abbeel, P., and Vaswani, A. (2021, January 20\u201325). Bottleneck transformers for visual recognition. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognit, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01625"},{"key":"ref_69","doi-asserted-by":"crossref","first-page":"263","DOI":"10.1109\/LRA.2020.3039744","article-title":"Real-time semantic segmentation with fast attention","volume":"6","author":"Hu","year":"2020","journal-title":"IEEE Rob. Autom. Lett."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Zhuang, J., Yang, J., Gu, L., and Dvornek, N. (2019, January 27\u201328). ShelfNet for Fast Semantic Segmentation. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision Workshop (ICCVW), Seoul, Republic of Korea.","DOI":"10.1109\/ICCVW.2019.00113"},{"key":"ref_71","doi-asserted-by":"crossref","first-page":"107611","DOI":"10.1016\/j.patcog.2020.107611","article-title":"Efficient semantic segmentation with pyramidal fusion","volume":"110","year":"2021","journal-title":"Pattern Recognit."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/14\/2526\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T15:12:36Z","timestamp":1760109156000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/14\/2526"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,7,10]]},"references-count":71,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2024,7]]}},"alternative-id":["rs16142526"],"URL":"https:\/\/doi.org\/10.3390\/rs16142526","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,7,10]]}}}