{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,9]],"date-time":"2026-06-09T06:54:48Z","timestamp":1780988088429,"version":"3.54.1"},"reference-count":50,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2021,12,10]],"date-time":"2021-12-10T00:00:00Z","timestamp":1639094400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41971352"],"award-info":[{"award-number":["41971352"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2018YFB0505003"],"award-info":[{"award-number":["2018YFB0505003"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Assigning geospatial objects with specific categories at the pixel level is a fundamental task in remote sensing image analysis. Along with the rapid development of sensor technologies, remotely sensed images can be captured at multiple spatial resolutions (MSR) with information content manifested at different scales. Extracting information from these MSR images represents huge opportunities for enhanced feature representation and characterisation. However, MSR images suffer from two critical issues: (1) increased scale variation of geo-objects and (2) loss of detailed information at coarse spatial resolutions. To bridge these gaps, in this paper, we propose a novel scale-aware neural network (SaNet) for the semantic segmentation of MSR remotely sensed imagery. SaNet deploys a densely connected feature network (DCFFM) module to capture high-quality multi-scale context, such that the scale variation is handled properly and the quality of segmentation is increased for both large and small objects. A spatial feature recalibration (SFRM) module was further incorporated into the network to learn intact semantic content with enhanced spatial relationships, where the negative effects of information loss are removed. The combination of DCFFM and SFRM allows SaNet to learn scale-aware feature representation, which outperforms the existing multi-scale feature representation. Extensive experiments on three semantic segmentation datasets demonstrated the effectiveness of the proposed SaNet in cross-resolution segmentation.<\/jats:p>","DOI":"10.3390\/rs13245015","type":"journal-article","created":{"date-parts":[[2021,12,10]],"date-time":"2021-12-10T02:07:18Z","timestamp":1639102038000},"page":"5015","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":32,"title":["Scale-Aware Neural Network for Semantic Segmentation of Multi-Resolution Remote Sensing Images"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8096-6531","authenticated-orcid":false,"given":"Libo","family":"Wang","sequence":"first","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5100-3584","authenticated-orcid":false,"given":"Ce","family":"Zhang","sequence":"additional","affiliation":[{"name":"Lancaster Environment Centre, Lancaster University, Lancaster LA1 4YQ, UK"},{"name":"UK Centre for Ecology & Hydrology, Library Avenue, Lancaster LA1 4AP, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7858-3160","authenticated-orcid":false,"given":"Rui","family":"Li","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chenxi","family":"Duan","sequence":"additional","affiliation":[{"name":"Faculty of Geo-Information Science and Earth Observation (ITC), University of Twente, 57522 Enschede, The Netherlands"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoliang","family":"Meng","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Information Engineering, Wuhan University, Wuhan 430079, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5489-6880","authenticated-orcid":false,"given":"Peter M.","family":"Atkinson","sequence":"additional","affiliation":[{"name":"Lancaster Environment Centre, Lancaster University, Lancaster LA1 4YQ, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2021,12,10]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"111593","DOI":"10.1016\/j.rse.2019.111593","article-title":"Scale Sequence Joint Deep Learning (SS-JDL) for land use and land cover classification","volume":"237","author":"Zhang","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2320","DOI":"10.1016\/j.rse.2011.04.032","article-title":"Mapping urbanization dynamics at regional and global scales using multi-temporal DMSP\/OLS nighttime light data","volume":"115","author":"Zhang","year":"2011","journal-title":"Remote Sens. Environ."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1777","DOI":"10.3390\/rs3081777","article-title":"Segment-Based Land Cover Mapping of a Suburban Area\u2014Comparison of High-Resolution Remotely Sensed Datasets Using Classification Trees and Test Field Points","volume":"3","author":"Matikainen","year":"2011","journal-title":"Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Yang, K., Xia, G.-S., Liu, Z., Du, B., Yang, W., Pelillo, M., and Zhang, L. (2021). Asymmetric Siamese Networks for Semantic Change Detection in Aerial Images. IEEE Trans. Geosci. Remote Sens., 1\u201318.","DOI":"10.1109\/TGRS.2021.3113912"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Wang, L., Li, R., Wang, D., Duan, C., Wang, T., and Meng, X. (2021). Transformer Meets Convolution: A Bilateral Awareness Network for Semantic Segmentation of Very Fine Resolution Urban Scene Images. Remote Sens., 13.","DOI":"10.3390\/rs13163065"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1016\/j.isprsjprs.2021.09.005","article-title":"ABCNet: Attentive bilateral contextual network for efficient semantic segmentation of Fine-Resolution remotely sensed imagery","volume":"181","author":"Li","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Li, R., Zheng, S., Zhang, C., Duan, C., Su, J., Wang, L., and Atkinson, P.M. (2021). Multiattention network for semantic segmentation of fine-resolution remote sensing images. IEEE Trans. Geosci. Remote Sens., 1\u201313.","DOI":"10.1109\/TGRS.2021.3093977"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2020.04.019","article-title":"HyNet: Hyper-scale object detection network framework for multiple spatial resolution remote sensing imagery","volume":"166","author":"Zheng","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"607","DOI":"10.1111\/0033-0124.00250","article-title":"Spatial Scale Problems and Geostatistical Solutions: A Review","volume":"52","author":"Atkinson","year":"2000","journal-title":"Prof. Geogr."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"102897","DOI":"10.1016\/j.earscirev.2019.102897","article-title":"Principles and methods of scaling geospatial Earth science data","volume":"197","author":"Ge","year":"2019","journal-title":"Earth-Sci. Rev."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"Yan","year":"2015","journal-title":"Nature"},{"key":"ref_12","unstructured":"Baatz, M., and Sch\u00e4pe, A. (2000). Multiresolution Segmentation\u2014An Optimization Approach for High Quality Multi-Scale Image Segmentation. Angewandte Geographische Informationsverarbeitung XII. Beitr\u00e4ge zum AGIT-Symposium Salzburg, Wichmann Verlag."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.isprsjprs.2019.08.014","article-title":"Optimizing multiscale segmentation with local spectral heterogeneity measure for high resolution remote sensing images","volume":"157","author":"Shen","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"319","DOI":"10.1016\/j.isprsjprs.2018.12.003","article-title":"Scale-variable region-merging for high resolution remote sensing image segmentation","volume":"147","author":"Su","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.isprsjprs.2014.07.002","article-title":"Comparing supervised and unsupervised multiresolution segmentation approaches for extracting buildings from very high resolution imagery","volume":"96","author":"Belgiu","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1016\/j.isprsjprs.2013.11.006","article-title":"Optimizing multi-resolution segmentation scale using empirical methods: Exploring the sensitivity of the supervised discrepancy measure Euclidean distance 2 (ED2)","volume":"87","author":"Witharana","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1016\/j.isprsjprs.2018.04.013","article-title":"A scale-invariant change detection method for land use\/cover change research","volume":"141","author":"Xing","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"3036","DOI":"10.1109\/TIP.2018.2808767","article-title":"Effective Sequential Classifier Training for SVM-Based Multitemporal Remote Sensing Image Classification","volume":"27","author":"Guo","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep learning in remote sensing: A comprehensive review and list of resources","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"217","DOI":"10.1080\/01431160412331269698","article-title":"Random forest classifier for remote sensing classification","volume":"26","author":"Pal","year":"2005","journal-title":"Int. J. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"3978","DOI":"10.1109\/TGRS.2007.907109","article-title":"A Multiple Conditional Random Fields Ensemble Model for Urban Area Detection in Remote Sensing Optical Images","volume":"45","author":"Zhong","year":"2007","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Wang, L., Li, R., Duan, C., Zhang, C., Meng, X., and Fang, S. (2021). A Novel Transformer based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images. arXiv.","DOI":"10.1109\/LGRS.2022.3143368"},{"key":"ref_24","unstructured":"Wang, L., Fang, S., Zhang, C., Li, R., and Duan, C. (2021). Efficient Hybrid Transformer: Learning Global-local Context for Urban Scene Segmentation. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_26","unstructured":"Sherrah, J. (2016). Fully convolutional networks for dense semantic labelling of high-resolution aerial imagery. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1016\/j.rse.2018.11.014","article-title":"Joint Deep Learning for land cover and land use classification","volume":"221","author":"Zhang","year":"2019","journal-title":"Remote Sens. Environ."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.isprsjprs.2016.01.004","article-title":"Learning multiscale and deep representations for classifying remotely sensed imagery","volume":"113","author":"Zhao","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Chen, L.-C., Yang, Y., Wang, J., Xu, W., and Yuille, A.L. (2016, January 27\u201330). Attention to scale: Scale-aware semantic image segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.396"},{"key":"ref_30","unstructured":"Yu, F., and Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.isprsjprs.2017.12.007","article-title":"Semantic labeling in very high resolution images via a self-cascaded convolutional neural network","volume":"145","author":"Liu","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1016\/j.neucom.2018.11.051","article-title":"Problems of encoder-decoder frameworks for high-resolution remote sensing image segmentation: Structural stereotype and insufficient learning","volume":"330","author":"Sun","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2019.07.007","article-title":"TreeUNet: Adaptive Tree convolutional neural networks for subdecimeter aerial image segmentation","volume":"156","author":"Yue","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.isprsjprs.2020.01.013","article-title":"Resunet-a: A deep learning framework for semantic segmentation of remotely sensed data","volume":"162","author":"Diakogiannis","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Seferbekov, S., Iglovikov, V., Buslaev, A., and Shvets, A. (2018, January 18\u201322). Feature pyramid network for multi-class land segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00051"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_40","unstructured":"Chen, L.-C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs","volume":"40","author":"Chen","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"6309","DOI":"10.1109\/TGRS.2020.2976658","article-title":"Dense dilated convolutions\u2019 merging network for land cover classification","volume":"58","author":"Liu","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"15","DOI":"10.1016\/j.isprsjprs.2020.09.019","article-title":"Parsing very high resolution urban scene images by learning deep ConvNets with edge-aware loss","volume":"170","author":"Zheng","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Huang, Z., Wang, X., Wei, Y., Huang, L., Shi, H., Liu, W., and Huang, T.S. (2020). CCNet: Criss-Cross Attention for Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell.","DOI":"10.1109\/ICCV.2019.00069"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"7557","DOI":"10.1109\/TGRS.2020.2979552","article-title":"Relation Matters: Relational Context-Aware Fully Convolutional Network for Semantic Segmentation of High-Resolution Aerial Images","volume":"58","author":"Mou","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Li, R., Duan, C., Zheng, S., Zhang, C., and Atkinson, P.M. (2021). MACU-Net for semantic segmentation of fine-resolution remotely sensed images. IEEE Geosci. Remote Sens. Lett., 1\u20135.","DOI":"10.1109\/LGRS.2021.3052886"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Li, R., Zheng, S., Duan, C., Su, J., and Zhang, C. (2021). Multistage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images. IEEE Geosci. Remote Sens. Lett., 1\u20135.","DOI":"10.1109\/LGRS.2021.3063381"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Boguszewski, A., Batorski, D., Ziemba-Jankowska, N., Zambrzycka, A., and Dziedzic, T. (2020). LandCover. ai: Dataset for Automatic Mapping of Buildings, Woodlands and Water from Aerial Imagery. arXiv.","DOI":"10.1109\/CVPRW53098.2021.00121"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/24\/5015\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:44:37Z","timestamp":1760168677000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/24\/5015"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,10]]},"references-count":50,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2021,12]]}},"alternative-id":["rs13245015"],"URL":"https:\/\/doi.org\/10.3390\/rs13245015","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,10]]}}}