{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,3]],"date-time":"2026-02-03T17:37:09Z","timestamp":1770140229668,"version":"3.49.0"},"reference-count":49,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2023,9,11]],"date-time":"2023-09-11T00:00:00Z","timestamp":1694390400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["32260388"],"award-info":[{"award-number":["32260388"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["2023CB008-22"],"award-info":[{"award-number":["2023CB008-22"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Xinjiang Production and Construction Corps Key Field Science and Technology Tackling Program Project","award":["32260388"],"award-info":[{"award-number":["32260388"]}]},{"name":"Xinjiang Production and Construction Corps Key Field Science and Technology Tackling Program Project","award":["2023CB008-22"],"award-info":[{"award-number":["2023CB008-22"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Accurately extracting buildings is essential for urbanization rate statistics, urban planning, resource allocation, etc. The high-resolution remote sensing images contain rich building information, which provides an important data source for building extraction. However, the extreme abundance of building types with large differences in size, as well as the extreme complexity of the background environment, result in the accurate extraction of spatial details of multi-scale buildings, which remains a difficult problem worth studying. To this end, this study selects the representative Xinjiang Tumxuk urban area as the study area. A building extraction network (SCA-Net) with feature highlighting, multi-scale sensing, and multi-level feature fusion is proposed, which includes Selective kernel spatial Feature Extraction (SFE), Contextual Information Aggregation (CIA), and Attentional Feature Fusion (AFF) modules. First, Selective kernel spatial Feature Extraction modules are used for cascading composition, highlighting information representation of features, and improving the feature extraction capability. Adding a Contextual Information Aggregation module enables the acquisition of multi-scale contextual information. The Attentional Feature Fusion module bridges the semantic gap between high-level and low-level features to achieve effective fusion between cross-level features. The classical U-Net, Segnet, Deeplab v3+, and HRNet v2 semantic segmentation models are compared on the self-built Tmsk and WHU building datasets. The experimental results show that the algorithm proposed in this paper can effectively extract multi-scale buildings in complex backgrounds with IoUs of 85.98% and 89.90% on the two datasets, respectively. SCA-Net is a suitable method for building extraction from high-resolution remote sensing images with good usability and generalization.<\/jats:p>","DOI":"10.3390\/rs15184466","type":"journal-article","created":{"date-parts":[[2023,9,11]],"date-time":"2023-09-11T09:09:21Z","timestamp":1694423361000},"page":"4466","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":16,"title":["SCA-Net: Multiscale Contextual Information Network for Building Extraction Based on High-Resolution Remote Sensing Images"],"prefix":"10.3390","volume":"15","author":[{"given":"Yuanzhi","family":"Wang","sequence":"first","affiliation":[{"name":"College of Information Science and Technology, Shihezi University, Shihezi 832002, China"},{"name":"Geospatial Information Engineering Research Center, Xinjiang Production and Construction Crops, Shihezi 832002, China"},{"name":"Xinjiang Production and Construction Corps Industrial Technology Research Institute, Shihezi 832002, China"}]},{"given":"Qingzhan","family":"Zhao","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Shihezi University, Shihezi 832002, China"},{"name":"Geospatial Information Engineering Research Center, Xinjiang Production and Construction Crops, Shihezi 832002, China"},{"name":"Xinjiang Production and Construction Corps Industrial Technology Research Institute, Shihezi 832002, China"}]},{"given":"Yuzhen","family":"Wu","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Shihezi University, Shihezi 832002, China"},{"name":"Geospatial Information Engineering Research Center, Xinjiang Production and Construction Crops, Shihezi 832002, China"},{"name":"Xinjiang Production and Construction Corps Industrial Technology Research Institute, Shihezi 832002, China"}]},{"given":"Wenzhong","family":"Tian","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Shihezi University, Shihezi 832002, China"},{"name":"Geospatial Information Engineering Research Center, Xinjiang Production and Construction Crops, Shihezi 832002, China"},{"name":"College of Mechanical and Electrical Engineering, Shihezi University, Shihezi 832002, China"}]},{"given":"Guoshun","family":"Zhang","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, Shihezi University, Shihezi 832002, China"},{"name":"Geospatial Information Engineering Research Center, Xinjiang Production and Construction Crops, Shihezi 832002, China"},{"name":"Xinjiang Production and Construction Corps Industrial Technology Research Institute, Shihezi 832002, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,9,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"8945","DOI":"10.1073\/pnas.1606035114","article-title":"Global Scenarios of Urban Density and Its Impacts on Building Energy Use through 2050","volume":"114","author":"Zhou","year":"2017","journal-title":"Proc. Natl. Acad. Sci. USA"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Claassens, J., Koomen, E., and Rouwendal, J. (2020). Urban Density and Spatial Planning: The Unforeseen Impacts of Dutch Devolution. PLoS ONE, 15.","DOI":"10.1371\/journal.pone.0240738"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"107114","DOI":"10.1016\/j.buildenv.2020.107114","article-title":"Identifying Key Determinants for Building Energy Analysis from Urban Building Datasets","volume":"181","author":"Li","year":"2020","journal-title":"Build. Environ."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1506","DOI":"10.1080\/17538947.2022.2111470","article-title":"A Transformer-Based Siamese Network and an Open Optical Dataset for Semantic Change Detection of Remote Sensing Images","volume":"15","author":"Yuan","year":"2022","journal-title":"Int. J. Digit. Earth"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"102614","DOI":"10.1016\/j.ijdrr.2021.102614","article-title":"Evaluating Urban Flood Risk Using Hybrid Method of TOPSIS and Machine Learning","volume":"66","author":"Azareh","year":"2021","journal-title":"Int. J. Disaster Risk Reduct."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"300","DOI":"10.1126\/science.abh4455","article-title":"A Massive Rock and Ice Avalanche Caused the 2021 Disaster at Chamoli, Indian Himalaya","volume":"373","author":"Shugar","year":"2021","journal-title":"Science"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1038\/s41561-022-00953-y","article-title":"High Mountain Asia Hydropower Systems Threatened by Climate-Driven Landscape Instability","volume":"15","author":"Li","year":"2022","journal-title":"Nat. Geosci."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1016\/j.isprsjprs.2021.11.005","article-title":"A coarse-to-fine boundary refinement network for building footprint extraction from remote sensing imagery","volume":"183","author":"Guo","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Yuan, W., Wang, J., and Xu, W. (2022). Shift Pooling PSPNet: Rethinking Pspnet for Building Extraction in Remote Sensing Images from Entire Local Feature Pooling. Remote Sens., 14.","DOI":"10.3390\/rs14194889"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Shao, Z., Tang, P., Wang, Z., Saleem, N., Yam, S., and Sommai, C. (2020). BRRNet: A Fully Convolutional Neural Network for Automatic Building Extraction from High-Resolution Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12061050"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ran, S., Gao, X., Yang, Y., Li, S., Zhang, G., and Wang, P. (2021). Building Multi-Feature Fusion Refined Network for Building Extraction from High-Resolution Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13142794"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"114417","DOI":"10.1016\/j.eswa.2020.114417","article-title":"A Review of Deep Learning Methods for Semantic Segmentation of Remote Sensing Imagery","volume":"169","author":"Yuan","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"100379","DOI":"10.1016\/j.cosrev.2021.100379","article-title":"A Survey on Deep Learning and Its Applications","volume":"40","author":"Dong","year":"2021","journal-title":"Comput. Sci. Rev."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1016\/j.neucom.2019.11.118","article-title":"A Brief Survey on Semantic Segmentation with Deep Learning","volume":"406","author":"Hao","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Zuo, T., Feng, J., and Chen, X. (2016, January 20\u201324). HF-FCN: Hierarchically Fused Fully Convolutional Network for Robust Building Extraction. Proceedings of the Computer Vision\u2013ACCV 2016: 13th Asian Conference on Computer Vision, Taipei, Taiwan. Revised Selected Papers, Part I 13.","DOI":"10.1007\/978-3-319-54181-5_19"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Schuegraf, P., and Bittner, K. (2019). Automatic Building Footprint Extraction from Multi-Resolution Remote Sensing Images Using a Hybrid FCN. ISPRS Int. J. Geo-Inf., 8.","DOI":"10.3390\/ijgi8040191"},{"key":"ref_17","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional Networks for Biomedical Image Segmentation. Proceedings of the Medical Image Computing and Computer-Assisted Intervention\u2013MICCAI 2015: 18th International Conference, Munich, Germany. Proceedings, Part III 18."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Hosseinpoor, H., and Samadzadegan, F. (2020, January 18\u201320). Convolutional Neural Network for Building Extraction from High-Resolution Remote Sensing Images. Proceedings of the 2020 International Conference on Machine Vision and Image Processing (MVIP), Qom, Iran.","DOI":"10.1109\/MVIP49855.2020.9187483"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Si, Z., Zhou, B., Wang, B., Wang, X., and Zhu, L. (2022, January 29\u201331). High-Resolution Remote Sensing Building Extraction Based on Attention Mechanism and DeepLabv3+. Proceedings of the 5th International Conference on Computer Information Science and Application Technology (CISAT 2022), Chongqing, China.","DOI":"10.1117\/12.2656777"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Seong, S., and Choi, J. (2021). Semantic Segmentation of Urban Buildings Using a High-Resolution Network (HRNet) with Channel and Spatial Attention Gates. Remote Sens., 13.","DOI":"10.3390\/rs13163087"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Sun, K., Xiao, B., Liu, D., and Wang, J. (2019, January 16\u201320). Deep High-Resolution Representation Learning for Human Pose Estimation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00584"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"6514405","DOI":"10.1109\/LGRS.2022.3197319","article-title":"CSA-UNet: Channel-Spatial Attention-Based Encoder\u2013Decoder Network for Rural Blue-Roofed Building Extraction From UAV Imagery","volume":"19","author":"Shi","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Aryal, J., and Neupane, B. (2023). Multi-Scale Feature Map Aggregation and Supervised Domain Adaptation of Fully Convolutional Networks for Urban Building Footprint Extraction. Remote Sens., 15.","DOI":"10.3390\/rs15020488"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Xu, X., Zhang, H., Ran, Y., and Tan, Z. (2023). High-Precision Segmentation of Buildings with Small Sample Sizes Based on Transfer Learning and Multi-Scale Fusion. Remote Sens., 15.","DOI":"10.3390\/rs15092436"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Li, M., Rui, J., Yang, S., Liu, Z., Ren, L., Ma, L., Li, Q., Su, X., and Zuo, X. (2023). Method of Building Detection in Optical Remote Sensing Images Based on SegFormer. Sensors, 23.","DOI":"10.3390\/s23031258"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Yuan, W., and Xu, W. (2021). MSST-Net: A Multi-Scale Adaptive Network for Building Extraction from Remote Sensing Images Based on Swin Transformer. Remote Sens., 13.","DOI":"10.3390\/rs13234743"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Chen, K., Zou, Z., and Shi, Z. (2021). Building Extraction from Remote Sensing Images with Sparse Token Transformers. Remote Sens., 13.","DOI":"10.3390\/rs13214441"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A.C., and Lo, W.-Y. (2023). Segment Anything. arXiv.","DOI":"10.1109\/ICCV51070.2023.00371"},{"key":"ref_30","unstructured":"Chen, K., Liu, C., Chen, H., Zhang, H., Li, W., Zou, Z., and Shi, Z. (2023). RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation Based on Visual Foundation Model. arXiv."},{"key":"ref_31","first-page":"2503605","article-title":"Multiscale Feature Learning by Transformer for Building Extraction from Satellite Images","volume":"19","author":"Chen","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"103509","DOI":"10.1016\/j.autcon.2020.103509","article-title":"Automated Building Extraction Using Satellite Remote Sensing Imagery","volume":"123","author":"Hu","year":"2021","journal-title":"Autom. Constr."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1109\/TGRS.2018.2858817","article-title":"Fully Convolutional Networks for Multisource Building Extraction from an Open Aerial and Satellite Imagery Data Set","volume":"57","author":"Ji","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Maggiori, E., Tarabalka, Y., Charpiat, G., and Alliez, P. (2017, January 23\u201328). Can Semantic Labeling Methods Generalize to Any City? The Inria Aerial Image Labeling Benchmark. Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA.","DOI":"10.1109\/IGARSS.2017.8127684"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wang, Y., Zeng, X., Liao, X., and Zhuang, D. (2022). B-FGC-Net: A Building Extraction Network from High Resolution Remote Sensing Imagery. Remote Sens., 14.","DOI":"10.3390\/rs14020269"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Chen, M., Wu, J., Liu, L., Zhao, W., Tian, F., Shen, Q., Zhao, B., and Du, R. (2021). DR-Net: An Improved Network for Building Extraction from High Resolution Remote Sensing Image. Remote Sens., 13.","DOI":"10.3390\/rs13020294"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Chen, Z., Li, D., Fan, W., Guan, H., Wang, C., and Li, J. (2021). Self-Attention in Reconstruction Bias U-Net for Semantic Segmentation of Building Rooftops in Optical Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13132524"},{"key":"ref_38","first-page":"1929","article-title":"Dropout: A Simple Way to Prevent Neural Networks from Overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"ref_39","unstructured":"Ioffe, S., and Szegedy, C. (2015;, January 6\u201311). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. Proceedings of the International Conference on Machine Learning, Lille, France."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Li, X., Wang, W., Hu, X., and Yang, J. (2019, January 15\u201320). Selective Kernel Networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00060"},{"key":"ref_41","unstructured":"Agarap, A.F. (2018). Deep Learning Using Rectified Linear Units (Relu). arXiv."},{"key":"ref_42","unstructured":"Han, J., and Moraga, C. (1995). Proceedings of the International Workshop on Artificial Neural Networks, Springer."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Wang, P., Chen, P., Yuan, Y., Liu, D., Huang, Z., Hou, X., and Cottrell, G. (2018, January 12\u201315). Understanding Convolution for Semantic Segmentation. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00163"},{"key":"ref_44","unstructured":"Kinga, D., and Adam, J.B. (2015, January 7\u20139). A Method for Stochastic Optimization. Proceedings of the International Conference on Learning Representations (ICLR), San Diego, CA, USA."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"179424","DOI":"10.1109\/ACCESS.2020.3026658","article-title":"VNet: An End-to-End Fully Convolutional Neural Network for Road Extraction from High-Resolution Remote Sensing Data","volume":"8","author":"Abdollahi","year":"2020","journal-title":"IEEE Access"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_47","unstructured":"Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., and Luo, P. (2021, January 6\u201314). SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. Proceedings of the Advances in Neural Information Processing Systems, Curran Associates, Inc., virtual."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"161","DOI":"10.1109\/JSTARS.2011.2168195","article-title":"Morphological Building\/Shadow Index for Building Extraction From High-Resolution Imagery Over Urban Areas","volume":"5","author":"Huang","year":"2012","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"2793","DOI":"10.1109\/TPAMI.2017.2750680","article-title":"Learning Building Extraction in Aerial Scenes with Convolutional Networks","volume":"40","author":"Yuan","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/18\/4466\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:48:42Z","timestamp":1760129322000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/18\/4466"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,11]]},"references-count":49,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2023,9]]}},"alternative-id":["rs15184466"],"URL":"https:\/\/doi.org\/10.3390\/rs15184466","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,11]]}}}