{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,3]],"date-time":"2026-04-03T14:58:42Z","timestamp":1775228322912,"version":"3.50.1"},"reference-count":31,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2023,1,21]],"date-time":"2023-01-21T00:00:00Z","timestamp":1674259200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>An appropriate detection network is required to extract building information in remote sensing images and to relieve the issue of poor detection effects resulting from the deficiency of detailed features. Firstly, we embed a transposed convolution sampling module fusing multiple normalization activation layers in the decoder based on the SegFormer network. This step alleviates the issue of missing feature semantics by adding holes and fillings, cascading multiple normalizations and activation layers to hold back over-fitting regularization expression and guarantee steady feature parameter classification. Secondly, the atrous spatial pyramid pooling decoding module is fused to explore multi-scale contextual information and to overcome issues such as the loss of detailed information on local buildings and the lack of long-distance information. Ablation experiments and comparison experiments are performed on the remote sensing image AISD, MBD, and WHU dataset. The robustness and validity of the improved mechanism are demonstrated by control groups of ablation experiments. In comparative experiments with the HRnet, PSPNet, U-Net, DeepLabv3+ networks, and the original detection algorithm, the mIoU of the AISD, the MBD, and the WHU dataset is enhanced by 17.68%, 30.44%, and 15.26%, respectively. The results of the experiments show that the method of this paper is superior to comparative methods such as U-Net. Furthermore, it is better for integrity detection of building edges and reduces the number of missing and false detections.<\/jats:p>","DOI":"10.3390\/s23031258","type":"journal-article","created":{"date-parts":[[2023,1,23]],"date-time":"2023-01-23T01:36:26Z","timestamp":1674437786000},"page":"1258","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":29,"title":["Method of Building Detection in Optical Remote Sensing Images Based on SegFormer"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1687-5077","authenticated-orcid":false,"given":"Meilin","family":"Li","sequence":"first","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"given":"Jie","family":"Rui","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"given":"Songkun","family":"Yang","sequence":"additional","affiliation":[{"name":"School of Computer Science & Technology, Beijing Institute of Technology, Haidian District, Beijing 100081, China"}]},{"given":"Zhi","family":"Liu","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"given":"Liqiu","family":"Ren","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"given":"Li","family":"Ma","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"given":"Qing","family":"Li","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"given":"Xu","family":"Su","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8120-8692","authenticated-orcid":false,"given":"Xibing","family":"Zuo","sequence":"additional","affiliation":[{"name":"Department of Geographic Information, Information Engineering University, Wutong Street High-Tech District, Zhengzhou 450001, China"}]}],"member":"1968","published-online":{"date-parts":[[2023,1,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Zhou, J., Liu, Y., Nie, G., Cheng, H., Yang, X., Chen, X., and Gross, L. (2022). Building Extraction and Floor Area Estimation at the Village Level in Rural China via a Comprehensive Method Integrating UAV Photogrammetry and the Novel EDSANet. Remote Sens., 14.","DOI":"10.3390\/rs14205175"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wang, Y., Cui, L., Zhang, C., Chen, W., Xu, Y., and Zhang, Q. (2022). A Two-Stage Seismic Damage Assessment Method for Small, Dense, and Imbalanced Buildings in Remote Sensing Images. Remote Sens., 14.","DOI":"10.3390\/rs14041012"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Degas, A., Islam, M.R., Hurter, C., Barua, S., Rahman, H., Poudel, M., Ruscio, D., Ahmed, M.U., Begum, S., and Rahman, M.A. (2022). A survey on artificial intelligence (ai) and explainable ai in air traffic management: Current trends and development with future research trajectory. Appl. Sci., 12.","DOI":"10.3390\/app12031295"},{"key":"ref_4","first-page":"193","article-title":"Machine Learning-based Classification of Hyperspectral Imagery","volume":"22","author":"Haq","year":"2022","journal-title":"Int. J. Comput. Sci. Netw. Secur."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Jiang, H., Peng, M., Zhong, Y., Xie, H., Hao, Z., Lin, J., Ma, X., and Hu, X. (2022). A Survey on Deep Learning-Based Change Detection from High-Resolution Remote Sensing Images. Remote Sens., 14.","DOI":"10.3390\/rs14071552"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Shafique, A., Cao, G., Khan, Z., Asad, M., and Aslam, M. (2022). Deep learning-based change detection in remote sensing images: A review. Remote Sens., 14.","DOI":"10.3390\/rs14040871"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"104440","DOI":"10.1016\/j.autcon.2022.104440","article-title":"Artificial intelligence and smart vision for building and construction 4.0: Machine and deep learning methods and applications","volume":"141","author":"Baduge","year":"2022","journal-title":"Autom. Constr."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1847","DOI":"10.1007\/s40747-021-00322-z","article-title":"Remote sensing image building detection method based on Mask R-CNN","volume":"8","author":"Han","year":"2022","journal-title":"Complex Intell. Syst."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1494","DOI":"10.1109\/JSTARS.2022.3146430","article-title":"Res2-Unet, a New Deep Architecture for Building Detection from High Spatial Resolution Images","volume":"15","author":"Chen","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Yu, M., Chen, X., Zhang, W., and Liu, Y. (2022). AGs-Unet: Building Extraction Model for High Resolution Remote Sensing Images Based on Attention Gates U Network. Sensors, 22.","DOI":"10.3390\/s22082932"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"240","DOI":"10.1016\/j.isprsjprs.2021.11.005","article-title":"A coarse-to-fine boundary refinement network for building footprint extraction from remote sensing imagery","volume":"183","author":"Guo","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"117268","DOI":"10.1016\/j.eswa.2022.117268","article-title":"A novel attention-based deep learning method for post-disaster building damage classification","volume":"202","author":"Liu","year":"2022","journal-title":"Expert Syst. Appl."},{"key":"ref_13","first-page":"1","article-title":"HRNet-and PSPNet-based multiband semantic segmentation of remote sensing images","volume":"35","author":"Sun","year":"2022","journal-title":"Neural Comput. Appl."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Yang, H., Xu, M., Chen, Y., Wu, W., and Dong, W. (2022). A Postprocessing Method Based on Regions and Boundaries Using Convolutional Neural Networks and a New Dataset for Building Extraction. Remote Sens., 14.","DOI":"10.3390\/rs14030647"},{"key":"ref_15","first-page":"102678","article-title":"Automatic generation of land use maps using aerial orthoimages and building floor data with a Conv-Depth Block (CDB) ResU-Net architecture","volume":"107","author":"Yoo","year":"2022","journal-title":"Int. J. Appl. Earth Observ. Geoinf."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"104569","DOI":"10.1016\/j.landurbplan.2022.104569","article-title":"Semantic Riverscapes: Perception and evaluation of linear landscapes from oblique imagery using computer vision","volume":"228","author":"Luo","year":"2022","journal-title":"Landsc. Urban Plan."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Wang, Y., Zeng, X., Liao, X., and Zhuang, D. (2022). B-FGC-Net: A Building Extraction Network from High Resolution Remote Sensing Imagery. Remote Sens., 14.","DOI":"10.3390\/rs14020269"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"100084","DOI":"10.1016\/j.adapen.2022.100084","article-title":"Transfer learning for smart buildings: A critical review of algorithms, applications, and future perspectives","volume":"5","author":"Pinto","year":"2022","journal-title":"Adv. Appl. Energy"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Zhou, Y., Chang, H., Lu, Y., and Lu, X. (2022). CDTNet: Improved image classification method using standard, Dilated and Transposed Convolutions. Appl. Sci., 12.","DOI":"10.3390\/app12125984"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Ahmad, I., Qayyum, A., Gupta, B.B., Alassafi, M.O., and AlGhamdi, R.A. (2022). Ensemble of 2D Residual Neural Networks Integrated with Atrous Spatial Pyramid Pooling Module for Myocardium Segmentation of Left Ventricle Cardiac MRI. Mathematics, 10.","DOI":"10.3390\/math10040627"},{"key":"ref_21","first-page":"12077","article-title":"SegFormer: Simple and efficient design for semantic segmentation with transformers","volume":"34","author":"Xie","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_22","first-page":"1","article-title":"Applications of artificial neural networks in microorganism image analysis: A comprehensive review from conventional multilayer perceptron to popular convolutional neural network and potential visual transformer","volume":"56","author":"Zhang","year":"2022","journal-title":"Artif. Intell. Rev."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3505244","article-title":"Transformers in vision: A survey","volume":"54","author":"Khan","year":"2022","journal-title":"ACM Comput. Surv. (CSUR)"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"109115","DOI":"10.1016\/j.patcog.2022.109115","article-title":"Batch normalization embeddings for deep domain generalization","volume":"135","author":"Segu","year":"2022","journal-title":"Pattern Recognit."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.neunet.2022.01.001","article-title":"Discovering parametric activation functions","volume":"148","author":"Bingham","year":"2022","journal-title":"Neural Netw."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"26","DOI":"10.1016\/j.neucom.2021.02.091","article-title":"See more than once: Kernel-sharing atrous convolution for semantic segmentation","volume":"443","author":"Huang","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Cui, F., and Jiang, J. (2022). Shuffle-CDNet: A Lightweight Network for Change Detection of Bitemporal Remote-Sensing Images. Remote Sens., 14.","DOI":"10.3390\/rs14153548"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Wang, P., Chen, P., Yuan, Y., Liu, D., Huang, Z., Hou, X., and Cottrell, G. (2018, January 12\u201315). Understanding Convolution for Semantic Segmentation. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00163"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"3355","DOI":"10.1080\/10106049.2020.1856199","article-title":"An ensemble architecture of deep convolutional Segnet and Unet networks for building semantic segmentation from high-resolution aerial images","volume":"37","author":"Abdollahi","year":"2022","journal-title":"Geocarto Int."},{"key":"ref_30","first-page":"1","article-title":"Building extraction with vision transformer","volume":"60","author":"Wang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1080\/22797254.2021.2018944","article-title":"Building extraction from remote sensing images using deep residual U-Net","volume":"55","author":"Wang","year":"2022","journal-title":"Eur. J. Remote Sens."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/3\/1258\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:13:10Z","timestamp":1760119990000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/3\/1258"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,21]]},"references-count":31,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2023,2]]}},"alternative-id":["s23031258"],"URL":"https:\/\/doi.org\/10.3390\/s23031258","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,21]]}}}