{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,8]],"date-time":"2026-03-08T21:19:09Z","timestamp":1773004749894,"version":"3.50.1"},"reference-count":50,"publisher":"MDPI AG","issue":"11","license":[{"start":{"date-parts":[[2018,11,8]],"date-time":"2018-11-08T00:00:00Z","timestamp":1541635200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41501376"],"award-info":[{"award-number":["41501376"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41571400"],"award-info":[{"award-number":["41571400"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003995","name":"Natural Science Foundation of Anhui Province","doi-asserted-by":"publisher","award":["1608085MD83"],"award-info":[{"award-number":["1608085MD83"]}],"id":[{"id":"10.13039\/501100003995","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Building extraction from very high resolution (VHR) imagery plays an important role in urban planning, disaster management, navigation, updating geographic databases, and several other geospatial applications. Compared with the traditional building extraction approaches, deep learning networks have recently shown outstanding performance in this task by using both high-level and low-level feature maps. However, it is difficult to utilize different level features rationally with the present deep learning networks. To tackle this problem, a novel network based on DenseNets and the attention mechanism was proposed, called the dense-attention network (DAN). The DAN contains an encoder part and a decoder part which are separately composed of lightweight DenseNets and a spatial attention fusion module. The proposed encoder\u2013decoder architecture can strengthen feature propagation and effectively bring higher-level feature information to suppress the low-level feature and noises. Experimental results based on public international society for photogrammetry and remote sensing (ISPRS) datasets with only red\u2013green\u2013blue (RGB) images demonstrated that the proposed DAN achieved a higher score (96.16% overall accuracy (OA), 92.56% F1 score, 90.56% mean intersection over union (MIOU), less training and response time and higher-quality value) when compared with other deep learning methods.<\/jats:p>","DOI":"10.3390\/rs10111768","type":"journal-article","created":{"date-parts":[[2018,11,9]],"date-time":"2018-11-09T03:08:02Z","timestamp":1541732882000},"page":"1768","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":101,"title":["Building Extraction in Very High Resolution Imagery by Dense-Attention Networks"],"prefix":"10.3390","volume":"10","author":[{"given":"Hui","family":"Yang","sequence":"first","affiliation":[{"name":"School of Resource and Environmental Science, Wuhan University, Wuhan 430079, China"},{"name":"School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1983-5978","authenticated-orcid":false,"given":"Penghai","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China"},{"name":"Institute of Physical Science and Information Technology, Anhui University, Hefei 230601, China"},{"name":"Anhui Engineering Research Center for Geographical Information Intelligent Technology, Hefei 230601, China"}]},{"given":"Xuedong","family":"Yao","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China"}]},{"given":"Yanlan","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China"},{"name":"Institute of Physical Science and Information Technology, Anhui University, Hefei 230601, China"},{"name":"Anhui Engineering Research Center for Geographical Information Intelligent Technology, Hefei 230601, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3594-7953","authenticated-orcid":false,"given":"Biao","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Anhui University, Hefei 230601, China"},{"name":"Anhui Engineering Research Center for Geographical Information Intelligent Technology, Hefei 230601, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7421-4915","authenticated-orcid":false,"given":"Yongyang","family":"Xu","sequence":"additional","affiliation":[{"name":"Department of Information Engineering, China University of Geosciences, Wuhan 430074, China"}]}],"member":"1968","published-online":{"date-parts":[[2018,11,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5234","DOI":"10.1080\/01431161.2016.1230287","article-title":"Building extraction from high-resolution satellite images in urban areas: Recent methods and strategies against significant challenges","volume":"37","author":"Ghanea","year":"2016","journal-title":"Int. J. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.isprsjprs.2013.09.004","article-title":"Automated detection of buildings from single vhr multispectral images using shadow information and graph cuts","volume":"86","author":"Ok","year":"2013","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MGRS.2016.2540798","article-title":"Deep learning for remote sensing data: A technical tutorial on the state of the art","volume":"4","author":"Zhang","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_4","first-page":"150","article-title":"Automatic urban building boundary extraction from high resolution aerial images using an innovative model of active contours","volume":"12","author":"Ahmadi","year":"2010","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"721","DOI":"10.14358\/PERS.77.7.721","article-title":"A multidirectional and multiscale morphological index for automatic building extraction from multispectral geoeye-1 imagery","volume":"77","author":"Huang","year":"2011","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"745309","DOI":"10.1155\/ASP.2005.2196","article-title":"Automated building extraction from high-resolution satellite imagery in urban areas using structural, contextual, and spectral information","volume":"14","author":"Jin","year":"2005","journal-title":"EURASIP J. Appl. Signal Process."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"5094","DOI":"10.1080\/01431161.2014.933278","article-title":"Automatic building extraction in dense urban areas through geoeye multispectral imagery","volume":"35","author":"Ghanea","year":"2014","journal-title":"Int. J. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1016\/j.isprsjprs.2017.06.005","article-title":"Automatic building extraction from LiDAR data fusion of point and grid-based features","volume":"130","author":"Du","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_9","first-page":"5547","article-title":"Processing of extremely high-resolution lidar and rgb data: Outcome of the 2015 ieee grss data fusion contest\u2014Part A: 2-D contest","volume":"9","author":"Gatta","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5560","DOI":"10.1109\/JSTARS.2016.2581843","article-title":"Processing of extremely high resolution LiDAR and RGB data: Outcome of the 2015 IEEE GRSS data fusion contest\u2014Part B: 3-D contest","volume":"9","author":"Vo","year":"2016","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Gilani, S.A.N., Awrangjeb, M., and Lu, G. (2016). An automatic building extraction and regularisation technique using lidar point cloud data and orthoimage. Remote Sens., 8.","DOI":"10.3390\/rs8030258"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"152","DOI":"10.1016\/j.isprsjprs.2013.11.001","article-title":"Contextual classification of lidar data and building object detection in urban areas","volume":"87","author":"Niemeyer","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"04017021","DOI":"10.1061\/(ASCE)CP.1943-5487.0000674","article-title":"Urban point cloud mining based on density clustering and MapReduce","volume":"31","author":"Aljumaily","year":"2017","journal-title":"J. Comput. Civ. Eng."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"906","DOI":"10.1109\/JSTARS.2016.2603184","article-title":"Building extraction from remotely sensed images by integrating saliency cue","volume":"10","author":"Li","year":"2017","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Vakalopoulou, M., Karantzalos, K., Komodakis, N., and Paragios, N. (2015, January 26\u201331). In Building detection in very high resolution multispectral data with deep learning features. Proceedings of the Geoscience and Remote Sensing Symposium, Milan, Italy.","DOI":"10.1109\/IGARSS.2015.7326158"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Sun, Y., Zhang, X., Zhao, X., and Xin, Q. (2018). Extracting Building Boundaries from High Resolution Optical Images and LiDAR Data by Integrating the Convolutional Neural Network and the Active Contour Model. Remote Sens., 10.","DOI":"10.3390\/rs10091459"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.isprsjprs.2017.05.002","article-title":"Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks","volume":"130","author":"Alshehhi","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yang, H.L., Yuan, J., Lunga, D., Laverdiere, M., Rose, A., and Bhaduri, B. (2018). Building extraction at scale using convolutional neural network: Mapping of the united states. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.","DOI":"10.1109\/JSTARS.2018.2835377"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Saito, S., Yamashita, T., and Aoki, Y. (2016). Multiple object extraction from aerial imagery with convolutional neural networks. Electron. Imaging, 60.","DOI":"10.2352\/J.ImagingSci.Technol.2016.60.1.010402"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Huang, Z., Cheng, G., Wang, H., Li, H., Shi, L., and Pan, C. (2016, January 10\u201315). Building extraction from multi-source remote sensing images via deep deconvolution neural networks. Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China.","DOI":"10.1109\/IGARSS.2016.7729471"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Bittner, K., Cui, S., and Reinartz, P. (2017, January 6\u20139). Building extraction from remote sensing data using fully convolutional networks. Proceedings of the International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, ISPRS Hannover Workshop, Hannover, Germany.","DOI":"10.5194\/isprs-archives-XLII-1-W1-481-2017"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Badrinarayanan, V., Kendall, A., and Cipolla, R. (arXiv, 2016). Segnet: A deep convolutional encoder-decoder architecture for scene segmentation, arXiv.","DOI":"10.1109\/TPAMI.2016.2644615"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Lin, G., Milan, A., Shen, C., and Reid, I. (arXiv, 2016). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation, arXiv.","DOI":"10.1109\/CVPR.2017.549"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Laurens, V.D.M., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"J\u00e9gou, S., Drozdzal, M., Vazquez, D., Romero, A., and Bengio, Y. (2017, January 21\u201326). The one hundred layers tiramisu: Fully convolutional densenets for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.156"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Li, L., Liang, J., Weng, M., and Zhu, H. (2018). A Multiple-Feature Reuse Network to Extract Buildings from Remote Sensing Imagery. Remote Sens., 10.","DOI":"10.3390\/rs10091350"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Wang, H., Wang, Y., Zhang, Q., Xiang, S., and Pan, C. (2017). Gated convolutional neural network for semantic segmentation in high-resolution images. Remote Sens., 9.","DOI":"10.3390\/rs9050446"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Yang, Y., Zhong, Z., Shen, T., and Lin, Z. (2018, January 18\u201322). Convolutional neural networks with alternately updated clique. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00256"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"3252","DOI":"10.1109\/JSTARS.2018.2860989","article-title":"Semantic segmentation for high spatial resolution remote sensing images based on convolution neural network and pyramid pooling module","volume":"11","author":"Yu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Kemker, R., Salvaggio, C., and Kanan, C. (2018). Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning. ISPRS J. Photogramm. Remote Sens.","DOI":"10.1016\/j.isprsjprs.2018.04.014"},{"key":"ref_32","unstructured":"Simonyan, K., and Zisserman, A. (arXiv, 2014). Very deep convolutional networks for large-scale image recognition, arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Wang, F., Jiang, M., Qian, C., Yang, S., Li, C., Zhang, H., Wang, X., and Tang, X. (2017, January 21\u201326). Residual attention network for image classification. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.683"},{"key":"ref_34","unstructured":"Chen, L.C., Papandreou, G., Schroff, F., and Adam, H. (arXiv, 2017). Rethinking atrous convolution for semantic image segmentation, arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1038\/35058500","article-title":"Computational modelling of visual attention","volume":"2","author":"Itti","year":"2001","journal-title":"Nat. Rev. Neurosci."},{"key":"ref_36","unstructured":"Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhutdinov, R., Zemel, R., and Bengio, Y. (2015). Show, attend and tell: Neural image caption generation with visual attention. Cmputer Sci."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Chen, L., Zhang, H., Xiao, J., Nie, L., Shao, J., Liu, W., and Chua, T.S. (2017, January 21\u201326). Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.667"},{"key":"ref_38","unstructured":"Yang, Z., He, X., Gao, J., Deng, L., and Smola, A. (July, January 26). Stacked attention networks for image question answering. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_39","unstructured":"Chen, K., Wang, J., Chen, L.C., Gao, H., Xu, W., and Nevatia, R. (arXiv, 2015). Abc-cnn: An attention based convolutional neural network for visual question answering, arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Fu, J., Zheng, H., and Mei, T. (2017, January 21\u201326). Look closer to see better: Recurrent attention convolutional neural network for fine-grained image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.476"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Yao, L., Torabi, A., Cho, K., Ballas, N., Pal, C., Larochelle, H., and Courville, A. (2015, January 7\u201313). Describing videos by exploiting temporal structure. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.512"},{"key":"ref_42","unstructured":"Kuen, J., Wang, Z., and Wang, G. (July, January 26). Recurrent attentional networks for saliency detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (arXiv, 2018). Squeeze-and-excitation networks, arXiv.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_44","unstructured":"Li, H., Xiong, P., An, J., and Wang, L. (arXiv, 2018). Pyramid attention network for semantic segmentation, arXiv."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Yu, C., Wang, J., Peng, C., Gao, C., Yu, G., and Sang, N. (2018, January 18\u201322). Learning a discriminative feature network for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00199"},{"key":"ref_46","unstructured":"Golnaz, G., and Fowlkes, C.C. (2016, January 8\u201316). Laplacian pyramid reconstruction and refinement for semantic segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands."},{"key":"ref_47","unstructured":"Pleiss, G., Chen, D., Huang, G., Li, T., Laurens, V.D.M., and Weinberger, K.Q. (arXiv, 2017). Memory-efficient implementation of densenets, arXiv."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"4184","DOI":"10.1109\/JSTARS.2014.2318694","article-title":"An Automatic and Threshold-Free Performance Evaluation System for Building Extraction Techniques from Airborne LIDAR Data","volume":"7","author":"Awrangjeb","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Xu, Y., Wu, L., Xie, Z., and Chen, Z. (2018). Building extraction in very high resolution remote sensing imagery using deep learning and guided filters. Remote Sens., 10.","DOI":"10.3390\/rs10010144"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/11\/1768\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:28:41Z","timestamp":1760196521000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/11\/1768"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,8]]},"references-count":50,"journal-issue":{"issue":"11","published-online":{"date-parts":[[2018,11]]}},"alternative-id":["rs10111768"],"URL":"https:\/\/doi.org\/10.3390\/rs10111768","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,8]]}}}