{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,22]],"date-time":"2026-01-22T14:19:48Z","timestamp":1769091588953,"version":"3.49.0"},"reference-count":59,"publisher":"MDPI AG","issue":"9","license":[{"start":{"date-parts":[[2019,5,1]],"date-time":"2019-05-01T00:00:00Z","timestamp":1556668800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41861062, 41401526, and 41861052"],"award-info":[{"award-number":["41861062, 41401526, and 41861052"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Foundation of Jiangxi Province of China","award":["20171BAB213025 and 20181BAB203022"],"award-info":[{"award-number":["20171BAB213025 and 20181BAB203022"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Automatic building extraction using a single data type, either 2D remotely-sensed images or light detection and ranging 3D point clouds, remains insufficient to accurately delineate building outlines for automatic mapping, despite active research in this area and the significant progress which has been achieved in the past decade. This paper presents an effective approach to extracting buildings from Unmanned Aerial Vehicle (UAV) images through the incorporation of superpixel segmentation and semantic recognition. A framework for building extraction is constructed by jointly using an improved Simple Linear Iterative Clustering (SLIC) algorithm and Multiscale Siamese Convolutional Networks (MSCNs). The SLIC algorithm, improved by additionally imposing a digital surface model for superpixel segmentation, namely 6D-SLIC, is suited for building boundary detection under building and image backgrounds with similar radiometric signatures. The proposed MSCNs, including a feature learning network and a binary decision network, are used to automatically learn a multiscale hierarchical feature representation and detect building objects under various complex backgrounds. In addition, a gamma-transform green leaf index is proposed to truncate vegetation superpixels for further processing to improve the robustness and efficiency of building detection, the Douglas\u2013Peucker algorithm and iterative optimization are used to eliminate jagged details generated from small structures as a result of superpixel segmentation. In the experiments, the UAV datasets, including many buildings in urban and rural areas with irregular shapes and different heights and that are obscured by trees, are collected to evaluate the proposed method. The experimental results based on the qualitative and quantitative measures confirm the effectiveness and high accuracy of the proposed framework relative to the digitized results. The proposed framework performs better than state-of-the-art building extraction methods, given its higher values of recall, precision, and intersection over Union (IoU).<\/jats:p>","DOI":"10.3390\/rs11091040","type":"journal-article","created":{"date-parts":[[2019,5,2]],"date-time":"2019-05-02T03:15:22Z","timestamp":1556766922000},"page":"1040","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":23,"title":["Building Extraction from UAV Images Jointly Using 6D-SLIC and Multiscale Siamese Convolutional Networks"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9361-0219","authenticated-orcid":false,"given":"Haiqing","family":"He","sequence":"first","affiliation":[{"name":"School of Geomatics, East China University of Technology, Nanchang 330013, China"},{"name":"Key Laboratory of Watershed Ecology and Geographical Environment Monitoring, National Administration of Surveying, Mapping and Geoinformation, Nanchang 330013, China"}]},{"given":"Junchao","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Geomatics, East China University of Technology, Nanchang 330013, China"}]},{"given":"Min","family":"Chen","sequence":"additional","affiliation":[{"name":"Faculty of Geosciences and Environmental Engineering, Southwest Jiaotong University, Chengdu 611756, China"}]},{"given":"Ting","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Water Resources &amp; Environmental Engineering, East China University of Technology, Nanchang 330013, China"}]},{"given":"Dajun","family":"Li","sequence":"additional","affiliation":[{"name":"School of Geomatics, East China University of Technology, Nanchang 330013, China"}]},{"given":"Penggen","family":"Cheng","sequence":"additional","affiliation":[{"name":"School of Geomatics, East China University of Technology, Nanchang 330013, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,5,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Gilani, S.A.N., Awrangjeb, M., and Lu, G. (2016). An automatic building extraction and regularisation technique using LiDAR point cloud data and orthoimage. Remote Sens., 8.","DOI":"10.3390\/rs8030258"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Wu, G., Guo, Z., Shi, X., Chen, Q., Xu, Y., Shibasaki, R., and Shao, X. (2018). A boundary regulated network for accurate roof segmentation and outline extraction. Remote Sens., 10.","DOI":"10.3390\/rs10081195"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Castagno, J., and Atkins, E. (2018). Roof shape classification from LiDAR and satellite image data fusion using supervised learning. Sensors, 18.","DOI":"10.3390\/s18113960"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Du, S., Zhang, Y., Qin, R., Yang, Z., Zou, Z., Tang, Y., and Fan, C. (2016). Building change detection using old aerial images and new LiDAR data. Remote Sens., 8.","DOI":"10.3390\/rs8121030"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1077","DOI":"10.1080\/17538947.2016.1269841","article-title":"Building segmentation and outline extraction from UAV image-derived point clouds by a line growing algorithm","volume":"10","author":"Dai","year":"2017","journal-title":"Int. J. Digit. Earth"},{"key":"ref_6","first-page":"150","article-title":"Automatic urban building boundary extraction from high resolution aerial images using an innovative model of active contours","volume":"12","author":"Ahmadi","year":"2010","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"721","DOI":"10.14358\/PERS.77.7.721","article-title":"A multidirectional and multiscale morphological index for automatic building extraction from multispectral geoeye-1 imagery","volume":"77","author":"Huang","year":"2011","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"5094","DOI":"10.1080\/01431161.2014.933278","article-title":"Automatic building extraction in dense urban areas through GeoEye multispectral imagery","volume":"35","author":"Ghanea","year":"2014","journal-title":"Int. J. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Chen, R., Li, X., and Li, J. (2018). Object-based features for house detection from RGB high-resolution images. Remote Sens., 10.","DOI":"10.3390\/rs10030451"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Yang, H., Wu, P., Yao, X., Wu, Y., Wang, B., and Xu, Y. (2018). Building extraction in very high resolution imagery by dense-attention networks. Remote Sens., 10.","DOI":"10.3390\/rs10111768"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1554","DOI":"10.1109\/TGRS.2009.2030180","article-title":"Segmentation and reconstruction of polyhedral building roofs from aerial Lidar point clouds. IEEE Trans","volume":"48","author":"Sampath","year":"2010","journal-title":"Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"6497","DOI":"10.1080\/01431161.2012.690083","article-title":"Urban building roof segmentation from airborne lidar point clouds","volume":"33","author":"Chen","year":"2012","journal-title":"Int. J. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Xu, B., Jiang, W., Shan, J., Zhang, J., and Li, L. (2016). Investigation on the weighted RANSAC approaches for building roof plane segmentation from LiDAR point clouds. Remote Sens., 8.","DOI":"10.3390\/rs8010005"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"183","DOI":"10.1016\/j.isprsjprs.2014.04.022","article-title":"A global optimization approach to roof segmentation from airborne lidar point clouds","volume":"94","author":"Yan","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1016\/j.isprsjprs.2017.06.005","article-title":"Automatic building extraction from LiDAR data fusion of point and grid-based features","volume":"130","author":"Du","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"5135","DOI":"10.1080\/01431161.2012.659355","article-title":"Building detection in an urban area using lidar data and QuickBird imagery","volume":"33","author":"Chen","year":"2012","journal-title":"Int. J. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2013.05.006","article-title":"Automatic extraction of building roofs using LiDAR data and multispectral imagery","volume":"83","author":"Awrangjeb","year":"2013","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"406","DOI":"10.1109\/TGRS.2013.2240692","article-title":"Building change detection based on Satellite stereo imagery and digital surface models","volume":"52","author":"Tian","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Crommelinck, S., Bennett, R., Gerke, M., Nex, F., Yang, M.Y., and Vosselman, G. (2016). Review of automatic feature extraction from high-resolution optical sensor data for UAV-based cadastral mapping. Remote Sens., 8.","DOI":"10.3390\/rs8080689"},{"key":"ref_20","unstructured":"Achanta, R., Shaji, A., Smith, K., Lucchi, A., Fua, P., and S\u00fcsstrunk, S. (2010). SLIC Superpixels, School of Computer and Communication Sciences, Ecole Polytechnique Fedrale de Lausanne. EPFL Technical Report No. 149300."},{"key":"ref_21","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). ImageNet classification with deep convolutional neural networks. Proceedings of the Conference on Neural Information Processing Systems (NIPS12), Lake Tahoe, NV, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Karpathy, A., Toderici, G., Shetty, S., Leung, T., Sukthankar, R., and Li, F.-F. (2014, January 23\u201328). Large-scale video classification with convolutional neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.223"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"1797","DOI":"10.1109\/LGRS.2014.2309695","article-title":"Vehicle detection in satellite images by hybrid deep convolutional neural networks","volume":"11","author":"Chen","year":"2014","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"4806","DOI":"10.1109\/TGRS.2016.2551720","article-title":"Target classification using the deep convolutional networks for SAR images","volume":"54","author":"Chen","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"He, H., Chen, M., Chen, T., and Li, D. (2018). Matching of remote sensing images with complex background variations via Siamese convolutional neural network. Remote Sens., 10.","DOI":"10.3390\/rs10020355"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"516","DOI":"10.1080\/2150704X.2019.1577572","article-title":"Learning to match multitemporal optical satellite images using multi-support-patches Siamese networks","volume":"10","author":"He","year":"2019","journal-title":"Remote Sens. Lett."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrel, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Bittner, K., Cui, S., and Reinartz, P. (2017, January 6\u20139). Building extraction from remote sensing data using fully convolutional networks. Proceedings of the International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Hannover, Germany.","DOI":"10.5194\/isprs-archives-XLII-1-W1-481-2017"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Xu, Y., Wu, L., Xie, Z., and Chen, Z. (2018). Building extraction in very high resolution remote sensing imagery using deep learning and guided filters. Remote Sens., 10.","DOI":"10.3390\/rs10010144"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1016\/0031-3203(85)90051-2","article-title":"A quad-tree approach to image segmentation which combines statistical and spatial information","volume":"18","author":"Spann","year":"1985","journal-title":"Pattern Recogn."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"187","DOI":"10.3233\/FI-2000-411207","article-title":"The watershed transform: Definitions, algorithms and parallelization and strategies","volume":"41","author":"Roerdink","year":"2000","journal-title":"Fundam. Inform."},{"key":"ref_33","unstructured":"Strobl, J., Blaschke, T., and Griesebner, G. (2000). Multiresolution segmentation: An optimization approach for high quality multi-scale image segmentation. Angewandte Geographische Informationsverarbeitung XII, Wichmann-Verlag."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2825","DOI":"10.1080\/01431161003745608","article-title":"Multi-scale GEOBIA with very high spatial resolution digital aerial imagery: Scale, texture and image objects","volume":"32","author":"Kim","year":"2011","journal-title":"Int. J. Remote Sens."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"5186","DOI":"10.1080\/01431161.2017.1325536","article-title":"Scale computation on high spatial resolution remotely sensed imagery multi-scale segmentation","volume":"38","author":"Liu","year":"2017","journal-title":"Int. J. Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"67","DOI":"10.1016\/j.isprsjprs.2014.07.002","article-title":"Comparing supervised and unsupervised multiresolution segmentation approaches for extracting buildings from very high resolution imagery","volume":"96","author":"Belgiu","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Csillik, O. (2017). Fast segmentation and classification of very high resolution remote sensing data using SLIC superpixels. Remote Sens., 9.","DOI":"10.3390\/rs9030243"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1915","DOI":"10.1109\/TPAMI.2012.231","article-title":"Learning hierarchical features for scene labeling","volume":"35","author":"Farabet","year":"2013","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_39","unstructured":"Badrinarayanan, V., Handa, A., and Cipolla, R. (2015). SegNet: A deep convolutional encoder-decoder architecture for robust semantic pixel-wise labeling. arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"640","DOI":"10.1109\/TPAMI.2016.2572683","article-title":"Fully convolutional networks for semantic segmentation","volume":"39","author":"Shelhamer","year":"2017","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_41","unstructured":"Lui, M.Y., Tuzel, O., Ramalingam, S., and Chellappa, R. (2011, January 20\u201325). Entropy rate superpixel segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, USA."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Van den Bergh, M., Boix, X., Roig, G., de Capitani, B., and Van Gool, L. (2012, January 7\u201313). SEEDS: Superpixels extracted via energy-driven sampling. Proceedings of the European Conference on Computer Vision, Florence, Italy.","DOI":"10.1007\/978-3-642-33786-4_2"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Neubert, P., and Protzel, P. (2014, January 24\u201328). Compact watershed and preemptive SLIC: On improving trade-offs of superpixel segmentation algorithms. Proceedings of the International Conference on Pattern Recognition, Stockholm, Sweden.","DOI":"10.1109\/ICPR.2014.181"},{"key":"ref_44","unstructured":"Li, Z., and Chen, J. (2015, January 7\u201312). Superpixel segmentation using linear spectral clustering. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_45","unstructured":"Neubert, P., and Protzel, P. Superpixel benchmark and comparison. Proceedings of the Forum Bildverarbeitung 2012."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/0034-4257(79)90013-0","article-title":"Red and photographic infrared linear combinations for monitoring vegetation","volume":"8","author":"Tucker","year":"1979","journal-title":"Remote Sens. Environ."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1016\/S0034-4257(01)00289-9","article-title":"Novel algorithms for remote estimation of vegetation fraction","volume":"80","author":"Gitelson","year":"2002","journal-title":"Remote Sens. Environ."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"179","DOI":"10.2111\/05-069R1.1","article-title":"The accuracy of ground-cover measurements","volume":"59","author":"Booth","year":"2006","journal-title":"Rangel. Ecol. Manag."},{"key":"ref_49","unstructured":"Ok, A.\u00d6. (2008, January 5\u20138). Robust detection of buildings from a single color aerial image. Proceedings of the GEOBIA 2008, Calgary, AB, Canada. Part 4\/C1."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1016\/j.compag.2008.03.009","article-title":"Verification of color vegetation indices for automated crop imaging applications","volume":"63","author":"Meyer","year":"2008","journal-title":"Comput. Electron. Agric."},{"key":"ref_51","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 6\u201311). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning (ICML-15), Lille, France."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"112","DOI":"10.3138\/FM57-6770-U75U-7727","article-title":"Algorithms for the reduction of the number of points required to represent a digitized line or its caricature","volume":"10","author":"Douglas","year":"1973","journal-title":"Can. Cartogr."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"7","DOI":"10.1559\/152304099782424901","article-title":"Topologically consistent line simplification with the Douglas-Peucker algorithm","volume":"26","author":"Saalfeld","year":"1999","journal-title":"Cartogr. Geogr. Inf. Sci."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1007\/s11263-007-0107-3","article-title":"Modeling the world from Internet photo collections","volume":"80","author":"Snavely","year":"2008","journal-title":"Int. J. Comput. Vis."},{"key":"ref_55","unstructured":"Rothermel, M., Wenzel, K., Fritsch, D., and Haala, N. (2012, January 4\u20135). SURE: Photogrammetric surface reconstruction from imagery. Proceedings of the LC3D Workshop, Berlin, Germany."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"4184","DOI":"10.1109\/JSTARS.2014.2318694","article-title":"An automatic and threshold-free performance evaluation system for building extraction techniques from airborne Lidar data","volume":"7","author":"Awrangjeb","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Demir, I., Koperski, K., Lindenbaum, D., Pang, G., Huang, J., Basu, S., Hughes, F., Tuia, D., and Raska, R. (2018, January 18\u201322). DeepGlobe 2018: A challenge to parse the earth through satellite images. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00031"},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"43","DOI":"10.1109\/TPAMI.2010.54","article-title":"Discriminative learning of local image descriptors","volume":"33","author":"Brown","year":"2011","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1007\/s10489-016-0762-6","article-title":"Rapid building detection using machine learning","volume":"45","author":"Cohen","year":"2016","journal-title":"Appl. Intell."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/9\/1040\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:48:39Z","timestamp":1760186919000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/9\/1040"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,5,1]]},"references-count":59,"journal-issue":{"issue":"9","published-online":{"date-parts":[[2019,5]]}},"alternative-id":["rs11091040"],"URL":"https:\/\/doi.org\/10.3390\/rs11091040","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,5,1]]}}}