{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,17]],"date-time":"2026-07-17T12:22:17Z","timestamp":1784290937935,"version":"3.55.0"},"reference-count":53,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2022,6,30]],"date-time":"2022-06-30T00:00:00Z","timestamp":1656547200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Strategic Priority Research Program of the Chinese Academy of Sciences (CAS)","award":["XDA19040301"],"award-info":[{"award-number":["XDA19040301"]}]},{"name":"Strategic Priority Research Program of the Chinese Academy of Sciences (CAS)","award":["CAS-WX2021PY-0109"],"award-info":[{"award-number":["CAS-WX2021PY-0109"]}]},{"name":"Strategic Priority Research Program of the Chinese Academy of Sciences (CAS)","award":["E0V00110YZ"],"award-info":[{"award-number":["E0V00110YZ"]}]},{"name":"Informatization Plan of Chinese Academy of Sciences","award":["XDA19040301"],"award-info":[{"award-number":["XDA19040301"]}]},{"name":"Informatization Plan of Chinese Academy of Sciences","award":["CAS-WX2021PY-0109"],"award-info":[{"award-number":["CAS-WX2021PY-0109"]}]},{"name":"Informatization Plan of Chinese Academy of Sciences","award":["E0V00110YZ"],"award-info":[{"award-number":["E0V00110YZ"]}]},{"name":"Institute of Geographic Sciences and Natural Resources Research of CAS","award":["XDA19040301"],"award-info":[{"award-number":["XDA19040301"]}]},{"name":"Institute of Geographic Sciences and Natural Resources Research of CAS","award":["CAS-WX2021PY-0109"],"award-info":[{"award-number":["CAS-WX2021PY-0109"]}]},{"name":"Institute of Geographic Sciences and Natural Resources Research of CAS","award":["E0V00110YZ"],"award-info":[{"award-number":["E0V00110YZ"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Convolutional neural network (CNN)-based very high-resolution (VHR) image segmentation has become a common way of extracting building footprints. Despite publicly available building datasets and pre-trained CNN models, it is still necessary to prepare sufficient labeled image tiles to train CNN models from scratch or update the parameters of pre-trained CNN models to extract buildings accurately in real-world applications, especially the large-scale building extraction, due to differences in landscapes and data sources. Deep active learning is an effective technique for resolving this issue. This study proposes a framework integrating two state-of-the-art (SOTA) models, U-Net and DeeplabV3+, three commonly used active learning strategies, (i.e., margin sampling, entropy, and vote entropy), and landscape characterization to illustrate the performance of active learning in reducing the effort of data annotation, and then understand what kind of image tiles are more advantageous for CNN-based building extraction. The framework enables iteratively selecting the most informative image tiles from the unlabeled dataset for data annotation, training the CNN models, and analyzing the changes in model performance. It also helps us to understand the landscape features of iteratively selected image tiles via active learning by considering building as the focal class and computing the percent, the number of patches, edge density, and landscape shape index of buildings based on labeled tiles in each selection. The proposed method was evaluated on two benchmark building datasets, WHU satellite dataset II and WHU aerial dataset. Models in each iteration were trained from scratch on all labeled tiles. Experimental results based on the two datasets indicate that, for both U-Net and DeeplabV3+, the three active learning strategies can reduce the number of image tiles to be annotated and achieve good model performance with fewer labeled image tiles. Moreover, image tiles with more building patches, larger areas of buildings, longer edges of buildings, and more dispersed building distribution patterns were more effective for model training. The study not only provides a framework to reduce the data annotation efforts in CNN-based building extraction but also summarizes the preliminary suggestions for data annotation, which could facilitate and guide data annotators in real-world applications.<\/jats:p>","DOI":"10.3390\/rs14133147","type":"journal-article","created":{"date-parts":[[2022,7,1]],"date-time":"2022-07-01T01:40:36Z","timestamp":1656639636000},"page":"3147","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":14,"title":["Suggestive Data Annotation for CNN-Based Building Footprint Mapping Based on Deep Active Learning and Landscape Metrics"],"prefix":"10.3390","volume":"14","author":[{"given":"Zhichao","family":"Li","sequence":"first","affiliation":[{"name":"Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Shuai","family":"Zhang","sequence":"additional","affiliation":[{"name":"Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5687-803X","authenticated-orcid":false,"given":"Jinwei","family":"Dong","sequence":"additional","affiliation":[{"name":"Key Laboratory of Land Surface Pattern and Simulation, Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,6,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"044003","DOI":"10.1088\/1748-9326\/4\/4\/044003","article-title":"A new map of global urban extent from MODIS satellite data","volume":"4","author":"Schneider","year":"2009","journal-title":"Environ. Res. Lett."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1161","DOI":"10.1177\/2399808320921208","article-title":"Classifying settlement types from multi-scale spatial patterns of building footprints","volume":"48","author":"Jochem","year":"2021","journal-title":"Environ. Plan. B Urban Anal. City Sci."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1146\/annurev-environ-100809-125336","article-title":"The New Geography of Contemporary Urbanization and the Environment","volume":"35","author":"Seto","year":"2010","journal-title":"Annu. Rev. Environ. Resour."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"114417","DOI":"10.1016\/j.eswa.2020.114417","article-title":"A review of deep learning methods for semantic segmentation of remote sensing imagery","volume":"169","author":"Yuan","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhao, F., and Zhang, C. (2020, January 11\u201313). Building Damage Evaluation from Satellite Imagery using Deep Learning. Proceedings of the 2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science (IRI), Las Vegas, NV, USA.","DOI":"10.1109\/IRI49571.2020.00020"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Pan, Z., Xu, J., Guo, Y., Hu, Y., and Wang, G. (2020). Deep Learning Segmentation and Classification for Urban Village Using a Worldview Satellite Image Based on U-Net. Remote Sens., 12.","DOI":"10.3390\/rs12101574"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Wagner, F.H., Dalagnol, R., Tarabalka, Y., Segantine, T.Y., Thom\u00e9, R., and Hirye, M. (2020). U-net-id, an instance segmentation model for building extraction from satellite images\u2014Case study in the Joanopolis City, Brazil. Remote Sens., 12.","DOI":"10.3390\/rs12101544"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1501","DOI":"10.1080\/10106049.2020.1778100","article-title":"Automatic building footprint extraction from very high-resolution imagery using deep learning techniques","volume":"37","author":"Rastogi","year":"2020","journal-title":"Geocarto Int."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Li, C., Fu, L., Zhu, Q., Zhu, J., Fang, Z., Xie, Y., Guo, Y., and Gong, Y. (2021). Attention Enhanced U-Net for Building Extraction from Farmland Based on Google and WorldView-2 Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13214411"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Pasquali, G., Iannelli, G.C., and Dell\u2019Acqua, F. (2019). Building footprint extraction from multispectral, spaceborne earth observation datasets using a structurally optimized U-Net convolutional neural network. Remote Sens., 11.","DOI":"10.3390\/rs11232803"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Touzani, S., and Granderson, J. (2021). Open Data and Deep Semantic Segmentation for Automated Extraction of Building Footprints. Remote Sens., 13.","DOI":"10.3390\/rs13132578"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Yang, N., and Tang, H. (2020). GeoBoost: An Incremental Deep Learning Approach toward Global Mapping of Buildings from VHR Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12111794"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"11530","DOI":"10.1109\/JSTARS.2021.3123398","article-title":"A Large-Scale Mapping Scheme for Urban Building From Gaofen-2 Images Using Deep Learning and Hierarchical Approach","volume":"14","author":"Zhou","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"2600","DOI":"10.1109\/JSTARS.2018.2835377","article-title":"Building Extraction at Scale Using Convolutional Neural Network: Mapping of the United States","volume":"11","author":"Yang","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"574","DOI":"10.1109\/TGRS.2018.2858817","article-title":"Fully Convolutional Networks for Multisource Building Extraction fom an Open Aerial and Satellite Imagery Data Set","volume":"57","author":"Ji","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"42","DOI":"10.1016\/j.isprsjprs.2018.11.011","article-title":"TEMPORARY REMOVAL: Aerial imagery for roof segmentation: A large-scale dataset towards automatic mapping of buildings","volume":"147","author":"Chen","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_17","unstructured":"Van Etten, A., Lindenbaum, D., and Bacastow, T.M. (2018). Spacenet: A remote sensing dataset and challenge series. arXiv."},{"key":"ref_18","unstructured":"Mace, E., Manville, K., Barbu-McInnis, M., Laielli, M., Klaric, M.K., and Dooley, S. (2018). Overhead Detection: Beyond 8-bits and RGB. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"20118","DOI":"10.1109\/ACCESS.2022.3149052","article-title":"A Survey of Deep Learning-Based Object Detection Methods and Datasets for Overhead Imagery","volume":"10","author":"Kang","year":"2022","journal-title":"IEEE Access"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Li, W., He, C., Fang, J., Zheng, J., Fu, H., and Yu, L. (2019). Semantic Segmentation-Based Building Footprint Extraction Using Very High-Resolution Satellite Images and Multi-Source GIS Data. Remote Sens., 11.","DOI":"10.3390\/rs11040403"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Maggiori, E., Tarabalka, Y., Charpiat, G., and Alliez, P. (2017, January 23\u201328). Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark. Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA.","DOI":"10.1109\/IGARSS.2017.8127684"},{"key":"ref_22","unstructured":"Mnih, V. (2013). Machine Learning for Aerial Image Labeling, University of Toronto."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Chen, Q., Zhang, Y., Li, X., and Tao, P. (2022). Extracting Rectified Building Footprints from Traditional Orthophotos: A New Workflow. Sensors, 22.","DOI":"10.3390\/s22010207"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Rahman, A.K.M.M., Zaber, M., Cheng, Q., Nayem, A.B.S., Sarker, A., Paul, O., and Shibasaki, R. (2021). Applying State-of-the-Art Deep-Learning Methods to Classify Urban Cities of the Developing World. Sensors, 21.","DOI":"10.3390\/s21227469"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Gergelova, M.B., Labant, S., Kuzevic, S., Kuzevicova, Z., and Pavolova, H. (2020). Identification of Roof Surfaces from LiDAR Cloud Points by GIS Tools: A Case Study of Lu\u010denec, Slovakia. Sustainability, 12.","DOI":"10.3390\/su12176847"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Li, J., Meng, L., Yang, B., Tao, C., Li, L., and Zhang, W. (2021). LabelRS: An Automated Toolbox to Make Deep Learning Samples from Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13112064"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"15014","DOI":"10.3390\/rs71115014","article-title":"Accurate Annotation of Remote Sensing Images via Active Spectral Clustering with Little Expert Knowledge","volume":"7","author":"Xia","year":"2015","journal-title":"Remote Sens."},{"key":"ref_28","first-page":"1","article-title":"A survey of deep active learning","volume":"54","author":"Ren","year":"2021","journal-title":"ACM Comput. Surv. (CSUR)"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Robinson, C., Ortiz, A., Malkin, K., Elias, B., Peng, A., Morris, D., Dilkina, B., and Jojic, N. (2020, January 7\u201312). Human-machine collaboration for fast land cover mapping. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i03.5633"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"76","DOI":"10.1016\/j.isprsjprs.2020.10.018","article-title":"From local to global: A transfer learning-based approach for mapping poplar plantations at national scale using Sentinel-2","volume":"171","author":"Hamrouni","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"9378","DOI":"10.1109\/TGRS.2019.2926434","article-title":"An active deep learning approach for minimally supervised PolSAR image classification","volume":"57","author":"Bi","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"4057","DOI":"10.1080\/01431161.2020.1714774","article-title":"Using convolutional neural networks incorporating hierarchical active learning for target-searching in large-scale remote sensing images","volume":"41","author":"Xu","year":"2020","journal-title":"Int. J. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Yang, L., Zhang, Y., Chen, J., Zhang, S., and Chen, D.Z. (2017, January 11\u201313). Suggestive annotation: A deep active learning framework for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Quebec City, QC, Canada.","DOI":"10.1007\/978-3-319-66179-7_46"},{"key":"ref_34","unstructured":"McGarigal, K., Cushman, S.A., and Ene, E. (2022, May 01). FRAGSTATS v4: Spatial Pattern Analysis Program for Categorical and Continuous Maps. Available online: http:\/\/www.umass.edu\/landeco\/research\/fragstats\/fragstats.html."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1007\/s40823-017-0026-0","article-title":"Landscape metrics: Past progress and future directions","volume":"2","author":"Frazier","year":"2017","journal-title":"Curr. Landsc. Ecol. Rep."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Li, Z., Roux, E., Dessay, N., Girod, R., Stefani, A., Nacher, M., Moiret, A., and Seyler, F. (2016). Mapping a knowledge-based malaria hazard index related to landscape using remote sensing: Application to the cross-border area between French Guiana and Brazil. Remote Sens., 8.","DOI":"10.3390\/rs8040319"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Li, Z., Feng, Y., Dessay, N., Delaitre, E., Gurgel, H., and Gong, P. (2019). Continuous monitoring of the spatio-temporal patterns of surface water in response to land use and land cover types in a Mediterranean lagoon complex. Remote Sens., 11.","DOI":"10.20944\/preprints201905.0119.v1"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Yang, H., Xu, M., Chen, Y., Wu, W., and Dong, W. (2022). A Postprocessing Method Based on Regions and Boundaries Using Convolutional Neural Networks and a New Dataset for Building Extraction. Remote Sens., 14.","DOI":"10.3390\/rs14030647"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"82031","DOI":"10.1109\/ACCESS.2021.3086020","article-title":"U-Net and Its Variants for Medical Image Segmentation: A Review of Theory and Applications","volume":"9","author":"Siddique","year":"2021","journal-title":"IEEE Access"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_42","unstructured":"Settles, B. (2009). Active Learning Literature Survey, University of Wisconsin-Madison."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Bosch, M. (2019). PyLandStats: An open-source Pythonic library to compute landscape metrics. PLoS ONE, 14.","DOI":"10.1101\/715052"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"634","DOI":"10.1111\/2041-210X.12198","article-title":"Measuring habitat fragmentation: An evaluation of landscape pattern metrics","volume":"5","author":"Wang","year":"2014","journal-title":"Methods Ecol. Evol."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1","DOI":"10.12942\/lrlr-2009-1","article-title":"Landscape Metrics and Indices: An Overview of Their Use in Landscape Research","volume":"3","author":"Uuemaa","year":"2009","journal-title":"Living Rev. Landsc. Res."},{"key":"ref_47","first-page":"26","article-title":"Selecting landscape metrics as indicators of spatial heterogeneity\u2014A comparison among Greek landscapes","volume":"26","author":"Plexida","year":"2014","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"691","DOI":"10.1016\/j.ecolind.2007.12.002","article-title":"Parsimony in landscape metrics: Strength, universality, and consistency","volume":"8","author":"Cushman","year":"2008","journal-title":"Ecol. Indic."},{"key":"ref_49","unstructured":"Openshaw, S. (1981). The modifiable areal unit problem. Quant. Geogr. A Br. View, 60\u201369. Available online: https:\/\/cir.nii.ac.jp\/crid\/1572824498971908736."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"1494","DOI":"10.1109\/JSTARS.2022.3146430","article-title":"Res2-Unet, a New Deep Architecture for Building Detection from High Spatial Resolution Images","volume":"15","author":"Chen","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Zhao, K., Kang, J., Jung, J., and Sohn, G. (2018, January 18\u201322). Building Extraction from Satellite Images Using Mask R-CNN with Building Boundary Regularization. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00045"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1016\/j.neucom.2020.10.115","article-title":"HAL: Hybrid active learning for efficient labeling in medical domain","volume":"456","author":"Wu","year":"2021","journal-title":"Neurocomputing"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"108278","DOI":"10.1016\/j.knosys.2022.108278","article-title":"One-shot active learning for image segmentation via contrastive learning and diversity-based sampling","volume":"241","author":"Jin","year":"2022","journal-title":"Knowl. Based Syst."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/13\/3147\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:41:13Z","timestamp":1760139673000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/13\/3147"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,6,30]]},"references-count":53,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2022,7]]}},"alternative-id":["rs14133147"],"URL":"https:\/\/doi.org\/10.3390\/rs14133147","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,6,30]]}}}