{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,27]],"date-time":"2026-01-27T22:02:59Z","timestamp":1769551379887,"version":"3.49.0"},"reference-count":38,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2021,7,1]],"date-time":"2021-07-01T00:00:00Z","timestamp":1625097600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100011682","name":"Building Technologies Office","doi-asserted-by":"publisher","award":["Building Technologies Office : n\/a"],"award-info":[{"award-number":["Building Technologies Office : n\/a"]}],"id":[{"id":"10.13039\/100011682","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Advances in machine learning and computer vision, combined with increased access to unstructured data (e.g., images and text), have created an opportunity for automated extraction of building characteristics, cost-effectively, and at scale. These characteristics are relevant to a variety of urban and energy applications, yet are time consuming and costly to acquire with today\u2019s manual methods. Several recent research studies have shown that in comparison to more traditional methods that are based on features engineering approach, an end-to-end learning approach based on deep learning algorithms significantly improved the accuracy of automatic building footprint extraction from remote sensing images. However, these studies used limited benchmark datasets that have been carefully curated and labeled. How the accuracy of these deep learning-based approach holds when using less curated training data has not received enough attention. The aim of this work is to leverage the openly available data to automatically generate a larger training dataset with more variability in term of regions and type of cities, which can be used to build more accurate deep learning models. In contrast to most benchmark datasets, the gathered data have not been manually curated. Thus, the training dataset is not perfectly clean in terms of remote sensing images exactly matching the ground truth building\u2019s foot-print. A workflow that includes data pre-processing, deep learning semantic segmentation modeling, and results post-processing is introduced and applied to a dataset that include remote sensing images from 15 cities and five counties from various region of the USA, which include 8,607,677 buildings. The accuracy of the proposed approach was measured on an out of sample testing dataset corresponding to 364,000 buildings from three USA cities. The results favorably compared to those obtained from Microsoft\u2019s recently released US building footprint dataset.<\/jats:p>","DOI":"10.3390\/rs13132578","type":"journal-article","created":{"date-parts":[[2021,7,1]],"date-time":"2021-07-01T12:03:27Z","timestamp":1625141007000},"page":"2578","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":36,"title":["Open Data and Deep Semantic Segmentation for Automated Extraction of Building Footprints"],"prefix":"10.3390","volume":"13","author":[{"given":"Samir","family":"Touzani","sequence":"first","affiliation":[{"name":"Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA"}]},{"given":"Jessica","family":"Granderson","sequence":"additional","affiliation":[{"name":"Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA"}]}],"member":"1968","published-online":{"date-parts":[[2021,7,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Robinson, C., Hohman, F., and Dilkina, B. (2017, January 7\u201310). A deep learning approach for population estimation from satellite imagery. Proceedings of the 1st ACM SIGSPATIAL Workshop on Geospatial Humanities, New York, NY, USA.","DOI":"10.1145\/3149858.3149863"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Rodriguez, A.C., and Wegner, J.D. (2018). Counting the uncountable: Deep semantic density estimation from space. German Conference on Pattern Recognition, Springer.","DOI":"10.1007\/978-3-030-12939-2_24"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"252","DOI":"10.1016\/j.enbuild.2018.11.008","article-title":"Development of City Buildings Dataset for Urban Building Energy Modeling","volume":"183","author":"Chen","year":"2019","journal-title":"Energy Build."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Wang, N., Goel, S., and Makhmalbaf, A. (2013). Commercial Building Energy Asset Score Program Overview and Technical Protocol (Version 1.1), Technical Report, PNNL-22045 Rev. 1.1.","DOI":"10.2172\/1108158"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Yu, M., Yang, C., and Li, Y. (2018). Big data in natural disaster management: A review. Geosciences, 8.","DOI":"10.3390\/geosciences8050165"},{"key":"ref_6","first-page":"20","article-title":"Evaluation of Change Detection Techniques using Very High Resolution Optical Satellite Imagery","volume":"2","author":"Cerovecki","year":"2015","journal-title":"Preface"},{"key":"ref_7","first-page":"117340O","article-title":"Semi-supervised learning for improved post-disaster damage assessment from satellite imagery","volume":"Volume 11734","author":"Oludare","year":"2021","journal-title":"Multimodal Image Exploitation and Learning 2021"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"He, H., Yang, D., Wang, S., Wang, S., and Li, Y. (2019). Road extraction by using atrous spatial pyramid pooling integrated encoder-decoder network and structural similarity loss. Remote Sens., 11.","DOI":"10.3390\/rs11091015"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Lin, Y., Xu, D., Wang, N., Shi, Z., and Chen, Q. (2020). Road Extraction from Very-High-Resolution Remote Sensing Images via a Nested SE-Deeplab Model. Remote Sens., 12.","DOI":"10.3390\/rs12182985"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Sharifzadeh, S., Tata, J., Sharifzadeh, H., and Tan, B. (2019). Farm Area Segmentation in Satellite Images Using DeepLabv3+ Neural Networks. International Conference on Data Management Technologies and Applications, Springer.","DOI":"10.1007\/978-3-030-54595-6_7"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Shrestha, S., and Vanneschi, L. (2018). Improved fully convolutional network with conditional random fields for building extraction. Remote Sens., 10.","DOI":"10.3390\/rs10071135"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Kang, W., Xiang, Y., Wang, F., and You, H. (2019). EU-Net: An Efficient Fully Convolutional Network for Building Extraction from Optical Remote Sensing Images. Remote Sens., 11.","DOI":"10.3390\/rs11232813"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"128774","DOI":"10.1109\/ACCESS.2019.2940527","article-title":"Automatic building extraction on high-resolution remote sensing imagery using deep convolutional encoder-decoder with spatial pyramid pooling","volume":"7","author":"Liu","year":"2019","journal-title":"IEEE Access."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3308","DOI":"10.1080\/01431161.2018.1528024","article-title":"A scale robust convolutional neural network for automatic building extraction from aerial and satellite imagery","volume":"40","author":"Ji","year":"2019","journal-title":"Int. J. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Valentijn, T., Margutti, J., van den Homberg, M., and Laaksonen, J. (2020). Multi-Hazard and Spatial Transferability of a CNN for Automated Building Damage Assessment. Remote Sens., 12.","DOI":"10.3390\/rs12172839"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Bai, Y., Hu, J., Su, J., Liu, X., Liu, H., He, X., Meng, S., Mas, E., and Koshimura, S. (2020). Pyramid Pooling Module-Based Semi-Siamese Network: A Benchmark Model for Assessing Building Damage from xBD Satellite Imagery Datasets. Remote Sens., 12.","DOI":"10.3390\/rs12244055"},{"key":"ref_17","unstructured":"Van Etten, A., Lindenbaum, D., and Bacastow, T.M. (2018). Spacenet: A remote sensing dataset and challenge series. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Maggiori, E., Tarabalka, Y., Charpiat, G., and Alliez, P. (2017, January 23\u201328). Can semantic labeling methods generalize to any city? Th Inria aerial image labeling benchmark. Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA.","DOI":"10.1109\/IGARSS.2017.8127684"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"630","DOI":"10.1016\/j.buildenv.2018.12.025","article-title":"Automated urban energy system modeling and thermal building simulation based on OpenStreetMap data sets","volume":"149","author":"Schiefelbein","year":"2019","journal-title":"Build. Environ."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Brovelli, M., and Zamboni, G. (2018). A new method for the assessment of spatial accuracy and completeness of OpenStreetMap building footprints. ISPRS Int. J. Geoinf., 7.","DOI":"10.3390\/ijgi7080289"},{"key":"ref_21","unstructured":"Touzani, S., Wudunn, M., Zakhor, A., Pritoni, M., Singh, R., Bergmann, H., and Granderson, J. (2020, January 17\u201321). Machine Learning for Automated Extraction of Building Geometry. Proceedings of the ACEEE Summer Study on Energy Efficiency in Buildings, Pacific Grove, CA, USA."},{"key":"ref_22","unstructured":"(2019, September 25). Mapbox 2019. Available online: https:\/\/docs.mapbox.com\/."},{"key":"ref_23","unstructured":"(2020, December 07). Mapbox, 2018. Available online: https:\/\/www.openstreetmap.org\/user\/pratikyadav\/diary\/43954."},{"key":"ref_24","unstructured":"(2020, September 15). OSM Wiki: Slippy Map 2020. Available online: https:\/\/wiki.openstreetmap.org\/wiki\/Slippy_Map."},{"key":"ref_25","unstructured":"(2020, March 10). USBuildingFootprints, 2018. Microsoft. Available online: https:\/\/github.com\/microsoft\/USBuildingFootprints."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_28","unstructured":"Badrinarayanan, V., Handa, A., and Cipolla, R. (2015). Segnet: A deep convolutional encoder-decoder architecture for robust semantic pixel-wise labelling. arXiv."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1007\/s11263-015-0816-y","article-title":"Imagenet large scale visual recognition challenge","volume":"115","author":"Russakovsky","year":"2015","journal-title":"Int. J. Comput. Vis."},{"key":"ref_31","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (July, January 26). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Sudre, C.H., Li, W., Vercauteren, T., Ourselin, S., and Cardoso, M.J. (2017). Generalised dice overlap as a deep learning loss function for highly unbalanced segmentations. Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support, Springer.","DOI":"10.1007\/978-3-319-67558-9_28"},{"key":"ref_33","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ng, V., and Hofmann, D. (2018, January 9\u201315). Scalable Feature Extraction with aerial and Satellite Imagery. Proceedings of the 17th Python in Science Conference (SCIPY 2018), Austin, TX, USA.","DOI":"10.25080\/Majora-4af1f417-015"},{"key":"ref_35","first-page":"112","article-title":"Algorithms for the reduction of the number of points required to represent a digitized line or its caricature","volume":"10","author":"Douglas","year":"1973","journal-title":"Cartogr. Int. J. Geogr. Inf. Geovis."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"457","DOI":"10.1016\/j.isprsjprs.2010.06.001","article-title":"Automatic detection of residential buildings using LIDAR data and multispectral imagery","volume":"65","author":"Awrangjeb","year":"2010","journal-title":"ISPRS J. Photogramm. Remote. Sens."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"3716","DOI":"10.3390\/rs6053716","article-title":"Automatic segmentation of raw LiDAR data for extraction of building roofs","volume":"6","author":"Awrangjeb","year":"2014","journal-title":"Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"729","DOI":"10.14358\/PERS.78.7.729","article-title":"Building detection in complex scenes thorough effective separation of buildings from trees","volume":"78","author":"Awrangjeb","year":"2012","journal-title":"Photogramm. Eng. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/13\/2578\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:24:44Z","timestamp":1760163884000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/13\/2578"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,1]]},"references-count":38,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2021,7]]}},"alternative-id":["rs13132578"],"URL":"https:\/\/doi.org\/10.3390\/rs13132578","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,7,1]]}}}