{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,25]],"date-time":"2026-04-25T14:27:53Z","timestamp":1777127273844,"version":"3.51.4"},"reference-count":47,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2021,6,11]],"date-time":"2021-06-11T00:00:00Z","timestamp":1623369600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the National Key R&amp;D Program of China","award":["2017YFB0504104"],"award-info":[{"award-number":["2017YFB0504104"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["41971280"],"award-info":[{"award-number":["41971280"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Satellite mapping of buildings and built-up areas used to be delineated from high spatial resolution (e.g., meters or sub-meters) and middle spatial resolution (e.g., tens of meters or hundreds of meters) satellite images, respectively. To the best of our knowledge, it is important to explore a deep-learning approach to delineate high-resolution semantic maps of buildings from middle-resolution satellite images. The approach is termed as super-resolution semantic segmentation in this paper. Specifically, we design a neural network with integrated low-level image features of super-resolution and high-level semantic features of super-resolution, which is trained with Sentinel-2A images (i.e., 10 m) and higher-resolution semantic maps (i.e., 2.5 m). The network, based on super-resolution semantic segmentation features is called FSRSS-Net. In China, the 35 cities are partitioned into three groups, i.e., 19 cities for model training, four cities for quantitative testing and the other 12 cities for qualitative generalization ability analysis of the learned networks. A large-scale sample dataset is created and utilized to train and validate the performance of the FSRSS-Net, which includes 8597 training samples and 766 quantitative accuracy evaluation samples. Quantitative evaluation results show that: (1) based on the 10 m Sentinel-2A image, the FSRSS-Net can achieve super-resolution semantic segmentation and produce 2.5 m building recognition results, and there is little difference between the accuracy of 2.5 m results by FSRSS-Net and 10 m results by U-Net. More importantly, the 2.5 m building recognition results by FSRSS-Net have higher accuracy than the 2.5 m results by U-Net 10 m building recognition results interpolation up-sampling; (2) from the spatial visualization of the results, the building recognition results of 2.5 m are more precise than those of 10 m, and the outline of the building is better depicted. Qualitative analysis shows that: (1) the learned FSRSS-Net can be also well generalized to other cities that are far from training regions; (2) the FSRSS-Net can still achieve comparable results to the U-Net 2 m building recognition results, even when the U-Net is directly trained using both 2-meter resolution GF2 satellite images and corresponding semantic labels.<\/jats:p>","DOI":"10.3390\/rs13122290","type":"journal-article","created":{"date-parts":[[2021,6,14]],"date-time":"2021-06-14T22:25:46Z","timestamp":1623709546000},"page":"2290","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":37,"title":["FSRSS-Net: High-Resolution Mapping of Buildings from Middle-Resolution Satellite Images Using a Super-Resolution Semantic Segmentation Network"],"prefix":"10.3390","volume":"13","author":[{"given":"Tao","family":"Zhang","sequence":"first","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"},{"name":"Chongqing Geographic Information and Remote Sensing Application Center, Chongqing 401121, China"},{"name":"Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4091-0175","authenticated-orcid":false,"given":"Hong","family":"Tang","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"},{"name":"Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yi","family":"Ding","sequence":"additional","affiliation":[{"name":"Chongqing Geographic Information and Remote Sensing Application Center, Chongqing 401121, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Penglong","family":"Li","sequence":"additional","affiliation":[{"name":"Chongqing Geographic Information and Remote Sensing Application Center, Chongqing 401121, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chao","family":"Ji","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"},{"name":"Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Penglei","family":"Xu","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Remote Sensing Science, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"},{"name":"Beijing Key Laboratory for Remote Sensing of Environment and Digital Cities, Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,6,11]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"2295","DOI":"10.1007\/s11430-016-5291-y","article-title":"Global mapping of artificial surfaces at 30-m resolution","volume":"59","author":"Chen","year":"2016","journal-title":"Sci. China Earth Sci."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Momeni, R., Aplin, P., and Boyd, D.S. (2016). Mapping Complex Urban Land Cover from Spaceborne Imagery: The Influence of Spatial Resolution, Spectral Band Set and Classification Approach. Remote Sens., 8.","DOI":"10.3390\/rs8020088"},{"key":"ref_3","unstructured":"Martino, P., Daniele, E., Stefano, F., Aneta, F., Manuel, C.F.S., Stamatia, H., Maria, J.A., Thomas, K., Pierre, S., and Vasileios, S. (2016). Operating procedure for the production of the Global Human Settlement Layer from Landsat data of the epochs 1975, 1990, 2000, and 2014. JRC Tech. Rep. EUR 27741 EN."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/j.isprsjprs.2017.05.002","article-title":"Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks","volume":"130","author":"Alshehhi","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Li, W., He, C., Fang, J., and Fu, H. (2018, January 18\u201322). Semantic Segmentation Based Building Extraction Method Using Multi-source GIS Map Datasets and Satellite Imagery. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2018), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPRW.2018.00043"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1016\/j.rse.2018.02.055","article-title":"High-resolution multi-temporal mapping of global urban land using Landsat images based on the Google Earth Engine Platform","volume":"209","author":"Liu","year":"2018","journal-title":"Remote Sens. Env."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"370","DOI":"10.1016\/j.scib.2019.03.002","article-title":"Stable classification with limited sample: Transferring a 30-m resolution sample set collected in 2015 to mapping 10-m resolution global land cover in 2017","volume":"6","author":"Gong","year":"2019","journal-title":"Sci. Bull."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Pesaresi, M., Ouzounis, G.K., and Gueguen, L. (2012). A new compact representation of morphological profiles: Report on first massive VHR image processing at the JRC. Proc. SPIE 8390, Algorithms and Technologies for Multispectral, Hyperspectral, and Ultraspectral Imagery XVIII, SPIE.","DOI":"10.1117\/12.920291"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"2607","DOI":"10.1080\/01431161.2012.748992","article-title":"Finer Resolution Observation and Monitoring of Global Land Cover: First Mapping Results with Landsat TM and ETM+ Data","volume":"34","author":"Gong","year":"2013","journal-title":"Int. J. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2015.01.001","article-title":"Global land cover mapping using earth observation satellite data: Recent progresses and challenges","volume":"103","author":"Ban","year":"2015","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2317","DOI":"10.1007\/s11430-014-4919-z","article-title":"A multi-resolution global land cover dataset through multisource data aggregation","volume":"57","author":"Yu","year":"2014","journal-title":"Sci. China Earth Sci."},{"key":"ref_12","first-page":"847","article-title":"Global urban area mapping in high resolution using aster satellite images","volume":"38","author":"Miyazaki","year":"2010","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1004","DOI":"10.1109\/JSTARS.2012.2226563","article-title":"An automated method for global urban area mapping by integrating aster satellite images and gis data","volume":"6","author":"Miyazaki","year":"2013","journal-title":"Sel. Top. Appl. Earth Obs. Remote Sens. IEEE J."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1016\/j.isprsjprs.2017.10.012","article-title":"Breaking New Ground in Mapping Human Settlements from Space-The Global Urban Footprint","volume":"134","author":"Esch","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1080\/20964471.2017.1397899","article-title":"Big earth data analytics on sentinel-1 and landsat imagery in support to global human settlements mapping","volume":"1","author":"Corbane","year":"2017","journal-title":"Big Earth Data"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1080\/20964471.2019.1625528","article-title":"Automated Global Delineation of Human Settlements from 40 Years of Landsat Satellite Data Archives","volume":"3","author":"Corbane","year":"2019","journal-title":"Big Earth Data"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1153","DOI":"10.1109\/LGRS.2019.2942131","article-title":"Application of the Symbolic Machine Learning to Copernicus VHR Imagery: The European Settlement Map","volume":"17","author":"Corbane","year":"2019","journal-title":"Geosci. Remote Sens. Lett."},{"key":"ref_18","unstructured":"Bing Maps Team (2021, June 09). Microsoft Releases 125 million Building Footprints in the US as Open Data. Bing Blog, Available online: https:\/\/blogs.bing.com\/maps\/2018-06\/microsoft-releases-125-million-building-footprints-in-the-us-as-open-data."},{"key":"ref_19","unstructured":"Bonafilia, D., Yang, D., Gill, J., and Basu, S. (2021, June 09). Building High Resolution Maps for Humanitarian Aid and Development with Weakly- and Semi-Supervised Learning, Available online: https:\/\/research.fb.com\/publications\/building-high-resolution-maps-for-humanitarian-aid-and-development-with-weakly-and-semi-supervised-learning\/."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Pesaresi, M., Syrris, V., and Julea, A. (2016). A New Method for Earth Observation Data Analytics Based on Symbolic Machine Learning. Remote Sens., 8.","DOI":"10.3390\/rs8050399"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2102","DOI":"10.1109\/JSTARS.2013.2271445","article-title":"A Global Human Settlement Layer From Optical HR\/VHR RS Data: Concept and First Results","volume":"6","author":"Pesaresi","year":"2013","journal-title":"Sel. Top. Appl. Earth Obs. Remote Sens. IEEE J."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"111322","DOI":"10.1016\/j.rse.2019.111322","article-title":"Land-cover classification with high-resolution remote sensing images using transferable deep models","volume":"237","author":"Tong","year":"2020","journal-title":"Remote Sens. Environ."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Hayat, K. (2017). Super-Resolution via Deep Learning. Digit. Signal Process.","DOI":"10.1016\/j.dsp.2018.07.005"},{"key":"ref_24","unstructured":"Wang, Z., Chen, J., and Hoi, S. (2019). Deep Learning for Image Super-resolution: A Survey. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Dong, C., Loy, C.C., He, K., and Tang, X. (2014). Learning a Deep Convolutional Network for Image Super-Resolution. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-10593-2_13"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Dong, C., Loy, C.C., and Tang, X. (2016). Accelerating the super-resolution convolutional neural network. Comput. Sci.-CVPR, 391\u2013407.","DOI":"10.1007\/978-3-319-46475-6_25"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Shi, W., Caballero, J., Husz\u00e1r, F., Totz, J., and Wang, Z. (2016). Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. arXiv.","DOI":"10.1109\/CVPR.2016.207"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016). Deep Residual Learning for Image Recognition. IEEE CVPR.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Kim, J., Lee, K.J., and Lee, M.K. (2016). Accurate Image Super-Resolution Using Very Deep Convolutional Networks. 2016 IEEE CVPR.","DOI":"10.1109\/CVPR.2016.182"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Ledig, C., Theis, L., Huszar, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A., Tejani, A., Totz, J., and Wang, Z.H. (2016). Photo-realistic single image super-resolution using a generative adversarial network. arXiv.","DOI":"10.1109\/CVPR.2017.19"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Pathak, H.N., Li, X., Minaee, S., and Cowan, B. (2018). Efficient Super Resolution for Large-Scale Images Using Attentional GAN. arXiv.","DOI":"10.1109\/BigData.2018.8622477"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Mustafa, A., Khan, S.H., Hayat, M., Shen, J.B., and Shao, L. (2019). Image Super-Resolution as a Defense Against Adversarial Attacks. arXiv.","DOI":"10.1109\/TIP.2019.2940533"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Gargiulo, M. (2019). Advances on CNN-based super-resolution of Sentinel-2 images. arXiv.","DOI":"10.1109\/IGARSS.2019.8899186"},{"key":"ref_34","first-page":"627","article-title":"Land Use Classification in Remote Sensing Images by Convolutional Neural Networks","volume":"28","author":"Castelluccio","year":"2015","journal-title":"Acta Ecol. Sin."},{"key":"ref_35","unstructured":"Luc, P., Couprie, C., Chintala, S., and Verbeek, J. (2016). Semantic Segmentation using Adversarial Networks. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"740","DOI":"10.1109\/LGRS.2016.2542358","article-title":"Convolutional neural network based automatic object detection on aerial images","volume":"13","author":"Evo","year":"2016","journal-title":"IEEE Geoence Remote Sens. Lett."},{"key":"ref_37","first-page":"640","article-title":"Fully convolutional networks for semantic segmentation","volume":"39","author":"Long","year":"2014","journal-title":"IEEE Trans. PAMI"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Noh, H., Hong, S., and Han, B. (2015). Learning Deconvolution Network for Semantic Segmentation. ICCV.","DOI":"10.1109\/ICCV.2015.178"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. PAMI"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2018","journal-title":"IEEE Trans. PAMI"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. arXiv.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2013). Visualizing and Understanding Convolutional Networks. arXiv.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., Krishnan, D., Taylor, G.W., and Fergus, R. (2010, January 13\u201318). Deconvolutional networks. Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539957"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., Taylor, G.W., and Fergus, R. (2011, January 6\u201313). Adaptive deconvolutional networks for mid and high level feature learning. Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain.","DOI":"10.1109\/ICCV.2011.6126474"},{"key":"ref_45","unstructured":"Fisher, Y., and Vladlen, K. (2015). Multi-scale context aggregation by dilated convolutions. arXiv."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"52","DOI":"10.1007\/s11801-020-9032-2","article-title":"Evaluating the generalization ability of convolutional neural networks for built-up area extraction in different cities of china","volume":"16","author":"Zhang","year":"2020","journal-title":"Optoelectron. Lett."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Zhang, T., and Tang, H. (2019). A Comprehensive Evaluation of Approaches for Built-Up Area Extraction from Landsat OLI Images Using Massive Samples. Remote Sens., 11.","DOI":"10.20944\/preprints201812.0067.v1"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/12\/2290\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:13:20Z","timestamp":1760163200000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/12\/2290"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,11]]},"references-count":47,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2021,6]]}},"alternative-id":["rs13122290"],"URL":"https:\/\/doi.org\/10.3390\/rs13122290","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,6,11]]}}}