{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T15:47:51Z","timestamp":1772898471424,"version":"3.50.1"},"reference-count":62,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2021,6,21]],"date-time":"2021-06-21T00:00:00Z","timestamp":1624233600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Horizon 2020","award":["739578"],"award-info":[{"award-number":["739578"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Estimating the height of buildings and vegetation in single aerial images is a challenging problem. A task-focused Deep Learning (DL) model that combines architectural features from successful DL models (U-NET and Residual Networks) and learns the mapping from a single aerial imagery to a normalized Digital Surface Model (nDSM) was proposed. The model was trained on aerial images whose corresponding DSM and Digital Terrain Models (DTM) were available and was then used to infer the nDSM of images with no elevation information. The model was evaluated with a dataset covering a large area of Manchester, UK, as well as the 2018 IEEE GRSS Data Fusion Contest LiDAR dataset. The results suggest that the proposed DL architecture is suitable for the task and surpasses other state-of-the-art DL approaches by a large margin.<\/jats:p>","DOI":"10.3390\/rs13122417","type":"journal-article","created":{"date-parts":[[2021,6,21]],"date-time":"2021-06-21T04:21:21Z","timestamp":1624249281000},"page":"2417","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":36,"title":["IMG2nDSM: Height Estimation from Single Airborne RGB Images with Deep Learning"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4034-7709","authenticated-orcid":false,"given":"Savvas","family":"Karatsiolis","sequence":"first","affiliation":[{"name":"CYENS Center of Excellence, Nicosia 1016, Cyprus"}]},{"given":"Andreas","family":"Kamilaris","sequence":"additional","affiliation":[{"name":"CYENS Center of Excellence, Nicosia 1016, Cyprus"},{"name":"Department of Computer Science, University of Twente, 7522 NB Enschede, The Netherlands"}]},{"given":"Ian","family":"Cole","sequence":"additional","affiliation":[{"name":"CYENS Center of Excellence, Nicosia 1016, Cyprus"},{"name":"Department of Computer Science, University of Cyprus, Aglantzia 2109, Cyprus"}]}],"member":"1968","published-online":{"date-parts":[[2021,6,21]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"103921","DOI":"10.1016\/j.landurbplan.2020.103921","article-title":"Remote Sensing in Urban Planning: Contributions towards Ecologically Sound Policies?","volume":"204","author":"Wellmann","year":"2020","journal-title":"Landsc. Urban Plan."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Bechtel, B. (April, January 30). Recent Advances in Thermal Remote Sensing for Urban Planning and Management. Proceedings of the Joint Urban Remote Sensing Event, JURSE 2015, Lausanne, Switzerland.","DOI":"10.1109\/JURSE.2015.7120515"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"164","DOI":"10.1016\/j.rse.2019.04.020","article-title":"Understanding an Urbanizing Planet: Strategic Directions for Remote Sensing","volume":"228","author":"Zhu","year":"2019","journal-title":"Remote Sens. Environ."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/sdata.2018.56","article-title":"Spatial Distribution of Arable and Abandoned Land across Former Soviet Union Countries","volume":"5","author":"Lesiv","year":"2018","journal-title":"Sci. Data"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Ma, L., Li, M., Blaschke, T., Ma, X., Tiede, D., Cheng, L., Chen, Z., and Chen, D. (2016). Object-Based Change Detection in Urban Areas: The Effects of Segmentation Strategy, Scale, and Feature Space on Unsupervised Methods. Remote Sens., 8.","DOI":"10.3390\/rs8090761"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Muro, J., Canty, M., Conradsen, K., H\u00fcttich, C., Nielsen, A.A., Skriver, H., Remy, F., Strauch, A., Thonfeld, F., and Menz, G. (2016). Short-Term Change Detection in Wetlands Using Sentinel-1 Time Series. Remote Sens., 8.","DOI":"10.3390\/rs8100795"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Lyu, H., Lu, H., and Mou, L. (2016). Learning a Transferable Change Rule from a Recurrent Neural Network for Land Cover Change Detection. Remote Sens., 8.","DOI":"10.3390\/rs8060506"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/j.ijdrr.2018.09.015","article-title":"Satellite Remote Sensing for Disaster Management Support: A Holistic and Staged Approach Based on Case Studies in Sentinel Asia","volume":"33","author":"Kaku","year":"2019","journal-title":"Int. J. Disaster Risk Reduct."},{"key":"ref_9","first-page":"341","article-title":"Eyes in the Sky: Remote Sensing Technology Development Using Small Unmanned Aircraft Systems","volume":"111","author":"Wing","year":"2013","journal-title":"J. For."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1080\/10106049.2010.537786","article-title":"Remote Sensing Applications of Unmanned Aircraft: Challenges to Flight in United States Airspace","volume":"26","author":"Mulac","year":"2011","journal-title":"Geocarto Int."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"418","DOI":"10.1016\/j.isprsjprs.2020.07.020","article-title":"From LiDAR Point Cloud towards Digital Twin City: Clustering City Objects Based on Gestalt Principles","volume":"167","author":"Xue","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Micha\u0142owska, M., and Rapi\u0144ski, J. (2021). A Review of Tree Species Classification Based on Airborne LiDAR Data and Applied Classifiers. Remote Sens., 13.","DOI":"10.3390\/rs13030353"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Sch\u00f6nberger, J.L., and Frahm, J.-M. (2016, January 27\u201330). Structure-from-Motion Revisited. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.445"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Bosch, M., Foster, K., Christie, G.A., Wang, S., Hager, G.D., and Brown, M.Z. (2019, January 7\u201311). Semantic Stereo for Incidental Satellite Images. Proceedings of the IEEE Winter Conference on Applications of Computer Vision, WACV 2019, Waikoloa Village, HI, USA.","DOI":"10.1109\/WACV.2019.00167"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Voumard, J., Derron, M.-H., Jaboyedoff, M., Bornemann, P., and Malet, J.-P. (2018). Pros and Cons of Structure for Motion Embarked on a Vehicle to Survey Slopes along Transportation Lines Using 3D Georeferenced and Coloured Point Clouds. Remote Sens., 10.","DOI":"10.3390\/rs10111732"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"31","DOI":"10.1177\/0309133308089496","article-title":"Airborne LiDAR for DEM Generation: Some Critical Issues","volume":"32","author":"Liu","year":"2008","journal-title":"Prog. Phys. Geogr. Earth Environ."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Liu, C.-J., Krylov, V.A., Kane, P., Kavanagh, G., and Dahyot, R. (2020). IM2ELEVATION: Building Height Estimation from Single-View Aerial Imagery. Remote Sens., 12.","DOI":"10.3390\/rs12172719"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"386","DOI":"10.1109\/TPAMI.2018.2844175","article-title":"Mask R-CNN","volume":"42","author":"He","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_19","unstructured":"Chen, L.-C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking Atrous Convolution for Semantic Image Segmentation. arXiv."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"G\u00fcler, R.A., Neverova, N., and Kokkinos, I. (2018, January 18\u201322). DensePose: Dense Human Pose Estimation in the Wild. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00762"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., and Wojna, Z. (2016, January 27\u201330). Rethinking the Inception Architecture for Computer Vision. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.308"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR) 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_23","unstructured":"Pereira, F., Burges, C.J.C., Bottou, L., and Weinberger, K.Q. (2012, January 3\u20136). ImageNet Classification with Deep Convolutional Neural Networks. Proceedings of the Advances in Neural Information Processing Systems (NIPS), Lake Tahoe, NV, USA."},{"key":"ref_24","first-page":"270","article-title":"A Survey on Deep Transfer Learning","volume":"Volume 11141","author":"Manolopoulos","year":"2018","journal-title":"Proceedings of the Artificial Neural Networks and Machine Learning\u2014CANN 2018\u201427th International Conference on Artificial Neural Networks"},{"key":"ref_25","unstructured":"Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., and Weinberger, K.Q. (2014, January 8\u201313). Depth Map Prediction from a Single Image Using a Multi-Scale Deep Network. Proceedings of the Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems 2014, Montreal, QC, Canada."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Laina, I., Rupprecht, C., Belagiannis, V., Tombari, F., and Navab, N. (2016). Deeper Depth Prediction with Fully Convolutional Residual Networks. arXiv.","DOI":"10.1109\/3DV.2016.32"},{"key":"ref_27","unstructured":"Alhashim, I., and Wonka, P. (2018). High Quality Monocular Depth Estimation via Transfer Learning. arXiv."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., and Weinberger, K.Q. (2016). Densely Connected Convolutional Networks. CoRR.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A Large-Scale Hierarchical Image Database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_30","unstructured":"Bhat, S.F., Alhashim, I., and Wonka, P. (2020). AdaBins: Depth Estimation Using Adaptive Bins. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Geiger, A., Lenz, P., and Urtasun, R. (2012, January 16\u201321). Are We Ready for Autonomous Driving? The KITTI Vision Benchmark Suite. Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6248074"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Nathan Silberman Derek Hoiem, P.K., and Fergus, R. (2012, January 7\u201313). Indoor Segmentation and Support Inference from RGBD Images. Proceedings of the ECCV, Florence, Italy.","DOI":"10.1007\/978-3-642-33715-4_54"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Mahjourian, R., Wicke, M., and Angelova, A. (2018, January 18\u201322). Unsupervised Learning of Depth and Ego-Motion From Monocular Video Using 3D Geometric Constraints. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00594"},{"key":"ref_34","unstructured":"PNVR, K., Zhou, H., and Jacobs, D. (2020, January 13\u201319). SharinGAN: Combining Synthetic and Real Data for Unsupervised Geometry Estimation. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA."},{"key":"ref_35","unstructured":"Goodfellow, I., Pouget-Abadie, J., and Mirza, M. (2014). Generative Adversarial Networks. arXiv."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"155","DOI":"10.1016\/j.isprsjprs.2020.11.011","article-title":"Automatic 3D Building Reconstruction from Multi-View Aerial Images with Deep Learning","volume":"171","author":"Yu","year":"2021","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_37","unstructured":"Mou, L., and Zhu, X.X. (2018). IM2HEIGHT: Height Estimation from Single Monocular Imagery via Fully Residual Convolutional-Deconvolutional Network. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"50","DOI":"10.1016\/j.isprsjprs.2019.01.013","article-title":"Height Estimation from Single Aerial Images Using a Deep Convolutional Encoder-Decoder Network","volume":"149","author":"Amirkolaee","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Srivastava, S., Volpi, M., and Tuia, D. (2017, January 23\u201328). Joint Height Estimation and Semantic Labeling of Monocular Aerial Images with CNNS. Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium, IGARSS 2017, Fort Worth, TX, USA.","DOI":"10.1109\/IGARSS.2017.8128167"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1391","DOI":"10.1109\/LGRS.2019.2947783","article-title":"Multitask Learning of Height and Semantics From Aerial Images","volume":"17","author":"Carvalho","year":"2020","journal-title":"IEEE Geosci. Remote. Sens. Lett."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"794","DOI":"10.1109\/LGRS.2018.2806945","article-title":"IMG2DSM: Height Simulation from Single Imagery Using Conditional Generative Adversarial Net","volume":"15","author":"Ghamisi","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Panagiotou, E., Chochlakis, G., Grammatikopoulos, L., and Charou, E. (2020). Generating Elevation Surface from a Single RGB Remotely Sensed Image Using Deep Learning. Remote Sens., 12.","DOI":"10.3390\/rs12122002"},{"key":"ref_43","unstructured":"Goodfellow, I., Bengio, Y., and Courville, A. (2016). Deep Learning, MIT Press."},{"key":"ref_44","unstructured":"Nielsen, M. (2021, March 24). Neural Networks and Deep Learning. Available online: http:\/\/neuralnetworksanddeeplearning.com\/."},{"key":"ref_45","unstructured":"(2021, March 25). Digimap. Available online: https:\/\/digimap.edina.ac.uk\/."},{"key":"ref_46","unstructured":"(2021, March 25). Edina. Available online: https:\/\/edina.ac.uk\/."},{"key":"ref_47","unstructured":"Defra (Department for Environment, Food and Rural Affairs) (2021, March 25). Spatial Data. Available online: https:\/\/environment.data.gov.uk\/DefraDataDownload\/."},{"key":"ref_48","unstructured":"(2021, March 24). 2018 IEEE GRSS Data Fusion Contest. Available online: http:\/\/dase.grss-ieee.org\/index.php."},{"key":"ref_49","unstructured":"IEEE (2021, March 24). France GRSS Chapter. Available online: https:\/\/site.ieee.org\/france-grss\/2018\/01\/16\/data-fusion-contest-2018-contest-open\/."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional Networks for Biomedical Image Segmentation. Proceedings of the International Conference of Medical Image Computing and Computer-Assisted Intervention 18 (MICCAI), Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_51","first-page":"424","article-title":"3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation","volume":"Volume 9901","author":"Ourselin","year":"2016","journal-title":"Proceedings of the Medical Image Computing and Computer-Assisted Intervention (MICCAI)"},{"key":"ref_52","unstructured":"Iglovikov, V., and Shvets, A. (2018). TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation. arXiv."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully Convolutional Networks for Semantic Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Shi, W., Caballero, J., Huszar, F., Totz, J., Aitken, A.P., Bishop, R., Rueckert, D., and Wang, Z. (2016, January 27\u201330). Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.207"},{"key":"ref_55","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv."},{"key":"ref_56","unstructured":"Kingma, D.P., and Ba, J. (2015). Adam: A Method for Stochastic Optimization. ICLR, 1\u201315."},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. Proceedings of the International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_58","unstructured":"Van Dijk, T., and de Croon, G. (November, January 27). How Do Neural Networks See Depth in Single Images?. Proceedings of the International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Christie, G.A., Abujder, R.R.R.M., Foster, K., Hagstrom, S., Hager, G.D., and Brown, M.Z. (2020, January 13\u201319). Learning Geocentric Object Pose in Oblique Monocular Images. Proceedings of the Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01452"},{"key":"ref_60","doi-asserted-by":"crossref","first-page":"4148","DOI":"10.1016\/j.rse.2008.01.024","article-title":"Surface Hydrology of Low-Relief Landscapes: Assessing Surface Water Flow Impedance Using LIDAR-Derived Digital Elevation Models","volume":"112","author":"Jones","year":"2008","journal-title":"Remote Sens. Environ."},{"key":"ref_61","doi-asserted-by":"crossref","first-page":"1567","DOI":"10.1109\/JSTARS.2016.2516900","article-title":"Comparison of Pleiades and LiDAR Digital Elevation Models for Terraces Detection in Farmlands","volume":"9","author":"Sofia","year":"2016","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Palmer, D., Koumpli, E., Cole, I., Gottschalg, R., and Betts, T. (2018). A GIS-Based Method for Identification of Wide Area Rooftop Suitability for Minimum Size PV Systems Using LiDAR Data and Photogrammetry. Energies, 11.","DOI":"10.20944\/preprints201811.0518.v1"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/12\/2417\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:19:50Z","timestamp":1760163590000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/12\/2417"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,6,21]]},"references-count":62,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2021,6]]}},"alternative-id":["rs13122417"],"URL":"https:\/\/doi.org\/10.3390\/rs13122417","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,6,21]]}}}