{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,12]],"date-time":"2026-06-12T17:05:07Z","timestamp":1781283907525,"version":"3.54.1"},"reference-count":49,"publisher":"MDPI AG","issue":"12","license":[{"start":{"date-parts":[[2018,11,30]],"date-time":"2018-11-30T00:00:00Z","timestamp":1543536000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>A digital surface model (DSM) provides the geometry and structure of an urban environment with buildings being the most prominent objects in it. Built-up areas change with time due to the rapid expansion of cities. New buildings are being built, existing ones are expanded, and old buildings are torn down. As a result, 3D surface models can increase the understanding and explanation of complex urban scenarios. They are very useful in numerous fields of remote sensing applications, in tasks related to 3D reconstruction and city modeling, planning, visualization, disaster management, navigation, and decision-making, among others. DSMs are typically derived from various acquisition techniques, like photogrammetry, laser scanning, or synthetic aperture radar (SAR). The generation of DSMs from very high resolution optical stereo satellite imagery leads to high resolution DSMs which often suffer from mismatches, missing values, or blunders, resulting in coarse building shape representation. To overcome these problems, we propose a method for 3D surface model generation with refined building shapes to level of detail (LoD) 2 from stereo half-meter resolution satellite DSMs using deep learning techniques. Mainly, we train a conditional generative adversarial network (cGAN) with an objective function based on least square residuals to generate an accurate LoD2-like DSM with enhanced 3D object shapes directly from the noisy stereo DSM input. In addition, to achieve close to LoD2 shapes of buildings, we introduce a new approach to generate an artificial DSM with accurate and realistic building geometries from city geography markup language (CityGML) data, on which we later perform a training of the proposed cGAN architecture. The experimental results demonstrate the strong potential to create large-scale remote sensing elevation models where the buildings exhibit better-quality shapes and roof forms than just using the matching process. Moreover, the developed model is successfully applied to a different city that is unseen during the training to show its generalization capacity.<\/jats:p>","DOI":"10.3390\/rs10121926","type":"journal-article","created":{"date-parts":[[2018,11,30]],"date-time":"2018-11-30T12:13:17Z","timestamp":1543579997000},"page":"1926","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":39,"title":["DSM-to-LoD2: Spaceborne Stereo Digital Surface Model Refinement"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4048-3583","authenticated-orcid":false,"given":"Ksenia","family":"Bittner","sequence":"first","affiliation":[{"name":"German Aerospace Center (DLR), Remote Sensing Technology Institute, M\u00fcnchner Str. 20, 82234 We\u00dfling, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-8541-3856","authenticated-orcid":false,"given":"Pablo","family":"D\u2019Angelo","sequence":"additional","affiliation":[{"name":"German Aerospace Center (DLR), Remote Sensing Technology Institute, M\u00fcnchner Str. 20, 82234 We\u00dfling, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-9186-4175","authenticated-orcid":false,"given":"Marco","family":"K\u00f6rner","sequence":"additional","affiliation":[{"name":"Department of Civil, Geo and Environmental Engineering, Technical University of Munich (TUM), Arcisstra\u00dfe 21, 80333 Munich, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8122-1475","authenticated-orcid":false,"given":"Peter","family":"Reinartz","sequence":"additional","affiliation":[{"name":"German Aerospace Center (DLR), Remote Sensing Technology Institute, M\u00fcnchner Str. 20, 82234 We\u00dfling, Germany"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2018,11,30]]},"reference":[{"key":"ref_1","unstructured":"Jacobsen, K. (2018, August 25). DEM Generation from Satellite Data. Available online: http:\/\/pdfs.semanticscholar.org\/e066\/f8a278f845bf2cb19c9b4e81ec4dde1e1131.pdf."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"597","DOI":"10.5721\/EuJRS20134635","article-title":"Productivity of TerraSAR-X 3D data in urban areas: A case study in Trento","volume":"46","author":"Sefercik","year":"2013","journal-title":"Eur. J. Remote Sens."},{"key":"ref_3","unstructured":"Poli, D., and Caravaggi, I. (2018, August 25). Digital Surface Modelling and 3D Information Extraction From Spaceborne Very High Resolution Stereo Pairs. Available online: http:\/\/publications.jrc.ec.europa.eu\/repository\/bitstream\/JRC68848\/lbna25234enn.pdf."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1080\/13658810500286943","article-title":"A hybrid interpolation method for the refinement of a regular grid digital elevation model","volume":"20","author":"Shi","year":"2006","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_5","first-page":"235","article-title":"Improving a digital elevation model by reducing source data errors and optimising interpolation algorithm parameters: An example in the Loess Plateau, China","volume":"9","author":"Yang","year":"2007","journal-title":"Int. J. App. Earth Obs. Geoinf."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1016\/j.biosystemseng.2008.09.010","article-title":"Improving quality of public domain digital elevation models through data fusion","volume":"101","author":"Karkee","year":"2008","journal-title":"Biosyst. Eng."},{"key":"ref_7","unstructured":"Wang, P. (2018, August 25). Applying Two Dimensional Kalman Filtering for Digital Terrain Modelling. Available online: https:\/\/pdfs.semanticscholar.org\/b75e\/4787d497a6fc5becb500f895ebe7df3c9d66.pdf."},{"key":"ref_8","unstructured":"Zhen, X., Huang, X., and Kwoh, L.K. (2001, January 5\u20139). Extracting DEM from SPOT stereo images. Proceedings of the 22nd Asian Conference of Remote Sensing, Singapore."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Laina, I., Rupprecht, C., Belagiannis, V., Tombari, F., and Navab, N. (2016, January 25\u201328). Deeper depth prediction with fully convolutional residual networks. Proceedings of the 2016 Fourth International Conference on 3D Vision (3DV), Stanford, CA, USA.","DOI":"10.1109\/3DV.2016.32"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Li, J., Klein, R., and Yao, A. (2017, January 22\u201329). A two-streamed network for estimating fine-scaled depth maps from single rgb images. Proceedings of the 2017 IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.365"},{"key":"ref_11","unstructured":"Reed, S., Akata, Z., Yan, X., Logeswaran, L., Schiele, B., and Lee, H. (arXiv, 2016). Generative adversarial text to image synthesis, arXiv."},{"key":"ref_12","unstructured":"Huang, H., Yu, P.S., and Wang, C. (arXiv, 2018). An Introduction to Image Synthesis with Generative Adversarial Nets, arXiv."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Ledig, C., Theis, L., Husz\u00e1r, F., Caballero, J., Cunningham, A., Acosta, A., Aitken, A.P., Tejani, A., Totz, J., and Wang, Z. (2017, January 21\u201326). Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.19"},{"key":"ref_14","unstructured":"Vondrick, C., Pirsiavash, H., and Torralba, A. (2016). Generating videos with scene dynamics. Advances in Neural Information Processing Systems, MIT Press."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Schlegl, T., Seeb\u00f6ck, P., Waldstein, S.M., Schmidt-Erfurth, U., and Langs, G. (2017, January 2\u20137). Unsupervised anomaly detection with generative adversarial networks to guide marker discovery. Proceedings of the International Conference on Information Processing in Medical Imaging, Boone, NC, USA.","DOI":"10.1007\/978-3-319-59050-9_12"},{"key":"ref_16","unstructured":"Wu, J., Zhang, C., Xue, T., Freeman, B., and Tenenbaum, J. (2016, January 5\u201310). Learning a probabilistic latent space of object shapes via 3D generative-adversarial modeling. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_17","unstructured":"Smith, E., and Meger, D. (arXiv, 2017). Improved adversarial systems for 3D object generation and reconstruction, arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yang, B., Wen, H., Wang, S., Clark, R., Markham, A., and Trigoni, N. (arXiv, 2017). 3D object reconstruction from a single depth view with adversarial learning, arXiv.","DOI":"10.1109\/ICCVW.2017.86"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Isola, P., Zhu, J.Y., Zhou, T., and Efros, A.A. (arXiv, 2016). Image-to-image translation with conditional adversarial networks, arXiv.","DOI":"10.1109\/CVPR.2017.632"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1111\/0031-868X.t01-1-00015","article-title":"A novel method for automating the checking and correction of digital elevation models using orthophotographs","volume":"18","author":"Georgopoulos","year":"2003","journal-title":"Photogramm. Rec."},{"key":"ref_21","unstructured":"Amitabh, A., Vijayvargiya, B., GopalaKrishna, P., and Srivastava, K. (2005, January 20\u201322). Iterative automatic technique for refinement of DEM and orthoimages. Proceedings of the 8th Map India International Conference, Pune, India."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"771","DOI":"10.14358\/PERS.72.7.771","article-title":"A comparative study of Australian cartometric and photogrammetric digital elevation model accuracy","volume":"72","author":"Walker","year":"2006","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1016\/0924-2716(94)90044-2","article-title":"Parametric statistical method for error detection in digital elevation models","volume":"49","year":"1994","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_24","unstructured":"Sirmacek, B., d\u2019Angelo, P., Krauss, T., and Reinartz, P. (2010). Enhancing Urban Digital Elevation Models Using Automated Computer Vision Techniques, ISPRS."},{"key":"ref_25","unstructured":"Sirmacek, B., d\u2019Angelo, P., and Reinartz, P. (2010, January 11\u201313). Detecting complex building shapes in panchromatic satellite images for digital elevation model enhancement. Proceedings of the ISPRS Workshop on Modeling of Optical Airborne and Space Borne Sensors, Citeseer, Istanbul, Turkey."},{"key":"ref_26","first-page":"6","article-title":"Enhancement of dense urban digital surface models from VHR optical satellite stereo data by pre-segmentation and object detection","volume":"38","author":"Reinartz","year":"2010","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_27","unstructured":"Eigen, D., Puhrsch, C., and Fergus, R. (2014, January 8\u201313). Depth map prediction from a single image using a multi-scale deep network. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Eigen, D., and Fergus, R. (2015, January 7\u201313). Predicting depth, surface normals and semantic labels with a common multi-scale convolutional architecture. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.304"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Liu, F., Shen, C., and Lin, G. (2015, January 7\u201312). Deep convolutional neural fields for depth estimation from a single image. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299152"},{"key":"ref_30","unstructured":"Zhu, J., and Ma, R. (2018, August 25). Real-Time Depth Estimation from 2D Images. Available online: http:\/\/cs231n.stanford.edu\/reports\/2016\/pdfs\/407_Report.pdf."},{"key":"ref_31","unstructured":"Simonyan, K., and Zisserman, A. (arXiv, 2014). Very deep convolutional networks for large-scale image recognition, arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Jeon, J., and Lee, S. (2018, January 8\u201314). Reconstruction-based Pairwise Depth Dataset for Depth Image Enhancement Using CNN. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01270-0_26"},{"key":"ref_33","unstructured":"Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014, January 8\u201313). Generative adversarial nets. Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Choy, C.B., Xu, D., Gwak, J., Chen, K., and Savarese, S. (2016, January 8\u201316). 3D-R2N2: A unified approach for single and multi-view 3D object reconstruction. Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46484-8_38"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Dai, A., Qi, C.R., and Nie\u00dfner, M. (2017, January 21\u201326). Shape completion using 3D-encoder-predictor cnns and shape synthesis. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.693"},{"key":"ref_36","unstructured":"Rezende, D.J., Eslami, S.A., Mohamed, S., Battaglia, P., Jaderberg, M., and Heess, N. (2016, January 5\u201310). Unsupervised learning of 3D structure from images. Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain."},{"key":"ref_37","unstructured":"Mirza, M., and Osindero, S. (arXiv, 2014). Conditional generative adversarial nets, arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Guo, Y.X., and Tong, X. (arXiv, 2018). View-volume Network for Semantic Scene Completion from a Single Depth Image, arXiv.","DOI":"10.24963\/ijcai.2018\/101"},{"key":"ref_39","first-page":"103","article-title":"Automatic Large-Scale 3D Building Shape Refinement Using Conditional Generative Adversarial Networks","volume":"422","author":"Bittner","year":"2018","journal-title":"ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Bittner, K., and K\u00f6rner, M. (2018, January 18\u201322). Automatic Large-Scale 3D Building Shape Refinement Using Conditional Generative Adversarial Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Salt Lake City, Utah.","DOI":"10.1109\/CVPRW.2018.00249"},{"key":"ref_41","unstructured":"Gr\u00f6ger, G., Kolbe, T., Nagel, C., and H\u00e4fele, K. (2018, August 25). OGC City Geography Markup Language (CityGML) Encoding Standard, Version 2.0, OGC doc no. 12-019. Available online: https:\/\/www.google.com\/url?sa=t&rct=j&q=&esrc=s&source=web&cd=8&cad=rja&uact=8&ved=2ahUKEwjuxKi5__reAhUMwLwKHVcHBU0QFjAHegQICBAC&url=https%3A%2F%2Fportal.opengeospatial.org%2Ffiles%2F%3Fartifact_id%3D47842&usg=AOvVaw2wSGg9aGHeIRQc8wQe3Vug."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Mao, X., Li, Q., Xie, H., Lau, R.Y., Wang, Z., and Smolley, S.P. (2017, January 22\u201329). Least squares generative adversarial networks. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.304"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Shewchuk, J.R. (1996). Triangle: Engineering a 2D quality mesh generator and Delaunay triangulator. Applied Computational Geometry Towards Geometric Engineering, Springer.","DOI":"10.1007\/BFb0014497"},{"key":"ref_45","first-page":"793","article-title":"Sur la Sph\u00e8re Vide","volume":"7","author":"Delaunay","year":"1934","journal-title":"Bull. Acad. Sci. USSR"},{"key":"ref_46","first-page":"79","article-title":"Semiglobal matching results on the ISPRS stereo matching benchmark","volume":"38","author":"Reinartz","year":"2011","journal-title":"ISPRS Hann. Works."},{"key":"ref_47","unstructured":"Kingma, D.P., and Ba, J. (arXiv, 2014). Adam: A method for stochastic optimization, arXiv."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1016\/j.isprsjprs.2009.02.003","article-title":"Accuracy assessment of digital elevation models by means of robust statistical methods","volume":"64","year":"2009","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Mao, X., Li, Q., Xie, H., Lau, R.Y., Wang, Z., and Smolley, S.P. (arXiv, 2017). On the Effectiveness of Least Squares Generative Adversarial Networks, arXiv.","DOI":"10.1109\/ICCV.2017.304"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/12\/1926\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:33:42Z","timestamp":1760196822000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/10\/12\/1926"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,11,30]]},"references-count":49,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2018,12]]}},"alternative-id":["rs10121926"],"URL":"https:\/\/doi.org\/10.3390\/rs10121926","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,11,30]]}}}