{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,1]],"date-time":"2026-05-01T13:55:30Z","timestamp":1777643730793,"version":"3.51.4"},"reference-count":50,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2021,12,30]],"date-time":"2021-12-30T00:00:00Z","timestamp":1640822400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100004410","name":"Scientific and Technological Research Council of Turkey","doi-asserted-by":"publisher","award":["119Y363"],"award-info":[{"award-number":["119Y363"]}],"id":[{"id":"10.13039\/501100004410","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>In geospatial applications such as urban planning and land use management, automatic detection and classification of earth objects are essential and primary subjects. When the significant semantic segmentation algorithms are considered, DeepLabV3+ stands out as a state-of-the-art CNN. Although the DeepLabV3+ model is capable of extracting multi-scale contextual information, there is still a need for multi-stream architectural approaches and different training approaches of the model that can leverage multi-modal geographic datasets. In this study, a new end-to-end dual-stream architecture that considers geospatial imagery was developed based on the DeepLabV3+ architecture. As a result, the spectral datasets other than RGB provided increments in semantic segmentation accuracies when they were used as additional channels to height information. Furthermore, both the given data augmentation and Tversky loss function which is sensitive to imbalanced data accomplished better overall accuracies. Also, it has been shown that the new dual-stream architecture using Potsdam and Vaihingen datasets produced 88.87% and 87.39% overall semantic segmentation accuracies, respectively. Eventually, it was seen that enhancement of the traditional significant semantic segmentation networks has a great potential to provide higher model performances, whereas the contribution of geospatial data as the second stream to RGB to segmentation was explicitly shown.<\/jats:p>","DOI":"10.3390\/ijgi11010023","type":"journal-article","created":{"date-parts":[[2021,12,30]],"date-time":"2021-12-30T21:41:21Z","timestamp":1640900481000},"page":"23","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":32,"title":["Semantic Segmentation of High-Resolution Airborne Images with Dual-Stream DeepLabV3+"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0474-7518","authenticated-orcid":false,"given":"Ozgun","family":"Akcay","sequence":"first","affiliation":[{"name":"Department of Geomatics Engineering, Faculty of Engineering, \u00c7anakkale Onsekiz Mart University, Canakkale 17100, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8832-5453","authenticated-orcid":false,"given":"Ahmet Cumhur","family":"Kinaci","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Faculty of Engineering, \u00c7anakkale Onsekiz Mart University, Canakkale 17100, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3804-1209","authenticated-orcid":false,"given":"Emin Ozgur","family":"Avsar","sequence":"additional","affiliation":[{"name":"Department of Geomatics Engineering, Faculty of Engineering, \u00c7anakkale Onsekiz Mart University, Canakkale 17100, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3987-6435","authenticated-orcid":false,"given":"Umut","family":"Aydar","sequence":"additional","affiliation":[{"name":"Department of Geomatics Engineering, Faculty of Engineering, \u00c7anakkale Onsekiz Mart University, Canakkale 17100, Turkey"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2021,12,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"172","DOI":"10.1016\/j.isprsjprs.2019.07.009","article-title":"Development and evaluation of a deep learning model for real-time ground vehicle semantic segmentation from UAV-based thermal infrared imagery","volume":"155","author":"Masouleh","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"2355","DOI":"10.1007\/s11063-019-10174-x","article-title":"Automatic semantic segmentation with DeepLab dilated learning network for change detection in remote sensing images","volume":"51","author":"Venugopal","year":"2020","journal-title":"Neural Processing Lett."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3146","DOI":"10.1080\/01431161.2020.1871100","article-title":"A semantic segmentation method with category boundary for Land Use and Land Cover (LULC) mapping of Very-High Resolution (VHR) remote sensing image","volume":"42","author":"Xu","year":"2021","journal-title":"Int. J. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Touzani, S., and Granderson, J. (2021). Open Data and Deep Semantic Segmentation for Automated Extraction of Building Footprints. Remote Sens., 13.","DOI":"10.3390\/rs13132578"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"105189","DOI":"10.1016\/j.catena.2021.105189","article-title":"Convolutional neural networks applied to semantic segmentation of landslide scars","volume":"201","author":"Bragagnolo","year":"2021","journal-title":"CATENA"},{"key":"ref_6","unstructured":"Kanwal, S., Uzair, M., and Ullah, H. (2021). A Survey of Hand Crafted and Deep Learning Methods for Image Aesthetic Assessment. arXiv."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"273","DOI":"10.1007\/BF00994018","article-title":"Support-vector networks","volume":"20","author":"Cortes","year":"1995","journal-title":"Mach. Learn."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1007","DOI":"10.1080\/01431160512331314083","article-title":"Support vector machines for classification in remote sensing","volume":"26","author":"Pal","year":"2005","journal-title":"Int. J. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"5","DOI":"10.1023\/A:1010933404324","article-title":"Random forests","volume":"45","author":"Breiman","year":"2001","journal-title":"Mach. Learn."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1080\/01431160701352154","article-title":"The application of artificial neural networks to the analysis of remotely sensed data","volume":"29","author":"Mas","year":"2008","journal-title":"Int. J. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"1233","DOI":"10.1038\/s41592-019-0403-1","article-title":"Deep learning for cellular image analysis","volume":"16","author":"Moen","year":"2019","journal-title":"Nat. Methods"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"116205","DOI":"10.1016\/j.eswa.2021.116205","article-title":"Score-based mask edge improvement of Mask-RCNN for segmentation of fruit and vegetables","volume":"190","author":"Hameed","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_13","unstructured":"Wei, X.S., Cui, Q., Yang, L., Wang, P., and Liu, L. (2019). RPC: A large-scale retail product checkout dataset. arXiv."},{"key":"ref_14","unstructured":"Hamian, M.H., Beikmohammadi, A., Ahmadi, A., and Nasersharif, B. (2021, January 3\u20134). Semantic Segmentation of Autonomous Driving Images by the combination of Deep Learning and Classical Segmentation. Proceedings of the 2021 26th International Computer Conference, Computer Society of Iran (CSICC), Tehran, Iran."},{"key":"ref_15","first-page":"1","article-title":"Convolutional neural networks for semantic segmentation as a tool for multiclass face analysis in thermal infrared","volume":"40","author":"Ehlen","year":"2021","journal-title":"J. Nondestruct. Eval."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1007\/s13735-017-0141-z","article-title":"A review of semantic segmentation using deep neural networks","volume":"7","author":"Guo","year":"2018","journal-title":"Int. J. Multimed. Inf. Retr."},{"key":"ref_17","unstructured":"Neubert, M., Herold, H., and Meinel, G. (2006, January 4\u20135). Evaluation of remote sensing image segmentation quality\u2013further results and concepts. Proceedings of the International Conference on Object-Based Image Analysis (ICOIA), Salzburg, Austria."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Akcay, O., Avsar, E., Inalpulat, M., Genc, L., and Cam, A. (2018). Assessment of Segmentation Parameters for Object-Based Land Cover Classification Using Color-Infrared Imagery. ISPRS Int. J. Geo-Inf., 7.","DOI":"10.3390\/ijgi7110424"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Schwartzman, A., Kagan, M., Mackey, L., Nachman, B., and De Oliveira, L. (2016). Image Processing, Computer Vision, and Deep Learning: New Approaches to the Analysis and Physics Interpretation of LHC Events, IOP Publishing.","DOI":"10.1088\/1742-6596\/762\/1\/012035"},{"key":"ref_20","unstructured":"Sherrah, J. (2016). Fully Convolutional Networks for Dense Semantic Labelling of High-Resolution Aerial Imagery. arXiv."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, J., Shen, L., Qiao, W., Dai, Y., and Li, Z. (2019). Deep feature fusion with integration of residual connection and attention model for classification of VHR remote sensing images. Remote Sens., 11.","DOI":"10.3390\/rs11131617"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1016\/j.neucom.2018.11.051","article-title":"Problems of encoder-decoder frameworks for high-resolution remote sensing image segmentation: Structural stereotype and insufficient learning","volume":"330","author":"Sun","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_23","unstructured":"Marcu, A., and Leordeanu, M. (2016). Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Piramanayagam, S., Saber, E., Schwartzkopf, W., and Koehler, F.W. (2018). Supervised classification of multisensor remotely sensed images using a deep learning framework. Remote Sens., 10.","DOI":"10.3390\/rs10091429"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1016\/j.isprsjprs.2017.11.009","article-title":"Classification with an edge: Improving semantic image segmentation with boundary detection","volume":"135","author":"Marmanis","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Xie, S., and Tu, Z. (2015, January 7\u201313). Holistically-nested edge detection. Proceedings of the IEEE International Conference on Computer Vision, Washington, DC, USA.","DOI":"10.1109\/ICCV.2015.164"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"357","DOI":"10.1080\/17538947.2020.1831087","article-title":"Incorporating DeepLabv3+ and object-based image analysis for semantic segmentation of very high resolution remote sensing images","volume":"14","author":"Du","year":"2021","journal-title":"Int. J. Digit. Earth"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Song, A., and Kim, Y. (2020). Semantic Segmentation of Remote-Sensing Imagery Using Heterogeneous Big Data: International Society for Photogrammetry and Remote Sensing Potsdam and Cityscape Datasets. ISPRS Int. J. Geo-Inf., 9.","DOI":"10.3390\/ijgi9100601"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"114417","DOI":"10.1016\/j.eswa.2020.114417","article-title":"A review of deep learning methods for semantic segmentation of remote sensing imagery","volume":"169","author":"Yuan","year":"2020","journal-title":"Expert Syst. Appl."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Nikparvar, B., and Thill, J.C. (2021). Machine Learning of Spatial Data. ISPRS Int. J. Geo-Inf., 10.","DOI":"10.3390\/ijgi10090600"},{"key":"ref_33","unstructured":"Wu, H., Zhang, J., Huang, K., Liang, K., and Yu, Y. (2019). Fastfcn: Rethinking dilated convolution in the backbone for semantic segmentation. arXiv."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Takikawa, T., Acuna, D., Jampani, V., and Fidler, S. (2019, January 27\u201328). Gated-scnn: Gated shape cnns for semantic segmentation. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00533"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Minaee, S., Boykov, Y.Y., Porikli, F., Plaza, A.J., Kehtarnavaz, N., and Terzopoulos, D. (2020). Image segmentation using deep learning: A survey. arXiv.","DOI":"10.1109\/TPAMI.2021.3059968"},{"key":"ref_38","unstructured":"Chen, L.C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv."},{"key":"ref_39","unstructured":"ISPRS (2021, October 05). International Society for Photogrammetry and Remote Sensing. 2D Semantic Labeling Challenge. Available online: http:\/\/www2.isprs.org\/commissions\/comm3\/wg4\/semantic-labeling.html."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s40537-019-0197-0","article-title":"A survey on image data augmentation for deep learning","volume":"6","author":"Shorten","year":"2019","journal-title":"J. Big Data"},{"key":"ref_41","first-page":"73","article-title":"The DGPF-test on digital airborne camera evaluation overview and test design","volume":"2010","author":"Cramer","year":"2010","journal-title":"PFG Photogramm. Fernerkund. Geoinf."},{"key":"ref_42","first-page":"358","article-title":"Boundary Extraction Based on Dual Stream Deep Learning Model in High Resolution Remote Sensing Images","volume":"7","author":"Akcay","year":"2021","journal-title":"J. Adv. Res. Nat. Appl. Sci."},{"key":"ref_43","unstructured":"Gerke, M. (2015). Use of the Stair Vision Library within the ISPRS 2D Semantic Labeling Benchmark (Vaihingen), University of Twente. Technical Report."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"312","DOI":"10.1016\/j.cogsys.2019.10.004","article-title":"Inception and ResNet features are (almost) equivalent","volume":"59","author":"Beveridge","year":"2020","journal-title":"Cogn. Syst. Res."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1016\/j.neucom.2019.11.118","article-title":"A brief survey on semantic segmentation with deep learning","volume":"406","author":"Hao","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Azimi, S.M., Henry, C., Sommer, L., Schumann, A., and Vig, E. (2019, January 27\u201328). Skyscapes fine-grained semantic understanding of aerial scenes. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00749"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Boguszewski, A., Batorski, D., Ziemba-Jankowska, N., Zambrzycka, A., and Dziedzic, T. (2020). Landcover. ai: Dataset for automatic mapping of buildings, woodlands and water from aerial imagery. arXiv.","DOI":"10.1109\/CVPRW53098.2021.00121"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Abraham, N., and Khan, N.M. (2019, January 8\u201311). A novel focal tversky loss function with improved attention u-net for lesion segmentation. Proceedings of the 2019 IEEE 16th International Symposium on Biomedical Imaging (ISBI 2019), Venice, Italy.","DOI":"10.1109\/ISBI.2019.8759329"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Gragera, A., and Suppakitpaisarn, V. (2016). Semimetric properties of s\u00f8rensen-dice and tversky indexes. International Workshop on Algorithms and Computation, Springer.","DOI":"10.1007\/978-3-319-30139-6_27"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Cordts, M., Omran, M., Ramos, S., Rehfeld, T., Enzweiler, M., Benenson, R., Franke, U., Roth, S., and Schiele, B. (2016, January 27\u201330). The cityscapes dataset for semantic urban scene understanding. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.350"}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/11\/1\/23\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T07:56:18Z","timestamp":1760169378000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/11\/1\/23"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,12,30]]},"references-count":50,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2022,1]]}},"alternative-id":["ijgi11010023"],"URL":"https:\/\/doi.org\/10.3390\/ijgi11010023","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,12,30]]}}}