{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T23:18:09Z","timestamp":1776208689783,"version":"3.50.1"},"reference-count":175,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2021,2,23]],"date-time":"2021-02-23T00:00:00Z","timestamp":1614038400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100005790","name":"Thammasat University","doi-asserted-by":"publisher","award":["Thammasat University Research Fund under the TU Research Scholar, Contract No. 6\/2562."],"award-info":[{"award-number":["Thammasat University Research Fund under the TU Research Scholar, Contract No. 6\/2562."]}],"id":[{"id":"10.13039\/501100005790","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Availability of very high-resolution remote sensing images and advancement of deep learning methods have shifted the paradigm of image classification from pixel-based and object-based methods to deep learning-based semantic segmentation. This shift demands a structured analysis and revision of the current status on the research domain of deep learning-based semantic segmentation. The focus of this paper is on urban remote sensing images. We review and perform a meta-analysis to juxtapose recent papers in terms of research problems, data source, data preparation methods including pre-processing and augmentation techniques, training details on architectures, backbones, frameworks, optimizers, loss functions and other hyper-parameters and performance comparison. Our detailed review and meta-analysis show that deep learning not only outperforms traditional methods in terms of accuracy, but also addresses several challenges previously faced. Further, we provide future directions of research in this domain.<\/jats:p>","DOI":"10.3390\/rs13040808","type":"journal-article","created":{"date-parts":[[2021,2,23]],"date-time":"2021-02-23T20:19:36Z","timestamp":1614111576000},"page":"808","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":203,"title":["Deep Learning-Based Semantic Segmentation of Urban Features in Satellite Images: A Review and Meta-Analysis"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5331-9897","authenticated-orcid":false,"given":"Bipul","family":"Neupane","sequence":"first","affiliation":[{"name":"Advanced Geospatial Technology Research Unit, Sirindhorn International Institute of Technology, 131 Moo 5, Tiwanon Road, Bangkadi, Mueang Pathumthani, Pathumthani 12000, Thailand"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3452-8845","authenticated-orcid":false,"given":"Teerayut","family":"Horanont","sequence":"additional","affiliation":[{"name":"School of Information, Computer, and Communication Technology (ICT), Sirindhorn International Institute of Technology, 131 Moo 5, Tiwanon Road, Bangkadi, Mueang Pathumthani, Pathumthani 12000, Thailand"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4875-2127","authenticated-orcid":false,"given":"Jagannath","family":"Aryal","sequence":"additional","affiliation":[{"name":"Department of Infrastructure Engineering, Faculty of Engineering and IT, The University of Melbourne, Melbourne, VIC 3010, Australia"}]}],"member":"1968","published-online":{"date-parts":[[2021,2,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1016\/j.rse.2011.11.026","article-title":"Sentinel-2: ESA\u2019s optical high-resolution mission for GMES operational services","volume":"120","author":"Drusch","year":"2012","journal-title":"Remote Sens. Environ."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.rse.2011.08.026","article-title":"The next Landsat satellite: The Landsat data continuity mission","volume":"122","author":"Irons","year":"2012","journal-title":"Remote Sens. Environ."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1016\/j.rse.2010.12.017","article-title":"Per-pixel vs. object-based classification of urban land cover extraction using high spatial resolution imagery","volume":"115","author":"Myint","year":"2011","journal-title":"Remote Sens. Environ."},{"key":"ref_4","first-page":"1393","article-title":"The design and implementation of an integrated geographic information system for environmental applications","volume":"61","author":"Cowen","year":"1995","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_5","unstructured":"Campbell, J.B., and Wynne, R.H. (2011). Introduction to Remote Sensing, Guilford Press."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"225","DOI":"10.1016\/j.isprsjprs.2003.09.007","article-title":"Object-based classification of remote sensing data for change detection","volume":"58","author":"Walter","year":"2004","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"188","DOI":"10.2747\/1548-1603.45.2.188","article-title":"Identifying mangrove species and their surrounding land use and land cover classes using an object-oriented approach with a lacunarity spatial measure","volume":"45","author":"Myint","year":"2008","journal-title":"GIScience Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Navulur, K. (2006). Multispectral Image Analysis Using the Object-Oriented Paradigm, CRC Press.","DOI":"10.1201\/9781420043075"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"5167","DOI":"10.1080\/01431160701604703","article-title":"Object-based classification of residential land use within Accra, Ghana based on QuickBird satellite data","volume":"28","author":"Stow","year":"2007","journal-title":"Int. J. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"823","DOI":"10.1080\/01431160600746456","article-title":"A survey of image classification methods and techniques for improving classification performance","volume":"28","author":"Lu","year":"2007","journal-title":"Int. J. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Neupane, B., Horanont, T., Duy, H.N., Suebvong, S., and Mahattanawutakorn, T. (2019, January 7\u201312). An Open-Source UAV Image Processing Web Service for Crop Health Monitoring. Proceedings of the 2019 8th IEEE International Congress on Advanced Applied Informatics (IIAI-AAI), lToyama, Japan.","DOI":"10.1109\/IIAI-AAI.2019.00014"},{"key":"ref_12","first-page":"1995","article-title":"Convolutional networks for images, speech, and time series","volume":"3361","author":"LeCun","year":"1995","journal-title":"Handb. Brain Theory Neural Netw."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"LeCun","year":"2015","journal-title":"Nature"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"1345","DOI":"10.1109\/TKDE.2009.191","article-title":"A survey on transfer learning","volume":"22","author":"Pan","year":"2009","journal-title":"IEEE Trans. Knowl. Data Eng."},{"key":"ref_15","first-page":"467","article-title":"Fine-tuning Deep Convolutional Networks for Plant Recognition","volume":"1391","author":"Reyes","year":"2015","journal-title":"CLEF (Work. Notes)"},{"key":"ref_16","unstructured":"Amara, J., Bouaziz, B., and Algergawy, A. (2017). A Deep Learning-based Approach for Banana Leaf Diseases Classification. Datenbanksysteme f\u00fcr Business, Technologie und Web (BTW 2017)\u2014Workshopband, Gesellschaft f\u00fcr Informatik e.V."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1419","DOI":"10.3389\/fpls.2016.01419","article-title":"Using deep learning for image-based plant disease detection","volume":"7","author":"Mohanty","year":"2016","journal-title":"Front. Plant Sci."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Huang, H., Deng, J., Lan, Y., Yang, A., Deng, X., and Zhang, L. (2018). A fully convolutional network for weed mapping of unmanned aerial vehicle (UAV) imagery. PLoS ONE, 13.","DOI":"10.1371\/journal.pone.0196302"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"778","DOI":"10.1109\/LGRS.2017.2681128","article-title":"Deep learning classification of land cover and crop types using remote sensing data","volume":"14","author":"Kussul","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_20","unstructured":"Mortensen, A.K., Dyrmann, M., Karstoft, H., J\u00f8rgensen, R.N., and Gislum, R. (2016, January 26\u201329). Semantic segmentation of mixed crops using deep convolutional neural network. Proceedings of the CIGR-AgEng Conference, Abstracts and Full Papers, Aarhus, Denmark."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Neupane, B., Horanont, T., and Hung, N.D. (2019). Deep learning based banana plant detection and counting using high-resolution red-green-blue (RGB) images collected from unmanned aerial vehicle (UAV). PLoS ONE, 14.","DOI":"10.1371\/journal.pone.0223906"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1038\/srep03188","article-title":"Deep cognitive imaging systems enable estimation of continental-scale fire incidence from climate data","volume":"3","author":"Dutta","year":"2013","journal-title":"Sci. Rep."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Ghorbanzadeh, O., Blaschke, T., Gholamnia, K., Meena, S.R., Tiede, D., and Aryal, J. (2019). Evaluation of different machine learning methods and deep-learning convolutional neural networks for landslide detection. Remote Sens., 11.","DOI":"10.3390\/rs11020196"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ghorbanzadeh, O., Meena, S.R., Blaschke, T., and Aryal, J. (2019). UAV-based slope failure detection using deep-learning convolutional neural networks. Remote Sens., 11.","DOI":"10.3390\/rs11172046"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"8","DOI":"10.1109\/MGRS.2017.2762307","article-title":"Deep learning in remote sensing: A comprehensive review and list of resources","volume":"5","author":"Zhu","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Shadman Roodposhti, M., Aryal, J., Lucieer, A., and Bryan, B.A. (2019). Uncertainty assessment of hyperspectral image classification: Deep learning vs. random forest. Entropy, 21.","DOI":"10.3390\/e21010078"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1162\/neco.1989.1.4.541","article-title":"Backpropagation applied to handwritten zip code recognition","volume":"1","author":"LeCun","year":"1989","journal-title":"Neural Comput."},{"key":"ref_28","first-page":"1097","article-title":"Imagenet classification with deep convolutional neural networks","volume":"25","author":"Krizhevsky","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Fu, G., Liu, C., Zhou, R., Sun, T., and Zhang, Q. (2017). Classification for high resolution remote sensing imagery using a fully convolutional network. Remote Sens., 9.","DOI":"10.3390\/rs9050498"},{"key":"ref_31","unstructured":"Sherrah, J. (2016). Fully convolutional networks for dense semantic labelling of high-resolution aerial imagery. arXiv."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"e1264","DOI":"10.1002\/widm.1264","article-title":"Deep learning for remote sensing image classification: A survey","volume":"8","author":"Li","year":"2018","journal-title":"Wiley Interdiscip. Rev. Data Min. Knowl. Discov."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2019.04.015","article-title":"Deep learning in remote sensing applications: A meta-analysis and review","volume":"152","author":"Ma","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1109\/36.46698","article-title":"Fuzzy supervised classification of remote sensing images","volume":"28","author":"Wang","year":"1990","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"3713","DOI":"10.1109\/JSTARS.2014.2330830","article-title":"Near-real time detection of beetle infestation in pine forests using MODIS data","volume":"7","author":"Anees","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1717","DOI":"10.1109\/LGRS.2014.2306712","article-title":"A statistical framework for near-real time detection of beetle infestation in pine forests using MODIS data","volume":"11","author":"Anees","year":"2014","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"3359","DOI":"10.1109\/JSTARS.2015.2428306","article-title":"A relative density ratio-based framework for detection of land cover changes in MODIS NDVI time series","volume":"9","author":"Anees","year":"2015","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"173","DOI":"10.1080\/02693799308901949","article-title":"Artificial neural networks for land-cover classification and mapping","volume":"7","author":"Civco","year":"1993","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_40","unstructured":"Chen, K., Tzeng, Y., Chen, C., Kao, W., and Ni, C. (1993, January 18\u201321). Classification of multispectral imagery using dynamic learning neural network. Proceedings of the IGARSS\u201993-IEEE International Geoscience and Remote Sensing Symposium, Tokyo, Japan."},{"key":"ref_41","unstructured":"Foody, G.M. (1999). Image classification with a neural network: From completely-crisp to fully-fuzzy situations. Adv. Remote Sens. GIS Anal., 17\u201337."},{"key":"ref_42","unstructured":"Flanagan, M., and Civco, D.L. (2001, January 23\u201327). Subpixel impervious surface mapping. Proceedings of the 2001 ASPRS Annual Convention, Bethesda, MD, USA."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"1003","DOI":"10.14358\/PERS.69.9.1003","article-title":"Urban land-cover change detection through sub-pixel imperviousness mapping using remotely sensed data","volume":"69","author":"Yang","year":"2003","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"253","DOI":"10.1016\/j.rse.2006.09.005","article-title":"Sub-pixel mapping of urban land cover using multiple endmember spectral mixture analysis: Manaus, Brazil","volume":"106","author":"Powell","year":"2007","journal-title":"Remote Sens. Environ."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"1213","DOI":"10.14358\/PERS.74.10.1213","article-title":"Subpixel urban land cover estimation","volume":"74","author":"Walton","year":"2008","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"62","DOI":"10.1016\/j.rse.2013.02.005","article-title":"A spatially adaptive spectral mixture analysis for mapping subpixel urban impervious surface distribution","volume":"133","author":"Deng","year":"2013","journal-title":"Remote Sens. Environ."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"4907","DOI":"10.1080\/0143116031000114851","article-title":"The use of backpropagating artificial neural networks in land cover classification","volume":"24","author":"Kavzoglu","year":"2003","journal-title":"Int. J. Remote Sens."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"910","DOI":"10.3390\/s8020910","article-title":"Improving distributed runoff prediction in urbanized catchments with remote sensing based estimates of impervious surface cover","volume":"8","author":"Chormanski","year":"2008","journal-title":"Sensors"},{"key":"ref_49","unstructured":"Mohapatra, R.P., and Wu, C. (2008). Subpixel imperviousness estimation with IKONOS imagery: An artificial neural network approach. Remote Sens. Impervious Surfaces, 21\u201337."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"2397","DOI":"10.1109\/TGRS.2008.917601","article-title":"Medium spatial resolution satellite imagery for estimating and mapping urban impervious surfaces using LSMA and ANN","volume":"46","author":"Weng","year":"2008","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"2089","DOI":"10.1016\/j.rse.2009.05.014","article-title":"Estimating impervious surfaces from medium spatial resolution imagery using the self-organizing map and multi-layer perceptron neural networks","volume":"113","author":"Hu","year":"2009","journal-title":"Remote Sens. Environ."},{"key":"ref_52","first-page":"1451","article-title":"Land-use classification of remotely sensed data using Kohonen self-organizing feature map neural networks","volume":"66","author":"Ji","year":"2000","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Li, Z., and Eastman, J.R. (2006, January 28\u201329). Commitment and typicality measurements for fuzzy ARTMAP neural network. Proceedings of the Geoinformatics 2006: Geospatial Information Science, International Society for Optics and Photonics, Wuhan, China.","DOI":"10.1117\/12.712998"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Volpi, M., and Ferrari, V. (2015, January 7\u201312). Semantic segmentation of urban scenes by learning local class interactions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301377"},{"key":"ref_55","doi-asserted-by":"crossref","first-page":"881","DOI":"10.1109\/TGRS.2016.2616585","article-title":"Dense semantic labeling of subdecimeter resolution images with convolutional neural networks","volume":"55","author":"Volpi","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. (2017). Segment-before-detect: Vehicle detection and classification through semantic segmentation of aerial images. Remote Sens., 9.","DOI":"10.3390\/rs9040368"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"2274","DOI":"10.1109\/TPAMI.2012.120","article-title":"SLIC superpixels compared to state-of-the-art superpixel methods","volume":"34","author":"Achanta","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"167","DOI":"10.1023\/B:VISI.0000022288.19776.77","article-title":"Efficient graph-based image segmentation","volume":"59","author":"Felzenszwalb","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"1915","DOI":"10.1109\/TPAMI.2012.231","article-title":"Learning hierarchical features for scene labeling","volume":"35","author":"Farabet","year":"2012","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_60","unstructured":"Pinherio, R.C.P.H., and Pedro, H. (2014, January 22\u201324). Recurrent convolutional neural networks for scene parsing. Proceedings of the International Conference of Machine Learning, Bejing, China."},{"key":"ref_61","unstructured":"Kampffmeyer, M., Salberg, A.B., and Jenssen, R. (July, January 26). Semantic segmentation of small objects and modeling of uncertainty in urban remote sensing images using deep convolutional neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Las Vegas, NV, USA."},{"key":"ref_62","doi-asserted-by":"crossref","first-page":"749","DOI":"10.1080\/15481603.2018.1564499","article-title":"Semantic segmentation of high spatial resolution images with deep neural networks","volume":"56","author":"Yang","year":"2019","journal-title":"GIScience Remote Sens."},{"key":"ref_63","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1162\/neco.2006.18.7.1527","article-title":"A fast learning algorithm for deep belief nets","volume":"18","author":"Hinton","year":"2006","journal-title":"Neural Comput."},{"key":"ref_64","doi-asserted-by":"crossref","first-page":"504","DOI":"10.1126\/science.1127647","article-title":"Reducing the dimensionality of data with neural networks","volume":"313","author":"Hinton","year":"2006","journal-title":"Science"},{"key":"ref_65","unstructured":"Salakhutdinov, R., and Hinton, G. (2009, January 16\u201318). Deep boltzmann machines. Proceedings of the Artificial Intelligence and Statistics, Clearwater Beach, FL, USA."},{"key":"ref_66","first-page":"3371","article-title":"Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion","volume":"11","author":"Vincent","year":"2010","journal-title":"J. Mach. Learn. Res."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., and Fei-Fei, L. (2009, January 20\u201325). Imagenet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Zeiler, M.D., and Fergus, R. (2014). Visualizing and understanding convolutional networks. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-10590-1_53"},{"key":"ref_69","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_70","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_72","first-page":"1929","article-title":"Dropout: A simple way to prevent neural networks from overfitting","volume":"15","author":"Srivastava","year":"2014","journal-title":"J. Mach. Learn. Res."},{"key":"ref_73","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Bottou, L. (2010). Large-scale machine learning with stochastic gradient descent. Proceedings of COMPSTAT\u20192010, Springer.","DOI":"10.1007\/978-3-7908-2604-3_16"},{"key":"ref_75","unstructured":"Kingma, D.P., and Ba, J. (2014). Adam: A method for stochastic optimization. arXiv."},{"key":"ref_76","unstructured":"Mou, L., and Zhu, X.X. (2018). RiFCN: Recurrent network in fully convolutional network for semantic segmentation of high resolution remote sensing images. arXiv."},{"key":"ref_77","doi-asserted-by":"crossref","unstructured":"Noh, H., Hong, S., and Han, B. (2015, January 7\u201313). Learning deconvolution network for semantic segmentation. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.178"},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_79","doi-asserted-by":"crossref","first-page":"3308","DOI":"10.1080\/01431161.2018.1528024","article-title":"A scale robust convolutional neural network for automatic building extraction from aerial and satellite imagery","volume":"40","author":"Ji","year":"2019","journal-title":"Int. J. Remote Sens."},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Chen, G., Li, C., Wei, W., Jing, W., Wo\u017aniak, M., Bla\u017eauskas, T., and Dama\u0161evi\u010dius, R. (2019). Fully convolutional neural network with augmented atrous spatial pyramid pool and fully connected fusion path for high resolution remote sensing image segmentation. Appl. Sci., 9.","DOI":"10.3390\/app9091816"},{"key":"ref_81","unstructured":"Chen, L.C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv."},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Zheng, S., Jayasumana, S., Romera-Paredes, B., Vineet, V., Su, Z., Du, D., Huang, C., and Torr, P.H. (2015, January 7\u201313). Conditional random fields as recurrent neural networks. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.179"},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Lin, G., Shen, C., Van Den Hengel, A., and Reid, I. (2016, January 27\u201330). Efficient piecewise training of deep structured models for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.348"},{"key":"ref_84","doi-asserted-by":"crossref","first-page":"016501","DOI":"10.1117\/1.JRS.13.016501","article-title":"Semantic segmentation of multisensor remote sensing imagery with deep ConvNets and higher-order conditional random fields","volume":"13","author":"Liu","year":"2019","journal-title":"J. Appl. Remote Sens."},{"key":"ref_85","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27-30). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_86","unstructured":"Wu, Z., Shen, C., and Hengel, A.V.D. (2016). High-performance semantic segmentation using very deep fully convolutional networks. arXiv."},{"key":"ref_87","doi-asserted-by":"crossref","unstructured":"Wang, H., Wang, Y., Zhang, Q., Xiang, S., and Pan, C. (2017). Gated convolutional neural network for semantic segmentation in high-resolution images. Remote Sens., 9.","DOI":"10.3390\/rs9050446"},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_89","doi-asserted-by":"crossref","first-page":"3252","DOI":"10.1109\/JSTARS.2018.2860989","article-title":"Semantic segmentation for high spatial resolution remote sensing images based on convolution neural network and pyramid pooling module","volume":"11","author":"Yu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_90","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_91","doi-asserted-by":"crossref","unstructured":"Chen, Q., Wang, L., Wu, Y., Wu, G., Guo, Z., and Waslander, S.L. (2018). Aerial imagery for roof segmentation: A large-scale dataset towards automatic mapping of buildings. arXiv.","DOI":"10.1016\/j.isprsjprs.2018.11.011"},{"key":"ref_92","unstructured":"Audebert, N., Le Saux, B., and Lef\u00e8vre, S. (2016). Semantic segmentation of earth observation data using multimodal and multi-scale deep networks. Asian Conference on Computer Vision, Springer."},{"key":"ref_93","doi-asserted-by":"crossref","unstructured":"Audebert, N., Boulch, A., Randrianarivo, H., Le Saux, B., Ferecatu, M., Lef\u00e8vre, S., and Marlet, R. (2017, January 6\u20138). Deep learning for urban remote sensing. Proceedings of the 2017 IEEE Joint Urban Remote Sensing Event (JURSE), Dubai, United Arab Emirates.","DOI":"10.1109\/JURSE.2017.7924536"},{"key":"ref_94","doi-asserted-by":"crossref","unstructured":"Liu, Y., Minh Nguyen, D., Deligiannis, N., Ding, W., and Munteanu, A. (2017). Hourglass-shapenetwork based semantic segmentation for high resolution aerial imagery. Remote Sens., 9.","DOI":"10.3390\/rs9060522"},{"key":"ref_95","doi-asserted-by":"crossref","first-page":"29","DOI":"10.5194\/isprs-annals-IV-1-29-2018","article-title":"Semantic Segmentation of Aerial Imagery Via Multi-Scale Shuffling Convolutional Neural Networks with Deep Supervision","volume":"4","author":"Chen","year":"2018","journal-title":"ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_96","doi-asserted-by":"crossref","unstructured":"Panboonyuen, T., Jitkajornwanich, K., Lawawirojwong, S., Srestasathiern, P., and Vateekul, P. (2019). Semantic segmentation on remotely sensed images using an enhanced global convolutional network with channel attention and domain specific transfer learning. Remote Sens., 11.","DOI":"10.20944\/preprints201812.0090.v3"},{"key":"ref_97","doi-asserted-by":"crossref","unstructured":"Shang, R., Zhang, J., Jiao, L., Li, Y., Marturi, N., and Stolkin, R. (2020). Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12050872"},{"key":"ref_98","doi-asserted-by":"crossref","unstructured":"Zhang, J., Lin, S., Ding, L., and Bruzzone, L. (2020). Multi-scale context aggregation for semantic segmentation of remote sensing images. Remote Sens., 12.","DOI":"10.3390\/rs12040701"},{"key":"ref_99","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_100","doi-asserted-by":"crossref","unstructured":"Guo, Z., Shengoku, H., Wu, G., Chen, Q., Yuan, W., Shi, X., Shao, X., Xu, Y., and Shibasaki, R. (2018, January 22\u201327). Semantic segmentation for urban planning maps based on U-Net. Proceedings of the IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8519049"},{"key":"ref_101","doi-asserted-by":"crossref","unstructured":"Li, W., He, C., Fang, J., Zheng, J., Fu, H., and Yu, L. (2019). Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data. Remote Sens., 11.","DOI":"10.3390\/rs11040403"},{"key":"ref_102","doi-asserted-by":"crossref","unstructured":"Pan, Z., Xu, J., Guo, Y., Hu, Y., and Wang, G. (2020). Deep Learning Segmentation and Classification for Urban Village Using a Worldview Satellite Image Based on U-Net. Remote Sens., 12.","DOI":"10.3390\/rs12101574"},{"key":"ref_103","doi-asserted-by":"crossref","unstructured":"Yi, Y., Zhang, Z., Zhang, W., Zhang, C., Li, W., and Zhao, T. (2019). Semantic segmentation of urban buildings from vhr remote sensing imagery using a deep convolutional neural network. Remote Sens., 11.","DOI":"10.3390\/rs11151774"},{"key":"ref_104","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/j.isprsjprs.2019.07.007","article-title":"TreeUNet: Adaptive Tree convolutional neural networks for subdecimeter aerial image segmentation","volume":"156","author":"Yue","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_105","doi-asserted-by":"crossref","first-page":"3954","DOI":"10.1109\/JSTARS.2018.2833382","article-title":"Deepunet: A deep fully convolutional network for pixel-level sea-land segmentation","volume":"11","author":"Li","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_106","doi-asserted-by":"crossref","unstructured":"Robinson, Y.H., Vimal, S., Khari, M., Hern\u00e1ndez, F.C.L., and Crespo, R.G. (2020). Tree-based convolutional neural networks for object classification in segmented satellite images. Int. J. High Perform. Comput. Appl.","DOI":"10.1177\/1094342020945026"},{"key":"ref_107","doi-asserted-by":"crossref","unstructured":"Liu, W., Su, F., Jin, X., Li, H., and Qin, R. (2020). Bispace Domain Adaptation Network for Remotely Sensed Semantic Segmentation. IEEE Trans. Geosci. Remote Sens.","DOI":"10.1109\/TGRS.2020.3035561"},{"key":"ref_108","doi-asserted-by":"crossref","first-page":"94","DOI":"10.1016\/j.isprsjprs.2020.01.013","article-title":"Resunet-a: A deep learning framework for semantic segmentation of remotely sensed data","volume":"162","author":"Diakogiannis","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_109","doi-asserted-by":"crossref","unstructured":"Dong, R., Bai, L., and Li, F. (2020). SiameseDenseU-Net-based Semantic Segmentation of Urban Remote Sensing Images. Math. Probl. Eng., 2020.","DOI":"10.1155\/2020\/1515630"},{"key":"ref_110","doi-asserted-by":"crossref","first-page":"65347","DOI":"10.1109\/ACCESS.2019.2917952","article-title":"DenseU-net-based semantic segmentation of small objects in urban remote sensing images","volume":"7","author":"Dong","year":"2019","journal-title":"IEEE Access"},{"key":"ref_111","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_112","doi-asserted-by":"crossref","first-page":"158","DOI":"10.1016\/j.isprsjprs.2017.11.009","article-title":"Classification with an edge: Improving semantic image segmentation with boundary detection","volume":"135","author":"Marmanis","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_113","doi-asserted-by":"crossref","unstructured":"Cao, R., and Qiu, G. (2018, January 25\u201327). Urban land use classification based on aerial and ground images. Proceedings of the 2018 IEEE International Conference on Content-Based Multimedia Indexing (CBMI), Bordeaux, France.","DOI":"10.1109\/CBMI.2018.8516552"},{"key":"ref_114","doi-asserted-by":"crossref","unstructured":"Bischke, B., Helber, P., Folz, J., Borth, D., and Dengel, A. (2019, January 22\u201329). Multi-task learning for segmentation of building footprints with deep neural networks. Proceedings of the 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan.","DOI":"10.1109\/ICIP.2019.8803050"},{"key":"ref_115","doi-asserted-by":"crossref","first-page":"297","DOI":"10.1016\/j.neucom.2018.11.051","article-title":"Problems of encoder-decoder frameworks for high-resolution remote sensing image segmentation: Structural stereotype and insufficient learning","volume":"330","author":"Sun","year":"2019","journal-title":"Neurocomputing"},{"key":"ref_116","doi-asserted-by":"crossref","unstructured":"Abdollahi, A., Pradhan, B., and Alamri, A.M. (2020). An Ensemble Architecture of Deep Convolutional Segnet and Unet Networks for Building Semantic Segmentation from High-resolution Aerial Images. Geocarto Int., 1\u201313.","DOI":"10.1080\/10106049.2020.1856199"},{"key":"ref_117","doi-asserted-by":"crossref","first-page":"138","DOI":"10.26833\/ijeg.645426","article-title":"Feature Extraction from Satellite Images Using Segnet and Fully Convolutional Networks (FCN)","volume":"5","author":"Sariturk","year":"2020","journal-title":"Int. J. Eng. Geosci."},{"key":"ref_118","unstructured":"Hong, S., Noh, H., and Han, B. (2015). Decoupled deep neural network for semi-supervised semantic segmentation. Advances in neural information processing systems. arXiv."},{"key":"ref_119","doi-asserted-by":"crossref","unstructured":"Pinheiro, P.O., Lin, T.Y., Collobert, R., and Doll\u00e1r, P. (2016). Learning to refine object segments. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-46448-0_5"},{"key":"ref_120","unstructured":"Pinheiro, P.O., Collobert, R., and Doll\u00e1r, P. (2015). Learning to segment object candidates. Advances in Neural Information Processing Systems. arXiv."},{"key":"ref_121","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1016\/j.isprsjprs.2018.04.014","article-title":"Algorithms for semantic segmentation of multispectral remote sensing imagery using deep learning","volume":"145","author":"Kemker","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_122","doi-asserted-by":"crossref","unstructured":"Lin, G., Milan, A., Shen, C., and Reid, I. (2017, January 21\u201326). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.549"},{"key":"ref_123","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_124","doi-asserted-by":"crossref","unstructured":"Du, S., Du, S., Liu, B., and Zhang, X. (2020). Incorporating DeepLabv3+ and object-based image analysis for semantic segmentation of very high resolution remote sensing images. Int. J. Digit. Earth, 1\u201322.","DOI":"10.1080\/17538947.2020.1831087"},{"key":"ref_125","doi-asserted-by":"crossref","first-page":"1633","DOI":"10.1109\/JSTARS.2018.2810320","article-title":"Symmetrical dense-shortcut deep fully convolutional networks for semantic segmentation of very-high-resolution remote sensing images","volume":"11","author":"Chen","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_126","doi-asserted-by":"crossref","first-page":"381","DOI":"10.1080\/2150704X.2018.1557791","article-title":"A Y-Net deep learning method for road segmentation using high-resolution visible remote sensing images","volume":"10","author":"Li","year":"2019","journal-title":"Remote Sens. Lett."},{"key":"ref_127","doi-asserted-by":"crossref","unstructured":"Benjdira, B., Bazi, Y., Koubaa, A., and Ouni, K. (2019). Unsupervised domain adaptation using generative adversarial networks for semantic segmentation of aerial images. Remote Sens., 11.","DOI":"10.3390\/rs11111369"},{"key":"ref_128","unstructured":"Goodfellow, I., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S., Courville, A., and Bengio, Y. (2014). Generative adversarial nets. Advances in Neural Information Processing Systems, ACM."},{"key":"ref_129","doi-asserted-by":"crossref","first-page":"2092","DOI":"10.1109\/LGRS.2017.2752750","article-title":"MARTA GANs: Unsupervised representation learning for remote sensing image classification","volume":"14","author":"Lin","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_130","doi-asserted-by":"crossref","first-page":"212","DOI":"10.1109\/LGRS.2017.2780890","article-title":"Semisupervised hyperspectral image classification based on generative adversarial networks","volume":"15","author":"Zhan","year":"2017","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_131","doi-asserted-by":"crossref","unstructured":"Torrey, L., and Shavlik, J. (2010). Transfer learning. Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques, IGI Global.","DOI":"10.4018\/978-1-60566-766-9.ch011"},{"key":"ref_132","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1186\/s40537-016-0043-6","article-title":"A survey of transfer learning","volume":"3","author":"Weiss","year":"2016","journal-title":"J. Big Data"},{"key":"ref_133","doi-asserted-by":"crossref","unstructured":"Tan, C., Sun, F., Kong, T., Zhang, W., Yang, C., and Liu, C. (2018). A survey on deep transfer learning. International Conference on Artificial Neural Networks, Springer.","DOI":"10.1007\/978-3-030-01424-7_27"},{"key":"ref_134","doi-asserted-by":"crossref","unstructured":"Du, Z., Yang, J., Ou, C., and Zhang, T. (2019). Smallholder crop area mapped with a semantic segmentation deep learning method. Remote Sens., 11.","DOI":"10.3390\/rs11070888"},{"key":"ref_135","doi-asserted-by":"crossref","first-page":"59","DOI":"10.1016\/j.isprsjprs.2019.02.006","article-title":"Semantic segmentation of slums in satellite images using transfer learning on fully convolutional neural networks","volume":"150","author":"Wurm","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_136","doi-asserted-by":"crossref","unstructured":"Lv, Q., Dou, Y., Niu, X., Xu, J., Xu, J., and Xia, F. (2015). Urban land use and land cover classification using remotely sensed SAR data through deep belief networks. J. Sens., 2015.","DOI":"10.1155\/2015\/538063"},{"key":"ref_137","doi-asserted-by":"crossref","unstructured":"Nogueira, K., Dalla Mura, M., Chanussot, J., Schwartz, W.R., and dos Santos, J.A. (2016, January 4\u20138). Learning to semantically segment high-resolution remote sensing images. Proceedings of the 2016 23rd IEEE International Conference on Pattern Recognition (ICPR), Cancun, Mexico.","DOI":"10.1109\/ICPR.2016.7900187"},{"key":"ref_138","doi-asserted-by":"crossref","first-page":"155753","DOI":"10.1109\/ACCESS.2020.3012701","article-title":"Mask-R-FCN: A Deep Fusion Network for Semantic Segmentation","volume":"8","author":"Zhang","year":"2020","journal-title":"IEEE Access"},{"key":"ref_139","doi-asserted-by":"crossref","unstructured":"Poomani, M., Sutha, J., and Soundar, K.R. (2020). Wiener filter based deep convolutional network approach for classification of satellite images. J. Ambient. Intell. Humaniz. Comput.","DOI":"10.1007\/s12652-020-02410-3"},{"key":"ref_140","doi-asserted-by":"crossref","unstructured":"Mattyus, G., Wang, S., Fidler, S., and Urtasun, R. (2015, January 7\u201313). Enhancing road maps by parsing aerial images around the world. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.197"},{"key":"ref_141","doi-asserted-by":"crossref","first-page":"177","DOI":"10.5194\/isprs-annals-III-3-177-2016","article-title":"Fast semantic segmentation of 3D point clouds with strongly varying density","volume":"3","author":"Hackel","year":"2016","journal-title":"ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_142","doi-asserted-by":"crossref","first-page":"2920","DOI":"10.1109\/TGRS.2018.2878510","article-title":"Aerial LaneNet: Lane-marking semantic segmentation in aerial imagery using wavelet-enhanced cost-sensitive symmetric fully convolutional neural networks","volume":"57","author":"Azimi","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_143","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1016\/j.isprsjprs.2019.11.006","article-title":"Superpixel-enhanced deep neural forest for remote sensing image semantic segmentation","volume":"159","author":"Mi","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_144","doi-asserted-by":"crossref","unstructured":"Paisitkriangkrai, S., Sherrah, J., Janney, P., and Hengel, V.D. (2015, January 7\u201312). Effective semantic pixel labelling with convolutional networks and conditional random fields. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301381"},{"key":"ref_145","doi-asserted-by":"crossref","unstructured":"Maggiori, E., Tarabalka, Y., Charpiat, G., and Alliez, P. (2016). High-resolution semantic labeling with convolutional neural networks. arXiv.","DOI":"10.1109\/IGARSS.2017.8128163"},{"key":"ref_146","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_147","doi-asserted-by":"crossref","unstructured":"Piramanayagam, S., Saber, E., Schwartzkopf, W., and Koehler, F.W. (2018). Supervised classification of multisensor remotely sensed images using a deep learning framework. Remote Sens., 10.","DOI":"10.3390\/rs10091429"},{"key":"ref_148","doi-asserted-by":"crossref","unstructured":"Pan, X., Gao, L., Marinoni, A., Zhang, B., Yang, F., and Gamba, P. (2018). Semantic labeling of high resolution aerial imagery and LiDAR data with fine segmentation network. Remote Sens., 10.","DOI":"10.3390\/rs10050743"},{"key":"ref_149","doi-asserted-by":"crossref","first-page":"6361","DOI":"10.1109\/TGRS.2018.2837357","article-title":"Recurrent multiresolution convolutional networks for VHR image classification","volume":"56","author":"Bergado","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_150","doi-asserted-by":"crossref","unstructured":"Liu, Y., Piramanayagam, S., Monteiro, S.T., and Saber, E. (2017, January 21\u201326). Dense semantic labeling of very-high-resolution aerial imagery and lidar with fully-convolutional neural networks and higher-order CRFs. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Honolulu, HI, USA.","DOI":"10.1109\/CVPRW.2017.200"},{"key":"ref_151","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1016\/j.isprsjprs.2017.12.007","article-title":"Semantic labeling in very high resolution images via a self-cascaded convolutional neural network","volume":"145","author":"Liu","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_152","unstructured":"Marcu, A., and Leordeanu, M. (2016). Dual local-global contextual pathways for recognition in aerial imagery. arXiv."},{"key":"ref_153","unstructured":"Marcu, A.E., and Leordeanu, M. (2017, January 4\u20139). Object contra context: Dual local-global semantic segmentation in aerial images. Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, San Francisco, CA, USA."},{"key":"ref_154","unstructured":"Li, P., Lin, Y., and Schultz-Fellenz, E. (2018). Contextual Hourglass Network for Semantic Segmentation of High Resolution Aerial Imagery. arXiv."},{"key":"ref_155","doi-asserted-by":"crossref","first-page":"7503","DOI":"10.1109\/TGRS.2019.2913861","article-title":"Dynamic multicontext segmentation of remote sensing images based on convolutional networks","volume":"57","author":"Nogueira","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_156","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.isprsjprs.2017.08.011","article-title":"Contextually guided very-high-resolution imagery classification with semantic segments","volume":"132","author":"Zhao","year":"2017","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_157","doi-asserted-by":"crossref","first-page":"473","DOI":"10.5194\/isprs-annals-III-3-473-2016","article-title":"Semantic segmentation of aerial images with an ensemble of CNSS","volume":"3","author":"Marmanis","year":"2016","journal-title":"ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2016"},{"key":"ref_158","first-page":"94050K","article-title":"Building and road detection from large aerial imagery","volume":"Volume 9405","author":"Saito","year":"2015","journal-title":"Image Processing: Machine Vision Applications VIII"},{"key":"ref_159","doi-asserted-by":"crossref","first-page":"1","DOI":"10.2352\/ISSN.2470-1173.2016.10.ROBVIS-392","article-title":"Multiple object extraction from aerial imagery with convolutional neural networks","volume":"2016","author":"Saito","year":"2016","journal-title":"Electron. Imaging"},{"key":"ref_160","doi-asserted-by":"crossref","first-page":"70","DOI":"10.1016\/j.isprsjprs.2019.05.013","article-title":"Improving public data for building segmentation from Convolutional Neural Networks (CNNs) for fused airborne lidar and image data using active contours","volume":"154","author":"Griffiths","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_161","doi-asserted-by":"crossref","first-page":"6054","DOI":"10.1109\/TGRS.2017.2719738","article-title":"Learning aerial image segmentation from online maps","volume":"55","author":"Kaiser","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_162","doi-asserted-by":"crossref","unstructured":"Li, J., Ding, W., Li, H., and Liu, C. (2017, January 27\u201329). Semantic segmentation for high-resolution aerial imagery using multi-skip network and Markov random fields. Proceedings of the 2017 IEEE International Conference on Unmanned Systems (ICUS), Beijing, China.","DOI":"10.1109\/ICUS.2017.8278309"},{"key":"ref_163","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.isprsjprs.2018.06.007","article-title":"Deep multi-task learning for a geographically-regularized semantic segmentation of aerial images","volume":"144","author":"Volpi","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_164","doi-asserted-by":"crossref","first-page":"184","DOI":"10.1016\/j.isprsjprs.2019.11.004","article-title":"Building segmentation through a gated graph convolutional neural network with deep structured feature embedding","volume":"159","author":"Shi","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_165","doi-asserted-by":"crossref","unstructured":"Tschannen, M., Cavigelli, L., Mentzer, F., Wiatowski, T., and Benini, L. (September, January 28). Deep structured features for semantic segmentation. Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece.","DOI":"10.23919\/EUSIPCO.2017.8081169"},{"key":"ref_166","unstructured":"(2020, December 21). ISPRS Vaihingen 2D Semantic Labeling Dataset. Available online: http:\/\/www2.isprs.org\/commissions\/comm3\/wg4\/2d-sem-label-vaihingen.html."},{"key":"ref_167","unstructured":"(2020, December 21). ISPRS Potsdam 2D Semantic Labeling Dataset. Available online: http:\/\/www2.isprs.org\/commissions\/comm3\/wg4\/2d-sem-label-potsdam.html."},{"key":"ref_168","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1109\/MGRS.2015.2397448","article-title":"2015 IEEE GRSS data fusion contest: Extremely high resolution LidAR and optical data [technical committees]","volume":"3","author":"Moser","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_169","doi-asserted-by":"crossref","unstructured":"Qin, Y., Wu, Y., Li, B., Gao, S., Liu, M., and Zhan, Y. (2019). Semantic segmentation of building roof in dense urban environment with deep convolutional neural network: A case study using GF2 VHR imagery in China. Sensors, 19.","DOI":"10.3390\/s19051164"},{"key":"ref_170","unstructured":"Mnih, V. (2013). Machine Learning for Aerial Image Labeling, Citeseer."},{"key":"ref_171","unstructured":"Gerke, M. (2020, December 21). Use of the Stair Vision Library within the ISPRS 2D Semantic Labeling Benchmark (Vaihingen). Available online: https:\/\/doi.org\/10.13140\/2.1.5015.9683."},{"key":"ref_172","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_173","doi-asserted-by":"crossref","unstructured":"Bisong, E. (2019). Google colaboratory. Building Machine Learning and Deep Learning Models on Google Cloud Platform, Springer.","DOI":"10.1007\/978-1-4842-4470-8"},{"key":"ref_174","doi-asserted-by":"crossref","first-page":"2868","DOI":"10.1109\/JSTARS.2016.2582921","article-title":"Semantic labeling of aerial and satellite imagery","volume":"9","author":"Paisitkriangkrai","year":"2016","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_175","unstructured":"Kipf, T.N., and Welling, M. (2016). Semi-supervised classification with graph convolutional networks. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/4\/808\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:26:45Z","timestamp":1760160405000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/4\/808"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,23]]},"references-count":175,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2021,2]]}},"alternative-id":["rs13040808"],"URL":"https:\/\/doi.org\/10.3390\/rs13040808","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2,23]]}}}