{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,24]],"date-time":"2026-06-24T15:14:47Z","timestamp":1782314087952,"version":"3.54.5"},"reference-count":41,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2021,2,1]],"date-time":"2021-02-01T00:00:00Z","timestamp":1612137600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,2,12]],"date-time":"2021-02-12T00:00:00Z","timestamp":1613088000000},"content-version":"vor","delay-in-days":11,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Mobile Netw Appl"],"published-print":{"date-parts":[[2021,2]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>In recent years, the success of deep learning in natural scene image processing boosted its application in the analysis of remote sensing images. In this paper, we applied Convolutional Neural Networks (CNN) on the semantic segmentation of remote sensing images. We improve the Encoder- Decoder CNN structure SegNet with index pooling and U-net to make them suitable for multi-targets semantic segmentation of remote sensing images. The results show that these two models have their own advantages and disadvantages on the segmentation of different objects. In addition, we propose an integrated algorithm that integrates these two models. Experimental results show that the presented integrated algorithm can exploite the advantages of both the models for multi-target segmentation and achieve a better segmentation compared to these two models.<\/jats:p>","DOI":"10.1007\/s11036-020-01703-3","type":"journal-article","created":{"date-parts":[[2021,2,13]],"date-time":"2021-02-13T19:10:20Z","timestamp":1613243420000},"page":"200-215","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":95,"title":["Convolutional Neural Network for the Semantic Segmentation of Remote Sensing Images"],"prefix":"10.1007","volume":"26","author":[{"given":"Muhammad","family":"Alam","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jian-Feng","family":"Wang","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Cong","family":"Guangpei","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"LV","family":"Yunrong","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuanfang","family":"Chen","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2021,2,12]]},"reference":[{"key":"1703_CR1","doi-asserted-by":"crossref","unstructured":"Maggiori E, Tarabalka Y, Charpiat G, Alliez P (2016) Fully convolutional neural networks for remote sensing image classification. In: Geoscience remote sensing symposium","DOI":"10.1109\/IGARSS.2016.7730322"},{"issue":"8","key":"1703_CR2","doi-asserted-by":"publisher","first-page":"2936","DOI":"10.1109\/TGRS.2011.2113186","volume":"49","author":"G Bilgin","year":"2011","unstructured":"Bilgin G, Erturk S, Yildirim T (2011) Segmentation of hyperspectral images via subtractive clustering and cluster validation using one-class support vector machines. IEEE Trans Geosci Remote Sens 49 (8):2936\u20132944","journal-title":"IEEE Trans Geosci Remote Sens"},{"issue":"99","key":"1703_CR3","first-page":"1","volume":"PP","author":"E Maggiori","year":"2017","unstructured":"Maggiori E, Tarabalka Y, Charpiat G, Alliez P (2017) Recurrent neural networks to enhance satellite image classification maps. IEEE Trans Geosci Remote Sens PP(99):1\u201310","journal-title":"IEEE Trans Geosci Remote Sens"},{"key":"1703_CR4","doi-asserted-by":"crossref","unstructured":"Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A (2015) Going deeper with convolutions:1\u20139","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"1703_CR5","doi-asserted-by":"crossref","unstructured":"Paisitkriangkrai S, Sherrah J, Janney P, Hengel VD (2015) Effective semantic pixel labelling with convolutional networks and conditional random fields. In: Computer Vision Pattern Recognition Workshops","DOI":"10.1109\/CVPRW.2015.7301381"},{"key":"1703_CR6","unstructured":"Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2014) Semantic image segmentation with deep convolutional nets and fully connected crfs, arXiv:1412.7062"},{"issue":"7","key":"1703_CR7","first-page":"1","volume":"5","author":"M Johnson","year":"2008","unstructured":"Johnson M, Shotton J, Cipolla R (2008) Semantic texton forests for image categorization and segmentation. Proc IEEE Cvpr 5(7):1\u20138","journal-title":"Proc IEEE Cvpr"},{"issue":"4","key":"1703_CR8","doi-asserted-by":"publisher","first-page":"193","DOI":"10.1007\/BF00344251","volume":"36","author":"K Fukushima","year":"1980","unstructured":"Fukushima K (1980) Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol Cybern 36(4):193\u2013202","journal-title":"Biol Cybern"},{"issue":"1","key":"1703_CR9","doi-asserted-by":"publisher","first-page":"62","DOI":"10.1109\/TSMC.1979.4310076","volume":"9","author":"N Ohtsu","year":"2007","unstructured":"Ohtsu N (2007) A threshold selection method from gray-level histograms. IEEE Trans Sys Man Cybern 9(1):62\u201366","journal-title":"IEEE Trans Sys Man Cybern"},{"key":"1703_CR10","unstructured":"Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: International Conference on Neural Information Processing Systems"},{"key":"1703_CR11","doi-asserted-by":"crossref","unstructured":"Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: IEEE International Conference on Computer Vision","DOI":"10.1109\/ICCV.2007.4409066"},{"key":"1703_CR12","doi-asserted-by":"publisher","first-page":"48","DOI":"10.1016\/j.isprsjprs.2017.08.011","volume":"132","author":"W Zhao","year":"2017","unstructured":"Zhao W, Du S, Qiao W, Emery WJ (2017) Contextually guided very-high-resolution imagery classification with semantic segments. ISPRS J Photogramm Remote Sens 132:48\u201360","journal-title":"ISPRS J Photogramm Remote Sens"},{"key":"1703_CR13","unstructured":"Nair V, Hinton GE (2010) Rectified linear units improve restricted boltzmann machines. In: Proceedings of the 27th international conference on machine learning (ICML-10), pp 807\u2013814"},{"key":"1703_CR14","first-page":"649","volume":"2015","author":"X Zhang","year":"2015","unstructured":"Zhang X, Zhao J, LeCun Y (2015) Character-level convolutional networks for text classification. Adv Neural Inf Process Syst 2015:649\u2013657","journal-title":"Adv Neural Inf Process Syst"},{"key":"1703_CR15","doi-asserted-by":"crossref","unstructured":"Graves A, Mohamed AR, Hinton G (2013) Speech recognition with deep recurrent neural networks. In: IEEE international conference on acoustics, speech and signal processing (pp. 6645\u20136649)","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"1703_CR16","doi-asserted-by":"crossref","unstructured":"Hoque MM, Quaresma P (2016) A semantic-based technique for question lassification in question answering systems \u2014 a hybrid approach. In: International Conference on Computer Information Technology","DOI":"10.1109\/ICCITechn.2015.7488039"},{"issue":"6","key":"1703_CR17","doi-asserted-by":"publisher","first-page":"1003","DOI":"10.1152\/jn.1963.26.6.1003","volume":"26","author":"TN Wiesel","year":"1963","unstructured":"Wiesel TN, Hubel DH (1963) Single-cell responses in striate cortex of kittens deprived of vision in one eye. J Neurophysiol 26(6):1003\u20131017","journal-title":"J Neurophysiol"},{"key":"1703_CR18","doi-asserted-by":"crossref","unstructured":"Chen LC, Zhu Y, Papandreou G, Schroff F, Adam H (2018) Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Proceedings of the European conference on computer vision (ECCV) (pp. 801\u2013818)","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"1703_CR19","doi-asserted-by":"crossref","unstructured":"Kim Y, Jernite Y, Sontag D, Rush AM (2016) Character-aware neural language models. In: Thirtieth AAAI Conference on Artificial Intelligence","DOI":"10.1609\/aaai.v30i1.10362"},{"key":"1703_CR20","doi-asserted-by":"crossref","unstructured":"He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE international conference on computer vision, pp 1026\u20131034","DOI":"10.1109\/ICCV.2015.123"},{"key":"1703_CR21","doi-asserted-by":"crossref","unstructured":"Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700\u20134708","DOI":"10.1109\/CVPR.2017.243"},{"issue":"7","key":"1703_CR22","doi-asserted-by":"publisher","first-page":"1312","DOI":"10.1109\/TPAMI.2011.231","volume":"34","author":"J Carreira","year":"2012","unstructured":"Carreira J, Sminchisescu C (2012) Cpmc: Automatic object segmentation using constrained parametric min-cuts. IEEE Trans Pattern Anal Mach Intell 34(7):1312\u20131328","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1703_CR23","unstructured":"Belongie S, Malik J, Shi J, Leung T (1998) Image and video segmentation: the normalized cut framework. In: International conference on image processing"},{"key":"1703_CR24","unstructured":"Freedman D, Zhang T (2005) Interactive graph cut based segmentation with shape priors. In: IEEE Computer Society Conference on Computer Vision Pattern Recognition"},{"issue":"12","key":"1703_CR25","doi-asserted-by":"publisher","first-page":"1684","DOI":"10.1109\/83.730380","volume":"7","author":"K Haris","year":"2002","unstructured":"Haris K, Efstratiadis SN, Maglaveras N, Katsaggelos AK (2002) Hybrid image segmentation using watersheds and fast region merging. IEEE Trans Image Process 7(12):1684\u20131699","journal-title":"IEEE Trans Image Process"},{"key":"1703_CR26","doi-asserted-by":"crossref","unstructured":"Hou L, Samaras D, Kurc TM, Gao Y, Davis JE, Saltz JH (2016) Patch-based convolutional neural network for whole slide tissue image classification. In: Computer Vision Pattern Recognition","DOI":"10.1109\/CVPR.2016.266"},{"issue":"3","key":"1703_CR27","doi-asserted-by":"publisher","first-page":"302","DOI":"10.1007\/s11263-018-1140-0","volume":"127","author":"B Zhou","year":"2019","unstructured":"Zhou B, Zhao H, Puig X, Xiao T, Fidler S, Barriuso A, Torralba A (2019) Semantic understanding of scenes through the ade20k dataset. Int J Comput Vis 127(3):302\u2013321","journal-title":"Int J Comput Vis"},{"key":"1703_CR28","doi-asserted-by":"publisher","first-page":"214","DOI":"10.1016\/j.neuroimage.2014.12.061","volume":"108","author":"W Zhang","year":"2015","unstructured":"Zhang W, Li R, Deng H, Wang L, Lin W, Ji S, Shen D (2015) Deep convolutional neural networks for multi-modality isointense infant brain image segmentation. Neuroimage 108:214\u2013224","journal-title":"Neuroimage"},{"key":"1703_CR29","doi-asserted-by":"crossref","unstructured":"Ronneberger O, Fischer P, Brox T (2015) U-net: Convolutional networks for biomedical image segmentation. In: International conference on medical image computing and computer-assisted intervention. Springer, pp 234\u2013241","DOI":"10.1007\/978-3-319-24574-4_28"},{"issue":"4","key":"1703_CR30","doi-asserted-by":"publisher","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","volume":"40","author":"L-C Chen","year":"2018","unstructured":"Chen L-C, Papandreou G, Kokkinos I, Murphy K, Yuille AL (2018) Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intell 40(4):834\u2013848","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1703_CR31","doi-asserted-by":"crossref","unstructured":"Noh H, Hong S, Han B (2015) Learning deconvolution network for semantic segmentation. In: IEEE International Conference on Computer Vision","DOI":"10.1109\/ICCV.2015.178"},{"key":"1703_CR32","doi-asserted-by":"crossref","unstructured":"Yang J, Price B, Cohen S, Lee H, Yang MH (2016) Object contour detection with a fully convolutional encoder-decoder network. In: Computer Vision Pattern Recognition","DOI":"10.1109\/CVPR.2016.28"},{"issue":"99","key":"1703_CR33","first-page":"1","volume":"PP","author":"V Badrinarayanan","year":"2017","unstructured":"Badrinarayanan V, Kendall A, Cipolla R (2017) Segnet: A deep convolutional encoder-decoder architecture for scene segmentation. IEEE Trans Pattern Anal Mach Intell PP(99):1\u20131","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"1703_CR34","doi-asserted-by":"crossref","unstructured":"Lin G, Milan A, Shen C, Reid I (2017) Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1925\u20131934","DOI":"10.1109\/CVPR.2017.549"},{"issue":"9","key":"1703_CR35","doi-asserted-by":"publisher","first-page":"1067","DOI":"10.1016\/j.patrec.2004.03.004","volume":"25","author":"P Mitra","year":"2004","unstructured":"Mitra P, Shankar BU, Pal SK (2004) Segmentation of multispectral remote sensing images using active support vector machines. Pattern Recogn Lett 25(9):1067\u20131074","journal-title":"Pattern Recogn Lett"},{"key":"1703_CR36","doi-asserted-by":"publisher","first-page":"158","DOI":"10.1016\/j.isprsjprs.2017.11.009","volume":"135","author":"D Marmanis","year":"2017","unstructured":"Marmanis D, Schindler K, Wegner JD, Galliani S, Datcu M, Stilla U (2017) Classification with an edge: improving semantic image segmentation with boundary detection. Isprs J Photogramm Remote Sens 135:158\u2013172","journal-title":"Isprs J Photogramm Remote Sens"},{"key":"1703_CR37","unstructured":"Chen LC, Papandreou G, Schroff F, Adam H (2017) Rethinking atrous convolution for semantic image segmentation, arXiv:1706.05587"},{"issue":"3","key":"1703_CR38","doi-asserted-by":"publisher","first-page":"617","DOI":"10.1080\/01431160701352154","volume":"29","author":"JF Mas","year":"2008","unstructured":"Mas JF, Flores JJ (2008) The application of artificial neural networks to the analysis of remotely sensed data. Int J Remote Sens 29(3):617\u2013663","journal-title":"Int J Remote Sens"},{"key":"1703_CR39","unstructured":"Ioffe S, Szegedy C (2015) Batch normalization: Accelerating deep network training by reducing internal covariate shift, arXiv:1502.03167"},{"key":"1703_CR40","unstructured":"BDCI (2017) Big data and computing intelligence contest. https:\/\/www.datafountain.cn\/competitions\/270\/details\/"},{"issue":"5","key":"1703_CR41","doi-asserted-by":"publisher","first-page":"1974","DOI":"10.1109\/JSTARS.2014.2357832","volume":"8","author":"R Qin","year":"2015","unstructured":"Qin R (2015) A mean shift vector-based shape feature for classification of high spatial resolution remotely sensed imagery. IEEE J Sel Top Appl Earth Obs Remote Sens 8(5):1974\u20131985","journal-title":"IEEE J Sel Top Appl Earth Obs Remote Sens"}],"container-title":["Mobile Networks and Applications"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11036-020-01703-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.1007\/s11036-020-01703-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1007\/s11036-020-01703-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,17]],"date-time":"2022-12-17T06:44:21Z","timestamp":1671259461000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1007\/s11036-020-01703-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2]]},"references-count":41,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,2]]}},"alternative-id":["1703"],"URL":"https:\/\/doi.org\/10.1007\/s11036-020-01703-3","relation":{},"ISSN":["1383-469X","1572-8153"],"issn-type":[{"value":"1383-469X","type":"print"},{"value":"1572-8153","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,2]]},"assertion":[{"value":"25 November 2020","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"12 February 2021","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}