{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,19]],"date-time":"2025-12-19T15:36:25Z","timestamp":1766158585767,"version":"build-2065373602"},"reference-count":51,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2020,12,8]],"date-time":"2020-12-08T00:00:00Z","timestamp":1607385600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["No.41674015"],"award-info":[{"award-number":["No.41674015"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Low inter-class variance and complex spatial details exist in ground objects of the coastal zone, which leads to a challenging task for coastal land cover classification (CLCC) from high-resolution remote sensing images. Recently, fully convolutional neural networks have been widely used in CLCC. However, the inherent structure of the convolutional operator limits the receptive field, resulting in capturing the local context. Additionally, complex decoders bring additional information redundancy and computational burden. Therefore, this paper proposes a novel attention-driven context encoding network to solve these problems. Among them, lightweight global feature attention modules are employed to aggregate multi-scale spatial details in the decoding stage. Meanwhile, position and channel attention modules with long-range dependencies are embedded to enhance feature representations of specific categories by capturing the multi-dimensional global context. Additionally, multiple objective functions are introduced to supervise and optimize feature information at specific scales. We apply the proposed method in CLCC tasks of two study areas and compare it with other state-of-the-art approaches. Experimental results indicate that the proposed method achieves the optimal performances in encoding long-range context and recognizing spatial details and obtains the optimum representations in evaluation indexes.<\/jats:p>","DOI":"10.3390\/s20247032","type":"journal-article","created":{"date-parts":[[2020,12,8]],"date-time":"2020-12-08T20:27:15Z","timestamp":1607459235000},"page":"7032","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Coastal Land Cover Classification of High-Resolution Remote Sensing Images Using Attention-Driven Context Encoding Network"],"prefix":"10.3390","volume":"20","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-7578-6376","authenticated-orcid":false,"given":"Jifa","family":"Chen","sequence":"first","affiliation":[{"name":"College of Marine Science and Technology, China University of Geosciences, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Gang","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Marine Science and Technology, China University of Geosciences, Wuhan 430074, China"},{"name":"Key Laboratory of Geological Survey and Evaluation of Ministry of Education, China University of Geosciences, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lizhe","family":"Wang","sequence":"additional","affiliation":[{"name":"Key Laboratory of Geological Survey and Evaluation of Ministry of Education, China University of Geosciences, Wuhan 430074, China"},{"name":"School of Computer Science, China University of Geosciences, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Bo","family":"Fang","sequence":"additional","affiliation":[{"name":"College of Marine Science and Technology, China University of Geosciences, Wuhan 430074, China"},{"name":"Key Laboratory of Geological Survey and Evaluation of Ministry of Education, China University of Geosciences, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9874-2659","authenticated-orcid":false,"given":"Ping","family":"Zhou","sequence":"additional","affiliation":[{"name":"College of Marine Science and Technology, China University of Geosciences, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Mingjie","family":"Zhu","sequence":"additional","affiliation":[{"name":"College of Marine Science and Technology, China University of Geosciences, Wuhan 430074, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,12,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"12070","DOI":"10.3390\/rs61212070","article-title":"Global Land Cover Mapping: A Review and Uncertainty Analysis","volume":"6","author":"Congalton","year":"2014","journal-title":"Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"3440","DOI":"10.1080\/01431161.2014.903435","article-title":"Land-use\/cover classification in a heterogeneous coastal landscape using RapidEye imagery: Evaluating the performance of random forest and support vector machines classifiers","volume":"35","author":"Adam","year":"2014","journal-title":"Int. J. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1109\/LGRS.2017.2657778","article-title":"Training Deep Convolutional Neural Networks for Land\u2013Cover Classification of High-Resolution Imagery","volume":"14","author":"Scott","year":"2017","journal-title":"IEEE Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Fang, B., Kou, R., Pan, L., and Chen, P. (2019). Category sensitive domain adaptation for land cover mapping in aerial scenes. Remote Sens., 11.","DOI":"10.3390\/rs11222631"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Thanh Noi, P., and Kappas, M. (2018). Comparison of Random Forest, k-Nearest Neighbor, and Support Vector Machine Classifiers for Land Cover Classification Using Sentinel-2 Imagery. Sensors, 18.","DOI":"10.3390\/s18010018"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_7","unstructured":"Chen, L., Papandreou, G., Kokkinos, I., Murphy, K., and Yuille, A.L. (2014). Semantic image segmentation with deep convolutional nets and fully connected crfs. arXiv."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","article-title":"Spatial pyramid pooling in deep convolutional networks for visual recognition","volume":"37","author":"He","year":"2015","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"834","DOI":"10.1109\/TPAMI.2017.2699184","article-title":"Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs","volume":"40","author":"Chen","year":"2017","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_10","unstructured":"Chen, L., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Chen, L., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the 2018 European Conference on Computer Vision (ECCV), Cham, Switzerland.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_13","unstructured":"Li, H., Xiong, P., An, J., and Wang, L. (2018). Pyramid attention network for semantic segmentation. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"3639","DOI":"10.1109\/TGRS.2016.2636241","article-title":"Deep recurrent neural networks for hyperspectral image classification","volume":"55","author":"Mou","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1480","DOI":"10.1109\/TPAMI.2017.2712691","article-title":"Scene segmentation with dag-recurrent neural networks","volume":"40","author":"Shuai","year":"2018","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-Net: Convolutional networks for biomedical image segmentation. Proceedings of the 2015 Medical Image Computing and Computer-Assisted Intervention (MICCAI), Cham, Switzerland.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Lin, G., Milan, A., Shen, C., and Reid, I. (2017, January 21\u201326). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.549"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"525","DOI":"10.1016\/j.apgeog.2010.11.007","article-title":"A comparison of classification techniques to support land cover and land use analysis in tropical coastal zones","volume":"31","author":"Szuster","year":"2011","journal-title":"Appl. Geogr."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Hua, L., Zhang, X., Chen, X., Yin, K., and Tang, L. (2017). A feature-based approach of decision tree classification to map time series urban land use and land cover with Landsat 5 TM and Landsat 8 OLI in a Coastal City, China. ISPRS Int. J. Geo-Inf., 6.","DOI":"10.3390\/ijgi6110331"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Shang, R., Zhang, J., Jiao, L., Li, Y., Marturi, N., and Stolkin, R. (2020). Multi-scale Adaptive Feature Fusion Network for Semantic Segmentation in Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12050872"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Liu, C., Zeng, D., Wu, H., Wang, Y., Jia, S., and Xin, L. (2020). Urban land cover classification of high-resolution aerial imagery using a relation-enhanced multiscale convolutional network. Remote Sens., 12.","DOI":"10.3390\/rs12020311"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Guo, M., Liu, H., Xu, Y., and Huang, Y. (2020). Building Extraction Based on U-Net with an Attention Block and Multiple Losses. Remote Sens., 12.","DOI":"10.3390\/rs12091400"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Cao, K., and Zhang, X. (2020). An Improved Res-UNet Model for Tree Species Classification Using Airborne High-Resolution Images. Remote Sens., 12.","DOI":"10.3390\/rs12071128"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R. (2017, January 22\u201329). Mask r-cnn. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, L., Wu, J., Fan, Y., Gao, H., and Shao, Y. (2020). An Efficient Building Extraction Method from High Spatial Resolution Remote Sensing Images Based on Improved Mask R-CNN. Sensors, 20.","DOI":"10.3390\/s20051465"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Liu, S., Ding, W., Liu, C., Liu, Y., Wang, Y., and Li, H. (2018). ERN: Edge loss reinforced semantic segmentation network for remote sensing images. Remote Sens., 10.","DOI":"10.3390\/rs10091339"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"He, C., Li, S., Xiong, D., Fang, P., and Liao, M. (2020). Remote Sensing Image Semantic Segmentation Based on Edge Information Guidance. Remote Sens., 12.","DOI":"10.3390\/rs12091501"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"84","DOI":"10.1145\/3065386","article-title":"Imagenet classification with deep convolutional neural networks","volume":"60","author":"Krizhevsky","year":"2017","journal-title":"Commun. ACM"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_32","unstructured":"Liu, W., Rabinovich, A., and Berg, A.C. (2015). Parsenet: Looking wider to see better. arXiv."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Peng, C., Zhang, X., Yu, G., Luo, G., and Sun, J. (2017, January 21\u201326). Large kernel matters-improve semantic segmentation by global convolutional network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.189"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Ghiasi, G., and Fowlkes, C.C. (2016, January 8\u201316). Laplacian pyramid reconstruction and refinement for semantic segmentation. Proceedings of the 2016 European Conference on Computer Vision (ECCV), Amsterdam, The Netherlands.","DOI":"10.1007\/978-3-319-46487-9_32"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Zhao, H., Zhang, Y., Liu, S., Shi, J., Loy, C.C., Lin, D., and Jia, J. (2018, January 8\u201314). PSANet: Point-wise Spatial Attention Network for Scene Parsing. Proceedings of the 2018 European Conference on Computer Vision (ECCV), Cham, Switzerland.","DOI":"10.1007\/978-3-030-01240-3_17"},{"key":"ref_36","unstructured":"Yuan, Y., and Wang, J. (2018). Ocnet: Object context network for scene parsing. arXiv."},{"key":"ref_37","unstructured":"Huang, Z., Wang, X., Huang, L., Huang, C., Wei, Y., and Liu, W. (November, January 27). CCNet: Criss-cross attention for semantic segmentation. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhang, H., Dana, K., Shi, J., Zhang, Z., Wang, X., Tyagi, A., and Agrawal, A. (2018, January 18\u201323). Context encoding for semantic segmentation. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00747"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., and Lu, H. (2019, January 16\u201320). Dual attention network for scene segmentation. Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00326"},{"key":"ref_41","unstructured":"Ioffe, S., and Szegedy, C. (2015). Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv."},{"key":"ref_42","unstructured":"Glorot, X., Bordes, A., and Bengio, Y. (2011, January 11\u201313). Deep sparse rectifier neural networks. Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, Ft. Lauderdale, FL, USA."},{"key":"ref_43","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large scale image recognition. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). In Going deeper with convolutions. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Mou, L., Hua, Y., and Zhu, X.X. (2019, January 16\u201320). A relation-augmented fully convolutional network for semantic segmentation in aerial scenes. Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01270"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Sinha, A., and Dolz, J. (2020). Multi-scale self-guided attention for medical image segmentation. IEEE J. Biomed. Health, in press.","DOI":"10.1109\/JBHI.2020.2986926"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"197","DOI":"10.1016\/j.media.2019.01.012","article-title":"Attention gated networks: Learning to leverage salient regions in medical images","volume":"53","author":"Schlemper","year":"2019","journal-title":"Med. Image Anal."},{"key":"ref_48","unstructured":"(2020, March 23). Available online: http:\/\/labelme.csail.mit.edu\/Release3.0\/."},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.rse.2011.11.001","article-title":"A novel algorithm for land use and land cover classification using RADARSAT-2 polarimetric SAR data","volume":"118","author":"Qi","year":"2012","journal-title":"Remote Sens. Environ."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Zhou, P., Chen, G., Wang, M., Chen, J., and Li, Y. (2020). Sediment Classification of Acoustic Backscatter Image Based on Stacked Denoising Autoencoder and Modified Extreme Learning Machine. Remote Sens., 12.","DOI":"10.3390\/rs12223762"},{"key":"ref_51","unstructured":"Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Desmaison, A., Antiga, L., and Lerer, A. (2017, January 4\u201310). Automatic differentiation in pytorch. Proceedings of the 2017 Conference and Workshop on Neural Information Processing Systems (NIPS), Long Beach, CA, USA."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/24\/7032\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T10:42:29Z","timestamp":1760179349000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/20\/24\/7032"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,12,8]]},"references-count":51,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2020,12]]}},"alternative-id":["s20247032"],"URL":"https:\/\/doi.org\/10.3390\/s20247032","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2020,12,8]]}}}