{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,30]],"date-time":"2025-12-30T17:57:15Z","timestamp":1767117435615,"version":"build-2065373602"},"reference-count":57,"publisher":"MDPI AG","issue":"7","license":[{"start":{"date-parts":[[2021,3,30]],"date-time":"2021-03-30T00:00:00Z","timestamp":1617062400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Key R &amp; D Program of China","award":["2018YFC0810600, 2018YFC0810605"],"award-info":[{"award-number":["2018YFC0810600, 2018YFC0810605"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The pixel-based semantic segmentation methods take pixels as recognitions units, and are restricted by the limited range of receptive fields, so they cannot carry richer and higher-level semantics. These reduce the accuracy of remote sensing (RS) semantic segmentation to a certain extent. Comparing with the pixel-based methods, the graph neural networks (GNNs) usually use objects as input nodes, so they not only have relatively small computational complexity, but also can carry richer semantic information. However, the traditional GNNs are more rely on the context information of the individual samples and lack geographic prior knowledge that reflects the overall situation of the research area. Therefore, these methods may be disturbed by the confusion of \u201cdifferent objects with the same spectrum\u201d or \u201cviolating the first law of geography\u201d in some areas. To address the above problems, we propose a remote sensing semantic segmentation model called knowledge and spatial pyramid distance-based gated graph attention network (KSPGAT), which is based on prior knowledge, spatial pyramid distance and a graph attention network (GAT) with gating mechanism. The model first uses superpixels (geographical objects) to form the nodes of a graph neural network and then uses a novel spatial pyramid distance recognition algorithm to recognize the spatial relationships. Finally, based on the integration of feature similarity and the spatial relationships of geographic objects, a multi-source attention mechanism and gating mechanism are designed to control the process of node aggregation, as a result, the high-level semantics, spatial relationships and prior knowledge can be introduced into a remote sensing semantic segmentation network. The experimental results show that our model improves the overall accuracy by 4.43% compared with the U-Net Network, and 3.80% compared with the baseline GAT network.<\/jats:p>","DOI":"10.3390\/rs13071312","type":"journal-article","created":{"date-parts":[[2021,3,31]],"date-time":"2021-03-31T00:13:10Z","timestamp":1617149590000},"page":"1312","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":19,"title":["Knowledge and Spatial Pyramid Distance-Based Gated Graph Attention Network for Remote Sensing Semantic Segmentation"],"prefix":"10.3390","volume":"13","author":[{"given":"Wei","family":"Cui","sequence":"first","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Xin","family":"He","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Meng","family":"Yao","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Ziwei","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Yuanjie","family":"Hao","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Jie","family":"Li","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Weijie","family":"Wu","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Huilin","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Cong","family":"Xia","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Jin","family":"Li","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]},{"given":"Wenqi","family":"Cui","sequence":"additional","affiliation":[{"name":"School of Resources and Environmental Engineering, Wuhan University of Technology, Wuhan 430070, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,3,30]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Cui, W., Wang, F., He, X., Zhang, D., Xu, X., Yao, M., Wang, Z., and Huang, J. (2019). Multi-Scale Semantic Segmentation and Spatial Relationship Recognition of Remote Sensing Images Based on an Attention Model. Remote Sens., 11.","DOI":"10.3390\/rs11091044"},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Yi, Y., Zhang, Z., Zhang, W., Zhang, C., Li, W., and Zhao, T. (2019). Semantic Segmentation of Urban Buildings from VHR Remote Sensing Imagery Using a Deep Convolutional Neural Network. Remote Sens., 11.","DOI":"10.3390\/rs11151774"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Shao, Z., Tang, P., Wang, Z., Saleem, N., Yam, S., and Sommai, C. (2020). BRRNet: A Fully Convolutional Neural Network for Automatic Building Extraction From High-Resolution Remote Sensing Images. Remote Sens., 12.","DOI":"10.3390\/rs12061050"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"He, C., Li, S., Xiong, D., Fang, P., and Liao, M. (2020). Remote Sensing Image Semantic Segmentation Based on Edge Information Guidance. Remote Sens., 12.","DOI":"10.3390\/rs12091501"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Xu, Z., Zhang, W., Zhang, T., and Li, J. (2020). HRCNet: High-Resolution Context Extraction Network for Semantic Segmentation of Remote Sensing Images. Remote Sens., 13.","DOI":"10.3390\/rs13010071"},{"key":"ref_6","unstructured":"Liu, W., Rabinovich, A., and Berg, A.C. (2015). ParseNet: Looking Wider to See Better. arXiv."},{"key":"ref_7","unstructured":"Luo, W., Li, Y., Urtasun, R., and Zemel, R. (2017). Understanding the Effective Receptive Field in Deep Convolutional Neural Networks. arXiv."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018). Non-Local Neural Networks. arXiv.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_9","unstructured":"Yu, F., and Koltun, V. (2016). Multi-Scale Context Aggregation by Dilated Convolutions. arXiv."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Li, D., Shen, X., Yu, Y., Guan, H., Li, J., Zhang, G., and Li, D. (2020). Building Extraction from Airborne Multi-Spectral LiDAR Point Clouds Based on Graph Geometric Moments Convolutional Neural Networks. Remote Sens., 12.","DOI":"10.3390\/rs12193186"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ma, F., Gao, F., Sun, J., Zhou, H., and Hussain, A. (2019). Attention Graph Convolution Network for Image Segmentation in Big SAR Imagery Data. Remote Sens., 11.","DOI":"10.3390\/rs11212586"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhao, W., Emery, W., Bo, Y., and Chen, J. (2018). Land Cover Mapping with Higher Order Graph-Based Co-Occurrence Model. Remote Sens., 10.","DOI":"10.3390\/rs10111713"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"234","DOI":"10.2307\/143141","article-title":"A Computer Movie Simulating Urban Growth in the Detroit Region","volume":"46","author":"Tobler","year":"1970","journal-title":"Econ. Geogr."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"471","DOI":"10.1023\/A:1013101931793","article-title":"A Multiscale Framework for Landscape Analysis: Object-Specific Analysis and Upscaling","volume":"16","author":"Hay","year":"2001","journal-title":"Landsc. Ecol."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Huang, H., Chen, J., Li, Z., Gong, F., and Chen, N. (2017). Ontology-Guided Image Interpretation for GEOBIA of High Spatial Resolution Remote Sense Imagery: A Coastal Area Case Study. IJGI, 6.","DOI":"10.3390\/ijgi6040105"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Merciol, F., Faucqueur, L., Damodaran, B., R\u00e9my, P.-Y., Descl\u00e9e, B., Dazin, F., Lef\u00e8vre, S., Masse, A., and Sannier, C. (2019). GEOBIA at the Terapixel Scale: Toward Efficient Mapping of Small Woody Features from Heterogeneous VHR Scenes. IJGI, 8.","DOI":"10.3390\/ijgi8010046"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhou, Z., Ma, L., Fu, T., Zhang, G., Yao, M., and Li, M. (2018). Change Detection in Coral Reef Environment Using High-Resolution Images: Comparison of Object-Based and Pixel-Based Paradigms. IJGI, 7.","DOI":"10.3390\/ijgi7110441"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Knevels, R., Petschko, H., Leopold, P., and Brenning, A. (2019). Geographic Object-Based Image Analysis for Automated Landslide Detection Using Open Source GIS Software. IJGI, 8.","DOI":"10.3390\/ijgi8120551"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Mishra, N., Mainali, K., Shrestha, B., Radenz, J., and Karki, D. (2018). Species-Level Vegetation Mapping in a Himalayan Treeline Ecotone Using Unmanned Aerial System (UAS) Imagery. IJGI, 7.","DOI":"10.3390\/ijgi7110445"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Lef\u00e8vre, S., Sheeren, D., and Tasar, O. (2019). A Generic Framework for Combining Multiple Segmentations in Geographic Object-Based Image Analysis. IJGI, 8.","DOI":"10.3390\/ijgi8020070"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Alganci, U. (2019). Dynamic Land Cover Mapping of Urbanized Cities with Landsat 8 Multi-Temporal Images: Comparative Evaluation of Classification Algorithms and Dimension Reduction Methods. IJGI, 8.","DOI":"10.3390\/ijgi8030139"},{"key":"ref_22","unstructured":"Cui, W. (2016). Geographical Ontology Modeling Based on Object-Oriented Remote Sensing Technology, The Science Publishing Compan."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1080\/2150704X.2017.1420265","article-title":"Application of a Parallel Spectral\u2013Spatial Convolution Neural Network in Object-Oriented Remote Sensing Land Use Classification","volume":"9","author":"Cui","year":"2018","journal-title":"Remote Sens. Lett."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Hamedianfar, A., Gibril, M.B.A., Hosseinpoor, M., and Pellikka, P.K.E. (2020). Synergistic Use of Particle Swarm Optimization, Artificial Neural Network, and Extreme Gradient Boosting Algorithms for Urban LULC Mapping from WorldView-3 Images. Geocarto Int., 1\u201319.","DOI":"10.1080\/10106049.2020.1737974"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/MSP.2017.2693418","article-title":"Geometric Deep Learning: Going beyond Euclidean Data","volume":"34","author":"Bronstein","year":"2017","journal-title":"IEEE Signal. Process. Mag."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Zhou, J., Cui, G., Zhang, Z., Yang, C., Liu, Z., Wang, L., Li, C., and Sun, M. (2019). Graph Neural Networks: A Review of Methods and Applications. arXiv.","DOI":"10.1016\/j.aiopen.2021.01.001"},{"key":"ref_27","unstructured":"Kipf, T.N., and Welling, M. (2017). Semi-Supervised Classification with Graph Convolutional Networks. arXiv."},{"key":"ref_28","unstructured":"Veli\u010dkovi\u0107, P., Cucurull, G., Casanova, A., Romero, A., Li\u00f2, P., and Bengio, Y. (2018). Graph Attention Networks. arXiv."},{"key":"ref_29","unstructured":"Hechtlinger, Y., Chakravarti, P., and Qin, J. (2017). A Generalization of Convolutional Neural Networks to Graph-Structured Data. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Liu, Q., Kampffmeyer, M., Jenssen, R., and Salberg, A.-B. (2020). Self-Constructing Graph Convolutional Networks for Semantic Labeling. arXiv.","DOI":"10.1109\/IGARSS39084.2020.9324719"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Chen, Y., Rohrbach, M., Yan, Z., Yan, S., Feng, J., and Kalantidis, Y. (2018). Graph-Based Global Reasoning Networks. arXiv.","DOI":"10.1109\/CVPR.2019.00052"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Lu, Y., Chen, Y., Zhao, D., and Chen, J. (2020). Graph-FCN for Image Semantic Segmentation. arXiv.","DOI":"10.1007\/978-3-030-22796-8_11"},{"key":"ref_33","unstructured":"Abu-El-Haija, S., Kapoor, A., Perozzi, B., and Lee, J. (2018). N-GCN: Multi-Scale Graph Convolution for Semi-Supervised Node Classification. arXiv."},{"key":"ref_34","unstructured":"Hamilton, W.L., Ying, R., and Leskovec, J. (2018). Inductive Representation Learning on Large Graphs. arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Chiang, W.-L., Liu, X., Si, S., Li, Y., Bengio, S., and Hsieh, C.-J. (2019, January 25). Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, Anchorage, AK, USA.","DOI":"10.1145\/3292500.3330925"},{"key":"ref_36","unstructured":"Rong, Y., Huang, W., Xu, T., and Huang, J. (2020). DropEdge: Towards Deep Graph Convolutional Networks on Node Classification. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Wang, G., Ying, R., Huang, J., and Leskovec, J. (2020). Direct Multi-Hop Attention Based Graph Neural Network. arXiv.","DOI":"10.24963\/ijcai.2021\/425"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Kampffmeyer, M., Chen, Y., Liang, X., Wang, H., Zhang, Y., and Xing, E.P. (2019). Rethinking Knowledge Graph Propagation for Zero-Shot Learning. arXiv.","DOI":"10.1109\/CVPR.2019.01175"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Singh, K.K., Divvala, S., Farhadi, A., and Lee, Y.J. (2018). DOCK: Detecting Objects by Transferring Common-Sense Knowledge. arXiv.","DOI":"10.1007\/978-3-030-01261-8_30"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Marino, K., Salakhutdinov, R., and Gupta, A. (2017). The More You Know: Using Knowledge Graphs for Image Classification. arXiv.","DOI":"10.1109\/CVPR.2017.10"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"10973","DOI":"10.1609\/aaai.v34i07.6731","article-title":"Joint Commonsense and Relation Reasoning for Image and Video Captioning","volume":"34","author":"Hou","year":"2020","journal-title":"AAAI"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"You, R., Guo, Z., Cui, L., Long, X., Bao, Y., and Wen, S. (2020). Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification. arXiv.","DOI":"10.1609\/aaai.v34i07.6964"},{"key":"ref_43","unstructured":"Xie, Y., Xu, Z., Kankanhalli, M.S., Meel, K.S., and Soh, H. (2019). Embedding Symbolic Knowledge into Deep Networks. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chen, T., Yu, W., Chen, R., and Lin, L. (2019). Knowledge-Embedded Routing Network for Scene Graph Generation. arXiv.","DOI":"10.1109\/CVPR.2019.00632"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Li, M., and Stein, A. (2020). Mapping Land Use from High Resolution Satellite Images by Exploiting the Spatial Arrangement of Land Cover Objects. Remote Sens., 12.","DOI":"10.3390\/rs12244158"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Li, Y., Chen, R., Zhang, Y., Zhang, M., and Chen, L. (2020). Multi-Label Remote Sensing Image Scene Classification by Combining a Convolutional Neural Network and a Graph Neural Network. Remote Sens., 12.","DOI":"10.3390\/rs12234003"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Iddianozie, C., and McArdle, G. (2020). Improved Graph Neural Networks for Spatial Networks Using Structure-Aware Sampling. IJGI, 9.","DOI":"10.3390\/ijgi9110674"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Jimenez-Sierra, D.A., Ben\u00edtez-Restrepo, H.D., Vargas-Cardona, H.D., and Chanussot, J. (2020). Graph-Based Data Fusion Applied to: Change Detection and Biomass Estimation in Rice Crops. Remote Sens., 12.","DOI":"10.3390\/rs12172683"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Niu, Y., and Wang, B. (2016, January 10\u201315). A Novel Hyperspectral Anomaly Detector Based on Low-Rank Representation and Learned Dictionary. Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China.","DOI":"10.1109\/IGARSS.2016.7730531"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Liu, J., Xiao, Z., Chen, Y., and Yang, J. (2017). Spatial-Spectral Graph Regularized Kernel Sparse Representation for Hyperspectral Image Classification. IJGI, 6.","DOI":"10.3390\/ijgi6080258"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1096","DOI":"10.1109\/LGRS.2014.2384051","article-title":"Airport Target Detection in Remote Sensing Images: A New Method Based on Two-Way Saliency","volume":"12","author":"Zhu","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"2461","DOI":"10.1109\/JSTARS.2018.2833102","article-title":"Stepwise Evolution Analysis of the Region-Merging Segmentation for Scale Parameterization","volume":"11","author":"Hu","year":"2018","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"2394","DOI":"10.1109\/TIP.2017.2676342","article-title":"Unsupervised Simplification of Image Hierarchies via Evolution Analysis in Scale-Sets Framework","volume":"26","author":"Hu","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"7366","DOI":"10.1109\/TGRS.2016.2600636","article-title":"A Bilevel Scale-Sets Model for Hierarchical Representation of Large Remote Sensing Images","volume":"54","author":"Hu","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Tong, X.-Y., Xia, G.-S., Lu, Q., Shen, H., Li, S., You, S., and Zhang, L. (2019). Land-Cover Classification with High-Resolution Remote Sensing Images Using Transferable Deep Models. arXiv.","DOI":"10.1016\/j.rse.2019.111322"},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"1040","DOI":"10.1080\/13658816.2019.1568441","article-title":"Examining the sensitivity of spatial scale in cellular automata Markov chain simulation of land use change","volume":"33","author":"Hao","year":"2019","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"110331","DOI":"10.1016\/j.jenvman.2020.110331","article-title":"Analysis of Sustainable Utilization of Water Resources Based on the Improved Water Resources Ecological Footprint Model: A Case Study of Hubei Province, China","volume":"262","author":"Wang","year":"2020","journal-title":"J. Environ. Manag."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/7\/1312\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T13:25:42Z","timestamp":1760361942000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/7\/1312"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3,30]]},"references-count":57,"journal-issue":{"issue":"7","published-online":{"date-parts":[[2021,4]]}},"alternative-id":["rs13071312"],"URL":"https:\/\/doi.org\/10.3390\/rs13071312","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2021,3,30]]}}}