{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,24]],"date-time":"2026-03-24T07:21:30Z","timestamp":1774336890296,"version":"3.50.1"},"reference-count":31,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2024,4,4]],"date-time":"2024-04-04T00:00:00Z","timestamp":1712188800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of China","award":["32372239"],"award-info":[{"award-number":["32372239"]}]},{"name":"National Natural Science Foundation of China","award":["221100110700"],"award-info":[{"award-number":["221100110700"]}]},{"name":"Henan Province Major Science and Technology Project","award":["32372239"],"award-info":[{"award-number":["32372239"]}]},{"name":"Henan Province Major Science and Technology Project","award":["221100110700"],"award-info":[{"award-number":["221100110700"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Algorithms"],"abstract":"<jats:p>Semantic segmentation algorithms leveraging deep convolutional neural networks often encounter challenges due to their extensive parameters, high computational complexity, and slow execution. To address these issues, we introduce a semantic segmentation network model emphasizing the rapid generation of redundant features and multi-level spatial aggregation. This model applies cost-efficient linear transformations instead of standard convolution operations during feature map generation, effectively managing memory usage and reducing computational complexity. To enhance the feature maps\u2019 representation ability post-linear transformation, a specifically designed dual-attention mechanism is implemented, enhancing the model\u2019s capacity for semantic understanding of both local and global image information. Moreover, the model integrates sparse self-attention with multi-scale contextual strategies, effectively combining features across different scales and spatial extents. This approach optimizes computational efficiency and retains crucial information, enabling precise and quick image segmentation. To assess the model\u2019s segmentation performance, we conducted experiments in Changge City, Henan Province, using datasets such as LoveDA, PASCAL VOC, LandCoverNet, and DroneDeploy. These experiments demonstrated the model\u2019s outstanding performance on public remote sensing datasets, significantly reducing the parameter count and computational complexity while maintaining high accuracy in segmentation tasks. This advancement offers substantial technical benefits for applications in agriculture and forestry, including land cover classification and crop health monitoring, thereby underscoring the model\u2019s potential to support these critical sectors effectively.<\/jats:p>","DOI":"10.3390\/a17040151","type":"journal-article","created":{"date-parts":[[2024,4,4]],"date-time":"2024-04-04T00:42:26Z","timestamp":1712191346000},"page":"151","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["Research on Efficient Feature Generation and Spatial Aggregation for Remote Sensing Semantic Segmentation"],"prefix":"10.3390","volume":"17","author":[{"ORCID":"https:\/\/orcid.org\/0009-0001-8290-3611","authenticated-orcid":false,"given":"Ruoyang","family":"Li","sequence":"first","affiliation":[{"name":"College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China"}]},{"given":"Shuping","family":"Xiong","sequence":"additional","affiliation":[{"name":"College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China"}]},{"given":"Yinchao","family":"Che","sequence":"additional","affiliation":[{"name":"College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1536-4427","authenticated-orcid":false,"given":"Lei","family":"Shi","sequence":"additional","affiliation":[{"name":"College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China"}]},{"given":"Xinming","family":"Ma","sequence":"additional","affiliation":[{"name":"College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China"},{"name":"College of Agronomy, Henan Agricultural University, Zhengzhou 450046, China"}]},{"given":"Lei","family":"Xi","sequence":"additional","affiliation":[{"name":"College of Information and Management Science, Henan Agricultural University, Zhengzhou 450046, China"}]}],"member":"1968","published-online":{"date-parts":[[2024,4,4]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"4786","DOI":"10.1080\/01431161.2018.1434329","article-title":"Estimation of positions and heights from UAV-sensed imagery in tree plantations in agrosilvopastoral systems","volume":"39","author":"Ribeiro","year":"2018","journal-title":"Int. J. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015). Fully Convolutional Networks for Semantic Segmentation. arXiv.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., and Adam, H. (2018, January 8\u201314). Encoder-decoder with atrous separable convolution for semantic image segmentation. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_49"},{"key":"ref_5","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C. (2018, January 18\u201323). Mobilenetv2: Inverted residuals and linear bottlenecks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_7","unstructured":"Howard, A., Sandler, M., Chu, G., Chen, L.C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., and Vasudevan, V. (November, January 27). Searching for mobilenetv3. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhang, X., Zhou, X., Lin, M., and Sun, J. (2018, January 18\u201323). Shufflenet: An extremely efficient convolutional neural network for mobile devices. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00716"},{"key":"ref_10","first-page":"1963","article-title":"Pelee: A real-time object detection system on mobile devices","volume":"31","author":"Wang","year":"2018","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_11","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv."},{"key":"ref_12","first-page":"102702","article-title":"Exploring the potential of Chinese GF-6 images for crop mapping in regions with complex agricultural landscapes","volume":"107","author":"Xia","year":"2022","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_13","unstructured":"Wang, J. (2021). LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation. arXiv."},{"key":"ref_14","unstructured":"Alemohammad, H., and Booth, K. (2020). LandCoverNet: A global benchmark land cover classification training dataset. arXiv."},{"key":"ref_15","unstructured":"Parmar, V., Bhatia, N., Negi, S., and Suri, M. (2020). Exploration of optimized semantic segmentation architectures for edge-deployment on drones. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Noh, H., Hong, S., and Han, B. (2015, January 7\u201313). Learning deconvolution network for semantic segmentation. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.178"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Fan, M., Lai, S., Huang, J., Wei, X., Chai, Z., Luo, J., and Wei, X. (2021, January 20\u201325). Rethinking bisenet for real-time semantic segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00959"},{"key":"ref_18","unstructured":"Kampffmeyer, M., Salberg, A.-B., and Jenssen, R. (July, January 26). Semantic segmentation of small objects and modeling of uncertainty in urban remote sensing images using deep convolutional neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Las Vegas, NV, USA."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Luo, J.H., Wu, J., and Lin, W. (2017, January 22\u201329). Thinet: A filter level pruning method for deep neural network compression. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.541"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Yu, R., Li, A., Chen, C.F., Lai, J.H., Morariu, V.I., Han, X., Gao, M., Lin, C.Y., and Davis, L.S. (2018, January 18\u201323). Nisp: Pruning networks using neuron importance score propagation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00958"},{"key":"ref_21","first-page":"1608","article-title":"Learning versatile filters for efficient convolutional neural networks","volume":"31","author":"Wang","year":"2018","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Huang, Z., and Wang, N. (2018, January 8\u201314). Data-driven sparse structure selection for deep neural networks. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01270-0_19"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"114532","DOI":"10.1016\/j.eswa.2020.114532","article-title":"Optimized HRNet for image semantic segmentation","volume":"174","author":"Wu","year":"2021","journal-title":"Expert Syst. Appl."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Lin, G., Milan, A., Shen, C., and Reid, I. (2017, January 21\u201326). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.549"},{"key":"ref_25","first-page":"1853","article-title":"Symbolic graph reasoning meets convolutions","volume":"31","author":"Liang","year":"2018","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_26","unstructured":"Ding, X., Guo, Y., Ding, G., and Han, J. (November, January 27). Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1037\/0096-3445.115.1.39","article-title":"Attention, similarity, and the identification\u2013categorization relationship","volume":"115","author":"Nosofsky","year":"1986","journal-title":"J. Exp. Psychol. Gen."},{"key":"ref_28","first-page":"4814","article-title":"Devil in the details: Towards accurate single and multiple human parsing","volume":"33","author":"Ruan","year":"2019","journal-title":"Proc. AAAI Conf. Artif. Intell."},{"key":"ref_29","first-page":"3508","article-title":"Hierarchical human semantic parsing with comprehensive part-relation modeling","volume":"44","author":"Wang","year":"2021","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"82031","DOI":"10.1109\/ACCESS.2021.3086020","article-title":"U-net and its variants for medical image segmentation: A review of theory and applications","volume":"9","author":"Siddique","year":"2021","journal-title":"IEEE Access"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"2481","DOI":"10.1109\/TPAMI.2016.2644615","article-title":"Segnet: A deep convolutional encoder-decoder architecture for image segmentation","volume":"39","author":"Badrinarayanan","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."}],"container-title":["Algorithms"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1999-4893\/17\/4\/151\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T14:23:13Z","timestamp":1760106193000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1999-4893\/17\/4\/151"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,4,4]]},"references-count":31,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2024,4]]}},"alternative-id":["a17040151"],"URL":"https:\/\/doi.org\/10.3390\/a17040151","relation":{},"ISSN":["1999-4893"],"issn-type":[{"value":"1999-4893","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,4,4]]}}}