{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T17:13:45Z","timestamp":1774286025500,"version":"3.50.1"},"reference-count":48,"publisher":"MDPI AG","issue":"3","license":[{"start":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T00:00:00Z","timestamp":1774224000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Research Funding of Wuhan Polytechnic University","award":["2023RZ036"],"award-info":[{"award-number":["2023RZ036"]}]},{"name":"Research Fund of Hubei Provincial Department of Education Scientific Research Plan Guiding Project","award":["B2021122"],"award-info":[{"award-number":["B2021122"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>Semantic segmentation of remote sensing images (RSIs) is a fundamental task in geoscience research. However, designing efficient feature fusion modules remains challenging for existing dual-branch or multi-branch architectures. Furthermore, existing deep learning-based architectures predominantly concentrate on spatial feature modeling and context capturing while inherently neglecting the exploration and utilization of critical frequency-domain features, which is crucial for addressing issues of semantic confusion and blurred boundaries in complex remote sensing scenes. To address the challenges of feature fusion and the lack of frequency-domain information, we propose a novel dual-path feature extraction network (DFENet) in this paper. Specifically, a dual-path module (DPM) is developed in DFENet to extract global and local features, respectively. In the global path, after applying the channel splitting strategy, four feature extraction strategies are innovatively integrated to extract global features from different granularities. According to the strategy of supplementing frequency-domain information, a frequency-domain feature extraction block (FFEB) dominated by discrete Wavelet transform (DWT) is designed to effectively captures both high- and low-frequency components. Experimental results show that our method outperforms existing state-of-the-art methods in terms of segmentation performance, achieving a mean intersection over union (mIoU) of 83.09% on the ISPRS Vaihingen dataset and 86.05% on the ISPRS Potsdam dataset.<\/jats:p>","DOI":"10.3390\/jimaging12030141","type":"journal-article","created":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T16:40:20Z","timestamp":1774284020000},"page":"141","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["DFENet: A Novel Dual-Path Feature Extraction Network for Semantic Segmentation of Remote Sensing Images"],"prefix":"10.3390","volume":"12","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-1848-0051","authenticated-orcid":false,"given":"Li","family":"Cao","sequence":"first","affiliation":[{"name":"School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430023, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-5040-6379","authenticated-orcid":false,"given":"Zishang","family":"Liu","sequence":"additional","affiliation":[{"name":"School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430023, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0009-0503-4879","authenticated-orcid":false,"given":"Yan","family":"Wang","sequence":"additional","affiliation":[{"name":"School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430023, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Run","family":"Gao","sequence":"additional","affiliation":[{"name":"School of Electrical and Electronic Engineering, Wuhan Polytechnic University, Wuhan 430023, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2026,3,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"549","DOI":"10.1038\/s41598-024-84134-4","article-title":"DSIA U-Net: Deep shallow interaction with attention mechanism UNet for remote sensing satellite images","volume":"15","author":"Jonnala","year":"2025","journal-title":"Sci. Rep."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"16099","DOI":"10.1038\/s41598-025-99322-z","article-title":"AER U-Net: Attention-enhanced multi-scale residual U-Net structure for water body segmentation using Sentinel-2 satellite images","volume":"15","author":"Jonnala","year":"2025","journal-title":"Sci. Rep."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. International Conference on Medical Image Computing and Computer-Assisted Intervention, Springer.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid Scene Parsing Network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Kirillov, A., Girshick, R., He, K.-M., and Dollar, P. (2019, January 15\u201320). Panoptic Feature Pyramid Networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00656"},{"key":"ref_7","unstructured":"Ma, A.-L., Wang, J.-J., Zhong, Y.-F., and Zheng, Z. (2020, January 13\u201319). Foreground-Aware Relation Network for Geospatial Object Segmentation in High Spatial Resolution Remote Sensing Imagery. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"110415","DOI":"10.1016\/j.knosys.2023.110415","article-title":"Orientation Attention Network for semantic segmentation of remote sensing images","volume":"267","author":"Wang","year":"2023","journal-title":"Knowl. Based Syst."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"4701517","DOI":"10.1109\/TGRS.2024.3504733","article-title":"Stair fusion network with context-refined attention for remote sensing image semantic segmentation","volume":"62","author":"Liu","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"5400515","DOI":"10.1109\/TGRS.2023.3334294","article-title":"Unsupervised domain adaptation augmented by mutually boosted attention for semantic segmentation of VHR remote sensing images","volume":"61","author":"Ma","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"103280","DOI":"10.1016\/j.media.2024.103280","article-title":"TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers","volume":"97","author":"Chen","year":"2024","journal-title":"Med. Image Anal."},{"key":"ref_12","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16 \u00d7 16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_13","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141, and Polosukhin, I. (2017). Attention is all you need. Adv. Neural Inf. Process. Syst., 30."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"4407115","DOI":"10.1109\/TGRS.2023.3300706","article-title":"Boundary-aware multiscale learning perception for remote sensing image segmentation","volume":"61","author":"You","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_15","first-page":"5613415","article-title":"Mmt: Mixed-mask transformer for remote sensing image semantic segmentation","volume":"61","author":"Xu","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","unstructured":"Gu, A., and Dao, T. (2023). Mamba: Linear-time sequence modeling with selective state spaces. arXiv."},{"key":"ref_17","unstructured":"Gu, A., Goel, K., and R\u00e9, C. (2021). Efficiently modeling long sequences with structured state spaces. arXiv."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"He, X., Cao, K., Yan, K., Li, R., Xie, C., Zhang, J., and Zhou, M. (2024). Pan-Mamba: Effective pan-sharpening with state space model. arXiv.","DOI":"10.1016\/j.inffus.2024.102779"},{"key":"ref_19","first-page":"8002605","article-title":"RS-Mamba: Remote sensing image classification with state space model","volume":"21","author":"Chen","year":"2024","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 13\u201319). EfficientDet: Scalable and efficient object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Qiao, S., Chen, L.-C., and Yuille, A. (2021, January 20\u201325). DetectoRS: Detecting objects with recursive feature pyramid and switchable atrous convolution. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01008"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"6011405","DOI":"10.1109\/LGRS.2024.3414293","article-title":"RS-3-Mamba: Visual state space model for remote sensing image semantic segmentation","volume":"21","author":"Ma","year":"2024","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_24","unstructured":"Lu, W., Chen, S.-B., Ding, C.H.Q., Tang, J., and Luo, B. (2025). LWGANet: A lightweight group attention backbone for remote sensing visual tasks. arXiv."},{"key":"ref_25","unstructured":"Hwang, S., Han, D., Jung, C., and Jeon, M. (2024). WaveDH: Wavelet sub-bands guided ConvNet for efficient image dehazing. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Xu, Z., Zhang, W., Zhang, T., Yang, Z., and Li, J. (2021). Efficient Transformer for remote sensing image segmentation. Remote Sens., 13.","DOI":"10.3390\/rs13183585"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 10\u201317). Swin Transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Yang, C., Wang, Y., Zhang, J., Zhang, H., Wei, Z., Lin, Z., and Yuille, A. (2022, January 18\u201324). Lite Vision Transformer with enhanced self-attention. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01169"},{"key":"ref_29","unstructured":"Zhu, L., Liao, B., Zhang, Q., Wang, X., Liu, W., and Wang, X. (2024). Vision Mamba: Efficient visual representation learning with bidirectional state space model. arXiv."},{"key":"ref_30","first-page":"103031","article-title":"VMamba: Visual state space model","volume":"37","author":"Liu","year":"2024","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"107638","DOI":"10.1016\/j.engappai.2023.107638","article-title":"Frequency-aware robust multidimensional information fusion framework for remote sensing image segmentation","volume":"129","author":"Fan","year":"2024","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_32","unstructured":"Zou, Z., Yu, H., Huang, J., and Zhao, F. (November, January 28). FreqMamba: Viewing Mamba from a frequency perspective for image deraining. Proceedings of the 32nd ACM International Conference on Multimedia, Melbourne, VIC, Australia."},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Li, Y., Liu, Z., Yang, J., and Zhang, H. (2023). Wavelet transform feature enhancement for semantic segmentation of remote sensing images. Remote Sens., 15.","DOI":"10.3390\/rs15245644"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Zhou, Y., Huang, J., Wang, C., Song, L., and Yang, G. (2023, January 1\u20136). XNet: Wavelet-based low and high frequency fusion networks for semantic segmentation. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Paris, France.","DOI":"10.1109\/ICCV51070.2023.01928"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Wei, G., Xu, J., Yan, W., Chong, Q., Xing, H., and Ni, M. (2024). Dual-domain fusion network based on wavelet frequency decomposition and fuzzy spatial constraint for remote sensing image segmentation. Remote Sens., 16.","DOI":"10.3390\/rs16193594"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1600","DOI":"10.1007\/s40815-021-01053-6","article-title":"Wavelet K-means clustering and fuzzy-based method for segmenting MRI images depicting Parkinson\u2019s disease","volume":"23","author":"Huang","year":"2021","journal-title":"Int. J. Fuzzy Syst."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"00368504241232537","DOI":"10.1177\/00368504241232537","article-title":"WET-UNet: Wavelet integrated efficient transformer networks for nasopharyngeal carcinoma tumor segmentation","volume":"107","author":"Zeng","year":"2024","journal-title":"Sci. Prog."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"196","DOI":"10.1016\/j.isprsjprs.2022.06.008","article-title":"UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery","volume":"190","author":"Wang","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_39","unstructured":"Hendrycks, D., and Gimpel, K. (2016). Gaussian error linear units (GELUs). arXiv."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Liu, Y., Meng, F., Zhang, J., Zhou, J., Chen, Y., and Xu, J. (2019). CM-Net: A novel collaborative memory network for spoken language understanding. arXiv.","DOI":"10.18653\/v1\/D19-1097"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Zeng, Y., Luo, A., Zhan, K., Li, J., Zhang, Y., and Hu, K. (July, January 30). Multiscale feature enhancement and adaptive receptive field for tiny object detection in remote sensing images. Proceedings of the 2025 International Conference on Multimedia Retrieval, Chicago, IL, USA.","DOI":"10.1145\/3731715.3733404"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Zhao, C., Cai, W., Dong, C., and Hu, C. (2024, January 16\u201322). Wavelet-based Fourier information interaction with frequency diffusion adjustment for underwater image restoration. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.00791"},{"key":"ref_43","first-page":"3000617","article-title":"SFFNet: A wavelet-based spatial and frequency domain fusion network for remote sensing segmentation","volume":"62","author":"Yang","year":"2024","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_44","first-page":"1","article-title":"Multistage attention ResU-Net for semantic segmentation of fine-resolution remote sensing images","volume":"19","author":"Li","year":"2021","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_45","unstructured":"Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., Lu, L., Yuille, A.L., and Zhou, Y. (2021). TransUNet: Transformers make strong encoders for medical image segmentation. arXiv."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"2004612","DOI":"10.1109\/TGRS.2023.3314641","article-title":"CMTFNet: CNN and multiscale transformer fusion network for remote-sensing image semantic segmentation","volume":"61","author":"Wu","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_47","first-page":"6001205","article-title":"UNetMamba: An efficient UNet-like Mamba for semantic segmentation of high-resolution remote sensing images","volume":"22","author":"Zhu","year":"2024","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"14","DOI":"10.1016\/j.eswa.2024.124950","article-title":"A dual encoder crack segmentation network with Haar wavelet-based high\u2013low frequency attention","volume":"256","author":"Zhang","year":"2024","journal-title":"Expert Syst. Appl."}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/12\/3\/141\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T16:42:10Z","timestamp":1774284130000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/12\/3\/141"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3,23]]},"references-count":48,"journal-issue":{"issue":"3","published-online":{"date-parts":[[2026,3]]}},"alternative-id":["jimaging12030141"],"URL":"https:\/\/doi.org\/10.3390\/jimaging12030141","relation":{},"ISSN":["2313-433X"],"issn-type":[{"value":"2313-433X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3,23]]}}}