{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,10]],"date-time":"2026-01-10T01:32:52Z","timestamp":1768008772194,"version":"3.49.0"},"reference-count":58,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2023,12,13]],"date-time":"2023-12-13T00:00:00Z","timestamp":1702425600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["42001362"],"award-info":[{"award-number":["42001362"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>This paper presents the MSSFF (multistage spectral\u2013spatial feature fusion) framework, which introduces a novel approach for semantic segmentation from hyperspectral imagery (HSI). The framework aims to simplify the modeling of spectral relationships in HSI sequences and unify the architecture for semantic segmentation of HSIs. It incorporates a spectral\u2013spatial feature fusion module and a multi-attention mechanism to efficiently extract hyperspectral features. The MSSFF framework reevaluates the potential impact of spectral and spatial features on segmentation models and leverages the spectral\u2013spatial fusion module (SSFM) in the encoder component to effectively extract and enhance these features. Additionally, an efficient Transformer (ET) is introduced in the skip connection part of deep features to capture long-term dependent features and extract global spectral\u2013spatial information from the entire feature map. This highlights the significant potential of Transformers in modeling spectral\u2013spatial feature maps within the context of hyperspectral remote sensing. Moreover, a spatial attention mechanism is adopted in the shallow skip connection part to extract local features. The framework demonstrates promising capabilities in hyperspectral remote sensing applications. The conducted experiments provide valuable insights for optimizing the model depth and the order of feature fusion, thereby contributing to the advancement of hyperspectral semantic segmentation research.<\/jats:p>","DOI":"10.3390\/rs15245717","type":"journal-article","created":{"date-parts":[[2023,12,13]],"date-time":"2023-12-13T08:55:16Z","timestamp":1702457716000},"page":"5717","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":4,"title":["MSSFF: Advancing Hyperspectral Classification through Higher-Accuracy Multistage Spectral\u2013Spatial Feature Fusion"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-5723-4789","authenticated-orcid":false,"given":"Yuhan","family":"Chen","sequence":"first","affiliation":[{"name":"School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China"},{"name":"Qingdao Innovation and Development Center (Base), Harbin Engineering University, Qingdao 266000, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6693-957X","authenticated-orcid":false,"given":"Qingyun","family":"Yan","sequence":"additional","affiliation":[{"name":"School of Remote Sensing and Geomatics Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9622-5041","authenticated-orcid":false,"given":"Weimin","family":"Huang","sequence":"additional","affiliation":[{"name":"Faculty of Engineering and Applied Science, Memorial University, St. John\u2019s, NL A1B 3X5, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2023,12,13]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1109\/MGRS.2019.2902525","article-title":"Hypersectral Imaging for Military and Security Applications: Combining Myriad Processing and Sensing Techniques","volume":"7","author":"Shimoni","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"523","DOI":"10.1016\/B978-0-444-63977-6.00021-3","article-title":"Hyperspectral imaging in medical applications","volume":"32","author":"Fei","year":"2019","journal-title":"Data Handling in Science and Technology"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Liu, H., Yu, T., Hu, B., Hou, X., Zhang, Z., Liu, X., Liu, J., Wang, X., Zhong, J., and Tan, Z. (2021). Uav-borne hyperspectral imaging remote sensing system based on acousto-optic tunable filter for water quality monitoring. Remote Sens., 13.","DOI":"10.3390\/rs13204069"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"5506305","DOI":"10.1109\/LGRS.2021.3079317","article-title":"Multitask Learning of Alfalfa Nutritive Value From UAV-Based Hyperspectral Images","volume":"19","author":"Feng","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"8693","DOI":"10.1109\/TGRS.2020.3047363","article-title":"Exploring the relationship between 2D\/3D convolution for hyperspectral image super-resolution","volume":"59","author":"Li","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"279","DOI":"10.1016\/j.isprsjprs.2019.09.006","article-title":"Deep learning classifiers for hyperspectral imaging: A review","volume":"158","author":"Paoletti","year":"2019","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"4544","DOI":"10.1109\/TGRS.2016.2543748","article-title":"Spectral\u2013spatial feature extraction for hyperspectral image classification: A dimension reduction and deep learning approach","volume":"54","author":"Zhao","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"4581","DOI":"10.1109\/TGRS.2018.2828029","article-title":"SuperPCA: A superpixelwise PCA approach for unsupervised feature extraction of hyperspectral imagery","volume":"56","author":"Jiang","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1510","DOI":"10.1109\/LGRS.2018.2852143","article-title":"Sea ice sensing from GNSS-R data using convolutional neural networks","volume":"15","author":"Yan","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Chen, Y., Yan, Q., and Huang, W. (2023). MFTSC: A Semantically Constrained Method for Urban Building Height Estimation Using Multiple Source Images. Remote Sens., 15.","DOI":"10.3390\/rs15235552"},{"key":"ref_11","first-page":"1500305","article-title":"Inland Water Mapping Based on GA-LinkNet from CyGNSS Data","volume":"20","author":"Yan","year":"2022","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_12","first-page":"1","article-title":"Leveraging Machine Learning for Enhanced Business Intelligence","volume":"7","author":"Bharadiya","year":"2023","journal-title":"Int. J. Comput. Sci. Technol."},{"key":"ref_13","unstructured":"Dhamo, H., Navab, N., and Tombari, F. (November, January 27). Object-driven multi-layer scene decomposition from a single image. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Repbulic of Korea."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"88","DOI":"10.1016\/j.neucom.2016.09.010","article-title":"Convolutional neural networks for hyperspectral image classification","volume":"219","author":"Yu","year":"2017","journal-title":"Neurocomputing"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"6232","DOI":"10.1109\/TGRS.2016.2584107","article-title":"Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks","volume":"54","author":"Chen","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"7570","DOI":"10.1109\/JSTARS.2021.3099118","article-title":"Hyperspectral Image Classification Using a Hybrid 3D-2D Convolutional Neural Networks","volume":"14","author":"Ghaderizadeh","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"2448","DOI":"10.1109\/TGRS.2020.3005623","article-title":"Geometry-Aware Deep Recurrent Neural Networks for Hyperspectral Image Classification","volume":"59","author":"Hao","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"292","DOI":"10.1109\/LGRS.2017.2786272","article-title":"Classification of Hyperspectral Imagery Using a New Fully Convolutional Neural Network","volume":"15","author":"Li","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"5518615","DOI":"10.1109\/TGRS.2021.3130716","article-title":"SpectralFormer: Rethinking Hyperspectral Image Classification With Transformers","volume":"60","author":"Hong","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Chen, Y., Liu, P., Zhao, J., Huang, K., and Yan, Q. (2023). Shallow-Guided Transformer for Semantic Segmentation of Hyperspectral Remote Sensing Imagery. Remote Sens., 15.","DOI":"10.3390\/rs15133366"},{"key":"ref_21","first-page":"1228002","article-title":"Hyperspectral Remote-Sensing Classification Combining Transformer and Multiscale Residual Mechanisms","volume":"60","author":"Chen","year":"2023","journal-title":"Laser Optoelectron. Prog."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Chen, Y., and Yan, Q. (2022, January 19\u201321). Vision Transformer is Required for Hyperspectral Semantic Segmentation. Proceedings of the 2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Chengdu, China.","DOI":"10.1109\/PRAI55851.2022.9904012"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"5523815","DOI":"10.1109\/TGRS.2023.3314550","article-title":"Multiscale Neighborhood Attention Transformer With Optimized Spatial Pattern for Hyperspectral Image Classification","volume":"61","author":"Qiao","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"5530515","DOI":"10.1109\/TGRS.2022.3179513","article-title":"Unsupervised Hyperspectral Band Selection via Hybrid Graph Convolutional Network","volume":"60","author":"Yu","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"5512505","DOI":"10.1109\/LGRS.2023.3316732","article-title":"Graph Guided Transformer: An Image-Based Global Learning Framework for Hyperspectral Image Classification","volume":"20","author":"Shi","year":"2023","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_26","first-page":"5532513","article-title":"MSTNet: A multilevel spectral\u2013spatial transformer network for hyperspectral image classification","volume":"60","author":"Yu","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"11709","DOI":"10.1109\/TCYB.2021.3070577","article-title":"A spectral-spatial-dependent global learning framework for insufficient and imbalanced hyperspectral image classification","volume":"52","author":"Zhu","year":"2021","journal-title":"IEEE Trans. Cybern."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1016\/j.isprsjprs.2014.04.004","article-title":"Land cover classification of finer resolution remote sensing data integrating temporal features from time series coarser resolution data","volume":"93","author":"Jia","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"652","DOI":"10.1109\/JPROC.2012.2197589","article-title":"Advances in spectral-spatial classification of hyperspectral images","volume":"101","author":"Fauvel","year":"2012","journal-title":"Proc. IEEE"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"1045","DOI":"10.1080\/10106049.2015.1110207","article-title":"Segmentation-based classification of hyperspectral imagery using projected and correlation clustering techniques","volume":"31","author":"Mehta","year":"2016","journal-title":"Geocarto Int."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, H., and Shen, Q. (2017). Spectral\u2013spatial classification of hyperspectral imagery with 3D convolutional neural network. Remote Sens., 9.","DOI":"10.3390\/rs9010067"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"790","DOI":"10.1007\/s10851-019-00925-9","article-title":"A two-stage method for spectral\u2013spatial classification of hyperspectral images","volume":"62","author":"Chan","year":"2020","journal-title":"J. Math. Imaging Vis."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"5387","DOI":"10.1109\/JSTARS.2023.3283342","article-title":"Rotation is All You Need: Cross Dimensional Residual Interaction for Hyperspectral Image Classification","volume":"16","author":"Qiao","year":"2023","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Liu, J.J., Hou, Q., Cheng, M.M., Wang, C., and Feng, J. (2020, January 13\u201319). Improving convolutional networks with self-calibrated convolutions. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01011"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Li, J., Wen, Y., and He, L. (2023, January 18\u201322). SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00596"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"5115","DOI":"10.1109\/JSTARS.2022.3185125","article-title":"Multiscale adaptive convolution for hyperspectral image classification","volume":"15","author":"Ren","year":"2022","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_37","first-page":"5500917","article-title":"A novel hyperspectral image classification model using bole convolution with three-direction attention mechanism: Small sample and unbalanced learning","volume":"61","author":"Cai","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). Cbam: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201322). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_41","unstructured":"Fu, J., Liu, J., Tian, H., Li, Y., Bao, Y., Fang, Z., and Lu, H. (November, January 27). Dual attention network for scene segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seoul, Repbulic of Korea."},{"key":"ref_42","unstructured":"Li, X., Wang, W., Hu, X., and Yang, J. (November, January 27). Selective kernel networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seoul, Repbulic of Korea."},{"key":"ref_43","unstructured":"Wang, S., Li, B.Z., Khabsa, M., Fang, H., and Ma, H. (2020). Linformer: Self-attention with linear complexity. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z., Lin, S., and Guo, B. (2021, January 11\u201317). Swin transformer: Hierarchical vision transformer using shifted windows. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00986"},{"key":"ref_45","unstructured":"Jiang, Y., Chang, S., and Wang, Z. (2021). Transgan: Two transformers can make one strong gan. arXiv."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"12581","DOI":"10.1109\/TPAMI.2023.3282631","article-title":"Uniformer: Unifying convolution and self-attention for visual recognition","volume":"45","author":"Li","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","first-page":"12077","article-title":"SegFormer: Simple and efficient design for semantic segmentation with transformers","volume":"34","author":"Xie","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Zhao, H., Shi, J., Qi, X., Wang, X., and Jia, J. (2017, January 21\u201326). Pyramid scene parsing network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.660"},{"key":"ref_49","unstructured":"Chen, L.C., Papandreou, G., Schroff, F., and Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv."},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"He, M., Li, B., and Chen, H. (2017, January 17\u201320). Multi-scale 3D deep convolutional neural network for hyperspectral image classification. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8297014"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"277","DOI":"10.1109\/LGRS.2019.2918719","article-title":"HybridSN: Exploring 3-D\u20132-D CNN feature hierarchy for hyperspectral image classification","volume":"17","author":"Roy","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"7831","DOI":"10.1109\/TGRS.2020.3043267","article-title":"Attention-based adaptive spectral\u2013spatial kernel ResNet for hyperspectral image classification","volume":"59","author":"Roy","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_53","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2020). An image is worth 16 \u00d7 16 words: Transformers for image recognition at scale. arXiv."},{"key":"ref_54","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1109\/TGRS.2022.3231215","article-title":"Spectral\u2013spatial feature tokenization transformer for hyperspectral image classification","volume":"60","author":"Sun","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Ronneberger, O., Fischer, P., and Brox, T. (2015, January 5\u20139). U-net: Convolutional networks for biomedical image segmentation. Proceedings of the 18th International Conference, Munich, Germany.","DOI":"10.1007\/978-3-319-24574-4_28"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Misra, D., Nalamada, T., Arasanipalai, A.U., and Hou, Q. (2021, January 5\u20139). Rotate to attend: Convolutional triplet attention module. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Virtual Conference.","DOI":"10.1109\/WACV48630.2021.00318"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"5506105","DOI":"10.1109\/LGRS.2023.3287277","article-title":"Hybrid Conv-ViT Network for Hyperspectral Image Classification","volume":"20","author":"Yan","year":"2023","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"5519115","DOI":"10.1109\/TGRS.2023.3301310","article-title":"SSRNet: A Lightweight Successive Spatial Rectified Network with Non-Central Positional Sampling Strategy for Hyperspectral Images Classification","volume":"61","author":"Song","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/24\/5717\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:38:06Z","timestamp":1760132286000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/24\/5717"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,13]]},"references-count":58,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["rs15245717"],"URL":"https:\/\/doi.org\/10.3390\/rs15245717","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,13]]}}}