{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,16]],"date-time":"2026-04-16T16:33:13Z","timestamp":1776357193004,"version":"3.51.2"},"reference-count":43,"publisher":"MDPI AG","issue":"8","license":[{"start":{"date-parts":[[2019,4,23]],"date-time":"2019-04-23T00:00:00Z","timestamp":1555977600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61805181, 61705170, 61605146"],"award-info":[{"award-number":["61805181, 61705170, 61605146"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Many deep learning models, such as convolutional neural network (CNN) and recurrent neural network (RNN), have been successfully applied to extracting deep features for hyperspectral tasks. Hyperspectral image classification allows distinguishing the characterization of land covers by utilizing their abundant information. Motivated by the attention mechanism of the human visual system, in this study, we propose a spectral-spatial attention network for hyperspectral image classification. In our method, RNN with attention can learn inner spectral correlations within a continuous spectrum, while CNN with attention is designed to focus on saliency features and spatial relevance between neighboring pixels in the spatial dimension. Experimental results demonstrate that our method can fully utilize the spectral and spatial information to obtain competitive performance.<\/jats:p>","DOI":"10.3390\/rs11080963","type":"journal-article","created":{"date-parts":[[2019,4,24]],"date-time":"2019-04-24T03:14:28Z","timestamp":1556075668000},"page":"963","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":263,"title":["Spectral-Spatial Attention Networks for Hyperspectral Image Classification"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-0239-8580","authenticated-orcid":false,"given":"Xiaoguang","family":"Mei","sequence":"first","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"},{"name":"Institute of Aerospace Science and Technology, Wuhan University, Wuhan 430072, China"}]},{"given":"Erting","family":"Pan","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"}]},{"given":"Yong","family":"Ma","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"},{"name":"Institute of Aerospace Science and Technology, Wuhan University, Wuhan 430072, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0190-2538","authenticated-orcid":false,"given":"Xiaobing","family":"Dai","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"},{"name":"Institute of Aerospace Science and Technology, Wuhan University, Wuhan 430072, China"}]},{"given":"Jun","family":"Huang","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"},{"name":"Institute of Aerospace Science and Technology, Wuhan University, Wuhan 430072, China"}]},{"given":"Fan","family":"Fan","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"},{"name":"Institute of Aerospace Science and Technology, Wuhan University, Wuhan 430072, China"}]},{"given":"Qinglei","family":"Du","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"}]},{"given":"Hong","family":"Zheng","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"}]},{"given":"Jiayi","family":"Ma","sequence":"additional","affiliation":[{"name":"Electronic Information School, Wuhan University, Wuhan 430072, China"},{"name":"Institute of Aerospace Science and Technology, Wuhan University, Wuhan 430072, China"},{"name":"Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems, Wuhan 430074, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,4,23]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1016\/j.isprsjprs.2017.11.021","article-title":"A new deep convolutional neural network for fast hyperspectral image classification","volume":"145","author":"Paoletti","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MGRS.2016.2540798","article-title":"Deep learning for remote sensing data: A technical tutorial on the state of the art","volume":"4","author":"Zhang","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2305","DOI":"10.1109\/TGRS.2018.2872850","article-title":"Centroid and Covariance Alignment-Based Domain Adaptation for Unsupervised Classification of Remote Sensing Images","volume":"57","author":"Ma","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"964","DOI":"10.1016\/j.scitotenv.2019.03.186","article-title":"A transferable spectroscopic diagnosis model for predicting arsenic contamination in soil","volume":"669","author":"Tao","year":"2019","journal-title":"Sci. Total. Environ."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"48","DOI":"10.1016\/j.ins.2017.02.044","article-title":"Hyperspectral image denoising with superpixel segmentation and low-rank representation","volume":"397","author":"Fan","year":"2017","journal-title":"Inf. Sci."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/j.inffus.2018.02.004","article-title":"Infrared and visible image fusion methods and applications: A survey","volume":"45","author":"Ma","year":"2019","journal-title":"Inf. Fusion"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"2094","DOI":"10.1109\/JSTARS.2014.2329330","article-title":"Deep learning-based classification of hyperspectral data","volume":"7","author":"Chen","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"4581","DOI":"10.1109\/TGRS.2018.2828029","article-title":"SuperPCA: A Superpixelwise PCA Approach for Unsupervised Feature Extraction of Hyperspectral Imagery","volume":"56","author":"Jiang","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"3463","DOI":"10.1109\/TGRS.2012.2184292","article-title":"Incremental import vector machines for classifying hyperspectral data","volume":"50","author":"Roscher","year":"2012","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"4816","DOI":"10.1109\/TGRS.2012.2230268","article-title":"Generalized composite kernel framework for hyperspectral image classification","volume":"51","author":"Li","year":"2013","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"851","DOI":"10.1109\/TGRS.2018.2861992","article-title":"Hyperspectral Image Classification in the Presence of Noisy Labels","volume":"57","author":"Jiang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Hu, W., Huang, Y., Wei, L., Zhang, F., and Li, H. (2015). Deep convolutional neural networks for hyperspectral image classification. J. Sens., 2015.","DOI":"10.1155\/2015\/258619"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"3639","DOI":"10.1109\/TGRS.2016.2636241","article-title":"Deep recurrent neural networks for hyperspectral image classification","volume":"55","author":"Mou","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wu, H., and Prasad, S. (2017). Convolutional recurrent neural networks forhyperspectral data classification. Remote Sens., 9.","DOI":"10.3390\/rs9030298"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"4729","DOI":"10.1109\/TGRS.2017.2698503","article-title":"Learning and transferring deep joint spectral\u2013spatial features for hyperspectral classification","volume":"55","author":"Yang","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"6232","DOI":"10.1109\/TGRS.2016.2584107","article-title":"Deep feature extraction and classification of hyperspectral images based on convolutional neural networks","volume":"54","author":"Chen","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"2354","DOI":"10.1109\/TIP.2018.2799324","article-title":"Hyperspectral image classification with Markov random fields and a convolutional neural network","volume":"27","author":"Cao","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_18","first-page":"5893","article-title":"Spectral-Spatial Unified Networks for Hyperspectral Image Classification","volume":"56","author":"Xu","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Chen, L.C., Yang, Y., Wang, J., Xu, W., and Yuille, A.L. (2016, January 27\u201330). Attention to scale: Scale-aware semantic image segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.396"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Xu, R., Tao, Y., Lu, Z., and Zhong, Y. (2018). Attention-Mechanism-Containing Neural Networks for High-Resolution Remote Sensing Image Classification. Remote Sens., 10.","DOI":"10.3390\/rs10101602"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Chen, L., Zhang, H., Xiao, J., Nie, L., Shao, J., Liu, W., and Chua, T.S. (2017, January 21\u201326). Sca-cnn: Spatial and channel-wise attention in convolutional networks for image captioning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.667"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Nam, H., Ha, J.W., and Kim, J. (2017, January 21\u201326). Dual attention networks for multimodal reasoning and matching. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.232"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"512","DOI":"10.1007\/s11263-018-1117-z","article-title":"Locality preserving matching","volume":"127","author":"Ma","year":"2019","journal-title":"Int. J. Comput. Vis."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"765","DOI":"10.1109\/TCYB.2017.2655538","article-title":"Adaptive discrete hypergraph matching","volume":"48","author":"Yan","year":"2018","journal-title":"IEEE Trans. Cybern."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"4435","DOI":"10.1109\/TGRS.2018.2820040","article-title":"Guided locality preserving feature matching for remote sensing image registration","volume":"56","author":"Ma","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Kuen, J., Wang, Z., and Gang, W. (2016, January 21\u201326). Recurrent Attentional Networks for Saliency Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2016.399"},{"key":"ref_27","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_28","unstructured":"Bahdanau, D., Cho, K., and Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv."},{"key":"ref_29","unstructured":"Lin, Z., Feng, M., Santos, C.N.d., Yu, M., Xiang, B., Zhou, B., and Bengio, Y. (2017). A structured self-attentive sentence embedding. arXiv."},{"key":"ref_30","unstructured":"Pei, W., Dibeklio\u011flu, H., Baltru\u0161aitis, T., and Tax, D.M. (2017). Attended End-to-end Architecture for Age Estimation from Facial Expression Videos. arXiv."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Graves, A., Mohamed, A.R., and Hinton, G. (2013, January 26\u201331). Speech recognition with deep recurrent neural networks. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, DC, Canada.","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"517","DOI":"10.1109\/TASLP.2015.2400218","article-title":"From Feedforward to Recurrent LSTM Neural Networks for Language Modeling","volume":"23","author":"Sundermeyer","year":"2015","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Process."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Cho, K., Van Merri\u00ebnboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., and Bengio, Y. (2014). Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv.","DOI":"10.3115\/v1\/D14-1179"},{"key":"ref_35","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems, Lake Tahoe, NV, USA."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"2530","DOI":"10.1109\/TIP.2018.2887017","article-title":"Multi-memory convolutional neural network for video super-resolution","volume":"28","author":"Wang","year":"2019","journal-title":"IEEE Trans. Image Process."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.inffus.2018.09.004","article-title":"FusionGAN: A generative adversarial network for infrared and visible image fusion","volume":"48","author":"Ma","year":"2019","journal-title":"Inf. Fusion"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Tian, T., Li, C., Xu, J., and Ma, J. (2018). Urban area detection in very high resolution remote sensing images using deep convolutional neural networks. Sensors, 18.","DOI":"10.3390\/s18030904"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, Y., Huang, X., and Ma, J. (2018). Learning source-invariant deep hashing convolutional neural networks for cross-source remote sensing image retrieval. IEEE Trans. Geosci. Remote. Sens., 1\u201316.","DOI":"10.1109\/TGRS.2018.2839705"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"20707","DOI":"10.1109\/ACCESS.2017.2757765","article-title":"Robust topological navigation via convolutional neural network feature and sharpness measure","volume":"5","author":"Ma","year":"2017","journal-title":"IEEE Access"},{"key":"ref_41","unstructured":"Jaderberg, M., Simonyan, K., Zisserman, A., and Kavukcuoglu, K. (2015, January 7\u201312). Spatial transformer networks. Proceedings of the Neural Information Processing Systems 2015, Montreal, QC, Canada."},{"key":"ref_42","unstructured":"Kim, J.H., Lee, S.W., Kwak, D.H., Heo, M.O., Kim, J., Ha, J.W., and Zhang, B.T. (2016, January 5\u201310). Multimodal Residual Learning for Visual QA. Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS\u201916), Barcelona, Spain."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Yang, Y., Zhong, Z., Shen, T., and Lin, Z. (2018, January 18\u201323). Convolutional Neural Networks with Alternately Updated Clique. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00256"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/8\/963\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T12:46:26Z","timestamp":1760186786000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/8\/963"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,4,23]]},"references-count":43,"journal-issue":{"issue":"8","published-online":{"date-parts":[[2019,4]]}},"alternative-id":["rs11080963"],"URL":"https:\/\/doi.org\/10.3390\/rs11080963","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,4,23]]}}}