{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,9]],"date-time":"2026-04-09T04:13:04Z","timestamp":1775707984156,"version":"3.50.1"},"reference-count":47,"publisher":"MDPI AG","issue":"24","license":[{"start":{"date-parts":[[2019,12,13]],"date-time":"2019-12-13T00:00:00Z","timestamp":1576195200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61790550"],"award-info":[{"award-number":["61790550"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61790554"],"award-info":[{"award-number":["61790554"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["91538201"],"award-info":[{"award-number":["91538201"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Remote sensing image scene classification (RSISC) is an active task in the remote sensing community and has attracted great attention due to its wide applications. Recently, the deep convolutional neural networks (CNNs)-based methods have witnessed a remarkable breakthrough in performance of remote sensing image scene classification. However, the problem that the feature representation is not discriminative enough still exists, which is mainly caused by the characteristic of inter-class similarity and intra-class diversity. In this paper, we propose an efficient end-to-end local-global-fusion feature extraction (LGFFE) network for a more discriminative feature representation. Specifically, global and local features are extracted from channel and spatial dimensions respectively, based on a high-level feature map from deep CNNs. For the local features, a novel recurrent neural network (RNN)-based attention module is first proposed to capture the spatial layout information and context information across different regions. Gated recurrent units (GRUs) is then exploited to generate the important weight of each region by taking a sequence of features from image patches as input. A reweighed regional feature representation can be obtained by focusing on the key region. Then, the final feature representation can be acquired by fusing the local and global features. The whole process of feature extraction and feature fusion can be trained in an end-to-end manner. Finally, extensive experiments have been conducted on four public and widely used datasets and experimental results show that our method LGFFE outperforms baseline methods and achieves state-of-the-art results.<\/jats:p>","DOI":"10.3390\/rs11243006","type":"journal-article","created":{"date-parts":[[2019,12,13]],"date-time":"2019-12-13T11:27:22Z","timestamp":1576236442000},"page":"3006","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":47,"title":["An End-to-End Local-Global-Fusion Feature Extraction Network for Remote Sensing Image Scene Classification"],"prefix":"10.3390","volume":"11","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-2779-5099","authenticated-orcid":false,"given":"Yafei","family":"Lv","sequence":"first","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Xiaohan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Wei","family":"Xiong","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Yaqi","family":"Cui","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]},{"given":"Mi","family":"Cai","sequence":"additional","affiliation":[{"name":"Research Institute of information Fusion, Naval Aviation University, Yantai 264001, China"}]}],"member":"1968","published-online":{"date-parts":[[2019,12,13]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"4238","DOI":"10.1109\/TGRS.2015.2393857","article-title":"Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images","volume":"53","author":"Cheng","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1560","DOI":"10.1109\/JPROC.2015.2449668","article-title":"Multimodal Classification of Remote Sensing Images: A Review and Future Directions","volume":"103","author":"Tuia","year":"2015","journal-title":"Proc. IEEE."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_4","first-page":"1097","article-title":"ImageNet Classification with Deep Convolutional Neural Networks","volume":"141","author":"Krizhevsky","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MGRS.2016.2540798","article-title":"Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art","volume":"4","author":"Zhang","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Cheng, G., Ma, C., Zhou, P., Yao, X., and Han, J. (2016, January 10\u201315). Scene classification of high resolution remote sensing images using convolutional neural networks. Proceedings of the International Geoscience and Remote Sensing Symposium, Beijing, China.","DOI":"10.1109\/IGARSS.2016.7729193"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_8","unstructured":"Simonyan, K., and Zisserman, A. (2015, January 7\u20139). Very Deep Convolutional Networks for Large-Scale Image Recognition. Proceedings of the 2015 International Conference on Learning Representations (ICLR), San Diego, CA, USA."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely Connected Convolutional Networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Hu, F., Xia, G., Yang, W., and Zhang, L.J. Recent Advances and Opportunities in Scene Classification of Aerial Images with Deep Models. Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2018), Valencia, Spain.","DOI":"10.1109\/IGARSS.2018.8518336"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2811","DOI":"10.1109\/TGRS.2017.2783902","article-title":"When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs","volume":"56","author":"Cheng","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"7109","DOI":"10.1109\/TGRS.2018.2848473","article-title":"Scene Classification Based on Multiscale Convolutional Neural Network","volume":"56","author":"Liu","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"96","DOI":"10.1016\/j.isprsjprs.2018.01.021","article-title":"Land cover mapping at very high resolution with rotation equivariant CNNs: Towards small yet accurate models","volume":"145","author":"Marcos","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_14","unstructured":"Cheng, G., Zhou, P., and Han, J. (July, January 26). RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection. Proceedings of the Computer Vision and Pattern Recognition, Xi\u2019an, China."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Li, P., Ren, P., Zhang, X., Wang, Q., Zhu, X., and Wang, L. (2018). Region-Wise Deep Feature Representation for Remote Sensing Images. Remote Sens., 10.","DOI":"10.3390\/rs10060871"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1779","DOI":"10.1109\/TGRS.2018.2869101","article-title":"Remote Sensing Image Scene Classification Using Rearranged Local Features","volume":"57","author":"Yuan","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1109\/TGRS.2018.2864987","article-title":"Scene Classification with Recurrent Attention of VHR Remote Sensing Images","volume":"57","author":"Wang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1007\/BF00130487","article-title":"Color indexing","volume":"7","author":"Swain","year":"1991","journal-title":"Int. J. Comput. Vis."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"971","DOI":"10.1109\/TPAMI.2002.1017623","article-title":"Multiresolution gray-scale and rotation invariant texture classification with local binary patterns","volume":"24","author":"Ojala","year":"2002","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"295","DOI":"10.1016\/S0031-3203(96)00068-4","article-title":"Object detection using gabor filters","volume":"30","author":"Jain","year":"1997","journal-title":"Pattern Recognit."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"145","DOI":"10.1023\/A:1011139631724","article-title":"Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope","volume":"42","author":"Oliva","year":"2001","journal-title":"Int. J. Comput. Vis."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_23","first-page":"404","article-title":"SURF: Speeded up robust features","volume":"3951","author":"Bay","year":"2006","journal-title":"Int. J. Comput. Vis."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 2\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems, San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_25","first-page":"993","article-title":"Latent dirichlet allocation","volume":"3","author":"Blei","year":"2003","journal-title":"J. Mach. Learn. Res."},{"key":"ref_26","first-page":"095064","article-title":"Scene classification based on multifeature probabilistic latent semantic analysis for high spatial resolution remote sensing images","volume":"9","author":"Zhong","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Bosch, A., Zisserman, A., and Mu\u00f1oz, X. (2006). Scene Classification Via pLSA, Springer.","DOI":"10.1007\/11744085_40"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Penatti, O.A.B., Nogueira, K., and Santos, J.A.D. (2015, January 7\u201312). Do deep features generalize from everyday objects to remote sensing and aerial scenes domains?. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Boston, MA, USA.","DOI":"10.1109\/CVPRW.2015.7301382"},{"key":"ref_29","unstructured":"Castelluccio, M., Poggi, G., and Sansone, C. (2015). Verdoliva. Land Use Classification in Remote Sensing Images by Convolutional Neural Networks. arXiv."},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1016\/j.patcog.2016.07.001","article-title":"Towards better exploiting convolutional neural networks for remote sensing scene classification","volume":"61","author":"Nogueira","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"2321","DOI":"10.1109\/LGRS.2015.2475299","article-title":"Deep Learning Based Feature Selection for Remote Sensing Scene Classification","volume":"12","author":"Zou","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1793","DOI":"10.1109\/TGRS.2015.2488681","article-title":"Scene Classification via a Gradient Boosting Random Convolutional Network Framework","volume":"54","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"568","DOI":"10.3390\/rs10040568","article-title":"A Deep-Local-Global Feature Fusion Framework for High Spatial Resolution Imagery Scene Classification","volume":"10","author":"Zhu","year":"2018","journal-title":"Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"2183","DOI":"10.1109\/TGRS.2017.2776321","article-title":"Exploring Models and Data for Remote Sensing Image Caption Generation","volume":"56","author":"Lu","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Yang, Z., He, X., Gao, J., Deng, L., and Smola, A. (2016, January 27\u201330). Stacked Attention Networks for Image Question Answering. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), as Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.10"},{"key":"ref_36","unstructured":"Gregor, K., Danihelka, I., Graves, A., Rezende, D.J., and Wierstra, D.J. (2015, January 6\u201311). DRAW: A Recurrent Neural Network for Image Generation. Proceedings of the 32nd International Conference on Machine Learning (ICML), Lille, France."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Rush, A.M., Chopra, S., and Weston, J. (2015). A Neural Attention Model for Abstractive Sentence Summarization. arXiv.","DOI":"10.18653\/v1\/D15-1044"},{"key":"ref_38","unstructured":"Hermann, K.M., Ko\u010disk\u00fd, T., Grefenstette, E., Espeholt, L., Kay, W., Suleyman, M., and Blunsom, P. (2015, January 7\u201312). Teaching machines to read and comprehend. Proceedings of the 28th International Conference on Neural Information Processing Systems-Volume 1, Montreal, QC, Canada."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Xiong, W., Lv, Y., Cui, Y., Zhang, X., and Gu, X. (2019). A Discriminative Feature Learning Approach for Remote Sensing Image Retrieval. Remote Sens., 11.","DOI":"10.3390\/rs11030281"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"4775","DOI":"10.1109\/TGRS.2017.2700322","article-title":"Deep Feature Fusion for VHR Remote Sensing Scene Classification","volume":"55","author":"Chaib","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_41","unstructured":"Xia, G., Tong, X., Hu, F., Zhong, Y., Datcu, M., and Zhang, L. (2017). Exploiting Deep Features for Remote Sensing Image Retrieval: A Systematic Investigation. arXiv."},{"key":"ref_42","unstructured":"Lin, Y., Pang, Z., Wang, D., Zhuang, Y., and Recognition, P. (2017). Task-driven Visual Saliency and Attention-based Visual Question Answering. arxiv."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Cho, K., Van Merrienboer, B., Gulcehre, C., Bahdanau, D., Bougares, F., Schwenk, H., and Bengio, Y.J. (2014, January 25\u201329). Learning Phrase Representations using RNN Encoder\u2013Decoder for Statistical Machine Translation. Proceedings of the 2014 IEEE Conference on Computation and Language 2014, Doha, Qatar.","DOI":"10.3115\/v1\/D14-1179"},{"key":"ref_44","unstructured":"Chung, J., Gulcehre, C., Cho, K., Bengio, Y., and Computing, E. (2014). Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling. arXiv."},{"key":"ref_45","unstructured":"Jozefowicz, R., Zaremba, W., and Sutskever, I. (2015, January 6\u201311). An empirical exploration of recurrent network architectures. Proceedings of the 32nd International Conference on International Conference on Machine Learning-Volume 37, Lille, France."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1109\/JPROC.2017.2675998","article-title":"Remote Sensing Image Scene Classification: Benchmark and State of the Art","volume":"105","author":"Cheng","year":"2017","journal-title":"Proc. IEEE"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/24\/3006\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T13:42:13Z","timestamp":1760190133000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/11\/24\/3006"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,13]]},"references-count":47,"journal-issue":{"issue":"24","published-online":{"date-parts":[[2019,12]]}},"alternative-id":["rs11243006"],"URL":"https:\/\/doi.org\/10.3390\/rs11243006","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2019,12,13]]}}}