{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:44:47Z","timestamp":1760237087302,"version":"build-2065373602"},"reference-count":45,"publisher":"MDPI AG","issue":"4","license":[{"start":{"date-parts":[[2020,2,19]],"date-time":"2020-02-19T00:00:00Z","timestamp":1582070400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61802419"],"award-info":[{"award-number":["61802419"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2018YFB1003405"],"award-info":[{"award-number":["2018YFB1003405"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Aircraft recognition has great application value, but aircraft in remote sensing images have some problems such as low resolution, poor contrasts, poor sharpness, and lack of details caused by the vertical view, which make the aircraft recognition very difficult. Especially when there are many kinds of aircraft and the differences between aircraft are subtle, the fine-grained recognition of aircraft is more challenging. In this paper, we propose a non-locally enhanced feature fusion network(NLFFNet) and attempt to make full use of the features from discriminative parts of aircraft. First, according to the long-distance self-correlation in aircraft images, we adopt non-locally enhanced operation and guide the network to pay more attention to the discriminating areas and enhance the features beneficial to classification. Second, we propose a part-level feature fusion mechanism(PFF), which crops 5 parts of the aircraft on the shared feature maps, then extracts the subtle features inside the parts through the part full connection layer(PFC) and fuses the features of these parts together through the combined full connection layer(CFC). In addition, by adopting the improved loss function, we can enhance the weight of hard examples in the loss function meanwhile reducing the weight of excessively hard examples, which improves the overall recognition ability of the network. The dataset includes 47 categories of aircraft, including many aircraft of the same family with slight differences in appearance, and our method can achieve 89.12% accuracy on the test dataset, which proves the effectiveness of our method.<\/jats:p>","DOI":"10.3390\/rs12040681","type":"journal-article","created":{"date-parts":[[2020,2,20]],"date-time":"2020-02-20T03:20:03Z","timestamp":1582168803000},"page":"681","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Non-locally Enhanced Feature Fusion Network for Aircraft Recognition in Remote Sensing Images"],"prefix":"10.3390","volume":"12","author":[{"given":"Yunsheng","family":"Xiong","sequence":"first","affiliation":[{"name":"Key Laboratory for Parallel and Distributed Processing, Changsha 410005, China"},{"name":"College of Computer, National University of Defense Technology, Changsha 410005, China"}]},{"given":"Xin","family":"Niu","sequence":"additional","affiliation":[{"name":"Key Laboratory for Parallel and Distributed Processing, Changsha 410005, China"},{"name":"College of Computer, National University of Defense Technology, Changsha 410005, China"}]},{"given":"Yong","family":"Dou","sequence":"additional","affiliation":[{"name":"Key Laboratory for Parallel and Distributed Processing, Changsha 410005, China"},{"name":"College of Computer, National University of Defense Technology, Changsha 410005, China"}]},{"given":"Hang","family":"Qie","sequence":"additional","affiliation":[{"name":"Key Laboratory for Parallel and Distributed Processing, Changsha 410005, China"},{"name":"College of Computer, National University of Defense Technology, Changsha 410005, China"}]},{"given":"Kang","family":"Wang","sequence":"additional","affiliation":[{"name":"Key Laboratory for Parallel and Distributed Processing, Changsha 410005, China"},{"name":"College of Computer, National University of Defense Technology, Changsha 410005, China"}]}],"member":"1968","published-online":{"date-parts":[[2020,2,19]]},"reference":[{"key":"ref_1","first-page":"112","article-title":"Aircraft Recognition in High-Resolution Optical Satellite Remote Sensing Images","volume":"12","author":"Wu","year":"2014","journal-title":"IEEE Geosci. Remote. Sens. Lett."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Sun, H., Zuo, J., Wang, H., Xu, G., and Sun, X. (2018). Aircraft type recognition in remote sensing images based on feature learning with conditional generative adversarial networks. Remote. Sens., 10.","DOI":"10.3390\/rs10071123"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"012015","DOI":"10.1088\/1742-6596\/1061\/1\/012015","article-title":"Aircraft Type Recognition Based on Target Track","volume":"1061","author":"Huang","year":"2018","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Fu, K., Dai, W., Zhang, Y., Wang, Z., Yan, M., and Sun, X. (2019). Multicam: Multiple class activation mapping for aircraft recognition in remote sensing images. Remote. Sens., 11.","DOI":"10.3390\/rs11050544"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1109\/TC.1977.5009272","article-title":"Aircraft identification by moment invariants","volume":"100","author":"Dudani","year":"1977","journal-title":"IEEE Trans. Comput."},{"key":"ref_6","first-page":"51","article-title":"Research concerning aircraft recognition of remote sensing images based on ICA Zernike invariant moments","volume":"6","author":"Liu","year":"2011","journal-title":"Caai Trans. Intell. Syst."},{"key":"ref_7","first-page":"3771","article-title":"Plane Recognition Based on Moment Invariants and Neural Networks","volume":"5","author":"Zhang","year":"2009","journal-title":"Comput. Knowl. Technol."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Lowe, D.G., and Lowe, D. (1999, January 20\u201327). Object recognition from local scale-invariant features. Proceedings of the Seventh IEEE International Conference on Computer Vision, Kerkyra, Greece.","DOI":"10.1109\/ICCV.1999.790410"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"307","DOI":"10.1049\/ip-vis:20049020","article-title":"Aircraft type recognition in satellite images","volume":"152","author":"Hsieh","year":"2005","journal-title":"IEE Proc. Vis. Image Signal Process."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1759","DOI":"10.1016\/j.patrec.2009.11.018","article-title":"Artificial bee colony (ABC) optimized edge potential function (EPF) approach to target recognition for low-altitude aircraft","volume":"31","author":"Xu","year":"2010","journal-title":"Pattern Recognit. Lett."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"573","DOI":"10.1109\/LGRS.2012.2214022","article-title":"Aircraft Recognition in High-Resolution Satellite Images Using Coarse-to-Fine Shape Prior","volume":"10","author":"Ge","year":"2013","journal-title":"IEEE Geosci. Remote. Sens. Lett."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1413","DOI":"10.1109\/LGRS.2017.2715858","article-title":"Aircraft Recognition Based on Landmark Detection in Remote Sensing Images","volume":"14","author":"An","year":"2017","journal-title":"IEEE Geosci. Remote. Sens. Lett."},{"key":"ref_13","first-page":"7","article-title":"An aircraft recognition method based on principal component analysis and image model-matching","volume":"3","author":"Shao","year":"2009","journal-title":"Chin. J. Stereol. Image Anal."},{"key":"ref_14","first-page":"1097","article-title":"Imagenet classification with deep convolutional neural networks","volume":"25","author":"Krizhevsky","year":"2012","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_15","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Henan, W., Dejun, L., Hongwei, W., Ying, L., and Xiaorui, S. (2012, January 23\u201325). Research on aircraft object recognition model based on neural networks. Proceedings of the 2012 International Conference on Computer Science and Electronics Engineering, Hangzhou, China.","DOI":"10.1109\/ICCSEE.2012.330"},{"key":"ref_18","unstructured":"Fang, Z., Yao, G., and Zhang, Y. (2012, January 24\u201328). Target recognition of aircraft based on moment invariants and BP neural network. Proceedings of the World Automation Congress 2012, Puerto Vallarta, Mexico."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"745","DOI":"10.1080\/2150704X.2015.1072288","article-title":"Object recognition in remote sensing images using sparse deep belief networks","volume":"6","author":"Diao","year":"2015","journal-title":"Remote. Sens. Lett."},{"key":"ref_20","first-page":"1","article-title":"Aircraft Type Recognition Based on Segmentation With Deep Convolutional Neural Networks","volume":"PP","author":"Zuo","year":"2018","journal-title":"IEEE Geosci. Remote. Sens. Lett."},{"key":"ref_21","unstructured":"Kim, H., Choi, W.C., and Kim, H. (1994, January 2\u20135). A hierarchical approach to extracting polygons based on perceptual grouping. Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, San Antonio, TX, USA."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Randall, J., Guan, L., Zhang, X., and Li, W. (2002, January 13\u201317). Hierarchical cluster model for perceptual image processing. Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, Orlando, FL, USA.","DOI":"10.1109\/ICASSP.2002.1005922"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Michaelsen, E., Doktorski, L., Soergel, U., and Stilla, U. (2007, January 11\u201313). Perceptual grouping for building recognition in high-resolution SAR images using the GESTALT-system. Proceedings of the 2007 Urban Remote Sensing Joint Event, Paris, France.","DOI":"10.1109\/URS.2007.371818"},{"key":"ref_24","unstructured":"Csurka, G., Dance, C., Fan, L., Willamowski, J., and Bray, C. (2004, January 11\u201314). Visual categorization with bags of keypoints. Proceedings of the Workshop on statistical learning in computer vision, ECCV. Prague, Prague, Slovansky Ostrov."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Batista, N.C., Lopes, A.P.B., and Ara\u00fajo, A.d.A. (2009, January 11\u201315). Detecting buildings in historical photographs using bag-of-keypoints. Proceedings of the 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing, Rio De Janiero, Brazil.","DOI":"10.1109\/SIBGRAPI.2009.31"},{"key":"ref_26","unstructured":"Buades, A., Coll, B., and Morel, J.M. (2005, January 20\u201326). A non-local algorithm for image denoising. Proceedings of the IEEE Computer Society Conference on Computer Vision & Pattern Recognition, San Diego, CA, USA."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"1889","DOI":"10.1109\/JSTARS.2013.2280063","article-title":"Non-Local Sparse Unmixing for Hyperspectral Remote Sensing Imagery","volume":"7","author":"Zhong","year":"2014","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Deledalle, C.A., Tupin, F., and Denis, L. (2010, January 25\u201330). Polarimetric SAR estimation based on non-local means. Proceedings of the Geoscience & Remote Sensing Symposium, Honolulu, HI, USA.","DOI":"10.1109\/IGARSS.2010.5653936"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"294","DOI":"10.1016\/j.rse.2003.08.005","article-title":"A multi-spectral non-local method for retrieval of boundary layer cloud properties from optical remote sensing data","volume":"88","author":"Iwabuchi","year":"2003","journal-title":"Remote. Sens. Environ."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Zhang, N., Donahue, J., Girshick, R., and Darrell, T. (2014, January 6\u201312). Part-based R-CNNs for fine-grained category detection. Proceedings of the European Conference on Computer Vision, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10590-1_54"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective Search for Object Recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 24\u201327). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_33","unstructured":"Wah, C., Branson, S., Welinder, P., Perona, P., and Belongie, S. (2016, December 13). The Caltech-Ucsd Birds-200-2011 Dataset. Available online: http:\/\/www.vision.caltech.edu\/visipedia\/CUB-200-2011.html."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Huang, S., Xu, Z., Tao, D., and Zhang, Y. (2015, December 26). Part-Stacked CNN for Fine-Grained Visual Categorization. Computer Vision & Pattern Recognition. Available online: https:\/\/arxiv.org\/abs\/1512.08086.","DOI":"10.1109\/CVPR.2016.132"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., and Torralba, A. (2016, January 27\u201330). Learning deep features for discriminative localization. Proceedings of the IEEE conference on computer vision and pattern recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.319"},{"key":"ref_36","unstructured":"Peng, Y., He, X., and Zhao, J. (2017). Object-Part Attention Model for Fine-grained Image Classification. IEEE Trans. Image Process., 1."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Wang, X., Girshick, R., Gupta, A., and He, K. (2018, January 18\u201322). Non-local neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00813"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Li, G., He, X., Zhang, W., Chang, H., Dong, L., and Lin, L. (2018). Non-locally enhanced encoder-decoder network for single image de-raining. arXiv.","DOI":"10.1145\/3240508.3240636"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Xiao, B., Wu, H., and Wei, Y. (2018, January 8\u201314). Simple baselines for human pose estimation and tracking. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01231-1_29"},{"key":"ref_40","unstructured":"Li, B., Liu, Y., and Wang, X. (2019, January 27\u201331). Gradient harmonized single-stage detector. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201322). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote. Sens."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"157","DOI":"10.1007\/s11263-007-0090-8","article-title":"LabelMe: A database and web-based tool for image annotation","volume":"77","author":"Russell","year":"2008","journal-title":"Int. J. Comput. Vis."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Smith, L.N. (2017, January 24\u201331). Cyclical learning rates for training neural networks. Proceedings of the 2017 IEEE Winter Conference on Applications of Computer Vision (WACV), Santa Rosa, CA, USA.","DOI":"10.1109\/WACV.2017.58"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017, January 22\u201329). Grad-cam: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.74"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/4\/681\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T08:59:09Z","timestamp":1760173149000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/12\/4\/681"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,2,19]]},"references-count":45,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2020,2]]}},"alternative-id":["rs12040681"],"URL":"https:\/\/doi.org\/10.3390\/rs12040681","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2020,2,19]]}}}