{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,19]],"date-time":"2026-01-19T08:48:12Z","timestamp":1768812492358,"version":"3.49.0"},"reference-count":53,"publisher":"MDPI AG","issue":"18","license":[{"start":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T00:00:00Z","timestamp":1631059200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61603233"],"award-info":[{"award-number":["61603233"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["51909206"],"award-info":[{"award-number":["51909206"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Remote sensing image scene classification acts as an important task in remote sensing image applications, which benefits from the pleasing performance brought by deep convolution neural networks (CNNs). When applying deep models in this task, the challenges are, on one hand, that the targets with highly different scales may exist in the image simultaneously and the small targets could be lost in the deep feature maps of CNNs; and on the other hand, the remote sensing image data exhibits the properties of high inter-class similarity and high intra-class variance. Both factors could limit the performance of the deep models, which motivates us to develop an adaptive decision-level information fusion framework that can incorporate with any CNN backbones. Specifically, given a CNN backbone that predicts multiple classification scores based on the feature maps of different layers, we develop a pluginable importance factor generator that aims at predicting a factor for each score. The factors measure how confident the scores in different layers are with respect to the final output. Formally, the final score is obtained by a class-wise and weighted summation based on the scores and the corresponding factors. To reduce the co-adaptation effect among the scores of different layers, we propose a stochastic decision-level fusion training strategy that enables each classification score to randomly participate in the decision-level fusion. Experiments on four popular datasets including the UC Merced Land-Use dataset, the RSSCN 7 dataset, the AID dataset, and the NWPU-RESISC 45 dataset demonstrate the superiority of the proposed method over other state-of-the-art methods.<\/jats:p>","DOI":"10.3390\/rs13183579","type":"journal-article","created":{"date-parts":[[2021,9,8]],"date-time":"2021-09-08T21:28:45Z","timestamp":1631136525000},"page":"3579","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Decision-Level Fusion with a Pluginable Importance Factor Generator for Remote Sensing Image Scene Classification"],"prefix":"10.3390","volume":"13","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6563-9206","authenticated-orcid":false,"given":"Junge","family":"Shen","sequence":"first","affiliation":[{"name":"Unmanned System Research Institute, Northwestern Polytechnical University, Xi\u2019an 710072, China"}]},{"given":"Chi","family":"Zhang","sequence":"additional","affiliation":[{"name":"Unmanned System Research Institute, Northwestern Polytechnical University, Xi\u2019an 710072, China"}]},{"given":"Yu","family":"Zheng","sequence":"additional","affiliation":[{"name":"Unmanned System Research Institute, Northwestern Polytechnical University, Xi\u2019an 710072, China"},{"name":"China Academy of Launch Vehicle Technology, Beijing 100076, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2730-9409","authenticated-orcid":false,"given":"Ruxin","family":"Wang","sequence":"additional","affiliation":[{"name":"Engineering Research Center of Cyberspace, School of Software, Yunnan University, Kunming 650504, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,9,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1109\/JPROC.2017.2675998","article-title":"Remote sensing image scene classification: Benchmark and state of the art","volume":"105","author":"Cheng","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"4928","DOI":"10.1109\/TGRS.2011.2151866","article-title":"Segment optimization and data-driven thresholding for knowledge-based landslide detection by object-based image analysis","volume":"49","author":"Martha","year":"2011","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"819","DOI":"10.14358\/PERS.75.7.819","article-title":"Forest type mapping using object-specific texture measures from multispectral Ikonos imagery","volume":"75","author":"Kim","year":"2009","journal-title":"Photogramm. Eng. Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"747","DOI":"10.1109\/LGRS.2015.2513443","article-title":"Bag-of-visual-words scene classifier with local and global features for high spatial resolution remote sensing imagery","volume":"13","author":"Zhu","year":"2016","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1109\/LGRS.2015.2501383","article-title":"An informative feature selection method based on sparse PCA for VHR scene classification","volume":"13","author":"Chaib","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_7","unstructured":"Ke, Y., and Sukthankar, R. (July, January 27). PCA-SIFT: A more distinctive representation for local image descriptors. Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Washington, DC, USA."},{"key":"ref_8","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, CA, USA."},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Zhao, F., Sun, H., Liu, S., and Zhou, S. (2015, January 14). Combining low level features and visual attributes for VHR remote sensing image classification. Proceedings of the MIPPR 2015: Remote Sensing Image Processing, Geographic Information Systems, and Other Applications, International Society for Optics and Photonics, Enshi, China.","DOI":"10.1117\/12.2205566"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"741","DOI":"10.1080\/15481603.2017.1323377","article-title":"Deep learning in remote sensing scene classification: A data augmentation enhanced convolutional neural network framework","volume":"54","author":"Yu","year":"2017","journal-title":"GISci. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Zhou, W., Shao, Z., and Cheng, Q. (2016, January 4\u20136). Deep feature representations for high-resolution remote sensing scene classification. Proceedings of the 2016 IEEE International Workshop on Earth Observation and Remote Sensing Applications (EORSA), Guangzhou, China.","DOI":"10.1109\/EORSA.2016.7552825"},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Cheng, G., Ma, C., Zhou, P., Yao, X., and Han, J. (2016, January 10\u201315). Scene classification of high resolution remote sensing images using convolutional neural networks. Proceedings of the 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Beijing, China.","DOI":"10.1109\/IGARSS.2016.7729193"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1016\/j.ins.2020.05.062","article-title":"Global context based automatic road segmentation via dilated convolutional neural network","volume":"535","author":"Lan","year":"2020","journal-title":"Inf. Sci."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"343","DOI":"10.1016\/j.neucom.2020.05.059","article-title":"SemiText: Scene text detection with semi-supervised learning","volume":"407","author":"Liu","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"3750","DOI":"10.1109\/JSTARS.2021.3066508","article-title":"Hyperspectral Anomaly Change Detection Based on Autoencoder","volume":"14","author":"Hu","year":"2021","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"1540","DOI":"10.1109\/LGRS.2019.2902675","article-title":"A Combined Deep Learning Model for the Scene Classification of High-Resolution Remote Sensing Image","volume":"16","author":"Dong","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"250","DOI":"10.1016\/j.ins.2020.06.011","article-title":"Two-stream feature aggregation deep neural network for scene classification of remote sensing images","volume":"539","author":"Xu","year":"2020","journal-title":"Inf. Sci."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"7492","DOI":"10.1109\/TGRS.2019.2913816","article-title":"Robust space\u2013frequency joint representation for remote sensing image scene classification","volume":"57","author":"Fang","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"5194","DOI":"10.1109\/JSTARS.2020.3018307","article-title":"Branch Feature Fusion Convolution Network for Remote Sensing Scene Classification","volume":"13","author":"Shi","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1894","DOI":"10.1109\/LGRS.2019.2960026","article-title":"Multilayer feature fusion network for scene classification in remote sensing","volume":"17","author":"Xu","year":"2020","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"2811","DOI":"10.1109\/TGRS.2017.2783902","article-title":"When deep learning meets metric learning: Remote sensing image scene classification via learning discriminative CNNs","volume":"56","author":"Cheng","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"968","DOI":"10.1109\/LGRS.2019.2938996","article-title":"Marginal center loss for deep remote sensing image scene classification","volume":"17","author":"Wei","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_23","unstructured":"Hinton, G.E., Srivastava, N., Krizhevsky, A., Sutskever, I., and Salakhutdinov, R.R. (2012). Improving neural networks by preventing co-adaptation of feature detectors. arXiv."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"539","DOI":"10.1016\/j.patcog.2016.07.001","article-title":"Towards better exploiting convolutional neural networks for remote sensing scene classification","volume":"61","author":"Nogueira","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"1793","DOI":"10.1109\/TGRS.2015.2488681","article-title":"Scene classification via a gradient boosting random convolutional network framework","volume":"54","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Shen, J., Zhang, T., Wang, Y., Wang, R., Wang, Q., and Qi, M. (2021). A Dual-Model Architecture with Grouping-Attention-Fusion for Remote Sensing Scene Classification. Remote Sens., 13.","DOI":"10.3390\/rs13030433"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Zhang, W., Tang, P., and Zhao, L. (2019). Remote sensing image scene classification using CNN-CapsNet. Remote Sens., 11.","DOI":"10.3390\/rs11050494"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"732","DOI":"10.1109\/LGRS.2018.2880136","article-title":"Deep network ensembles for aerial scene classification","volume":"16","author":"Dede","year":"2018","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Shi, C., Zhao, X., and Wang, L. (2021). A Multi-Branch Feature Fusion Strategy Based on an Attention Mechanism for Remote Sensing Image Scene Classification. Remote Sens., 13.","DOI":"10.3390\/rs13101950"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"7894","DOI":"10.1109\/TGRS.2019.2917161","article-title":"A feature aggregation convolutional neural network for remote sensing scene classification","volume":"57","author":"Lu","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1109\/TGRS.2019.2931801","article-title":"Remote sensing scene classification by gated bidirectional network","volume":"58","author":"Sun","year":"2019","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1647","DOI":"10.1109\/LGRS.2019.2949253","article-title":"Combining multilevel features for remote sensing image scene classification with attention model","volume":"17","author":"Ji","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"6372","DOI":"10.1109\/JSTARS.2020.3030257","article-title":"Hierarchical Attention and Bilinear Fusion for Remote Sensing Image Scene Classification","volume":"13","author":"Yu","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_34","doi-asserted-by":"crossref","first-page":"943","DOI":"10.1109\/LGRS.2019.2937811","article-title":"Positional context aggregation network for remote sensing scene classification","volume":"17","author":"Zhang","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Li, X., Jiang, B., Sun, T., and Wang, S. (2018, January 14\u201316). Remote sensing scene classification based on decision-level fusion. Proceedings of the 2018 IEEE Information Technology and Mechatronics Engineering Conference (ITOEC), Chongqing, China.","DOI":"10.1109\/ITOEC.2018.8740526"},{"key":"ref_36","first-page":"1","article-title":"Looking Closer at the Scene: Multiscale Representation Learning for Remote Sensing Image Scene Classification","volume":"1","author":"Wang","year":"2020","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Yang, Y., and Newsam, S. (2010, January 2\u20135). Bag-of-visual-words and spatial extensions for land-use classification. Proceedings of the 18th SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL), San Jose, CA, USA.","DOI":"10.1145\/1869790.1869829"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"2321","DOI":"10.1109\/LGRS.2015.2475299","article-title":"Deep learning based feature selection for remote sensing scene classification","volume":"12","author":"Zou","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A benchmark data set for performance evaluation of aerial scene classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"2636","DOI":"10.1109\/JSTARS.2019.2919317","article-title":"A lightweight and discriminative model for remote sensing scene classification with multidilation pooling module","volume":"12","author":"Zhang","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_42","doi-asserted-by":"crossref","first-page":"1324","DOI":"10.1109\/LGRS.2019.2896411","article-title":"Domain adaptation for convolutional neural networks-based remote sensing scene classification","volume":"16","author":"Song","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_43","doi-asserted-by":"crossref","first-page":"3508","DOI":"10.1109\/JSTARS.2019.2934165","article-title":"Aggregated deep fisher feature for VHR remote sensing scene classification","volume":"12","author":"Li","year":"2019","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Lv, Y., Zhang, X., Xiong, W., Cui, Y., and Cai, M. (2019). An end-to-end local-global-fusion feature extraction network for remote sensing image scene classification. Remote Sens., 11.","DOI":"10.3390\/rs11243006"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"74","DOI":"10.1016\/j.isprsjprs.2018.01.023","article-title":"Binary patterns encoded convolutional neural networks for texture recognition and remote sensing scene classification","volume":"138","author":"Anwer","year":"2018","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"1200","DOI":"10.1109\/LGRS.2019.2894399","article-title":"Siamese convolutional neural networks for remote sensing scene classification","volume":"16","author":"Liu","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Liu, B.D., Meng, J., Xie, W.Y., Shao, S., Li, Y., and Wang, Y. (2019). Weighted spatial pyramid matching collaborative representation for remote-sensing-image scene classification. Remote Sens., 11.","DOI":"10.3390\/rs11050518"},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s13640-018-0398-z","article-title":"Remote sensing scene classification based on rotation-invariant feature learning and joint decision making","volume":"2019","author":"Zhou","year":"2019","journal-title":"EURASIP J. Image Video Process."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Qi, K., Yang, C., Hu, C., Shen, Y., Shen, S., and Wu, H. (2021). Rotation Invariance Regularization for Remote Sensing Image Scene Classification with Convolutional Neural Networks. Remote Sens., 13.","DOI":"10.3390\/rs13040569"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"6899","DOI":"10.1109\/TGRS.2018.2845668","article-title":"Remote sensing scene classification using multilayer stacked covariance pooling","volume":"56","author":"He","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1155","DOI":"10.1109\/TGRS.2018.2864987","article-title":"Scene classification with recurrent attention of VHR remote sensing images","volume":"57","author":"Wang","year":"2018","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Zeng, D., Chen, S., Chen, B., and Li, S. (2018). Improving remote sensing scene classification by integrating global-context and local-object features. Remote Sens., 10.","DOI":"10.3390\/rs10050734"},{"key":"ref_53","doi-asserted-by":"crossref","first-page":"3685","DOI":"10.1109\/TGRS.2019.2960889","article-title":"Deep multiple instance convolutional neural networks for learning robust scene representations","volume":"58","author":"Li","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3579\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:58:56Z","timestamp":1760165936000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/13\/18\/3579"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,9,8]]},"references-count":53,"journal-issue":{"issue":"18","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["rs13183579"],"URL":"https:\/\/doi.org\/10.3390\/rs13183579","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,9,8]]}}}