{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,1]],"date-time":"2026-06-01T17:16:03Z","timestamp":1780334163656,"version":"3.54.1"},"reference-count":83,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2022,7,5]],"date-time":"2022-07-05T00:00:00Z","timestamp":1656979200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Object recognition, as one of the most fundamental and challenging problems in high-resolution remote sensing image interpretation, has received increasing attention in recent years. However, most conventional object recognition pipelines aim to recognize instances with bounding boxes in a supervised learning strategy, which require intensive and manual labor for instance annotation creation. In this paper, we propose a weakly supervised learning method to alleviate this problem. The core idea of our method is to recognize multiple objects in an image using only image-level semantic labels and indicate the recognized objects with location points instead of box extent. Specifically, a deep convolutional neural network is first trained to perform semantic scene classification, of which the result is employed for the categorical determination of objects in an image. Then, by back-propagating the categorical feature from the fully connected layer to the deep convolutional layer, the categorical and spatial information of an image are combined to obtain an object discriminative localization map, which can effectively indicate the salient regions of objects. Next, a dynamic updating method of local response extremum is proposed to further determine the locations of objects in an image. Finally, extensive experiments are conducted to localize aircraft and oiltanks in remote sensing images based on different convolutional neural networks. Experimental results show that the proposed method outperforms the-state-of-the-art methods, achieving the precision, recall, and F1-score at 94.50%, 88.79%, and 91.56% for aircraft localization and 89.12%, 83.04%, and 85.97% for oiltank localization, respectively. We hope that our work could serve as a basic reference for remote sensing object localization via a weakly supervised strategy and provide new opportunities for further research.<\/jats:p>","DOI":"10.3390\/rs14133230","type":"journal-article","created":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T21:15:52Z","timestamp":1657142152000},"page":"3230","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":5,"title":["Object Localization in Weakly Labeled Remote Sensing Images Based on Deep Convolutional Features"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6281-6043","authenticated-orcid":false,"given":"Yang","family":"Long","sequence":"first","affiliation":[{"name":"Guangzhou Urban Planning & Survey Design Research Institute, Guangzhou 510060, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaofang","family":"Zhai","sequence":"additional","affiliation":[{"name":"College of Urban and Environment Sciences, Hubei Normal University, Huangshi 435000, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Qiao","family":"Wan","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaowei","family":"Tan","sequence":"additional","affiliation":[{"name":"State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan 430079, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2022,7,5]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1016\/j.isprsjprs.2015.10.004","article-title":"Remote sensing platforms and sensors: A survey","volume":"115","author":"Toth","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1109\/MGRS.2019.2918840","article-title":"Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, applications, and prospects","volume":"7","author":"Xiang","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Mag."},{"key":"ref_3","unstructured":"Zou, Z., Shi, Z., Guo, Y., and Ye, J. (2019). Object detection in 20 years: A survey. arXiv."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"34","DOI":"10.1109\/TGRS.2019.2930246","article-title":"Context-Aware Convolutional Neural Network for Object Detection in VHR Remote Sensing Imagery","volume":"58","author":"Gong","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"296","DOI":"10.1016\/j.isprsjprs.2019.11.023","article-title":"Object detection in optical remote sensing images: A survey and a new benchmark","volume":"159","author":"Li","year":"2020","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Xia, G.S., Bai, X., Yang, W., Yang, M.Y., Belongie, S., Luo, J., Datcu, M., and Pelillo, M. (2022). Object detection in aerial images: A large-scale benchmark and challenges. IEEE Trans. Pattern Anal. Mach. Intell., 1\u201318.","DOI":"10.1109\/TPAMI.2021.3117983"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"116","DOI":"10.1016\/j.isprsjprs.2021.12.004","article-title":"FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery","volume":"184","author":"Sun","year":"2022","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Hoeser, T., and Kuenzer, C. (2020). Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review\u2014Part I: Evolution and Recent Trends. Remote Sens., 12.","DOI":"10.3390\/rs12101667"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Hoeser, T., Bachofer, F., and Kuenzer, C. (2020). Object Detection and Image Segmentation with Deep Learning on Earth Observation Data: A Review\u2014Part II: Applications. Remote Sens., 12.","DOI":"10.3390\/rs12183053"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"2486","DOI":"10.1109\/TGRS.2016.2645610","article-title":"Accurate Object Localization in Remote Sensing Images Based on Convolutional Neural Networks","volume":"55","author":"Long","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2018, January 18\u201323). DOTA: A large-scale dataset for object detection in aerial images. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"39","DOI":"10.1016\/j.neucom.2020.01.085","article-title":"Recent advances in deep learning for object detection","volume":"396","author":"Wu","year":"2020","journal-title":"Neurocomputing"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, H., Li, H., Qian, W., Diao, W., Zhao, L., Zhang, J., and Zhang, D. (2021). Dynamic pseudo-label generation for weakly supervised object detection in remote sensing images. Remote Sens., 13.","DOI":"10.3390\/rs13081461"},{"key":"ref_15","first-page":"1","article-title":"Multipatch Feature Pyramid Network for Weakly Supervised Object Detection in Optical Remote Sensing Images","volume":"60","author":"Shamsolmoali","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Guo, G., Han, J., Wan, F., and Zhang, D. (2021, January 20\u201325). Strengthen learning tolerance for weakly supervised object localization. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00732"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Zhang, D., Han, J., Cheng, G., and Yang, M.H. (2021). Weakly Supervised Object Localization and Detection: A Survey. IEEE Trans. Pattern Anal. Mach. Intell., 1\u201318.","DOI":"10.1109\/TPAMI.2021.3074313"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"192","DOI":"10.1016\/j.neucom.2022.01.095","article-title":"Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey","volume":"496","author":"Shao","year":"2022","journal-title":"Neurocomputing"},{"key":"ref_19","unstructured":"Waqas Zamir, S., Arora, A., Gupta, A., Khan, S., Sun, G., Shahbaz Khan, F., Zhu, F., Shao, L., Xia, G.S., and Bai, X. (2019, January 16\u201320). iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Oquab, M., Bottou, L., Laptev, I., and Sivic, J. (2015, January 7\u201312). Is object localization for free?\u2014Weakly-supervised learning with convolutional neural networks. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298668"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1141","DOI":"10.1007\/s11263-019-01266-1","article-title":"The unmanned aerial vehicle benchmark: Object detection, tracking and baseline","volume":"128","author":"Yu","year":"2020","journal-title":"Int. J. Comput. Vis."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhang, T., Zhang, X., Li, J., Xu, X., Wang, B., Zhan, X., Xu, Y., Ke, X., Zeng, T., and Su, H. (2021). Sar ship detection dataset (ssdd): Official release and comprehensive data analysis. Remote Sens., 13.","DOI":"10.3390\/rs13183690"},{"key":"ref_23","first-page":"1","article-title":"Faster r-cnn: Towards real-time object detection with region proposal networks","volume":"28","author":"Ren","year":"2015","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G.S., and Lu, Q. (2019, January 15\u201320). Learning roi transformer for oriented object detection in aerial images. Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00296"},{"key":"ref_25","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of oriented gradients for human detection. Proceedings of the 2005 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, CA, USA."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive Image Features from Scale-Invariant Keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis."},{"key":"ref_27","unstructured":"Li, F., and Perona, P. (2005, January 20\u201325). A Bayesian Hierarchical Model for Learning Natural Scene Categories. Proceedings of the 2005 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Diego, CA, USA."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/j.patcog.2016.10.033","article-title":"Weakly supervised vehicle detection in satellite images via multi-instance discriminative learning","volume":"64","author":"Cao","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Tang, Y., Wang, X., Dellandrea, E., Masnou, S., and Chen, L. (2014, January 27\u201330). Fusing generic objectness and deformable part-based models for weakly supervised object detection. Proceedings of the 2014 IEEE International Conference on Image Processing (ICIP), Paris, France.","DOI":"10.1109\/ICIP.2014.7025827"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Wang, W., Wang, Y., Chen, F., and Sowmya, A. (2013, January 15\u201317). A weakly supervised approach for object detection based on soft-label boosting. Proceedings of the 2013 IEEE Workshop on Applications of Computer Vision (WACV), Clearwater Beach, FL, USA.","DOI":"10.1109\/WACV.2013.6475037"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1007\/s11263-012-0538-3","article-title":"Weakly supervised localization and learning with generic knowledge","volume":"100","author":"Deselaers","year":"2012","journal-title":"Int. J. Comput. Vis."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Shi, Z., Hospedales, T.M., and Xiang, T. (2013, January 1\u20138). Bayesian joint topic modelling for weakly supervised object localisation. Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV), Sydney, Australia.","DOI":"10.1109\/ICCV.2013.371"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Sikka, K., Dhall, A., and Bartlett, M. (2013, January 22\u201326). Weakly supervised pain localization using multiple instance learning. Proceedings of the 2013 10th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), Shanghai, China.","DOI":"10.1109\/FG.2013.6553762"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Siva, P., Russell, C., and Xiang, T. (2012, January 7\u201313). In defence of negative mining for annotating weakly labelled data. Proceedings of the European Conference on Computer Vision (ECCV), Florence, Italy.","DOI":"10.1007\/978-3-642-33712-3_43"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"1313","DOI":"10.1109\/TCYB.2017.2647965","article-title":"Instance annotation via optimal bow for weakly supervised object localization","volume":"47","author":"Wang","year":"2017","journal-title":"IEEE Trans. Cybern."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"1959","DOI":"10.1109\/TPAMI.2015.2392769","article-title":"Bayesian joint modelling for object localisation in weakly labelled images","volume":"37","author":"Shi","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"1523","DOI":"10.1016\/j.patcog.2013.09.028","article-title":"Learning discriminative localization from weakly labeled data","volume":"47","author":"Hoai","year":"2014","journal-title":"Pattern Recognit."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Gokberk Cinbis, R., Verbeek, J., and Schmid, C. (2014, January 23\u201328). Multi-fold mil training for weakly supervised object localization. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.309"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"3325","DOI":"10.1109\/TGRS.2014.2374218","article-title":"Object Detection in Optical Remote Sensing Images Based on Weakly Supervised Learning and High-Level Feature Learning","volume":"53","author":"Han","year":"2015","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Zhou, P., Zhang, D., Cheng, G., and Han, J. (2015, January 20\u201322). Negative Bootstrapping for Weakly Supervised Target Detection in Remote Sensing Images. Proceedings of the 2015 IEEE International Conference on Multimedia Big Data (BigMM), Beijing, China.","DOI":"10.1109\/BigMM.2015.13"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"701","DOI":"10.1109\/LGRS.2014.2358994","article-title":"Weakly Supervised Learning for Target Detection in Remote Sensing Images","volume":"12","author":"Zhang","year":"2015","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_42","unstructured":"Krizhevsky, A., Sutskever, I., and Hinton, G.E. (2012, January 3\u20136). Imagenet classification with deep convolutional neural networks. Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Lake Tahoe, NV, USA."},{"key":"ref_43","unstructured":"Simonyan, K., and Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., and Rabinovich, A. (2015, January 7\u201312). Going deeper with convolutions. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep Residual Learning for Image Recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Huang, G., Liu, Z., Van Der Maaten, L., and Weinberger, K.Q. (2017, January 21\u201326). Densely connected convolutional networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.243"},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Shen, Y., Ji, R., Zhang, S., Zuo, W., and Wang, Y. (2018, January 18\u201323). Generative adversarial learning towards fast weakly supervised detection. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00604"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Tang, P., Wang, X., Wang, A., Yan, Y., Liu, W., Huang, J., and Yuille, A. (2018, January 8\u201314). Weakly supervised region proposal network and object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01252-6_22"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Shen, Y., Ji, R., Wang, Y., Wu, Y., and Cao, L. (2019, January 15\u201320). Cyclic guidance for weakly supervised joint detection and segmentation. Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00079"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Li, X., Kan, M., Shan, S., and Chen, X. (2019, January 15\u201320). Weakly supervised object detection with segmentation collaboration. Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/ICCV.2019.00983"},{"key":"ref_52","unstructured":"Gao, Y., Liu, B., Guo, N., Ye, X., Wan, F., You, H., and Fan, D. (November, January 27). C-midn: Coupled multiple instance detection network with segmentation guidance for weakly supervised object detection. Proceedings of the 2019 IEEE International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Chen, Z., Fu, Z., Jiang, R., Chen, Y., and Hua, X.S. (2020, January 13\u201319). Slv: Spatial likelihood voting for weakly supervised object detection. Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01301"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Durand, T., Thome, N., and Cord, M. (2016, January 27\u201330). Weldon: Weakly supervised learning of deep convolutional neural networks. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.513"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Zhu, Y., Zhou, Y., Ye, Q., Qiu, Q., and Jiao, J. (2017, January 22\u201329). Soft proposal networks for weakly supervised object localization. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.204"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Bilen, H., and Vedaldi, A. (2016, January 27\u201330). Weakly supervised deep detection networks. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.311"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","article-title":"Selective search for object recognition","volume":"104","author":"Uijlings","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Zitnick, C.L., and Doll\u00e1r, P. (2014, January 6\u201312). Edge boxes: Locating object proposals from edges. Proceedings of the European Conference on Computer Vision (ECCV), Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_26"},{"key":"ref_59","doi-asserted-by":"crossref","first-page":"5553","DOI":"10.1109\/TGRS.2016.2569141","article-title":"Weakly Supervised Learning Based on Coupled Convolutional Neural Networks for Aircraft Detection","volume":"54","author":"Zhang","year":"2016","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., and Torralba, A. (2016, January 27\u201330). Learning deep features for discriminative localization. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.319"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017, January 22\u201329). Grad-CAM: Visual explanations from deep networks via gradient-based localization. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.74"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Xiao, Z., Long, Y., Li, D., Wei, C., Tang, G., and Liu, J. (2017). High-resolution remote sensing image retrieval based on CNNs from a dimensional perspective. Remote Sens., 17.","DOI":"10.3390\/rs9070725"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Zhang, X., Wei, Y., Feng, J., Yang, Y., and Huang, T.S. (2018, January 18\u201323). Adversarial complementary learning for weakly supervised object localization. Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00144"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Zhang, X., Wei, Y., Kang, G., Yang, Y., and Huang, T. (2018, January 8\u201314). Self-produced guidance for weakly-supervised object localization. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01258-8_37"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Choe, J., and Shim, H. (2019, January 15\u201320). Attention-based dropout layer for weakly supervised object localization. Proceedings of the 2019 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00232"},{"key":"ref_66","doi-asserted-by":"crossref","unstructured":"Xue, H., Liu, C., Wan, F., Jiao, J., Ji, X., and Ye, Q. (November, January 27). Danet: Divergent activation for weakly supervised object localization. Proceedings of the 2019 IEEE International Conference on Computer Vision (ICCV), Seoul, Korea.","DOI":"10.1109\/ICCV.2019.00669"},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Yang, S., Kim, Y., Kim, Y., and Kim, C. (2020, January 1\u20135). Combinational class activation maps for weakly supervised object localization. Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), Snowmass Village, CO, USA.","DOI":"10.1109\/WACV45572.2020.9093566"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Mai, J., Yang, M., and Luo, W. (2020, January 13\u201319). Erasing integrated learning: A simple yet effective approach for weakly supervised object localization. Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00879"},{"key":"ref_69","doi-asserted-by":"crossref","unstructured":"Chattopadhay, A., Sarkar, A., Howlader, P., and Balasubramanian, V.N. (2018, January 12\u201315). Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks. Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00097"},{"key":"ref_70","unstructured":"Ramaswamy, H.G. (2020, January 1\u20135). Ablation-cam: Visual explanations for deep convolutional network via gradient-free localization. Proceedings of the IEEE Winter Conference on Applications of Computer Vision (WACV), Snowmass Village, CO, USA."},{"key":"ref_71","doi-asserted-by":"crossref","unstructured":"Wang, H., Wang, Z., Du, M., Yang, F., Zhang, Z., Ding, S., Mardziel, P., and Hu, X. (2020, January 14\u201319). Score-CAM: Score-weighted visual explanations for convolutional neural networks. Proceedings of the 2020 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00020"},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Diba, A., Sharma, V., Pazandeh, A., Pirsiavash, H., and Van Gool, L. (2017, January 21\u201326). Weakly supervised cascaded convolutional networks. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.545"},{"key":"ref_73","doi-asserted-by":"crossref","unstructured":"Wei, Y., Shen, Z., Cheng, B., Shi, H., Xiong, J., Feng, J., and Huang, T. (2018, January 8\u201314). Ts2c: Tight box mining with surrounding segmentation context for weakly supervised object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01252-6_27"},{"key":"ref_74","unstructured":"Shwartz-Ziv, R., and Tishby, N. (2017). Opening the black box of deep neural networks via information. arXiv."},{"key":"ref_75","doi-asserted-by":"crossref","first-page":"3965","DOI":"10.1109\/TGRS.2017.2685945","article-title":"AID: A benchmark data set for performance evaluation of aerial scene classification","volume":"55","author":"Xia","year":"2017","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_76","doi-asserted-by":"crossref","first-page":"1865","DOI":"10.1109\/JPROC.2017.2675998","article-title":"Remote sensing image scene classification: Benchmark and state of the art","volume":"105","author":"Cheng","year":"2017","journal-title":"Proc. IEEE"},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"3735","DOI":"10.1109\/JSTARS.2020.3005403","article-title":"Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities","volume":"13","author":"Cheng","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_78","unstructured":"Maas, A.L., Hannun, A.Y., and Ng, A.Y. (2013, January 16\u201321). Rectifier nonlinearities improve neural network acoustic models. Proceedings of the 2013 International Conference on Machine Learning (ICML), Atlanta, GA, USA."},{"key":"ref_79","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015, January 7\u201313). Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.123"},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Murray, N., and Perronnin, F. (2014, January 23\u201328). Generalized max pooling. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.317"},{"key":"ref_81","doi-asserted-by":"crossref","first-page":"5875","DOI":"10.1109\/TIP.2021.3089943","article-title":"LayerCAM: Exploring hierarchical class activation maps for localization","volume":"30","author":"Jiang","year":"2021","journal-title":"IEEE Trans. Image Process."},{"key":"ref_82","unstructured":"Fu, R., Hu, Q., Dong, X., Guo, Y., Gao, Y., and Li, B. (2020). Axiom-based grad-cam: Towards accurate visualization and explanation of cnns. arXiv."},{"key":"ref_83","unstructured":"Naidu, R., Ghosh, A., Maurya, Y., and Kundu, S.S. (2020). IS-CAM: Integrated Score-CAM for axiomatic-based explanations. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/13\/3230\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:42:58Z","timestamp":1760139778000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/14\/13\/3230"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,5]]},"references-count":83,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2022,7]]}},"alternative-id":["rs14133230"],"URL":"https:\/\/doi.org\/10.3390\/rs14133230","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,5]]}}}