{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T01:37:29Z","timestamp":1760146649049,"version":"build-2065373602"},"reference-count":52,"publisher":"MDPI AG","issue":"23","license":[{"start":{"date-parts":[[2024,11,27]],"date-time":"2024-11-27T00:00:00Z","timestamp":1732665600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>In the realm of remote sensing image object detection compression distillation, establishing an efficient method for instance feature knowledge transfer between teacher and student models holds paramount importance. To this end, this paper introduces an innovative deep structured instance graph distillation method that endeavors to delve into the underlying information between instance features, thereby optimizing detection performance. Specifically, our proposed method incorporates feature instances and their relations into a graph-based structure (SIG). In this graph, feature instances serve as the nodes, while the relations between them serve as the edges. This structure enables us to capture both the individual significance of each feature instance and their collective influence within the context. Furthermore, in the experiment, we found that the index of some dense and small-target objects did not improve much because the edge assembly generated by a large number of background feature nodes in the SIG module inhibited the loss. To address the perennial imbalance between foreground and background features, we introduce an adaptive background feature mining strategy. Through carefully calibrated weights, this strategy effectively extracts and integrates background information, thereby minimizing noise interference in detection results and augmenting the expressive capacity of foreground features. We achieved state-of-the-art results on both the challenging DIOR and DOTA datasets, with the two-stage Oriented RCNN-based student Resnet18 model achieving a 73.23 mAP on the DOTA benchmark, close to the teacher Resnet101\u2019s 76.16. In addition, on the DIOR dataset, the student Resnet18 based on the two-stage Faster RCNN achieved 70.13 mAP, higher than the baseline 66.31, and the student Resnet50 achieved 72.28, higher than the teacher\u2019s 72.25.<\/jats:p>","DOI":"10.3390\/rs16234443","type":"journal-article","created":{"date-parts":[[2024,11,27]],"date-time":"2024-11-27T08:17:42Z","timestamp":1732695462000},"page":"4443","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["SIGKD: A Structured Instance Graph Distillation Method for Efficient Object Detection in Remote Sensing Images"],"prefix":"10.3390","volume":"16","author":[{"given":"Fangzhou","family":"Liu","sequence":"first","affiliation":[{"name":"The Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Spatial Information Processing and Application System Technology, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Target Cognition and Application Technology (TCAT), Chinese Academy of Sciences, Beijing 100190, China"},{"name":"School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 101408, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Wenzhe","family":"Zhao","sequence":"additional","affiliation":[{"name":"The Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Spatial Information Processing and Application System Technology, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Target Cognition and Application Technology (TCAT), Chinese Academy of Sciences, Beijing 100190, China"},{"name":"School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 101408, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Haoxiang","family":"Qi","sequence":"additional","affiliation":[{"name":"The Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Spatial Information Processing and Application System Technology, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Target Cognition and Application Technology (TCAT), Chinese Academy of Sciences, Beijing 100190, China"},{"name":"School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 101408, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guangyao","family":"Zhou","sequence":"additional","affiliation":[{"name":"The Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Spatial Information Processing and Application System Technology, Chinese Academy of Sciences, Beijing 100190, China"},{"name":"Key Laboratory of Target Cognition and Application Technology (TCAT), Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2024,11,27]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"6154","DOI":"10.1109\/TGRS.2020.3023928","article-title":"SRAF-Net: Shape Robust Anchor-Free Network for Garbage Dumps in Remote Sensing Imagery","volume":"59","author":"Sun","year":"2021","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"3377","DOI":"10.1109\/TGRS.2019.2954328","article-title":"FMSSD: Feature-Merged Single-Shot Detection for Multiscale Objects in Large-Scale Remote Sensing Imagery","volume":"58","author":"Wang","year":"2020","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.E., Fu, C., and Berg, A.C. (2015). SSD: Single Shot MultiBox Detector. arXiv.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S.K., Girshick, R.B., and Farhadi, A. (2015). You Only Look Once: Unified, Real-Time Object Detection. arXiv.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Huang, J., Rathod, V., Sun, C., Zhu, M., Korattikara, A., Fathi, A., Fischer, I., Wojna, Z., Song, Y., and Guadarrama, S. (2016). Speed\/accuracy trade-offs for modern convolutional object detectors. arXiv.","DOI":"10.1109\/CVPR.2017.351"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Liu, L., Ouyang, W., Wang, X., Fieguth, P.W., Chen, J., Liu, X., and Pietik\u00e4inen, M. (2018). Deep Learning for Generic Object Detection: A Survey. arXiv.","DOI":"10.1007\/s11263-019-01247-4"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"104110","DOI":"10.1016\/j.earscirev.2022.104110","article-title":"A comprehensive review on deep learning based remote sensing image super-resolution methods","volume":"232","author":"Wang","year":"2022","journal-title":"Earth-Sci. Rev."},{"key":"ref_8","first-page":"102456","article-title":"A review on deep learning in UAV remote sensing","volume":"102","author":"Osco","year":"2021","journal-title":"Int. J. Appl. Earth Obs. Geoinf."},{"key":"ref_9","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Wu, Q., and Zhou, Y. (2019, January 24\u201327). Real-Time Object Detection Based on Unmanned Aerial Vehicle. Proceedings of the 2019 IEEE 8th Data Driven Control and Learning Systems Conference (DDCLS), Dali, China.","DOI":"10.1109\/DDCLS.2019.8908984"},{"key":"ref_11","unstructured":"Guo, Y., Yao, A., and Chen, Y. (2016). Dynamic Network Surgery for Efficient DNNs. arXiv."},{"key":"ref_12","unstructured":"Han, S., Pool, J., Tran, J., and Dally, W.J. (2015, January 7\u201312). Learning both Weights and Connections for Efficient Neural Network. Proceedings of the Neural Information Processing Systems, Montreal, QC, Canada."},{"key":"ref_13","unstructured":"Courbariaux, M., and Bengio, Y. (2016). BinaryNet: Training Deep Neural Networks with Weights and Activations Constrained to +1 or \u22121. arXiv."},{"key":"ref_14","unstructured":"Zhu, C., Han, S., Mao, H., and Dally, W.J. (2016). Trained Ternary Quantization. arXiv."},{"key":"ref_15","unstructured":"Hinton, G.E., Vinyals, O., and Dean, J. (2015). Distilling the Knowledge in a Neural Network. arXiv."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Zhu, J., Tang, S., Chen, D., Yu, S., Liu, Y., Yang, A., Rong, M., and Wang, X. (2021). Complementary Relation Contrastive Distillation. arXiv.","DOI":"10.1109\/CVPR46437.2021.00914"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3048","DOI":"10.1109\/TPAMI.2021.3055564","article-title":"Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks","volume":"44","author":"Wang","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_18","unstructured":"Tung, F., and Mori, G. (November, January 27). Similarity-Preserving Knowledge Distillation. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, South Korea."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"11","DOI":"10.1016\/j.isprsjprs.2016.03.014","article-title":"A survey on object detection in optical remote sensing images","volume":"117","author":"Cheng","year":"2016","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.isprsjprs.2019.04.015","article-title":"Deep learning in remote sensing applications: A meta-analysis and review","volume":"152","author":"Ma","year":"2019","journal-title":"ISPRS J. Photogramm. Remote. Sens."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Guo, W., Yang, W., Zhang, H., and Hua, G. (2018). Geospatial Object Detection in High Resolution Satellite Images Based on Multi-Scale Convolutional Neural Network. Remote. Sens., 10.","DOI":"10.3390\/rs10010131"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Xie, X., Cheng, G., Wang, J., Yao, X., and Han, J. (2021, January 11\u201317). Oriented R-CNN for Object Detection. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00350"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Li, Y., Hou, Q., Zheng, Z., Cheng, M.M., Yang, J., and Li, X. (2023, January 1\u20136). Large Selective Kernel Network for Remote Sensing Object Detection. Proceedings of the 2023 IEEE\/CVF International Conference on Computer Vision (ICCV), Paris, France.","DOI":"10.1109\/ICCV51070.2023.01540"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Himeur, Y., Aburaed, N., Elharrouss, O., Varlamis, I., Atalla, S., Mansoor, W., and Ahmad, H.A. (2024). Applications of Knowledge Distillation in Remote Sensing: A Survey. arXiv.","DOI":"10.1016\/j.inffus.2024.102742"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Wang, T., Yuan, L., Zhang, X., and Feng, J. (2019, January 15\u201320). Distilling Object Detectors With Fine-Grained Feature Imitation. Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00507"},{"key":"ref_26","first-page":"1","article-title":"Adaptive Knowledge Distillation for Lightweight Remote Sensing Object Detectors Optimizing","volume":"60","author":"Yang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_27","first-page":"1","article-title":"Learning Efficient and Accurate Detectors With Dynamic Knowledge Distillation in Remote Sensing Imagery","volume":"60","author":"Zhang","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_28","first-page":"1","article-title":"Instance-Aware Distillation for Efficient Object Detection in Remote Sensing Images","volume":"61","author":"Li","year":"2023","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1016\/j.inffus.2016.03.003","article-title":"A review of remote sensing image fusion methods","volume":"32","author":"Ghassemian","year":"2016","journal-title":"Inf. Fusion"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2015). Deep Residual Learning for Image Recognition. arXiv.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_31","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015, January 8\u201312). Faster R-CNN: Towards real-time object detection with region proposal networks. Proceedings of the 28th International Conference on Neural Information Processing Systems\u2014Volume 1, NIPS\u201915, Montreal, QC, Canada."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"He, K., Gkioxari, G., Doll\u00e1r, P., and Girshick, R.B. (2017). Mask R-CNN. arXiv.","DOI":"10.1109\/ICCV.2017.322"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Lin, T., Goyal, P., Girshick, R.B., He, K., and Doll\u00e1r, P. (2017). Focal Loss for Dense Object Detection. arXiv.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_34","unstructured":"O\u2019Shea, K., and Nash, R. (2015). An Introduction to Convolutional Neural Networks. arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Lin, T., Doll\u00e1r, P., Girshick, R.B., He, K., Hariharan, B., and Belongie, S.J. (2016). Feature Pyramid Networks for Object Detection. arXiv.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_36","unstructured":"Kipf, T.N., and Welling, M. (2016). Semi-Supervised Classification with Graph Convolutional Networks. arXiv."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Shrivastava, A., Gupta, A., and Girshick, R.B. (2016). Training Region-based Object Detectors with Online Hard Example Mining. arXiv.","DOI":"10.1109\/CVPR.2016.89"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Xia, G.S., Bai, X., Ding, J., Zhu, Z., Belongie, S.J., Luo, J., Datcu, M., Pelillo, M., and Zhang, L. (2017, January 18\u201323). DOTA: A Large-Scale Dataset for Object Detection in Aerial Images. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00418"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Li, K., Wan, G., Cheng, G., Meng, L., and Han, J. (2019). Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark. arXiv.","DOI":"10.1016\/j.isprsjprs.2019.11.023"},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"Lecun","year":"1998","journal-title":"Proc. IEEE"},{"key":"ref_41","unstructured":"Bridle, J.S. (March, January 27). Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition. Proceedings of the NATO Neurocomputing, Les Arcs, France."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Chen, Y., Chen, P., Liu, S., Wang, L., and Jia, J. (2021, January 11\u201317). Deep Structured Instance Graph for Distilling Object Detectors. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.00432"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Kim, T., Oh, J., Kim, N., Cho, S., and Yun, S. (2021). Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation. arXiv.","DOI":"10.24963\/ijcai.2021\/362"},{"key":"ref_44","unstructured":"Guo, C., Pleiss, G., Sun, Y., and Weinberger, K.Q. (2017). On Calibration of Modern Neural Networks. arXiv."},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"119","DOI":"10.1016\/j.isprsjprs.2014.10.002","article-title":"Multi-class geospatial object detection and geographic image classification based on collection of part detectors","volume":"98","author":"Cheng","year":"2014","journal-title":"ISPRS J. Photogramm. Remote Sens."},{"key":"ref_46","doi-asserted-by":"crossref","unstructured":"Henderson, P., and Ferrari, V. (2016). End-to-end training of object class detectors for mean average precision. arXiv.","DOI":"10.1007\/978-3-319-54193-8_13"},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1109\/TPAMI.2009.167","article-title":"Object Detection with Discriminatively Trained Part-Based Models","volume":"32","author":"Felzenszwalb","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_48","unstructured":"Romero, A., Ballas, N., Kahou, S.E., Chassang, A., Gatta, C., and Bengio, Y. (2014). FitNets: Hints for Thin Deep Nets. arXiv."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Shen, Z., and Xing, E.P. (2021). A Fast Knowledge Distillation Framework for Visual Recognition. arXiv.","DOI":"10.1007\/978-3-031-20053-3_39"},{"key":"ref_50","doi-asserted-by":"crossref","unstructured":"Guo, J., Han, K., Wang, Y., Wu, H., Chen, X., Xu, C., and Xu, C. (2021). Distilling Object Detectors via Decoupled Features. arXiv.","DOI":"10.1109\/CVPR46437.2021.00219"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Yang, Z., Li, Z., Jiang, X., Gong, Y., Yuan, Z., Zhao, D., and Yuan, C. (2021). Focal and Global Knowledge Distillation for Detectors. arXiv.","DOI":"10.1109\/CVPR52688.2022.00460"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Ding, J., Xue, N., Long, Y., Xia, G., and Lu, Q. (2018). Learning RoI Transformer for Detecting Oriented Objects in Aerial Images. arXiv.","DOI":"10.1109\/CVPR.2019.00296"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/23\/4443\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T16:40:37Z","timestamp":1760114437000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/23\/4443"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,11,27]]},"references-count":52,"journal-issue":{"issue":"23","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["rs16234443"],"URL":"https:\/\/doi.org\/10.3390\/rs16234443","relation":{},"ISSN":["2072-4292"],"issn-type":[{"type":"electronic","value":"2072-4292"}],"subject":[],"published":{"date-parts":[[2024,11,27]]}}}