{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,8]],"date-time":"2026-04-08T16:58:35Z","timestamp":1775667515108,"version":"3.50.1"},"reference-count":23,"publisher":"MDPI AG","issue":"15","license":[{"start":{"date-parts":[[2022,8,4]],"date-time":"2022-08-04T00:00:00Z","timestamp":1659571200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Natural Science Foundation of Jilin province of China","award":["YDZJ202201ZYTS419"],"award-info":[{"award-number":["YDZJ202201ZYTS419"]}]},{"name":"National Natural Science Foundation of Jilin province of China","award":["61903048"],"award-info":[{"award-number":["61903048"]}]},{"name":"National Natural Science Foundation of China","award":["YDZJ202201ZYTS419"],"award-info":[{"award-number":["YDZJ202201ZYTS419"]}]},{"name":"National Natural Science Foundation of China","award":["61903048"],"award-info":[{"award-number":["61903048"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>To solve the problem of low accuracy and slow speed of drone detection in high-resolution images with fixed cameras, we propose a detection method combining background difference and lightweight network SAG-YOLOv5s. First, background difference is used to extract potential drone targets in high-resolution images, eliminating most of the background to reduce computational overhead. Secondly, the Ghost module and SimAM attention mechanism are introduced on the basis of YOLOv5s to reduce the total number of model parameters and improve feature extraction, and \u03b1-DIoU loss is used to replace the original DIoU loss to improve the accuracy of bounding box regression. Finally, to verify the effectiveness of our method, a high-resolution drone dataset is made based on the public data set. Experimental results show that the detection accuracy of the proposed method reaches 97.6%, 24.3 percentage points higher than that of YOLOv5s, and the detection speed in 4K video reaches 13.2 FPS, which meets the actual demand and is significantly better than similar algorithms. It achieves a good balance between detection accuracy and detection speed and provides a method benchmark for high-resolution drone detection under a fixed camera.<\/jats:p>","DOI":"10.3390\/s22155825","type":"journal-article","created":{"date-parts":[[2022,8,5]],"date-time":"2022-08-05T02:12:39Z","timestamp":1659665559000},"page":"5825","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":34,"title":["High-Resolution Drone Detection Based on Background Difference and SAG-YOLOv5s"],"prefix":"10.3390","volume":"22","author":[{"given":"Yaowen","family":"Lv","sequence":"first","affiliation":[{"name":"College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhiqing","family":"Ai","sequence":"additional","affiliation":[{"name":"College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Manfei","family":"Chen","sequence":"additional","affiliation":[{"name":"College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8585-0295","authenticated-orcid":false,"given":"Xuanrui","family":"Gong","sequence":"additional","affiliation":[{"name":"College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yuxuan","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zhenghai","family":"Lu","sequence":"additional","affiliation":[{"name":"College of Optoelectronic Engineering, Changchun University of Science and Technology, Changchun 130022, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,4]]},"reference":[{"key":"ref_1","unstructured":"Yoshihashi, R., Kawakami, R., You, S., Trinh, T.T., Iida, M., and Naemura, T. (2021). Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach. arXiv."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Samaras, S., Diamantidou, E., Ataloglou, D., Sakellariou, N., Vafeiadis, A., Magoulianitis, V., Lalas, A., Dimou, A., Zarpalas, D., and Votis, K. (2019). Deep Learning on Multi Sensor Data for Counter UAV Applications\u2014A Systematic Review. Sensors, 19.","DOI":"10.3390\/s19224837"},{"key":"ref_3","unstructured":"Aker, C., and Kalkan, S. (September, January 29). Using deep networks for drone detection. Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Craye, C., and Ardjoune, S. (2019, January 18\u201321). Spatio-temporal semantic segmentation for drone detection. Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Taipei, Taiwan.","DOI":"10.1109\/AVSS.2019.8909854"},{"key":"ref_5","unstructured":"Sommer, L., Schumann, A., M\u00fcller, T., Schuchert, T., and Beyerer, J. (September, January 29). Flying object detection for automatic UAV recognition. Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy."},{"key":"ref_6","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. arXiv."},{"key":"ref_7","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (November, January 27). Centernet: Keypoint triplets for object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision (ICCV), Seoul, Korea."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., and Xu, C. (2020, January 13\u201319). Ghostnet: More features from cheap operations. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00165"},{"key":"ref_9","unstructured":"Yang, L., Zhang, R.Y., Li, L., and Xie, X. (2021, January 18\u201324). Simam: A simple, parameter-free attention module for convolutional neural networks. Proceedings of the 38th International Conference on Machine Learning, Virtual."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2020, January 7\u201312). Distance-IoU loss: Faster and better learning for bounding box regression. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_11","first-page":"20230","article-title":"Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression","volume":"34","author":"He","year":"2021","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"100204","DOI":"10.1016\/j.cosrev.2019.100204","article-title":"Background subtraction in real applications: Challenges, current models and future directions","volume":"35","author":"Bouwmans","year":"2020","journal-title":"Comput. Sci. Rev."},{"key":"ref_13","unstructured":"(2021, May 20). Ultralytics YOLOv5. Available online: https:\/\/github.com\/ultralytics\/yolov5."},{"key":"ref_14","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path aggregation network for instance segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"2011","DOI":"10.1109\/TPAMI.2019.2913372","article-title":"Squeeze-and-excitation networks","volume":"42","author":"Hu","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). Cbam: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Yu, J., Jiang, Y., Wang, Z., Cao, Z., and Huang, T. (2016, January 15\u201319). Unitbox: An advanced object detection network. Proceedings of the 24th ACM International Conference on Multimedia, Amsterdam, The Netherlands.","DOI":"10.1145\/2964284.2967274"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Rezatofighi, H., Tsoi, N., Gwak, J., Sadeghian, A., Reid, I., and Savarese, S. (2019, January 15\u201320). Generalized intersection over union: A metric and a loss for bounding box regression. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00075"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Coluccia, A., Fascista, A., Schumann, A., Sommer, L., Dimou, A., Zarpalas, D., Akyon, C.F., Eryuksel, O., Ozfuttu, A.K., and Altinuc, O.S. (2021, January 16\u201319). Drone-vs-Bird Detection Challenge at IEEE AVSS2021. Proceedings of the 17th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Washington, DC, USA.","DOI":"10.1109\/AVSS52988.2021.9663844"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Coluccia, A., Fascista, A., Schumann, A., Sommer, L., Ghenescu, M., Piatrik, T., De Cubber, G., Nalamati, M., Kapoor, A., and Saqib, M. (2019, January 18\u201321). Drone vs. Bird Detection Challenge at IEEE AVSS2019. Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Taipei, Taiwan.","DOI":"10.1109\/AVSS.2019.8909876"},{"key":"ref_22","unstructured":"Van Etten, A. (2018). You only look twice: Rapid multi-scale object detection in satellite imagery. arXiv."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Selvaraju, R.R., Cogswell, M., Das, A., Vedantam, R., Parikh, D., and Batra, D. (2017, January 22\u201329). Grad-cam: Visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.74"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/15\/5825\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:04:04Z","timestamp":1760141044000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/15\/5825"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,4]]},"references-count":23,"journal-issue":{"issue":"15","published-online":{"date-parts":[[2022,8]]}},"alternative-id":["s22155825"],"URL":"https:\/\/doi.org\/10.3390\/s22155825","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,4]]}}}