{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,18]],"date-time":"2026-07-18T22:19:47Z","timestamp":1784413187829,"version":"3.55.0"},"reference-count":60,"publisher":"MDPI AG","issue":"16","license":[{"start":{"date-parts":[[2024,8,20]],"date-time":"2024-08-20T00:00:00Z","timestamp":1724112000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2021YFB1407005"],"award-info":[{"award-number":["2021YFB1407005"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012166","name":"National Key Research and Development Program of China","doi-asserted-by":"publisher","award":["2021SFGC0401"],"award-info":[{"award-number":["2021SFGC0401"]}],"id":[{"id":"10.13039\/501100012166","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014103","name":"Key Technology Research and Development Program of Shandong Province","doi-asserted-by":"publisher","award":["2021YFB1407005"],"award-info":[{"award-number":["2021YFB1407005"]}],"id":[{"id":"10.13039\/100014103","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014103","name":"Key Technology Research and Development Program of Shandong Province","doi-asserted-by":"publisher","award":["2021SFGC0401"],"award-info":[{"award-number":["2021SFGC0401"]}],"id":[{"id":"10.13039\/100014103","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>The rapid development of unmanned aerial vehicle (UAV) technology has contributed to the increasing sophistication of UAV-based object-detection systems, which are now extensively utilized in civilian and military sectors. However, object detection from UAV images has numerous challenges, including significant variations in the object size, changing spatial configurations, and cluttered backgrounds with multiple interfering elements. To address these challenges, we propose SOD-YOLO, an innovative model based on the YOLOv8 model, to detect small objects in UAV images. The model integrates the receptive field convolutional block attention module (RFCBAM) in the backbone network to perform downsampling, improving feature extraction efficiency and mitigating the spatial information sparsity caused by downsampling. Additionally, we developed a novel neck architecture called the balanced spatial and semantic information fusion pyramid network (BSSI-FPN) designed for multi-scale feature fusion. The BSSI-FPN effectively balances spatial and semantic information across feature maps using three primary strategies: fully utilizing large-scale features, increasing the frequency of multi-scale feature fusion, and implementing dynamic upsampling. The experimental results on the VisDrone2019 dataset demonstrate that SOD-YOLO-s improves the mAP50 indicator by 3% compared to YOLOv8s while reducing the number of parameters and computational complexity by 84.2% and 30%, respectively. Compared to YOLOv8l, SOD-YOLO-l improves the mAP50 indicator by 7.7% and reduces the number of parameters by 59.6%. Compared to other existing methods, SODA-YOLO-l achieves the highest detection accuracy, demonstrating the superiority of the proposed method.<\/jats:p>","DOI":"10.3390\/rs16163057","type":"journal-article","created":{"date-parts":[[2024,8,20]],"date-time":"2024-08-20T06:07:35Z","timestamp":1724134055000},"page":"3057","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":168,"title":["SOD-YOLO: Small-Object-Detection Algorithm Based on Improved YOLOv8 for UAV Images"],"prefix":"10.3390","volume":"16","author":[{"given":"Yangang","family":"Li","sequence":"first","affiliation":[{"name":"Qilu Aerospace Information Research Institute, Jinan 250132, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Qi","family":"Li","sequence":"additional","affiliation":[{"name":"Qilu Aerospace Information Research Institute, Jinan 250132, China"},{"name":"Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jie","family":"Pan","sequence":"additional","affiliation":[{"name":"Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ying","family":"Zhou","sequence":"additional","affiliation":[{"name":"Qilu Aerospace Information Research Institute, Jinan 250132, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hongliang","family":"Zhu","sequence":"additional","affiliation":[{"name":"Qilu Aerospace Information Research Institute, Jinan 250132, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Hongwei","family":"Wei","sequence":"additional","affiliation":[{"name":"Qilu Aerospace Information Research Institute, Jinan 250132, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chong","family":"Liu","sequence":"additional","affiliation":[{"name":"Qilu Aerospace Information Research Institute, Jinan 250132, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2024,8,20]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Chang, Y.C., Chen, H.T., Chuang, J.H., and Liao, I.C. (2018, January 7\u201310). Pedestrian detection in aerial images using vanishing point transformation and deep learning. Proceedings of the 2018 25th IEEE International Conference on Image Processing (ICIP), Athens, Greece.","DOI":"10.1109\/ICIP.2018.8451144"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1256","DOI":"10.1007\/s11263-019-01177-1","article-title":"Deep learning approach in aerial imagery for supporting land search and rescue missions","volume":"127","author":"Gotovac","year":"2019","journal-title":"Int. J. Comput. Vis."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Byun, S., Shin, I.K., Moon, J., Kang, J., and Choi, S.I. (2021). Road traffic monitoring from UAV images using deep learning networks. Remote Sens., 13.","DOI":"10.3390\/rs13204027"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Muhmad Kamarulzaman, A.M., Wan Mohd Jaafar, W.S., Mohd Said, M.N., Saad, S.N.M., and Mohan, M. (2023). UAV Implementations in Urban Planning and Related Sectors of Rapidly Developing Nations: A Review and Future Perspectives for Malaysia. Remote Sens., 15.","DOI":"10.3390\/rs15112845"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"1894","DOI":"10.1109\/LGRS.2019.2912582","article-title":"Vehicle detection from high-resolution remote sensing imagery using convolutional capsule networks","volume":"16","author":"Yu","year":"2019","journal-title":"IEEE Geosci. Remote Sens. Lett."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Li, Z., Zhang, Y., Wu, H., Suzuki, S., Namiki, A., and Wang, W. (2023). Design and application of a UAV autonomous inspection system for high-voltage power transmission lines. Remote Sens., 15.","DOI":"10.3390\/rs15030865"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"1829","DOI":"10.1108\/APJML-07-2020-0476","article-title":"Application of UAVs for tourism security and safety","volume":"33","author":"Ko","year":"2021","journal-title":"Asia Pac. J. Mark. Logist."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Jin, W., Yang, J., Fang, Y., and Feng, W. (2020, January 17\u201319). Research on application and deployment of UAV in emergency response. Proceedings of the 2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC), Beijing, China.","DOI":"10.1109\/ICEIEC49280.2020.9152338"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Cao, J., Bao, W., Shang, H., Yuan, M., and Cheng, Q. (2023). GCL-YOLO: A GhostConv-based lightweight yolo network for UAV small object detection. Remote Sens., 15.","DOI":"10.3390\/rs15204932"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Girshick, R., Donahue, J., Darrell, T., and Malik, J. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.81"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast r-cnn. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_12","unstructured":"Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in Neural Information Processing Systems, Proceedings of the 28th International Conference on Neural Information Processing Systems, Montreal, QC, Canada, 7\u201312 December 2015, MIT Press."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1734","DOI":"10.1109\/JSTARS.2023.3339235","article-title":"Small Object Detection Algorithm Based on Improved YOLOv8 for Remote Sensing","volume":"17","author":"Yi","year":"2023","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Tahir, N.U.A., Long, Z., Zhang, Z., Asim, M., and ELAffendi, M. (2024). PVswin-YOLOv8s: UAV-Based Pedestrian and Vehicle Detection for Traffic Management in Smart Cities Using Improved YOLOv8. Drones, 8.","DOI":"10.3390\/drones8030084"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"1556","DOI":"10.1109\/TIP.2020.3045636","article-title":"A global-local self-adaptive network for drone-view object detection","volume":"30","author":"Deng","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Domozi, Z., Stojcsics, D., Benhamida, A., Kozlovszky, M., and Molnar, A. (2020, January 2\u20134). Real time object detection for aerial search and rescue missions for missing persons. Proceedings of the 2020 IEEE 15th International Conference of System of Systems Engineering (SoSE), Budapest, Hungary.","DOI":"10.1109\/SoSE50414.2020.9130475"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3735","DOI":"10.1109\/JSTARS.2020.3005403","article-title":"Remote sensing image scene classification meets deep learning: Challenges, methods, benchmarks, and opportunities","volume":"13","author":"Cheng","year":"2020","journal-title":"IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens."},{"key":"ref_18","unstructured":"Adaimi, G., Kreiss, S., and Alahi, A. (2020). Perceiving traffic from aerial images. arXiv."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"6047","DOI":"10.1109\/TNNLS.2021.3080276","article-title":"Vehicle detection from UAV imagery with deep learning: A review","volume":"33","author":"Bouguettaya","year":"2021","journal-title":"IEEE Trans. Neural Netw. Learn. Syst."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Cao, Y., He, Z., Wang, L., Wang, W., Yuan, Y., Zhang, D., Zhang, J., Zhu, P., Van Gool, L., and Han, J. (2021, January 11\u201317). VisDrone-DET2021: The vision meets drone object detection challenge results. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCVW54120.2021.00319"},{"key":"ref_21","unstructured":"Zhang, X., Liu, C., Yang, D., Song, T., Ye, Y., Li, K., and Song, Y. (2023). Rfaconv: Innovating spatital attention and standard convolutional operation. arXiv."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_24","unstructured":"Ioffe, S., and Szegedy, C. (2015, January 7\u20139). Batch normalization: Accelerating deep network training by reducing internal covariate shift. Proceedings of the International Conference on Machine Learning. Pmlr, Lille, France."},{"key":"ref_25","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_26","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_27","unstructured":"Jocher, G., Stoken, A., Chaurasia, A., Borovec, J., Kwon, Y., Michael, K., Changyu, L., Fang, J., Skalski, P., and Hogan, A. (2023, December 15). Ultralytics\/Yolov5: V6.0\u2014YOLOv5n \u2018Nano\u2019 Models, Roboflow Integration, TensorFlow Export, OpenCV DNN Support. Available online: https:\/\/zenodo.org\/record\/5563715."},{"key":"ref_28","unstructured":"Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., and Nie, W. (2022). YOLOv6: A single-stage object detection framework for industrial applications. arXiv."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2023, January 17\u201324). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_30","unstructured":"Joher, G., Chaurasia, A., and Qiu, J. (2023, December 15). YOLO by Ultralytics. Available online: https:\/\/github.com\/ultralytics\/ultralytics\/blob\/main\/CITATION.cff."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Xu, H., Zheng, W., Liu, F., Li, P., and Wang, R. (2023). Unmanned aerial vehicle perspective small target recognition algorithm based on improved yolov5. Remote Sens., 15.","DOI":"10.3390\/rs15143583"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Yue, M., Zhang, L., Huang, J., and Zhang, H. (2024). Lightweight and Efficient Tiny-Object Detection Based on Improved YOLOv8n for UAV Aerial Images. Drones, 8.","DOI":"10.3390\/drones8070276"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Li, Y., Fan, Q., Huang, H., Han, Z., and Gu, Q. (2023). A modified YOLOv8 detection network for UAV aerial image recognition. Drones, 7.","DOI":"10.3390\/drones7050304"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Qu, J., Tang, Z., Zhang, L., Zhang, Y., and Zhang, Z. (2023). Remote sensing small object detection network based on attention mechanism and multi-scale feature fusion. Remote Sens., 15.","DOI":"10.3390\/rs15112728"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Yuan, X., Cheng, G., Yan, K., Zeng, Q., and Han, J. (2023, January 2\u20136). Small object detection via coarse-to-fine proposal generation and imitation learning. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00581"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"5015214","DOI":"10.1109\/TIM.2024.3381272","article-title":"MFFSODNet: Multi-Scale Feature Fusion Small Object Detection Network for UAV Aerial Images","volume":"73","author":"Jiang","year":"2024","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"24","DOI":"10.1016\/j.patrec.2024.04.002","article-title":"Towards better small object detection in UAV scenes: Aggregating more object-oriented information","volume":"182","author":"Yang","year":"2024","journal-title":"Pattern Recognit. Lett."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"1483","DOI":"10.1109\/TPAMI.2019.2956516","article-title":"Cascade R-CNN: High quality object detection and instance segmentation","volume":"43","author":"Cai","year":"2019","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201329). Focal loss for dense object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_40","unstructured":"Duan, K., Bai, S., Xie, L., Qi, H., Huang, Q., and Tian, Q. (November, January 27). Centernet: Keypoint triplets for object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-excitation networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature pyramid networks for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path aggregation network for instance segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chollet, F. (2017, January 21\u201326). Xception: Deep learning with depthwise separable convolutions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.195"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Liu, W., Lu, H., Fu, H., and Cao, Z. (2023, January 2\u20136). Learning to Upsample by Learning to Sample. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00554"},{"key":"ref_46","first-page":"13467","article-title":"Towards large-scale small object detection: Survey and benchmarks","volume":"45","author":"Cheng","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft coco: Common objects in context. Proceedings of the Computer Vision\u2013ECCV 2014: 13th European Conference, Zurich, Switzerland. Proceedings, Part V 13.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Tan, M., Pang, R., and Le, Q.V. (2020, January 13\u201319). Efficientdet: Scalable and efficient object detection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Yang, G., Lei, J., Zhu, Z., Cheng, S., Feng, Z., and Liang, R. (2023, January 1\u20134). AFPN: Asymptotic feature pyramid network for object detection. Proceedings of the 2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Oahu, HI, USA.","DOI":"10.1109\/SMC53992.2023.10394415"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"105057","DOI":"10.1016\/j.imavis.2024.105057","article-title":"ASF-YOLO: A novel YOLO model with attentional scale sequence fusion for cell instance segmentation","volume":"147","author":"Kang","year":"2024","journal-title":"Image Vis. Comput."},{"key":"ref_51","unstructured":"Wang, A., Chen, H., Liu, L., Chen, K., Lin, Z., Han, J., and Ding, G. (2024). YOLOv10: Real-Time End-to-End Object Detection. arXiv."},{"key":"ref_52","unstructured":"Wang, C.Y., Yeh, I.H., and Liao, H.Y.M. (2024). YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information. arXiv."},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Wang, G., Chen, Y., An, P., Hong, H., Hu, J., and Huang, T. (2023). UAV-YOLOv8: A small-object-detection model based on improved YOLOv8 for UAV aerial photography scenarios. Sensors, 23.","DOI":"10.3390\/s23167190"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Zhang, Z. (2023). Drone-YOLO: An efficient neural network method for target detection in drone images. Drones, 7.","DOI":"10.3390\/drones7080526"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Law, H., and Deng, J. (2018, January 8\u201314). Cornernet: Detecting objects as paired keypoints. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_45"},{"key":"ref_56","first-page":"1922","article-title":"FCOS: A simple and strong anchor-free object detector","volume":"44","author":"Tian","year":"2020","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_57","unstructured":"Yang, Z., Liu, S., Hu, H., Wang, L., and Lin, S. (November, January 27). Reppoints: Point set representation for object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea."},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Zhang, S., Chi, C., Yao, Y., Lei, Z., and Li, S.Z. (2020, January 13\u201319). Bridging the gap between anchor-based and anchor-free detection via adaptive training sample selection. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00978"},{"key":"ref_59","doi-asserted-by":"crossref","unstructured":"Sun, P., Zhang, R., Jiang, Y., Kong, T., Xu, C., Zhan, W., Tomizuka, M., Li, L., Yuan, Z., and Wang, C. (2021, January 20\u201325). Sparse r-cnn: End-to-end object detection with learnable proposals. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01422"},{"key":"ref_60","unstructured":"Ge, Z., Liu, S., Wang, F., Li, Z., and Sun, J. (2021). Yolox: Exceeding yolo series in 2021. arXiv."}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/16\/3057\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T15:39:34Z","timestamp":1760110774000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/16\/16\/3057"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,8,20]]},"references-count":60,"journal-issue":{"issue":"16","published-online":{"date-parts":[[2024,8]]}},"alternative-id":["rs16163057"],"URL":"https:\/\/doi.org\/10.3390\/rs16163057","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,8,20]]}}}