{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,16]],"date-time":"2026-07-16T02:41:35Z","timestamp":1784169695575,"version":"3.55.0"},"reference-count":37,"publisher":"MDPI AG","issue":"20","license":[{"start":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T00:00:00Z","timestamp":1697068800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Scientific Research Foundation of Hubei University of Technology","award":["BSQD2020055"],"award-info":[{"award-number":["BSQD2020055"]}]},{"name":"Scientific Research Foundation of Hubei University of Technology","award":["XBY-ZDKJ-2020-08"],"award-info":[{"award-number":["XBY-ZDKJ-2020-08"]}]},{"name":"Northwest Engineering Corporation Limited Major Science and Technology Projects","award":["BSQD2020055"],"award-info":[{"award-number":["BSQD2020055"]}]},{"name":"Northwest Engineering Corporation Limited Major Science and Technology Projects","award":["XBY-ZDKJ-2020-08"],"award-info":[{"award-number":["XBY-ZDKJ-2020-08"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Remote Sensing"],"abstract":"<jats:p>Precise object detection for unmanned aerial vehicle (UAV) images is a prerequisite for many UAV image applications. Compared with natural scene images, UAV images often have many small objects with few image pixels. These small objects are often obscured, densely distributed, or in complex scenes, which causes great interference to object detection. Aiming to solve this problem, a GhostConv-based lightweight YOLO network (GCL-YOLO) is proposed. In the proposed network, a GhostConv-based backbone network with a few parameters was firstly built. Then, a new prediction head for UAV small objects was designed, and the original prediction head for large natural scene objects was removed. Finally, the focal-efficient intersection over union (Focal-EIOU) loss was used as the localization loss. The experimental results of the VisDrone-DET2021 dataset and the UAVDT dataset showed that, compared with the YOLOv5-S network, the mean average precision at IOU = 0.5 achieved by the proposed GCL-YOLO-S network was improved by 6.9% and 1.8%, respectively, while the parameter amount and the calculation amount were reduced by 76.7% and 32.3%, respectively. Compared with some excellent lightweight networks, the proposed network achieved the highest and second-highest detection accuracy on the two datasets with the smallest parameter amount and a medium calculation amount, respectively.<\/jats:p>","DOI":"10.3390\/rs15204932","type":"journal-article","created":{"date-parts":[[2023,10,12]],"date-time":"2023-10-12T12:46:13Z","timestamp":1697114773000},"page":"4932","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":129,"title":["GCL-YOLO: A GhostConv-Based Lightweight YOLO Network for UAV Small Object Detection"],"prefix":"10.3390","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2266-5620","authenticated-orcid":false,"given":"Jinshan","family":"Cao","sequence":"first","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Wenshu","family":"Bao","sequence":"additional","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Haixing","family":"Shang","sequence":"additional","affiliation":[{"name":"Northwest Engineering Corporation Limited, Power China Group, Xi\u2019an 710064, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Ming","family":"Yuan","sequence":"additional","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Qian","family":"Cheng","sequence":"additional","affiliation":[{"name":"School of Computer Science, Hubei University of Technology, Wuhan 430068, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,10,12]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"20","DOI":"10.1016\/j.isprsjprs.2017.11.011","article-title":"Beyond RGB: Very High Resolution Urban Remote Sensing with Multimodal Deep Networks","volume":"140","author":"Audebert","year":"2018","journal-title":"ISPRS J. Photogram. Remote Sens."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"19","DOI":"10.3390\/drones6010019","article-title":"Improving the Model for Person Detection in Aerial Image Sequences Using the Displacement Vector: A Search and Rescue Scenario","volume":"6","year":"2022","journal-title":"Drones"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"3203","DOI":"10.3390\/rs15123203","article-title":"Implementing Cloud Computing for the Digital Mapping of Agricultural Soil Properties from High Resolution UAV Multispectral Imagery","volume":"15","author":"Pizarro","year":"2023","journal-title":"Remote Sens."},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2845","DOI":"10.3390\/rs15112845","article-title":"UAV Implementations in Urban Planning and Related Sectors of Rapidly Developing Nations: A Review and Future Perspectives for Malaysia","volume":"15","author":"Saad","year":"2023","journal-title":"Remote Sens."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Doll\u00e1r, P., and Zitnick, C.L. (2014, January 6\u201312). Microsoft COCO: Common Objects in Context. Proceedings of the Computer Vision\u2014ECCV 2014, Zurich, Switzerland.","DOI":"10.1007\/978-3-319-10602-1_48"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The Pascal Visual Object Classes (VOC) Challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vision"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Cao, Y., He, Z., Wang, L., Wang, W., Yuan, Y., Zhang, D., Zhang, J., Zhu, P., Van Gool, L., and Han, J. (2021, January 11\u201317). VisDrone-DET2021: The Vision Meets Drone Object Detection Challenge Results. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision Workshops, Montreal, QC, Canada.","DOI":"10.1109\/ICCVW54120.2021.00319"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"1141","DOI":"10.1007\/s11263-019-01266-1","article-title":"The Unmanned Aerial Vehicle Benchmark: Object Detection, Tracking and Baseline","volume":"128","author":"Yu","year":"2020","journal-title":"Int. J. Comput. Vision"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You Only Look Once: Unified, Real-Time Object Detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_10","unstructured":"Redmon, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_11","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_12","unstructured":"Ge, Z., Liu, S., Wang, F., Li, Z., and Sun, J. (2021). YOLOX: Exceeding YOLO Series in 2021. arXiv."},{"key":"ref_13","unstructured":"Li, C., Li, L., Jiang, H., Weng, K., Geng, Y., Li, L., Ke, Z., Li, Q., Cheng, M., and Nie, W. (2022). YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. arXiv."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2023, January 18\u201322). YOLOv7: Trainable Bag-of-Freebies Sets New State-of-the-Art for Real-Time Object Detectors. Proceedings of the 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Kisantal, M., Wojna, Z., Murawski, J., Naruniec, J., and Cho, K. (2019). Augmentation for Small Object Detection. arXiv.","DOI":"10.5121\/csit.2019.91713"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"108998","DOI":"10.1016\/j.patcog.2022.108998","article-title":"A Full Data Augmentation Pipeline for Small Object Detection Based on Generative Adversarial Networks","volume":"133","author":"Bosquet","year":"2023","journal-title":"Pattern Recognit."},{"key":"ref_17","first-page":"1","article-title":"Feature Split\u2013Merge\u2013Enhancement Network for Remote Sensing Object Detection","volume":"60","author":"Ma","year":"2022","journal-title":"IEEE Trans. Geosci. Remote Sens."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Zhu, X., Lyu, S., Wang, X., and Zhao, Q. (2021, January 11\u201317). TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision Workshops, Montreal, QC, Canada.","DOI":"10.1109\/ICCVW54120.2021.00312"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"8448","DOI":"10.1007\/s10489-021-02893-3","article-title":"RSOD: Real-Time Small Object Detection Algorithm in UAV-Based Traffic Monitoring","volume":"52","author":"Sun","year":"2022","journal-title":"Appl. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Li, C., Yang, T., Zhu, S., Chen, C., and Guan, S. (2020, January 14\u201319). Density Map Guided Object Detection in Aerial Images. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00103"},{"key":"ref_21","first-page":"1","article-title":"Real-Time Object Detection for the Running Train Based on the Improved YOLO V4 Neural Network","volume":"2022","author":"Liu","year":"2022","journal-title":"J. Adv. Transp."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Zhang, P., Zhong, Y., and Li, X. (2019, January 27\u201328). SlimYOLOv3: Narrower, Faster and Better for Real-Time UAV Applications. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision Workshop, Seoul, Republic of Korea.","DOI":"10.1109\/ICCVW.2019.00011"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C. (2018, January 18\u201323). Mobilenetv2: Inverted Residuals and Linear Bottlenecks. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_24","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017). Mobilenets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Howard, A., Sandler, M., Chu, G., Chen, L.-C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., and Vasudevan, V. (November, January 27). Searching for Mobilenetv3. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00140"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Zhang, X., Zhou, X., Lin, M., and Sun, J. (2018, January 18\u201323). ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00716"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Ma, N., Zhang, X., Zheng, H.-T., and Sun, J. (2018). ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design. arXiv.","DOI":"10.1007\/978-3-030-01264-9_8"},{"key":"ref_28","first-page":"9969","article-title":"GhostNetv2: Enhance Cheap Operation with Long-Range Attention","volume":"35","author":"Tang","year":"2022","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., and Xu, C. (2020, January 13\u201319). GhostNet: More Features from Cheap Operations. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00165"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. (2020, January 13\u201319). ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01155"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Liao, H.-Y.M., Wu, Y.-H., Chen, P.-Y., Hsieh, J.-W., and Yeh, I.-H. (2020, January 14\u201319). CSPNet: A New Backbone that can Enhance Learning Capability of CNN. Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Zheng, Z., Wang, P., Liu, W., Li, J., Ye, R., and Ren, D. (2019). Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression. arXiv.","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Ren, W., Zhang, Z., Jia, Z., Wang, L., and Tan, T. (2021). Focal and Efficient IOU Loss for Accurate Bounding Box Regression. arXiv.","DOI":"10.1016\/j.neucom.2022.07.042"},{"key":"ref_34","unstructured":"Yu, G., Chang, Q., Lv, W., Xu, C., Cui, C., Ji, W., Dang, Q., Deng, K., Wang, G., and Du, Y. (2021). PP-PicoDet: A Better Real-Time Object Detector on Mobile Devices. arXiv."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Feng, C., Zhong, Y., Gao, Y., Scott, M.R., and Huang, W. (2021, January 10\u201317). Tood: Task-Aligned One-Stage Object Detection. Proceedings of the 2021 IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00349"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Zhang, H., Wang, Y., Dayoub, F., and Sunderhauf, N. (2021, January 20\u201325). VarifocalNet: An IoU-Aware Dense Object Detector. Proceedings of the 2021 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.00841"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Li, Y., Chen, Y., Wang, N., and Zhang, Z. (2019\u20132, January 27). Scale-Aware Trident Networks for Object Detection. Proceedings of the 2019 IEEE\/CVF International Conference on Computer Vision, Seoul, Republic of Korea.","DOI":"10.1109\/ICCV.2019.00615"}],"container-title":["Remote Sensing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/20\/4932\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T21:05:37Z","timestamp":1760130337000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2072-4292\/15\/20\/4932"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,12]]},"references-count":37,"journal-issue":{"issue":"20","published-online":{"date-parts":[[2023,10]]}},"alternative-id":["rs15204932"],"URL":"https:\/\/doi.org\/10.3390\/rs15204932","relation":{},"ISSN":["2072-4292"],"issn-type":[{"value":"2072-4292","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,12]]}}}