{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,21]],"date-time":"2026-01-21T18:54:46Z","timestamp":1769021686905,"version":"3.49.0"},"reference-count":52,"publisher":"MDPI AG","issue":"10","license":[{"start":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T00:00:00Z","timestamp":1759881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Information"],"abstract":"<jats:p>Object detection has a vital impact on the analysis and interpretation of visual scenes. It is widely utilized in various fields, including healthcare, autonomous driving, and vehicle surveillance. However, complex scenes containing small, occluded, and multiscale objects present significant difficulties for object detection. This paper introduces a lightweight object detection algorithm, utilizing YOLOv8n as the baseline model, to address these problems. Our method focuses on four steps. Firstly, we add a layer for small object detection to enhance the feature expression capability of small objects. Secondly, to handle complex forms and appearances, we employ the C2f-DCNv2 module. This module integrates advanced DCNv2 (Deformable Convolutional Networks v2) by substituting the final C2f module in the backbone. Thirdly, we designed the CBAM, a lightweight attention module. We integrate it into the neck section to address missed detections. Finally, we use Ghost Convolution (GhostConv) as a light convolutional layer. This alternates with ordinary convolution in the neck. It ensures good detection performance while decreasing the number of parameters. Experimental performance on the PASCAL VOC dataset demonstrates that our approach lowers the number of model parameters by approximately 9.37%. The mAP@0.5:0.95 increased by 0.9%, recall (R) increased by 0.8%, mAP@0.5 increased by 0.3%, and precision (P) increased by 0.1% compared to the baseline model. To better evaluate the model\u2019s generalization performance in real-world driving scenarios, we conducted additional experiments using the KITTI dataset. Compared to the baseline model, our approach yielded a 0.8% improvement in mAP@0.5 and 1.3% in mAP@0.5:0.95. This result indicates strong performance in more dynamic and challenging conditions.<\/jats:p>","DOI":"10.3390\/info16100871","type":"journal-article","created":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T08:22:08Z","timestamp":1759911728000},"page":"871","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":1,"title":["Enhanced Lightweight Object Detection Model in Complex Scenes: An Improved YOLOv8n Approach"],"prefix":"10.3390","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0009-0002-2473-9443","authenticated-orcid":false,"given":"Sohaya","family":"El Hamdouni","sequence":"first","affiliation":[{"name":"Advanced Digital Enterprise Modeling and Information Retrieval (ADMIR) Laboratory, Rabat IT Center, Information Retrieval and Data Analytics Team (IRDA), ENSIAS, Mohammed V University in Rabat, Rabat 11000, Morocco"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4582-7418","authenticated-orcid":false,"given":"Boutaina","family":"Hdioud","sequence":"additional","affiliation":[{"name":"Advanced Digital Enterprise Modeling and Information Retrieval (ADMIR) Laboratory, Rabat IT Center, Information Retrieval and Data Analytics Team (IRDA), ENSIAS, Mohammed V University in Rabat, Rabat 11000, Morocco"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5255-4406","authenticated-orcid":false,"given":"Sanaa","family":"El Fkihi","sequence":"additional","affiliation":[{"name":"Advanced Digital Enterprise Modeling and Information Retrieval (ADMIR) Laboratory, Rabat IT Center, Information Retrieval and Data Analytics Team (IRDA), ENSIAS, Mohammed V University in Rabat, Rabat 11000, Morocco"}]}],"member":"1968","published-online":{"date-parts":[[2025,10,8]]},"reference":[{"key":"ref_1","unstructured":"Ang, G.J.N., Goil, A.K., Chan, H., Lee, X.C., Mustaffa, R.B.A., Jason, T., Woon, Z.T., and Shen, B. (2023). A novel application for real-time arrhythmia detection using YOLOv8. arXiv."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Salinas-Medina, A., and Neme, A. (2023, January 11\u201313). Enhancing Hospital Efficiency Through Web-Deployed Object Detection: A YOLOv8-Based Approach for Automating Healthcare Operations. Proceedings of the 2023 IEEE Mexican International Conference on Computer Science (ENC), Guanajuato, Mexico.","DOI":"10.1109\/ENC60556.2023.10508642"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Shi, Z. (2021, January 15\u201317). Object detection models and research directions. Proceedings of the 2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China.","DOI":"10.1109\/ICCECE51280.2021.9342049"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Li, J., Xu, R., Ma, J., Zou, Q., Ma, J., and Yu, H. (2023, January 2\u20137). Domain adaptive object detection for autonomous driving under foggy weather. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV56688.2023.00068"},{"key":"ref_5","unstructured":"Mao, J., Shi, S., Wang, X., and Li, H. (2022). 3d object detection for autonomous driving: A review and new outlooks. arXiv."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"936","DOI":"10.1109\/TSMC.2020.3005231","article-title":"A survey of the four pillars for small object detection: Multiscale representation, contextual information, super-resolution, and region proposal","volume":"52","author":"Chen","year":"2020","journal-title":"IEEE Trans. Syst. Man Cybern. Syst."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"24344","DOI":"10.1109\/ACCESS.2020.2971026","article-title":"DF-SSD: An improved SSD object detection algorithm based on DenseNet and feature fusion","volume":"8","author":"Zhai","year":"2020","journal-title":"IEEE Access"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Xie, X., Cheng, G., Wang, J., Yao, X., and Han, J. (2021, January 10\u201317). Oriented R-CNN for object detection. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, QC, Canada.","DOI":"10.1109\/ICCV48922.2021.00350"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1109\/TETCI.2020.3041019","article-title":"Granulated RCNN and multi-class deep sort for multi-object detection and tracking","volume":"6","author":"Pramanik","year":"2021","journal-title":"IEEE Trans. Emerg. Top. Comput. Intell."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"127447","DOI":"10.1016\/j.eswa.2025.127447","article-title":"SO-YOLOv8: A novel deep learning-based approach for small object detection with YOLO beyond COCO","volume":"280","author":"Giri","year":"2025","journal-title":"Expert Syst. Appl."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"5459","DOI":"10.1007\/s40747-024-01448-6","article-title":"TA-YOLO: A lightweight small object detection model based on multi-dimensional trans-attention module for remote sensing images","volume":"10","author":"Li","year":"2024","journal-title":"Complex Intell. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"29294","DOI":"10.1109\/ACCESS.2024.3368848","article-title":"Layn: Lightweight multi-scale attention yolov8 network for small object detection","volume":"12","author":"Ma","year":"2024","journal-title":"IEEE Access"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"4240","DOI":"10.1109\/TPAMI.2025.3538473","article-title":"YOLO-MS: Rethinking multi-scale representation learning for real-time object detection","volume":"47","author":"Chen","year":"2025","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"19","DOI":"10.1007\/s11063-024-11471-w","article-title":"A Vision Enhancement and Feature Fusion Multiscale Detection Network","volume":"56","author":"Qian","year":"2024","journal-title":"Neural Process. Lett."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"17021","DOI":"10.1007\/s11227-024-06121-w","article-title":"SPD-YOLOv8: An small-size object detection model of UAV imagery in complex scene","volume":"80","author":"Zhong","year":"2024","journal-title":"J. Supercomput."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"14987","DOI":"10.1109\/ACCESS.2024.3356869","article-title":"A Small Target Strawberry Recognition Method Based on Improved YOLOv8n Model","volume":"12","author":"Luo","year":"2024","journal-title":"IEEE Access"},{"key":"ref_18","first-page":"1095","article-title":"Improved faster R-CNN for multi-scale object detection","volume":"31","author":"Li","year":"2019","journal-title":"J. Comput.-Aided Des. Comput. Graph."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"1680","DOI":"10.3390\/make5040083","article-title":"A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas","volume":"5","author":"Terven","year":"2023","journal-title":"Mach. Learn. Knowl. Extr."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Dai, J., Qi, H., Xiong, Y., Li, Y., Zhang, G., Hu, H., and Wei, Y. (2017, January 22\u201329). Deformable convolutional networks. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.89"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Zhu, X., Hu, H., Lin, S., and Dai, J. (2019, January 15\u201320). Deformable convnets v2: More deformable, better results. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00953"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Jacobsen, J.H., Van Gemert, J., Lou, Z., and Smeulders, A.W. (2016, January 27\u201330). Structured receptive fields in cnns. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.286"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"e1145","DOI":"10.7717\/peerj-cs.1145","article-title":"Lightweight multi-scale network for small object detection","volume":"8","author":"Li","year":"2022","journal-title":"PeerJ Comput. Sci."},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). Cbam: Convolutional block attention module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"102907","DOI":"10.1109\/ACCESS.2020.2997466","article-title":"Research on recognition of fly species based on improved RetinaNet and CBAM","volume":"8","author":"Chen","year":"2020","journal-title":"IEEE Access"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"He, K., and Sun, J. (2015, January 7\u201312). Convolutional neural networks at constrained time cost. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299173"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Han, K., Wang, Y., Tian, Q., Guo, J., Xu, C., and Xu, C. (2020, January 13\u201319). Ghostnet: More features from cheap operations. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00165"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"69","DOI":"10.1007\/s11554-023-01323-6","article-title":"Lightweight real-time lane detection algorithm based on ghost convolution and self batch normalization","volume":"20","author":"Yang","year":"2023","journal-title":"J. Real-Time Image Process."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","article-title":"The pascal visual object classes (voc) challenge","volume":"88","author":"Everingham","year":"2010","journal-title":"Int. J. Comput. Vis."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Geiger, A., Lenz, P., and Urtasun, R. (2012, January 16\u201321). Are we ready for autonomous driving? The kitti vision benchmark suite. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA.","DOI":"10.1109\/CVPR.2012.6248074"},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"1231","DOI":"10.1177\/0278364913491297","article-title":"Vision meets robotics: The kitti dataset","volume":"32","author":"Geiger","year":"2013","journal-title":"Int. J. Robot. Res."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Gidaris, S., and Komodakis, N. (2015, January 7\u201313). Object detection via a multi-region and semantic segmentation-aware cnn model. Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.135"},{"key":"ref_34","unstructured":"Dai, J., Li, Y., He, K., and Sun, J. (2016). R-fcn: Object detection via region-based fully convolutional networks. Advances in Neural Information Processing Systems, The MIT Press."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Zhu, Y., Zhao, C., Wang, J., Zhao, X., Wu, Y., and Lu, H. (2017, January 22\u201329). Couplenet: Coupling global structure with local parts for object detection. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.444"},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.neucom.2023.01.088","article-title":"Boosting R-CNN: Reweighting R-CNN samples by RPN\u2019s error for underwater object detection","volume":"530","author":"Song","year":"2023","journal-title":"Neurocomputing"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Shen, Z., Liu, Z., Li, J., Jiang, Y.G., Chen, Y., and Xue, X. (2017, January 22\u201329). Dsod: Learning deeply supervised object detectors from scratch. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.212"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., and Berg, A.C. (2016, January 11\u201314). Ssd: Single shot multibox detector. Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands. Proceedings, Part I 14.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zhou, P., Ni, B., Geng, C., Hu, J., and Xu, Y. (2018, January 18\u201323). Scale-transferrable object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00062"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Zhang, S., Wen, L., Bian, X., Lei, Z., and Li, S.Z. (2018, January 18\u201323). Single-shot refinement neural network for object detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00442"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Kong, T., Sun, F., Tan, C., Liu, H., and Huang, W. (2018, January 8\u201314). Deep feature pyramid reconfiguration for object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01228-1_11"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Fan, B., Chen, W., Cong, Y., and Tian, J. (2020, January 23\u201328). Dual refinement underwater object detection network. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Proceedings, Part XX 16.","DOI":"10.1007\/978-3-030-58565-5_17"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Zhang, Z., Qiao, S., Xie, C., Shen, W., Wang, B., and Yuille, A.L. (2018, January 18\u201323). Single-shot object detection with enriched semantics. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00609"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Pang, Y., Wang, T., Anwer, R.M., Khan, F.S., and Shao, L. (2019, January 15\u201320). Efficient featurized image pyramid network for single shot detector. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00751"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Liu, S., and Huang, D. (2018, January 8\u201314). Receptive field block net for accurate and fast object detection. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01252-6_24"},{"key":"ref_46","unstructured":"Li, Y., Li, J., Lin, W., and Li, J. (2018). Tiny-DSOD: Lightweight object detection for resource-restricted usages. arXiv."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"101670","DOI":"10.1016\/j.jksuci.2023.101670","article-title":"Bitnet: A lightweight object detection network for real-time classroom behavior recognition with transformer and bi-directional pyramid network","volume":"35","author":"Zhao","year":"2023","journal-title":"J. King Saud Univ.-Comput. Inf. Sci."},{"key":"ref_48","unstructured":"Bochkovskiy, A., Wang, C.Y., and Liao, H.Y.M. (2020). Yolov4: Optimal speed and accuracy of object detection. arXiv."},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2021, January 20\u201325). Scaled-yolov4: Scaling cross stage partial network. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01283"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"e24143","DOI":"10.1016\/j.heliyon.2024.e24143","article-title":"YOLO-SK: A lightweight multiscale object detection algorithm","volume":"10","author":"Wang","year":"2024","journal-title":"Heliyon"},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Wang, C.Y., Bochkovskiy, A., and Liao, H.Y.M. (2023, January 17\u201324). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_52","doi-asserted-by":"crossref","first-page":"165","DOI":"10.1007\/s10462-024-10788-1","article-title":"Dynamic YOLO for small underwater object detection","volume":"57","author":"Chen","year":"2024","journal-title":"Artif. Intell. Rev."}],"container-title":["Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/10\/871\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T18:50:15Z","timestamp":1760035815000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2078-2489\/16\/10\/871"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,10,8]]},"references-count":52,"journal-issue":{"issue":"10","published-online":{"date-parts":[[2025,10]]}},"alternative-id":["info16100871"],"URL":"https:\/\/doi.org\/10.3390\/info16100871","relation":{},"ISSN":["2078-2489"],"issn-type":[{"value":"2078-2489","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,10,8]]}}}