{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,11]],"date-time":"2026-03-11T11:29:40Z","timestamp":1773228580699,"version":"3.50.1"},"reference-count":34,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,5,20]],"date-time":"2024-05-20T00:00:00Z","timestamp":1716163200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,5,20]],"date-time":"2024-05-20T00:00:00Z","timestamp":1716163200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61702464"],"award-info":[{"award-number":["61702464"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62272423"],"award-info":[{"award-number":["62272423"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61702462"],"award-info":[{"award-number":["61702462"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017700","name":"Henan Provincial Science and Technology Research Project","doi-asserted-by":"publisher","award":["222102210103"],"award-info":[{"award-number":["222102210103"]}],"id":[{"id":"10.13039\/501100017700","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017700","name":"Henan Provincial Science and Technology Research Project","doi-asserted-by":"publisher","award":["232102211014"],"award-info":[{"award-number":["232102211014"]}],"id":[{"id":"10.13039\/501100017700","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017700","name":"Henan Provincial Science and Technology Research Project","doi-asserted-by":"publisher","award":["222102210010"],"award-info":[{"award-number":["222102210010"]}],"id":[{"id":"10.13039\/501100017700","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017700","name":"Henan Provincial Science and Technology Research Project","doi-asserted-by":"publisher","award":["232102211006"],"award-info":[{"award-number":["232102211006"]}],"id":[{"id":"10.13039\/501100017700","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100017700","name":"Henan Provincial Science and Technology Research Project","doi-asserted-by":"publisher","award":["232102210044"],"award-info":[{"award-number":["232102210044"]}],"id":[{"id":"10.13039\/501100017700","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Basic Research Projects of Education Department of Henan","award":["21zx003"],"award-info":[{"award-number":["21zx003"]}]},{"name":"the Key 365 projects Natural Science Foundation of Henan","award":["232300421150"],"award-info":[{"award-number":["232300421150"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Image Video Proc."],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>In real scenes, small target faces often encounter various conditions, such as intricate background, occlusion and scale change, which leads to the problem of omission or misdetection of face detection results. To solve this puzzle, an improved algorithm of small target face detection 4AC-YOLOv5 is proposed. First, the algorithm by introducing a new layer to detect faces at a much smaller size, through the fusion of more shallow information, enhance the network perception of small objects, the accuracy of small target detection is improved; second, to improve the neck structure, to add the adaptive feature fusion network AFPN to replace FPN\u2009+\u2009PAN, to prevent the large information gap between non-adjacent Level to some extent, and to fully retain and integrate different scale characteristic information; and finally, improve the C3 module and propose a new multiscale residual module C3_MultiRes. Improving the expressive power of the network by introducing a multibranched structure and gradually increasing resolution somewhat reduces the complexity of the model calculation. The experimental results show that the precision of the improved model reached 94.54%, 93.08% and 84.98% in easy, medium and hard levels of WiderFace data set, respectively, and the results of detection are better than the original network. 4AC-YOLOv5 can meet the requirements of small target face detection in complex environment.<\/jats:p>","DOI":"10.1186\/s13640-024-00625-4","type":"journal-article","created":{"date-parts":[[2024,5,20]],"date-time":"2024-05-20T18:01:19Z","timestamp":1716228079000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["4AC-YOLOv5: an improved algorithm for small target face detection"],"prefix":"10.1186","volume":"2024","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6338-4051","authenticated-orcid":false,"given":"Bin","family":"Jiang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hongbin","family":"Jiang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Huanlong","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Qiuwen","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zuhe","family":"Li","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lixun","family":"Huang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2024,5,20]]},"reference":[{"issue":"4","key":"625_CR1","first-page":"122","volume":"9","author":"Y Liu","year":"2021","unstructured":"Y. Liu, L. Geng, W. Zhang et al., Survey of video based small target detection. J Image Graph 9(4), 122\u2013134 (2021)","journal-title":"J Image Graph"},{"issue":"1","key":"625_CR2","doi-asserted-by":"publisher","DOI":"10.1088\/1742-6596\/1544\/1\/012033","volume":"1544","author":"L Du","year":"2020","unstructured":"L. Du, R. Zhang, X. Wang, Overview of two-stage object detection algorithms. J Physics Conf Ser 1544(1), 012033 (2020)","journal-title":"J Physics Conf Ser"},{"key":"625_CR3","doi-asserted-by":"crossref","unstructured":"Y. Zhang, X Li, F. Wang, et al. A comprehensive review of one-stage networks for object detection. 2021 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC). IEEE, 2021: 1\u20136.","DOI":"10.1109\/ICSPCC52875.2021.9564613"},{"key":"625_CR4","doi-asserted-by":"crossref","unstructured":"R. Girshick, J. Donahue, T. Darrell, et al. Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition. 2014: 580\u2013587.","DOI":"10.1109\/CVPR.2014.81"},{"issue":"9","key":"625_CR5","doi-asserted-by":"publisher","first-page":"1904","DOI":"10.1109\/TPAMI.2015.2389824","volume":"37","author":"K He","year":"2015","unstructured":"K. He, X. Zhang, S. Ren et al., Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(9), 1904\u20131916 (2015)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"625_CR6","doi-asserted-by":"crossref","unstructured":"R. Girshick. \u201cFast r-cnn\u201d, Proceedings of the IEEE international conference on computer vision. 1440\u20131448 (2015)","DOI":"10.1109\/ICCV.2015.169"},{"key":"625_CR7","unstructured":"S. Ren, K. He, R. Girshick, et al. Faster r-cnn: towards real-time object detection with region proposal networks. Adv. Neural Inf. Process Syst. 28 (2015)."},{"key":"625_CR8","unstructured":"J. Dai, Y. Li, K. He, et al. R-fcn: Object detection via region-based fully convolutional networks. Adv. Neural Inf. Proc. Syst. 29 (2016)."},{"key":"625_CR9","doi-asserted-by":"crossref","unstructured":"T.Y. Lin, P. Goyal, R. Girshick, et al. Focal loss for dense object detection. Proceedings of the IEEE international conference on computer vision. 2980\u20132988 (2017).","DOI":"10.1109\/ICCV.2017.324"},{"key":"625_CR10","doi-asserted-by":"crossref","unstructured":"J. Redmon, S. Divvala, R. Girshick, et al. You only look once: unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition. 779\u2013788 (2016).","DOI":"10.1109\/CVPR.2016.91"},{"key":"625_CR11","doi-asserted-by":"crossref","unstructured":"J. Redmon, A. Farhadi. YOLO9000: better, faster, stronger. Proceedings of the IEEE conference on computer vision and pattern recognition. 7263\u20137271 (2017).","DOI":"10.1109\/CVPR.2017.690"},{"key":"625_CR12","unstructured":"J. Redmon, A. Farhadi. Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767 (2018)."},{"key":"625_CR13","unstructured":"A. Bochkovskiy, C.Y. Wang, H.Y.M. Liao. Yolov4: optimal speed and accuracy of object detection. arXiv preprint arXiv:2004.10934 (2020)."},{"key":"625_CR14","unstructured":"C. Li, L. Li, H. Jiang, et al. YOLOv6: a single-stage object detection framework for industrial applications. arXiv preprint arXiv:2209.02976 (2022)."},{"key":"625_CR15","doi-asserted-by":"crossref","unstructured":"C. Y. Wang, A. Bochkovskiy, H. Y. M. Liao. YOLOv7: trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 7464\u20137475 (2023).","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"625_CR16","unstructured":"Z. Ge, S. Liu, F. Wang, et al. Yolox: exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021)."},{"key":"625_CR17","doi-asserted-by":"crossref","unstructured":"W. Liu, D. Anguelov, D. Erhan, et al. Ssd: single shot multibox detector. Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11\u201314, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21\u201337.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"625_CR18","doi-asserted-by":"publisher","first-page":"927","DOI":"10.1007\/s10462-018-9650-2","volume":"52","author":"A Kumar","year":"2019","unstructured":"A. Kumar, A. Kaur, M. Kumar, Face detection techniques: a review. Artif. Intell. Rev. 52, 927\u2013948 (2019)","journal-title":"Artif. Intell. Rev."},{"issue":"1","key":"625_CR19","doi-asserted-by":"publisher","first-page":"502","DOI":"10.3390\/s23010502","volume":"23","author":"D Mamieva","year":"2023","unstructured":"D. Mamieva, A.B. Abdusalomov, M. Mukhiddinov et al., Improved face detection method via learning small faces on hard images based on a deep learning approach. Sensors 23(1), 502 (2023)","journal-title":"Sensors"},{"key":"625_CR20","doi-asserted-by":"crossref","unstructured":"C. Zhu, Y. Zheng, K. Luu, et al. Cms-rcnn: contextual multi-scale region-based cnn for unconstrained face detection. Deep Learn. Biometric. 57\u201379 (2017).","DOI":"10.1007\/978-3-319-61657-5_3"},{"key":"625_CR21","doi-asserted-by":"crossref","unstructured":"J. Deng, J. Guo, Y. Zhou et al. Retinaface: single-stage dense face localisation in the wild. arXiv preprint arXiv:1905.00641 (2019).","DOI":"10.1109\/CVPR42600.2020.00525"},{"key":"625_CR22","doi-asserted-by":"crossref","unstructured":"J. Li, Y. Wang, C. Wang, et al. DSFD: dual shot face detector. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 5060\u20135069 (2019).","DOI":"10.1109\/CVPR.2019.00520"},{"key":"625_CR23","doi-asserted-by":"crossref","unstructured":"D. Qi, W. Tan, Q. Yao, et al. YOLO5Face: why reinventing a face detector. European Conference on Computer Vision. Cham: Springer Nature Switzerland. 228\u2013244 (2022).","DOI":"10.1007\/978-3-031-25072-9_15"},{"key":"625_CR24","doi-asserted-by":"crossref","unstructured":"S. Liu, L. Qi, H. Qin et al. Path aggregation network for instance segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition. 8759\u20138768 (2018).","DOI":"10.1109\/CVPR.2018.00913"},{"issue":"8","key":"625_CR25","doi-asserted-by":"publisher","first-page":"1295","DOI":"10.3390\/electronics9081295","volume":"9","author":"M Ahmed","year":"2020","unstructured":"M. Ahmed, R. Seraj, S.M.S. Islam, The k-means algorithm: a comprehensive survey and performance evaluation. Electronics 9(8), 1295 (2020)","journal-title":"Electronics"},{"key":"625_CR26","first-page":"1","volume":"71","author":"N Zeng","year":"2022","unstructured":"N. Zeng, P. Wu, Z. Wang et al., A small-sized object detection oriented multi-scale feature fusion approach with application to defect detection. IEEE Trans. Instrum. Meas. 71, 1\u201314 (2022)","journal-title":"IEEE Trans. Instrum. Meas."},{"key":"625_CR27","doi-asserted-by":"crossref","unstructured":"G. Yang, J. Lei, Z. Zhu, et al. AFPN: asymptotic feature pyramid network for object detection. arXiv preprint arXiv:2306.15988 (2023).","DOI":"10.1109\/SMC53992.2023.10394415"},{"key":"625_CR28","unstructured":"S. Liu, D. Huang, Y. Wang. Learning spatial fusion for single-shot object detection. arXiv preprint arXiv:1911.09516 (2019)."},{"key":"625_CR29","doi-asserted-by":"crossref","unstructured":"K. He, X. Zhang, S. Ren, et al. Deep residual learning for image recognition. Proceedings of the IEEE conference on computer vision and pattern recognition. 770\u2013778 (2016).","DOI":"10.1109\/CVPR.2016.90"},{"issue":"2","key":"625_CR30","doi-asserted-by":"publisher","first-page":"652","DOI":"10.1109\/TPAMI.2019.2938758","volume":"43","author":"SH Gao","year":"2019","unstructured":"S.H. Gao, M.M. Cheng, K. Zhao et al., Res2net: a new multi-scale backbone architecture. IEEE Trans. Pattern Anal. Mach. Intell. 43(2), 652\u2013662 (2019)","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"625_CR31","doi-asserted-by":"publisher","first-page":"26595","DOI":"10.1007\/s11042-023-14859-3","volume":"82","author":"J He","year":"2023","unstructured":"J. He, X. Song, Z. Feng et al., ETM-face: effective training sample selection and multi-scale feature learning for face detection. Multimed. Tools Appl. 82, 26595\u201326611 (2023)","journal-title":"Multimed. Tools Appl."},{"key":"625_CR32","doi-asserted-by":"crossref","unstructured":"S. Yang, P. Luo, C.C. Loy, et al. Wider face: a face detection benchmark. Proceedings of the IEEE conference on computer vision and pattern recognition. 5525\u20135533 (2016).","DOI":"10.1109\/CVPR.2016.596"},{"key":"625_CR33","unstructured":"V. Jain, E. Learned-Miller. Fddb: a benchmark for face detection in unconstrained settings. UMass Amherst technical report (2010)."},{"key":"625_CR34","unstructured":"J. Guo, J. Deng, A. Lattas, et al. Sample and computation redistribution for efficient face detection. arXiv preprint arXiv:2105.04714 (2021)."}],"container-title":["EURASIP Journal on Image and Video Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13640-024-00625-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13640-024-00625-4\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13640-024-00625-4.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,5,20]],"date-time":"2024-05-20T18:04:31Z","timestamp":1716228271000},"score":1,"resource":{"primary":{"URL":"https:\/\/jivp-eurasipjournals.springeropen.com\/articles\/10.1186\/s13640-024-00625-4"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,5,20]]},"references-count":34,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2024,12]]}},"alternative-id":["625"],"URL":"https:\/\/doi.org\/10.1186\/s13640-024-00625-4","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-3871414\/v1","asserted-by":"object"}]},"ISSN":["1687-5281"],"issn-type":[{"value":"1687-5281","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,5,20]]},"assertion":[{"value":"17 January 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"22 April 2024","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"20 May 2024","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they do not have any competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"10"}}