{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,11]],"date-time":"2026-07-11T01:48:18Z","timestamp":1783734498955,"version":"3.55.0"},"reference-count":34,"publisher":"MDPI AG","issue":"13","license":[{"start":{"date-parts":[[2023,6,26]],"date-time":"2023-06-26T00:00:00Z","timestamp":1687737600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"National Key Research and Development Program of China","award":["2018YFB601003"],"award-info":[{"award-number":["2018YFB601003"]}]},{"name":"National Key Research and Development Program of China","award":["CIT&TCD20190304"],"award-info":[{"award-number":["CIT&TCD20190304"]}]},{"name":"Beijing Great Wall Scholar Training Program","award":["2018YFB601003"],"award-info":[{"award-number":["2018YFB601003"]}]},{"name":"Beijing Great Wall Scholar Training Program","award":["CIT&TCD20190304"],"award-info":[{"award-number":["CIT&TCD20190304"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The detection algorithm commonly misses obscured pedestrians in traffic scenes with a high pedestrian density because mutual occlusion among pedestrians reduces the prediction box score of the concealed pedestrians. The paper uses the YOLOv7 algorithm as the baseline and makes the following three improvements by investigating the variables influencing the detection method\u2019s performance: First, the backbone network of the YOLOv7 algorithm is replaced with the lightweight feature extraction network Mobilenetv3 since the pedestrian detection algorithm frequently needs to be deployed in driverless mobile, which requires a fast operating speed of the algorithm; second, a high-resolution feature pyramid structure is suggested for the issue of missed detection of hidden pedestrians, which upscales the feature maps generated from the feature pyramid to increase the resolution of the output feature maps and introduces shallow feature maps to strengthen the distinctions between adjacent sub-features to enhance the network\u2019s ability to extract features for the visible area of hidden pedestrians and small-sized pedestrians in order to produce deeper features with greater differentiation for pedestrians; and the third is to suggest a detection head based on an attention mechanism that is employed to lower the confidence level of target neighboring sub-features, lower the quantity of redundant detection boxes, and lower the following NMS computation. The mAP of the suggested approach in this work achieves 89.75%, which is 9.5 percentage points better than the YOLOv7 detection algorithm, according to experiments on the CrowdHuman pedestrian-intensive dataset. The algorithm proposed in this paper can considerably increase the detection performance of the detection algorithm, particularly for obscured pedestrians and small-sized pedestrians in the dataset, according to the experimental effect plots.<\/jats:p>","DOI":"10.3390\/s23135912","type":"journal-article","created":{"date-parts":[[2023,6,27]],"date-time":"2023-06-27T02:11:22Z","timestamp":1687831882000},"page":"5912","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":33,"title":["An Improved YOLOv7 Lightweight Detection Algorithm for Obscured Pedestrians"],"prefix":"10.3390","volume":"23","author":[{"given":"Chang","family":"Li","sequence":"first","affiliation":[{"name":"College of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yiding","family":"Wang","sequence":"additional","affiliation":[{"name":"College of Information Science and Technology, North China University of Technology, Beijing 100144, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoming","family":"Liu","sequence":"additional","affiliation":[{"name":"College of Electrical and Control Engineering, North China University of Technology, Beijing 100144, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,6,26]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1215001","DOI":"10.3788\/AOS202040.1215001","article-title":"Traffic light detection based on optimized YOLOv3 algorithm","volume":"40","author":"Sun","year":"2020","journal-title":"Acta Opt. Sin."},{"key":"ref_2","unstructured":"Wu, B., and Nevatia, R. (2005, January 17\u201321). Detection of multiple, partially occluded humans in a single image by Bayesian combination of edgelet part detectors. Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV\u201905), Beijing, China."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Sabzmeydani, P., and Mori, G. (2007, January 17\u201322). Detecting pedestrians by learning shapelet features. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA.","DOI":"10.1109\/CVPR.2007.383134"},{"key":"ref_4","unstructured":"Ross, G., Jeff, D., and Trevor, D. (2014, January 23\u201328). Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA."},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Girshick, R. (2015, January 7\u201313). Fast R-CNN. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Santiago, Chile.","DOI":"10.1109\/ICCV.2015.169"},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"Faster R-CNN: Towards real-time object detection with region proposal networks","volume":"39","author":"Ren","year":"2015","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1007\/978-3-319-46448-0_2","article-title":"SSD: Single shot multiBox detector","volume":"Volume 9905","author":"Leibe","year":"2016","journal-title":"Proceedings of the Computer Vision\u2013ECCV 2016: 14th European Conference"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Redmon, J., Divvala, S., Girshick, R., and Farhadi, A. (2016, January 27\u201330). You only look once: Unified, real-time object detection. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.91"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Redmon, J., and Farhadi, A. (2017, January 21\u201326). YOLO9000: Better, faster, stronger. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.690"},{"key":"ref_10","unstructured":"Redom, J., and Farhadi, A. (2018). YOLOv3: An Incremental Improvement. arXiv."},{"key":"ref_11","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"Zhu, X., Lyu, S., Wang, X., and Zhao, Q. (2021, January 10\u201317). TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Montreal, QC, Canada.","DOI":"10.1109\/ICCVW54120.2021.00312"},{"key":"ref_13","unstructured":"Chu, Y.L., Hong, L.J., and Kai, H.W. (2022, January 18\u201324). YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. Proceedings of the 2022 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2022, January 18\u201324). YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. Proceedings of the 2022 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA.","DOI":"10.1109\/CVPR52729.2023.00721"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Lin, T.Y., Goyal, P., Girshick, R., He, K., and Doll\u00e1r, P. (2017, January 22\u201327). Focal Loss for Dense Object Detection. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy.","DOI":"10.1109\/ICCV.2017.324"},{"key":"ref_16","unstructured":"Zhou, X., Wang, D., and Kr\u00e4henb\u00fchl, P. (2019). Objects as Points. arXiv."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Neubeck, A., and Van Gool, L. (2006, January 20\u201324). Efficient Non-Maximum Suppression. Proceedings of the 18th International Conference on Pattern Recognition (ICPR), Hong Kong, China.","DOI":"10.1109\/ICPR.2006.479"},{"key":"ref_18","unstructured":"Navaneeth, B., Bharat, S., Rama, C., and Larry, S. (2017, January 22\u201327). Soft-NMS: Improving Object Detection with One Line of Code. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy."},{"key":"ref_19","unstructured":"Song, T.L., Di, H., and Yun, H.W. (2019, January 16\u201320). Adaptive NMS: Refining Pedestrian Detection in a Crowd. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"1965","DOI":"10.1007\/s11554-021-01074-2","article-title":"An Improved one-stage pedestrian detection method based on multi-scale attention feature extraction","volume":"18","author":"Ma","year":"2021","journal-title":"J. Real-Time Image Process."},{"key":"ref_21","first-page":"1515001","article-title":"Occluded pedestrian detection algorithm based on attention mechanism","volume":"41","author":"Zou","year":"2021","journal-title":"Acta Opt. Sin."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Howard, A., Sandler, M., Chu, G., Chen, L.-C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., and Vasudevan, V. (2019, January 16\u201320). Searching for MobileNetV3. Proceedings of the IEEE International Conference on Computer Vision (ICCV), Long Beach, CA, USA.","DOI":"10.1109\/ICCV.2019.00140"},{"key":"ref_23","unstructured":"Shao, S., Zhao, Z., Li, B., Xiao, T., Yu, G., Zhang, X., and Sun, J. (2018, January 18\u201323). CrowdHuman: A Benchmark for Detecting Human in a Crowd. Proceedings of the IEEE International Conference of Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA."},{"key":"ref_24","unstructured":"Loy, C.C., Lin, D., Ouyang, W., Xiong, Y., Yang, S., Huang, Q., Zhou, D., Xia, W., Li, Q., and Luo, P. (2018, January 18\u201323). WIDER Face and Pedestrian Challenge 2018: Methods and Results. Proceedings of the 2018 IEEE International Conference of Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei-Fei, L. (2009, January 22\u201324). ImageNet: A large-scale hierarchical image database. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"ref_26","unstructured":"Chen, X., Fang, H., Lin, T.-Y., Vedantam, R., Gupta, S., Dollar, P., and Zitnick, C.L. (2015). Microsoft COCO Captions: Data Collection and Evaluation Server. arXiv."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Liu, S., Qi, L., Qin, H., Shi, J., and Jia, J. (2018, January 18\u201323). Path Aggregation Network for Instance Segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Lin, T.-Y., Doll\u00e1r, P., Girshick, R., He, K., Hariharan, B., and Belongie, S. (2017, January 21\u201326). Feature Pyramid Networks for Object Detection. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.106"},{"key":"ref_29","unstructured":"Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M., and Adam, H. (2017, January 21\u201326). MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Sandler, M., Howard, A., Zhu, M., Zhmoginov, A., and Chen, L.-C. (2018, January 18\u201323). MobileNetV2: Inverted Residuals and Linear Bottlenecks. Proceedings of the 2018 IEEE of Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00474"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Hu, J., Shen, L., and Sun, G. (2018, January 18\u201323). Squeeze-and-Excitation Networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.-Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional Block Attention Module. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., and Hu, Q. (2020, January 23\u201328). ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. Proceedings of the European Conference on Computer Vision (ECCV), Glasgow, UK.","DOI":"10.1109\/CVPR42600.2020.01155"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Wang, C.-Y., Bochkovskiy, A., and Liao, H.-Y.M. (2021, January 20\u201325). Scaled-YOLOv4: Scaling Cross Stage Partial Network. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01283"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/13\/5912\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T20:00:55Z","timestamp":1760126455000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/13\/5912"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,6,26]]},"references-count":34,"journal-issue":{"issue":"13","published-online":{"date-parts":[[2023,7]]}},"alternative-id":["s23135912"],"URL":"https:\/\/doi.org\/10.3390\/s23135912","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,6,26]]}}}