{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,12]],"date-time":"2025-10-12T02:19:59Z","timestamp":1760235599524,"version":"build-2065373602"},"reference-count":41,"publisher":"MDPI AG","issue":"17","license":[{"start":{"date-parts":[[2021,8,30]],"date-time":"2021-08-30T00:00:00Z","timestamp":1630281600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100010906","name":"NSAF Joint Fund","doi-asserted-by":"publisher","award":["U2030204"],"award-info":[{"award-number":["U2030204"]}],"id":[{"id":"10.13039\/501100010906","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Pedestrian detection has been widely used in applications such as video surveillance and intelligent robots. Recently, deep learning-based pedestrian detection engines have attracted lots of attention. However, the computational complexity of these engines is high, which makes them unsuitable for hardware- and power-constrained mobile applications, such as drones for surveillance. In this paper, we propose a lightweight pedestrian detection engine with a two-stage low-complexity detection network and adaptive region focusing technique, to reduce the computational complexity in pedestrian detection, while maintaining sufficient detection accuracy. The proposed pedestrian detection engine has significantly reduced the number of parameters (0.73 M) and operations (1.04 B), while achieving a comparable precision (85.18%) and miss rate (25.16%) to many existing designs. Moreover, the proposed engine, together with YOLOv3 and YOLOv3-Tiny, has been implemented on a Xilinx FPGA Zynq7020 for comparison. It is able to achieve 16.3 Fps while consuming 0.59 W, which outperforms the results of YOLOv3 (5.3 Fps, 2.43 W) and YOLOv3-Tiny (12.8 Fps, 0.95 W).<\/jats:p>","DOI":"10.3390\/s21175851","type":"journal-article","created":{"date-parts":[[2021,8,31]],"date-time":"2021-08-31T22:58:15Z","timestamp":1630450695000},"page":"5851","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["A Lightweight Pedestrian Detection Engine with Two-Stage Low-Complexity Detection Network and Adaptive Region Focusing Technique"],"prefix":"10.3390","volume":"21","author":[{"given":"Luying","family":"Que","sequence":"first","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Teng","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Hongtao","family":"Guo","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Conghan","family":"Jia","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Yuchuan","family":"Gong","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6685-5576","authenticated-orcid":false,"given":"Liang","family":"Chang","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]},{"given":"Jun","family":"Zhou","sequence":"additional","affiliation":[{"name":"School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China"}]}],"member":"1968","published-online":{"date-parts":[[2021,8,30]]},"reference":[{"key":"ref_1","unstructured":"Dalal, N., and Triggs, B. (2005, January 20\u201325). Histograms of oriented gradients for human detection. Proceedings of the In IEEE Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1109\/TPAMI.2009.167","article-title":"Object Detection with Discriminatively Trained Part-Based Models","volume":"32","author":"Felzenszwalb","year":"2010","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_3","first-page":"82","article-title":"A performance evaluation of single and multi-feature people detection","volume":"5096","author":"Rigoll","year":"2008","journal-title":"Pattern Recognition"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"1532","DOI":"10.1109\/TPAMI.2014.2300479","article-title":"Fast Feature Pyramids for Object Detection","volume":"36","author":"Dollar","year":"2014","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_5","first-page":"51","article-title":"A Pedestrian Detection Method Using SVM and CNN Multistage Classification","volume":"9","author":"He","year":"2018","journal-title":"J. Inf. Hiding Multim. Signal Process."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"3703","DOI":"10.1109\/TIP.2018.2818018","article-title":"Too Far to See? Not Really!\u2014Pedestrian Detection With Scale-Aware Localization Policy","volume":"27","author":"Zhang","year":"2018","journal-title":"IEEE Trans. Image Process."},{"key":"ref_7","first-page":"354","article-title":"A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection","volume":"9908","author":"Cai","year":"2016","journal-title":"European Conference on Computer Vision"},{"key":"ref_8","first-page":"745","article-title":"Graininess-Aware Deep Feature Learning for Pedestrian Detection","volume":"11213","author":"Ferrari","year":"2018","journal-title":"European Conference on Computer Vision"},{"key":"ref_9","first-page":"215","article-title":"Pedestrian Detection Method Based on YOLO Network","volume":"44","author":"Gao","year":"2018","journal-title":"Comput. Eng."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Peng, Q., Luo, W., Hong, G., Feng, M., Xia, Y., Yu, L., Hao, X., Wang, X., and Li, M. (2016, January 27\u201328). Pedestrian Detection for Transformer Substation Based on Gaussian Mixture Model and YOLO. Proceedings of the 2016 8th International Conference on Intelligent Human-Machine Systems and Cybernetics, Hangzhou, China.","DOI":"10.1109\/IHMSC.2016.130"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Byeon, Y.-H., and Kwak, K.-C. (2017, January 9\u201313). A Performance Comparison of Pedestrian Detection Using Faster RCNN and ACF. Proceedings of the 2017 6th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), Hamamatsu, Japan.","DOI":"10.1109\/IIAI-AAI.2017.196"},{"key":"ref_12","unstructured":"Xiaoqian, Y., Yujuan, S., and Liangliang, L. (2019, January 11\u201313). Pedestrian detection based on improved Faster RCNN algorithm. Proceedings of the 2019 IEEE\/CIC International Conference on Communications in China (ICCC), Changchun, China."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Felzenszwalb, P., McAllester, D., and Ramanan, D. (2008, January 23\u201328). A discriminatively trained, multiscale, deformable part model. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA.","DOI":"10.1109\/CVPR.2008.4587597"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wang, X., Han, T.X., and Yan, S. (October, January 29). An HOG-LBP Human Detector with Partial Occlusion Handling. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan.","DOI":"10.1109\/ICCV.2009.5459207"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Sabzmeydani, P., and Mori, G. (2007, January 17\u201322). Detecting pedestrians by learning shapelet features. Proceedings of the 2007 IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, USA.","DOI":"10.1109\/CVPR.2007.383134"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Schwartz, W.R., Kembhavi, A., Harwood, D., and Davis, L.S. (October, January 29). Human Detection Using Partial Least Squares Analysis. Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan.","DOI":"10.1109\/ICCV.2009.5459205"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"3148","DOI":"10.1109\/TMM.2018.2829602","article-title":"Pedestrian Detection via Body Part Semantic and Contextual Information With DNN","volume":"20","author":"Wang","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Liu, T., and Stathaki, T. (2017, January 23\u201325). Enhanced Pedestrian Detection using Deep Learning based Semantic Image Segmentation. Proceedings of the 2017 22nd International Conference on Digital Signal Processing, London, UK.","DOI":"10.1109\/ICDSP.2017.8096045"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"47687","DOI":"10.1109\/ACCESS.2019.2910201","article-title":"PedJointNet: Joint Head-Shoulder and Full body Deep Network for Pedestrian Detection","volume":"7","author":"Lin","year":"2019","journal-title":"IEEE Access"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"3143","DOI":"10.1109\/TIP.2019.2957927","article-title":"Taking a Look at Small-Scale Pedestrians and Occluded Pedestrians","volume":"29","author":"Cao","year":"2020","journal-title":"IEEE Trans. Image Process."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Yin, R. (2019, January 22\u201325). Multi-resolution generative adversarial networks for tiny-scale pedestrian detection. Proceedings of the 2019 IEEE International Conference on Image Processing, Taipei, Taiwan.","DOI":"10.1109\/ICIP.2019.8803030"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Li, X., Liu, Y., Chen, Z., Zhou, J., and Wu, Y. (2018, January 7\u201310). Fused discriminative metric learning for low resolution pedestrian detection. Proceedings of the 2018 25th IEEE International Conference on Image Processing, Athens, Greece.","DOI":"10.1109\/ICIP.2018.8451791"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Kruthiventi, S.S.S., Sahay, P., and Biswal, R. (2017, January 17\u201320). Low-light pedestrian detection from rgb images using multi-modal knowledge distillation. Proceedings of the 2017 24th IEEE International Conference on Image Processing, Beijing, China.","DOI":"10.1109\/ICIP.2017.8297075"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Xu, D., Ouyang, W., Ricci, E., Wang, X., and Sebe, N. (2017, January 21\u201326). Learning Cross-Modal Deep Representations for Robust Pedestrian Detection. Proceedings of the 30th IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA.","DOI":"10.1109\/CVPR.2017.451"},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"012053","DOI":"10.1088\/1742-6596\/1544\/1\/012053","article-title":"Design of lightweight pedestrian detection network in railway scenes","volume":"1544","author":"Chuanyi","year":"2020","journal-title":"J. Phys. Conf. Ser."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Huang, R., Pedoeem, J., and Chen, C. (2018, January 10\u201313). YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers. Proceedings of the 2018 IEEE International Conference on Big Data, Seattle, WA, USA.","DOI":"10.1109\/BigData.2018.8621865"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Dollar, P., Wojek, C., Schiele, B., and Perona, P. (2009;, January 20\u201325). Pedestrian Detection: A Benchmark. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPR.2009.5206631"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Luo, P., Tian, Y., Wang, X., and Tang, X. (2014, January 23\u201328). Switchable Deep Network for Pedestrian Detection. Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA.","DOI":"10.1109\/CVPR.2014.120"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Tian, Y., Luo, P., Wang, X., and Tang, X. (2015, January 7\u201313). Deep Learning Strong Parts for Pedestrian Detection. Proceedings of the 2015 IEEE International Conference on Computer Vision, Santiago, Chile.","DOI":"10.1109\/ICCV.2015.221"},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Angelova, A., Krizhevsky, A., and Vanhoucke, V. (2015, January 26\u201330). Pedestrian Detection with a Large-Field-Of-View Deep Network. Proceedings of the 2015 IEEE International Conference on Robotics and Automation, Seattle, WA, USA.","DOI":"10.1109\/ICRA.2015.7139256"},{"key":"ref_31","first-page":"985","article-title":"Scale-Aware Fast R-CNN for Pedestrian Detection","volume":"20","author":"Li","year":"2018","journal-title":"IEEE Trans. Multimed."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Zhang, S., Wen, L., Bian, X., Lei, Z., and Li, S.Z. (2018, January 8\u201314). Occlusion-Aware R-CNN: Detecting Pedestrians in a Crowd. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01219-9_39"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Liu, W., Liao, S., Hu, W., Liang, X., and Chen, X. (2018, January 8\u201314). Learning Efficient Single-Stage Pedestrian Detectors by Asymptotic Localization Fitting. Proceedings of the Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01264-9_38"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Liu, W., Liao, S., Ren, W., Hu, W., Yu, Y., and Soc, I.C. (2019, January 15\u201320). High-level Semantic Feature Detection: A New Perspective for Pedestrian Detection. Proceedings of the Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00533"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"76243","DOI":"10.1109\/ACCESS.2020.2986476","article-title":"CSANet: Channel and Spatial Mixed Attention CNN for Pedestrian Detection","volume":"8","author":"Zhang","year":"2020","journal-title":"IEEE Access"},{"key":"ref_36","unstructured":"Redmon, J., and Farhadi, A. (2018). Yolov3: An incremental improvement. arXiv."},{"key":"ref_37","unstructured":"Bochkovskiy, A., Wang, C.-Y., and Liao, H.-Y.M. (2020). YOLOv4: Optimal Speed and Accuracy of Object Detection. arXiv."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"3608","DOI":"10.1109\/TCSVT.2018.2883558","article-title":"Multi-Grained Deep Feature Learning for Robust Pedestrian Detection","volume":"29","author":"Lin","year":"2019","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Zeng, X., Ouyang, W., and Wang, X. (2013, January 1\u20138). Multi-stage Contextual Deep Learning for Pedestrian Detection. Proceedings of the 2013 IEEE International Conference on Computer Vision, Sydney, NSW, Australia.","DOI":"10.1109\/ICCV.2013.22"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Paisitkriangkrai, S., Shen, C., and van den Hengel, A. (2014). Strengthening the Effectiveness of Pedestrian Detection with Spatially Pooled Features. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-319-10593-2_36"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Hosang, J., Omran, M., Benenson, R., and Schiele, B. (2015, January 7\u201312). Taking a deeper look at pedestrians. Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7299034"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/17\/5851\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T06:56:01Z","timestamp":1760165761000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/21\/17\/5851"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,8,30]]},"references-count":41,"journal-issue":{"issue":"17","published-online":{"date-parts":[[2021,9]]}},"alternative-id":["s21175851"],"URL":"https:\/\/doi.org\/10.3390\/s21175851","relation":{},"ISSN":["1424-8220"],"issn-type":[{"type":"electronic","value":"1424-8220"}],"subject":[],"published":{"date-parts":[[2021,8,30]]}}}