{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,17]],"date-time":"2026-06-17T06:01:13Z","timestamp":1781676073085,"version":"3.54.5"},"reference-count":45,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2021,11,8]],"date-time":"2021-11-08T00:00:00Z","timestamp":1636329600000},"content-version":"vor","delay-in-days":311,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["51874300"],"award-info":[{"award-number":["51874300"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["onlinelibrary.wiley.com"],"crossmark-restriction":true},"short-container-title":["Computational Intelligence and Neuroscience"],"published-print":{"date-parts":[[2021,1]]},"abstract":"<jats:p>Frequent occurrence and long\u2010term existence of respiratory diseases such as COVID\u201019 and influenza require bus drivers to wear masks correctly during driving. To quickly detect whether the mask is worn correctly on resource\u2010constrained devices, a lightweight target detection network SAI\u2010YOLO is proposed. Based on YOLOv4\u2010Tiny, the network incorporates the Inception V3 structure, replaces two CSPBlock modules with the RES\u2010SEBlock modules to reduce the number of parameters and computational difficulty, and adds a convolutional block attention module and a squeeze\u2010and\u2010excitation module to extract key feature information. Moreover, a modified ReLU (M\u2010ReLU) activation function is introduced to replace the original Leaky_ReLU function. The experimental results show that SAI\u2010YOLO reduces the number of network parameters and calculation difficulty and improves the detection speed of the network while maintaining certain recognition accuracy. The mean average precision (mAP) for face\u2010mask\u2010wearing detection reaches 86% and the average precision (AP) for mask\u2010wearing normative detection reaches 88%. In the resource\u2010constrained device Raspberry Pi 4B, the average detection time after acceleration is 197\u2009ms, which meets the actual application requirements.<\/jats:p>","DOI":"10.1155\/2021\/4529107","type":"journal-article","created":{"date-parts":[[2021,11,8]],"date-time":"2021-11-08T23:50:13Z","timestamp":1636415413000},"update-policy":"https:\/\/doi.org\/10.1002\/crossmark_policy","source":"Crossref","is-referenced-by-count":21,"title":["SAI\u2010YOLO: A Lightweight Network for Real\u2010Time Detection of Driver Mask\u2010Wearing Specification on Resource\u2010Constrained Devices"],"prefix":"10.1155","volume":"2021","author":[{"given":"Zuopeng","family":"Zhao","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3566-1573","authenticated-orcid":false,"given":"Kai","family":"Hao","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Xiaoping","family":"Ma","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4517-4413","authenticated-orcid":false,"given":"Xiaofeng","family":"Liu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2968-7999","authenticated-orcid":false,"given":"Tianci","family":"Zheng","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4892-6390","authenticated-orcid":false,"given":"Junjie","family":"Xu","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-2921-4331","authenticated-orcid":false,"given":"Shuya","family":"Cui","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"311","published-online":{"date-parts":[[2021,11,8]]},"reference":[{"key":"e_1_2_11_1_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.scs.2020.102568"},{"key":"e_1_2_11_2_2","doi-asserted-by":"crossref","unstructured":"ShorfuzzamanM. HossainM. S. andAlhamidM. F. Towards the sustainable development of smart cities through mass video surveillance: a response to the COVID-19 pandemic Sustainable Cities and Society 64 no. 2021 102582 https:\/\/doi.org\/10.1016\/j.scs.2020.102582.","DOI":"10.1016\/j.scs.2020.102582"},{"key":"e_1_2_11_3_2","doi-asserted-by":"publisher","DOI":"10.1186\/s40001-020-00430-5"},{"key":"e_1_2_11_4_2","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/6616584"},{"key":"e_1_2_11_5_2","doi-asserted-by":"publisher","DOI":"10.1016\/s0896-6273(00)80984-8"},{"key":"e_1_2_11_6_2","doi-asserted-by":"publisher","DOI":"10.1145\/3065386"},{"key":"e_1_2_11_7_2","doi-asserted-by":"crossref","unstructured":"CaiZ. VasconcelosN. andCascadeR.-C. N. N. Delving into high quality object detection Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition 2018 Salt Lake City UT USA 6154\u20136162.","DOI":"10.1109\/CVPR.2018.00644"},{"key":"e_1_2_11_8_2","doi-asserted-by":"crossref","unstructured":"GirshickR. Fast R-CNN Proceedings of the IEEE International Conference on Computer Vision December 2015 Santiago Chile 1440\u20131448 https:\/\/doi.org\/10.1109\/iccv.2015.169 2-s2.0-84964588182.","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_2_11_9_2","unstructured":"RenS. HeK. GirshickR. andSunJ. Faster R-CNN: towards real-time object detection with region proposal networks Proceedings of the Advances in Neural Information Processing Systems December 2015 Montreal Canada MIT Press 91\u201399."},{"key":"e_1_2_11_10_2","doi-asserted-by":"crossref","unstructured":"RedmonJ. DivvalaS. GirshickR. andFarhadiA. You only look once: unified real-time object detection Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition June 2016 NV USA IEEE Computer Society Las Vegas 779\u2013788 https:\/\/doi.org\/10.1109\/cvpr.2016.91 2-s2.0-84986308404.","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_2_11_11_2","doi-asserted-by":"crossref","unstructured":"LiuW. AnguelovD. ErhanD. SzegedyC. ReedS. FuC.-Y. andBergA. C. SSD: single shot multibox detector Proceedings of the European Conference on Computer Vision October 2016 Cham Switzerland Springer 21\u201337 https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2 2-s2.0-84990068627.","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"e_1_2_11_12_2","doi-asserted-by":"crossref","unstructured":"RedmonJ.andFarhadiA. YOLO9000: better faster stronger Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition July 2017 Honolulu HI USA 7263\u20137271 https:\/\/doi.org\/10.1109\/cvpr.2017.690 2-s2.0-85041900441.","DOI":"10.1109\/CVPR.2017.690"},{"key":"e_1_2_11_13_2","unstructured":"RedmonJ.andFarhadiA. YOLOv3: an incremental improvement 2018 https:\/\/arxiv.org\/abs\/1804.02767."},{"key":"e_1_2_11_14_2","unstructured":"BochkovskiyA. WangC.-Y. andLiaoH. YOLOv4: Optimal Speed and Accuracy of Object Detection 2020 https:\/\/arxiv.org\/abs\/2004.10934."},{"key":"e_1_2_11_15_2","unstructured":"HowardA. G. ZhuM. ChenB. KalenichenkoD. WangW. WeyandT. AndreettoM. andAdamH. Mobilenets: efcient convolutional neural networks for mobile vision applications 2017 https:\/\/arxiv.org\/abs\/1704.04861."},{"key":"e_1_2_11_16_2","unstructured":"IandolaF. N. HanS. andMoskewiczM. W. Squeezenet: AlexNet-Level accuracy with 50x fewer parameters and <0.5 MB model size 2017 https:\/\/arxiv.org\/abs\/1602.07360."},{"key":"e_1_2_11_17_2","doi-asserted-by":"crossref","unstructured":"ZhangX. ZhouX. LinM. andSunJ. Shufflenet: an extremely efficient convolutional neural network for mobile devices Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition 2018 Salt Lake City UT USA 6848\u20136856 https:\/\/doi.org\/10.1109\/cvpr.2018.00716 2-s2.0-85061081999.","DOI":"10.1109\/CVPR.2018.00716"},{"key":"e_1_2_11_18_2","unstructured":"chuanqi305\/MobileNet-SSD 2018 http:\/\/github.com\/chuanqi305\/MobileNet-SSD."},{"key":"e_1_2_11_19_2","doi-asserted-by":"crossref","unstructured":"EjazM. S. IslamM. SifatullahM. andSarkerA. Implementation of principal component analysis on masked and non-masked face recognition Proceedings of the 2019 1st International Conference on Advances in Science Engineering and Robotics Technology (ICASERT) 2019 Dhaka Bangladesh 1\u20135 https:\/\/doi.org\/10.1109\/icasert.2019.8934543.","DOI":"10.1109\/ICASERT.2019.8934543"},{"key":"e_1_2_11_20_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.measurement.2020.108288"},{"key":"e_1_2_11_21_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.scs.2020.102600"},{"key":"e_1_2_11_22_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-19390-8_16"},{"key":"e_1_2_11_23_2","volume-title":"MaskNet: Detecting Different Kinds of Face Mask for Indian Ethnicity","author":"Gola A.","year":"2021"},{"key":"e_1_2_11_24_2","doi-asserted-by":"publisher","DOI":"10.17632\/xz5hbd6zds.2"},{"key":"e_1_2_11_25_2","doi-asserted-by":"publisher","DOI":"10.3390\/s20185236"},{"key":"e_1_2_11_26_2","doi-asserted-by":"crossref","unstructured":"WangC.-Y. LiaoH. YehI.-H. WuY.-H. ChenP.-Y. andHsiehJ.-W. CSPNet: a new backbone that can enhance learning capability of CNN Proceedings of the 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) June 2020 Seattle WA USA 1571\u20131580 https:\/\/doi.org\/10.1109\/cvprw50498.2020.00203.","DOI":"10.1109\/CVPRW50498.2020.00203"},{"key":"e_1_2_11_27_2","unstructured":"LinT. DollarP. andGirshickR. Feature pyramid networks for object detection 2016 https:\/\/arxiv.org\/abs\/1612.03144."},{"key":"e_1_2_11_28_2","doi-asserted-by":"crossref","unstructured":"LiuS. LuQ. QinH. ShiJ. andJiaJ. Path aggregation network for instance segmentation Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition June 2018 Salt Lake City UT USA 8759\u20138768 https:\/\/doi.org\/10.1109\/cvpr.2018.00913 2-s2.0-85060854014.","DOI":"10.1109\/CVPR.2018.00913"},{"key":"e_1_2_11_29_2","doi-asserted-by":"crossref","unstructured":"HuJ. ShenL. andSunG. Squeeze-and-excitation networks Proceedings of the 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition June 2018 Salt Lake City UT USA 7132\u20137141.","DOI":"10.1109\/CVPR.2018.00745"},{"key":"e_1_2_11_30_2","doi-asserted-by":"crossref","unstructured":"WangF. JiangM. QianC. YangS. ChengLi ZhangH. WangX. andTangX. Residual attention network for image classification Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) July 2017 Honolulu HI USA 6450\u20136458 https:\/\/doi.org\/10.1109\/cvpr.2017.683 2-s2.0-85044522649.","DOI":"10.1109\/CVPR.2017.683"},{"key":"e_1_2_11_31_2","doi-asserted-by":"publisher","DOI":"10.1038\/35058500"},{"key":"e_1_2_11_32_2","unstructured":"JaderbergM. SimonyanK. ZissermanA. andKavukcuogluK. Spatial transformer networks 2015 https:\/\/arxiv.org\/abs\/1506.02025."},{"key":"e_1_2_11_33_2","doi-asserted-by":"crossref","unstructured":"FuJ. LiuJ. TianH. FangZ. andLuH. Dual attention network for scene segmentation Proceedings of the 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) June 2019 Long Beach CA USA 3141\u20133149 https:\/\/doi.org\/10.1109\/cvpr.2019.00326.","DOI":"10.1109\/CVPR.2019.00326"},{"key":"e_1_2_11_34_2","unstructured":"WooS. ParkJ. LeeJ.-Y. andKweonI.-S. CBAM: Convolutional Block Attention Module 2018 https:\/\/arxiv.org\/abs\/1807.06521."},{"key":"e_1_2_11_35_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.imavis.2019.10.005"},{"key":"e_1_2_11_36_2","doi-asserted-by":"crossref","unstructured":"SzegedyC. IoffeS. VanhouckeV. andAlemiA. A. Inception-V4 inception-ResNet and the impact of residual connections on learning 2017 https:\/\/arxiv.org\/abs\/1602.07261.","DOI":"10.1609\/aaai.v31i1.11231"},{"key":"e_1_2_11_37_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.suscom.2020.100473"},{"key":"e_1_2_11_38_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-009-0275-4"},{"key":"e_1_2_11_39_2","doi-asserted-by":"crossref","unstructured":"GeS. JiaL. YeQ. andZhaoL. Detecting masked faces in the wild with LLE-CNNs Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition July 2017 Honolulu HI USA CVPR) 426\u2013434.","DOI":"10.1109\/CVPR.2017.53"},{"key":"e_1_2_11_40_2","unstructured":"LabelImg 2016 https:\/\/github.com\/tzutalin\/labelImg."},{"key":"e_1_2_11_41_2","unstructured":"BernardinK. AlexanderE. andStiefelhagenR. Multiple object tracking performance metrics and evaluation in a smart room environment 2006 https:\/\/cvhci.anthropomatik.kit.edu\/%7Estiefel\/papers\/ECCV2006WorkshopCameraReady.pdf."},{"key":"e_1_2_11_42_2","doi-asserted-by":"publisher","DOI":"10.1155\/2019\/7401235"},{"key":"e_1_2_11_43_2","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/9606908"},{"key":"e_1_2_11_44_2","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/7251280"},{"key":"e_1_2_11_45_2","unstructured":"Ncnn 2017 https:\/\/github.com\/Tencent\/ncnn."}],"container-title":["Computational Intelligence and Neuroscience"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/cin\/2021\/4529107.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/cin\/2021\/4529107.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/pdf\/10.1155\/2021\/4529107","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,6]],"date-time":"2024-08-06T11:55:38Z","timestamp":1722945338000},"score":1,"resource":{"primary":{"URL":"https:\/\/onlinelibrary.wiley.com\/doi\/10.1155\/2021\/4529107"}},"subtitle":[],"editor":[{"given":"Gaurav","family":"Singal","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"editor"}]}],"short-title":[],"issued":{"date-parts":[[2021,1]]},"references-count":45,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2021,1]]}},"alternative-id":["10.1155\/2021\/4529107"],"URL":"https:\/\/doi.org\/10.1155\/2021\/4529107","archive":["Portico"],"relation":{},"ISSN":["1687-5265","1687-5273"],"issn-type":[{"value":"1687-5265","type":"print"},{"value":"1687-5273","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1]]},"assertion":[{"value":"2021-07-31","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-10-25","order":2,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-11-08","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}],"article-number":"4529107"}}