{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,24]],"date-time":"2026-01-24T16:59:38Z","timestamp":1769273978999,"version":"3.49.0"},"reference-count":23,"publisher":"MDPI AG","issue":"15","license":[{"start":{"date-parts":[[2022,8,5]],"date-time":"2022-08-05T00:00:00Z","timestamp":1659657600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Open Foundation of Key Laboratory of Power Machinery and Engineering, Ministry of Education of China","award":["2955232"],"award-info":[{"award-number":["2955232"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>Risky driving behavior seriously affects the driver\u2019s ability to react, execute and judge, which is one of the major causes of traffic accidents. The timely and accurate identification of the driving status of drivers is particularly important, since drivers can quickly adjust their driving status to avoid safety accidents. In order to further improve the identification accuracy, this paper proposes a risky-driving image-recognition system based on the visual attention mechanism and deep-learning technology to identify four types of driving status images including normal driving, driving while smoking, driving while drinking and driving while talking. With reference to ResNet, we build four deep-learning models with different depths and embed the proposed visual attention blocks into the image-classification model. The experimental results indicate that the classification accuracy of the ResNet models with lower depth can exceed the ResNet models with higher depth by embedding the visual attention modules, while there is no significant change in model complexity, which could improve the model recognition accuracy without reducing the recognition efficiency.<\/jats:p>","DOI":"10.3390\/s22155868","type":"journal-article","created":{"date-parts":[[2022,8,9]],"date-time":"2022-08-09T04:16:55Z","timestamp":1660018615000},"page":"5868","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":7,"title":["Risky-Driving-Image Recognition Based on Visual Attention Mechanism and Deep Learning"],"prefix":"10.3390","volume":"22","author":[{"given":"Wei","family":"Song","sequence":"first","affiliation":[{"name":"School of Automobile and Traffic Engineering, Wuhan University of Science and Technology, Wuhan 430081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Guangde","family":"Zhang","sequence":"additional","affiliation":[{"name":"School of Automobile and Traffic Engineering, Wuhan University of Science and Technology, Wuhan 430081, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2022,8,5]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Khan, M.Q., and Lee, S. (2019). A comprehensive survey of driving monitoring and assistance systems. Sensors, 19.","DOI":"10.3390\/s19112574"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"577","DOI":"10.1109\/JAS.2017.7510598","article-title":"Parallel driving in CPSS: A unified approach for transport automation and vehicle intelligence","volume":"4","author":"Wang","year":"2017","journal-title":"IEEE\/CAA J. Autom. Sin."},{"key":"ref_3","first-page":"170","article-title":"Detecting distracted driving with deep learning","volume":"Volume 10459","author":"Okon","year":"2017","journal-title":"ICR 2017: Interactive Collaborative Robotics"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2014","DOI":"10.1109\/TITS.2015.2396031","article-title":"Driver Gaze Tracking and Eyes off the Road Detection System","volume":"16","author":"Vicente","year":"2015","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_5","first-page":"3583","article-title":"A framework of vehicle trajectory replanning in lane exchanging with considerations of driver characteristics","volume":"66","author":"Wang","year":"2017","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1016\/j.media.2019.02.010","article-title":"Medical image classification using synergic deep learning","volume":"54","author":"Zhang","year":"2019","journal-title":"Med. Image Anal."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"259","DOI":"10.1108\/RPJ-03-2019-0066","article-title":"Quality assurance in metal powder bed fusion via deep-learning-based image classification","volume":"26","author":"Kunkel","year":"2020","journal-title":"Rapid Prototyp. J."},{"key":"ref_8","first-page":"51","article-title":"Deep Learning for Remote Sensing Image Classification for Agriculture Applications","volume":"44","author":"Gebrehiwot","year":"2020","journal-title":"ISPRS-Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1007\/s11760-019-01589-z","article-title":"Distracted driver classification using deep learning","volume":"14","author":"Alotaibi","year":"2020","journal-title":"Signal Image Video Process."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"106657","DOI":"10.1016\/j.asoc.2020.106657","article-title":"Distracted driver detection by combining in-vehicle and image data using deep learning","volume":"96","author":"Omerustaoglu","year":"2020","journal-title":"Appl. Soft Comput. J."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"5379","DOI":"10.1109\/TVT.2019.2908425","article-title":"Driver activity recognition for intelligent vehicles: A deep learning approach","volume":"68","author":"Xing","year":"2019","journal-title":"IEEE Trans. Veh. Technol."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1410","DOI":"10.1049\/iet-its.2018.5392","article-title":"Fatigue driving recognition network: Fatigue driving recognition via convolutional neural network and long short-term memory units","volume":"13","author":"Xiao","year":"2019","journal-title":"IET Intell. Transp. Syst."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"1401","DOI":"10.1049\/iet-its.2018.5590","article-title":"Fatigue driving detection model based on multi-feature fusion and semi-supervised active learning","volume":"13","author":"Li","year":"2019","journal-title":"IET Intell. Transp. Syst."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Du, Y., Black, A.W., Morency, L.P., and Eskenazi, M. (2018). Multimodal polynomial fusion for detecting driver distraction. arXiv.","DOI":"10.21437\/Interspeech.2018-2011"},{"key":"ref_15","first-page":"173","article-title":"A Multi-Modal Driver Fatigue and Distraction Assessment System","volume":"14","author":"Craye","year":"2016","journal-title":"Int. J. Intell. Transp. Syst. Res."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"51","DOI":"10.1109\/TITS.2014.2324414","article-title":"Predicting perceived visual and cognitive distractions of drivers with multimodal features","volume":"16","author":"Li","year":"2015","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"436","DOI":"10.1038\/nature14539","article-title":"Deep learning","volume":"521","author":"Lecun","year":"2015","journal-title":"Nature"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"2011","DOI":"10.1109\/TPAMI.2019.2913372","article-title":"Squeeze-and-Excitation Networks","volume":"42","author":"Hu","year":"2017","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Woo, S., Park, J., Lee, J.Y., and Kweon, I.S. (2018, January 8\u201314). CBAM: Convolutional block attention module. Proceedings of the European Conference on Computer Vision, Munich, Germany.","DOI":"10.1007\/978-3-030-01234-2_1"},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"283","DOI":"10.1016\/S0001-2998(78)80014-2","article-title":"Basic principles of ROC analysis","volume":"8","author":"Metz","year":"1978","journal-title":"Semin. Nucl. Med."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"861","DOI":"10.1016\/j.patrec.2005.10.010","article-title":"An introduction to ROC analysis","volume":"27","author":"Fawcett","year":"2006","journal-title":"Pattern Recognit. Lett."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"336","DOI":"10.1007\/s11263-019-01228-7","article-title":"Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization","volume":"128","author":"Selvaraju","year":"2020","journal-title":"Int. J. Comput. Vis."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/15\/5868\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T00:04:44Z","timestamp":1760141084000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/15\/5868"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,8,5]]},"references-count":23,"journal-issue":{"issue":"15","published-online":{"date-parts":[[2022,8]]}},"alternative-id":["s22155868"],"URL":"https:\/\/doi.org\/10.3390\/s22155868","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,8,5]]}}}