{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T15:51:16Z","timestamp":1761061876411,"version":"3.41.2"},"reference-count":37,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2023,10,16]],"date-time":"2023-10-16T00:00:00Z","timestamp":1697414400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Comput. Sci."],"abstract":"<jats:p>Classification of school violence has been proven to be an effective solution for preventing violence within educational institutions. As a result, technical proposals aimed at enhancing the efficacy of violence classification are of considerable interest to researchers. This study explores the utilization of the SORT tracking method for localizing and tracking objects in videos related to school violence, coupled with the application of LSTM and GRU methods to enhance the accuracy of the violence classification model. Furthermore, we introduce the concept of a padding box to localize, identify actions, and recover tracked objects lost during video playback. The integration of these techniques offers a robust and efficient system for analyzing and preventing violence in educational environments. The results demonstrate that object localization and recovery algorithms yield improved violent classification outcomes compared to both the SORT tracking and violence classification algorithms alone, achieving an impressive accuracy rate of 72.13%. These experimental findings hold promise, especially in educational settings, where the assumption of camera stability is justifiable. This distinction is crucial due to the unique characteristics of violence in educational environments, setting it apart from other forms of violence.<\/jats:p>","DOI":"10.3389\/fcomp.2023.1274928","type":"journal-article","created":{"date-parts":[[2023,10,16]],"date-time":"2023-10-16T04:57:37Z","timestamp":1697432257000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["Violence region localization in video and the school violent actions classification"],"prefix":"10.3389","volume":"5","author":[{"given":"Ngo Duong","family":"Ha","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Nhu Y.","family":"Tran","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Le Nhi Lam","family":"Thuy","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ikuko","family":"Shimizu","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Pham The","family":"Bao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1965","published-online":{"date-parts":[[2023,10,16]]},"reference":[{"key":"B1","doi-asserted-by":"publisher","first-page":"853","DOI":"10.3844\/jcssp.2012.853.858","article-title":"Video retrieval using histogram and sift combined with graph-based image segmentation","volume":"8","author":"Anh","year":"2012","journal-title":"J. Comp. Sci"},{"key":"B2","first-page":"3464","article-title":"\u201cSimple online and realtime tracking,\u201d","volume-title":"Proceedings of the 2016 IEEE International Conference on Image Processing","author":"Bewley","year":"2016"},{"key":"B3","first-page":"30","article-title":"\u201cHuman violence recognition and detection in surveillance videos,\u201d","volume-title":"Proceedings of the 13th IEEE International Conference on Advanced Video and Signal Based Surveillance","author":"Bilinski","year":"2016"},{"key":"B4","doi-asserted-by":"publisher","first-page":"505","DOI":"10.1007\/s40998-019-00213-7","article-title":"Dilated deep neural network for segmentation of retinal blood vessels in fundus images, Iranian Journal of Science and Technology","volume":"44","author":"Biswas","year":"2020","journal-title":"Trans. Electr. Eng"},{"key":"B5","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1179","article-title":"Learning phrase representations using rnn encoder-decoder for statistical machine translation","author":"Cho","year":"2014","journal-title":"arXiv"},{"key":"B6","doi-asserted-by":"publisher","first-page":"191","DOI":"10.1016\/j.comnet.2019.01.028","article-title":"Real time violence detection framework for football stadium comprising of big data analysis and deep learning through bidirectional LSTM","volume":"151","author":"Dinesh","year":"2019","journal-title":"Comp. Networks."},{"key":"B7","doi-asserted-by":"publisher","first-page":"14617","DOI":"10.1007\/s11042-016-3316-3","article-title":"Abnormal event detection in crowded scenes based on deep learning","volume":"75","author":"Fang","year":"2016","journal-title":"Multim. Tools Appl."},{"key":"B8","doi-asserted-by":"publisher","first-page":"37","DOI":"10.1016\/j.imavis.2016.01.006","article-title":"Violence detection using oriented violent flows","volume":"48","author":"Gao","year":"2016","journal-title":"Image Vision Comp."},{"key":"B9","doi-asserted-by":"publisher","first-page":"1318","DOI":"10.1109\/TCYB.2013.2265378","article-title":"Enhanced computer vision with microsoft kinect sensor: a review","volume":"43","author":"Han","year":"2013","journal-title":"IEEE Trans. Cybern."},{"key":"B10","doi-asserted-by":"publisher","first-page":"1735","DOI":"10.1162\/neco.1997.9.8.1735","article-title":"Long short-term memory","volume":"9","author":"Hochreiter","year":"1997","journal-title":"Neural Comput."},{"key":"B11","first-page":"565","article-title":"\u201cDetection of violent crowd behavior based on statistical characteristics of the optical flow,\u201d","volume-title":"Proceedings of the 11th International Conference on Fuzzy Systems and Knowledge Discovery","author":"Huang","year":"2014"},{"key":"B12","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1115\/1.3662552","article-title":"A new approach to linear filtering and prediction problems","volume":"82","author":"Kalman","year":"1960","journal-title":"J. Basic Eng."},{"key":"B13","doi-asserted-by":"publisher","first-page":"76270","DOI":"10.1109\/ACCESS.2021.3083273","article-title":"Efficient spatio-temporal modeling methods for real-time violence recognition","volume":"9","author":"Kang","year":"2021","journal-title":"IEEE Access"},{"key":"B14","first-page":"772","article-title":"\u201cAdaptive real-time video-tracking for arbitrary objects,\u201d","volume-title":"Proceedings of the 2010 IEEE\/RSJ International Conference on Intelligent Robots and Systems","author":"Klein","year":"2010"},{"key":"B15","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1002\/nav.3800020109","article-title":"The Hungarian method for the assignment problem","volume":"2","author":"Kuhn","year":"1955","journal-title":"Naval Res. Logist. Quart."},{"key":"B16","doi-asserted-by":"publisher","first-page":"1","DOI":"10.3390\/s16050631","article-title":"Automatic recognition of aggressive behavior in pigs using a kinect depth sensor","volume":"16","author":"Lee","year":"2016","journal-title":"Sensors."},{"key":"B17","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2306.05238","article-title":"SparseTrack: multi-object tracking by performing scene decomposition based on pseudo-depth","author":"Liu","year":"2023","journal-title":"arXiv"},{"key":"B18","doi-asserted-by":"publisher","first-page":"121","DOI":"10.1016\/j.eswa.2019.02.032","article-title":"A classification method based on optical flow for violence detection","volume":"127","author":"Mahmoodi","year":"2019","journal-title":"Expert Syst. With Appl."},{"key":"B19","unstructured":"NaikA. J.\n            GopalakrishnaM. T.\n          Violence detection in surveillance video-a survey. 2016"},{"key":"B20","first-page":"23","article-title":"\u201cHuman behavioral analytics system for video surveillance,\u201d","volume-title":"Proceedings of the 2014 IEEE International Conference on Control System","author":"Pang","year":"2014"},{"key":"B21","doi-asserted-by":"publisher","first-page":"107560","DOI":"10.1109\/ACCESS.2019.2932114","article-title":"A. Mahmood, A review on state-of-the-art violence detection techniques","volume":"7","author":"Ramzan","year":"2019","journal-title":"IEEE Access."},{"key":"B22","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","article-title":"\u201cFaster R-CNN: towards real-time object detection with region proposal networks,\u201d","volume":"39","author":"Ren","year":"2017","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"B23","doi-asserted-by":"publisher","first-page":"915","DOI":"10.3390\/app10144915","article-title":"Incremental dilations using CNN for brain tumor classification","volume":"10","author":"Roy","year":"2020","journal-title":"Appl. Sci."},{"key":"B24","doi-asserted-by":"publisher","first-page":"503","DOI":"10.14569\/IJACSA.2020.0111163","article-title":"Moment features based violence action detection using optical flow","volume":"11","author":"Saif","year":"2020","journal-title":"Int. J. Adv. Comp. Sci. Appl."},{"key":"B25","first-page":"163","article-title":"\u201cMulti-Person tracking in smart surveillance system for crowd counting and normal\/abnormal events detection,\u201d","volume-title":"Proceedings of the 2019 International Conference on Applied and Engineering Mathematics","author":"Shehzed","year":"2019"},{"key":"B26","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1409.1556","author":"Simonyan","year":"2014","journal-title":"Very deep convolutional networks for large-scale image recognition"},{"key":"B27","doi-asserted-by":"crossref","first-page":"224","DOI":"10.1109\/SIBGRAPI.2010.38","article-title":"\u201cViolence detection in video using spatio-temporal features,\u201d","volume-title":"Proceedings of the 23rd SIBGRAPI Conference on Graphics, Patterns and Images","author":"Souza","year":"2010"},{"key":"B28","first-page":"1","article-title":"\u201cLearning to detect violent videos using convolutional long short-term memory,\u201d","volume-title":"Proceedings of the 14th IEEE International Conference on Advanced Video and Signal Based Surveillance","author":"Sudhakaran","year":"2017"},{"key":"B29","doi-asserted-by":"publisher","first-page":"377","DOI":"10.1109\/TII.2021.3116377","article-title":"AI assisted edge vision for violence detection in IoT based industrial surveillance networks","volume":"18","author":"Ullah","year":"2021","journal-title":"IEEE Trans. Indust. Inform"},{"key":"B30","first-page":"20","article-title":"\u201cTemporal segment networks: towards good practices for deep action recognition,\u201d","author":"Wang","year":"2016"},{"key":"B31","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1142\/S0218001416550077","article-title":"Hybrid histogram of oriented optical flow for abnormal behavior detection in crowd scene","volume":"30","author":"Wang","year":"2016","journal-title":"Int. J. Pattern Recog. Artif. Intellig."},{"key":"B32","doi-asserted-by":"crossref","first-page":"1145","DOI":"10.1109\/KAMW.2008.4810698","article-title":"\u201cCurrent status, causes and intervention strategies of soccer violence in chinese professional football league,\u201d","volume-title":"Proceedings of the 2008 IEEE International Symposium on Knowledge Acquisition and Modeling Workshop","author":"Wen","year":"2008"},{"key":"B33","doi-asserted-by":"publisher","first-page":"628","DOI":"10.3390\/rs13040628","article-title":"Campus violence detection based on artificial intelligent interpretation of surveillance video sequences","volume":"13","author":"Ye","year":"2021","journal-title":"Remote Sensing."},{"key":"B34","doi-asserted-by":"publisher","first-page":"7327","DOI":"10.1007\/s11042-015-2648-8","article-title":"A new method for violence detection in surveillance scenes","volume":"75","author":"Zhang","year":"2015","journal-title":"Multimedia Tools Appl."},{"key":"B35","first-page":"880","article-title":"\u201cEnd-to-end video violence detection with transformer,\u201d","author":"Zhou","year":"2022","journal-title":"Proceedings of the 5th International Conference on Pattern Recognition and Artificial Intelligence (IEEE)"},{"key":"B36","doi-asserted-by":"crossref","DOI":"10.1088\/1742-6596\/844\/1\/012044","article-title":"\u201cViolent interaction detection in video based on deep learning,\u201d","volume-title":"Proceedings of the 6th Conference on Advances in Optoelectronics and Micro\/nano-optics","author":"Zhou","year":"2017"},{"key":"B37","unstructured":"ZhuS.\n            CholletF.\n          Keras2015"}],"container-title":["Frontiers in Computer Science"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fcomp.2023.1274928\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,10,16]],"date-time":"2023-10-16T04:57:52Z","timestamp":1697432272000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fcomp.2023.1274928\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,16]]},"references-count":37,"alternative-id":["10.3389\/fcomp.2023.1274928"],"URL":"https:\/\/doi.org\/10.3389\/fcomp.2023.1274928","relation":{},"ISSN":["2624-9898"],"issn-type":[{"type":"electronic","value":"2624-9898"}],"subject":[],"published":{"date-parts":[[2023,10,16]]},"article-number":"1274928"}}