{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T10:24:28Z","timestamp":1760955868411,"version":"build-2065373602"},"reference-count":50,"publisher":"MDPI AG","issue":"6","license":[{"start":{"date-parts":[[2018,6,8]],"date-time":"2018-06-08T00:00:00Z","timestamp":1528416000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Imaging"],"abstract":"<jats:p>The automatic detection and recognition of anomalous events in crowded and complex scenes on video are the research objectives of this paper. The main challenge in this system is to create models for detecting such events due to their changeability and the territory of the context of the scenes. Due to these challenges, this paper proposed a novel HOME FAST (Histogram of Orientation, Magnitude, and Entropy with Fast Accelerated Segment Test) spatiotemporal feature extraction approach based on optical flow information to capture anomalies. This descriptor performs the video analysis within the smart surveillance domain and detects anomalies. In deep learning, the training step learns all the normal patterns from the high-level and low-level information. The events are described in testing and, if they differ from the normal pattern, are considered as anomalous. The overall proposed system robustly identifies both local and global abnormal events from complex scenes and solves the problem of detection under various transformations with respect to the state-of-the-art approaches. The performance assessment of the simulation outcome validated that the projected model could handle different anomalous events in a crowded scene and automatically recognize anomalous events with success.<\/jats:p>","DOI":"10.3390\/jimaging4060079","type":"journal-article","created":{"date-parts":[[2018,6,8]],"date-time":"2018-06-08T11:19:31Z","timestamp":1528456771000},"page":"79","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":20,"title":["Deep Learning with a Spatiotemporal Descriptor of Appearance and Motion Estimation for Video Anomaly Detection"],"prefix":"10.3390","volume":"4","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6554-8139","authenticated-orcid":false,"given":"Kishanprasad G.","family":"Gunale","sequence":"first","affiliation":[{"name":"Department of E&amp;TC, Sinhgad College of Engineering, Vadgaon, S.P.P.U., Pune 411 041, India"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Prachi","family":"Mukherji","sequence":"additional","affiliation":[{"name":"Department of E&amp;TC, Cummins College of Engineering for Women, Karve Nagar, S.P.P.U., Pune 411 052, India"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2018,6,8]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1257","DOI":"10.1109\/TSMCC.2012.2215319","article-title":"A Review of Abnormality Detection in Automated Surveillance","volume":"42","author":"Sodeman","year":"2012","journal-title":"IEEE Trans. Syst. Man Cybern."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"7252","DOI":"10.1109\/JSEN.2015.2472960","article-title":"Detection of Anomalous Crowd Behaviour Based on the Acceleration Feature","volume":"15","author":"Chen","year":"2015","journal-title":"IEEE Sens. J."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"2153","DOI":"10.1109\/TIP.2015.2409559","article-title":"Swarm Intelligence for Detecting Interesting Events in Crowded Environ","volume":"24","author":"Kaltsa","year":"2015","journal-title":"IEEE Trans. Image Process"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"2431","DOI":"10.1109\/JSEN.2014.2381260","article-title":"Crowd Escape Behaviour Detection and Localization Based on Divergent Centers","volume":"15","author":"Chen","year":"2015","journal-title":"IEEE Sens. J."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"2129","DOI":"10.1109\/JSEN.2013.2245889","article-title":"Visual-Based Human Crowds Behaviour Analysis Based on Graph Modelling and Matching","volume":"13","author":"Chen","year":"2013","journal-title":"IEEE Sens. J."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1109\/MIS.2010.38","article-title":"Surveillance-Oriented Event Detection in Video Streams","volume":"26","author":"Piciarelli","year":"2011","journal-title":"IEEE Intell. Syst."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"935","DOI":"10.1049\/el.2009.1000","article-title":"Detecting irregular camera events in time-multiplexed videos","volume":"45","author":"Utasi","year":"2009","journal-title":"Electron. Lett."},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"810","DOI":"10.1049\/iet-its.2014.0238","article-title":"Trajectory-based anomalous behaviour detection for intelligent traffic surveillance","volume":"9","author":"Cai","year":"2015","journal-title":"IET Intell. Transp. Syst."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"18","DOI":"10.1109\/TPAMI.2013.111","article-title":"Anomaly Detection and Localization in Crowded Scenes","volume":"36","author":"Li","year":"2014","journal-title":"IEEE Trans. Patterns Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1590","DOI":"10.1109\/TIFS.2013.2272243","article-title":"Video Anomaly Search in crowded scenes via Spatio-temporal Motion Context","volume":"8","author":"Cong","year":"2013","journal-title":"IEEE Trans. Inf. Forensics Secur."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1016\/j.neucom.2014.12.064","article-title":"Spatio-temporal context analysis within video volumes for anomalies-event detection and localization","volume":"155","author":"Li","year":"2015","journal-title":"Neurocomputing"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"11099","DOI":"10.1007\/s11042-014-2219-4","article-title":"Anomaly detection in compressed, H. 264\/AVC video","volume":"74","author":"Biswas","year":"2015","journal-title":"Multimedia Tools Appl."},{"key":"ref_13","first-page":"1","article-title":"Statistical Hypothesis Detector for Anomalous Detection in Crowded Scenes","volume":"99","author":"Yuan","year":"2016","journal-title":"IEEE Trans. Cybern."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1016\/j.patcog.2017.01.001","article-title":"Graph formulation of video activities for abnormal event recognition","volume":"65","author":"Singh","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1016\/j.patcog.2015.11.018","article-title":"Video anomaly detection based on locality sensitive hashing filters","volume":"59","author":"Zhang","year":"2016","journal-title":"Pattern Recognit."},{"key":"ref_16","first-page":"358","article-title":"Spatial-temporal convolution neural networks for anomaly detection and localization in crowded scenes","volume":"47","author":"Zhou","year":"2016","journal-title":"Image Commun."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"683","DOI":"10.1109\/TCSVT.2016.2589859","article-title":"Toward Abnormal Trajectory and Event Detection in video Surveillance","volume":"27","author":"Cosar","year":"2017","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1016\/j.patcog.2016.09.016","article-title":"Online growing neural gas for anomaly detect in changing surveillance scene","volume":"64","author":"Sun","year":"2017","journal-title":"Pattern Recognit."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"15101","DOI":"10.1007\/s11042-015-2453-4","article-title":"An efficient subsequence search for video anomaly detection and localization","volume":"75","author":"Cheng","year":"2016","journal-title":"Multimedia Tools Appl."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"166","DOI":"10.1016\/j.cviu.2015.09.010","article-title":"Online real-time crowd behaviour detection in video sequences","volume":"144","author":"Pennisi","year":"2016","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_21","doi-asserted-by":"crossref","first-page":"1419","DOI":"10.1007\/s11042-015-3133-0","article-title":"MOWLD: A robust motion image descriptor for violence detection","volume":"76","author":"Zhang","year":"2017","journal-title":"Multimedia Tools Appl."},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"320","DOI":"10.1016\/j.cviu.2011.09.009","article-title":"Multi-scale and real-time non-parametric approach for anomaly detection and localization","volume":"116","author":"Bertini","year":"2012","journal-title":"Comput. Vis. Image Underst."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"265","DOI":"10.1007\/s00371-015-1192-x","article-title":"A Visual-Numeric Approach to clustering and Anomaly Detection for Trajectory Data","volume":"33","author":"Kumar","year":"2017","journal-title":"Vis. Comput."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"3463","DOI":"10.1109\/TIP.2017.2695105","article-title":"Video Anomaly Detection with Compact Feature Sets for Online Performance","volume":"26","author":"Leyva","year":"2017","journal-title":"IEEE Trans. Image Process."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"443","DOI":"10.1016\/j.patcog.2015.09.005","article-title":"Combining motion and appearance cues for anomaly detection","volume":"51","author":"Zhang","year":"2016","journal-title":"Pattern Recognit."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"23213","DOI":"10.1007\/s11042-016-4100-0","article-title":"Abnormal event detection and localization in crowded scenes based on PCANet","volume":"76","author":"Bao","year":"2017","journal-title":"Multimedia Tools Appl."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"144","DOI":"10.1016\/j.neucom.2014.06.011","article-title":"Video anomaly detection based on a hierarchical activity discovery within spatio-temporal contexts","volume":"143","author":"Xu","year":"2014","journal-title":"Neurocomputing"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"548","DOI":"10.1016\/j.neucom.2016.09.063","article-title":"Learning deep event models for crowd anomaly detection","volume":"219","author":"Feng","year":"2017","journal-title":"Neurocomputing"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1016\/j.patrec.2017.07.016","article-title":"A study of deep convolutional auto-encoders for anomaly detection in videos","volume":"105","author":"Ribeiro","year":"2018","journal-title":"Pattern Recognit. Lett."},{"key":"ref_30","doi-asserted-by":"crossref","unstructured":"Xu, D., Ricci, E., and Yan, Y. (arXiv, 2015). Learning deep representation of appearance and motion for anomalous event detection, arXiv.","DOI":"10.5244\/C.29.8"},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Caetano, C.A., De Melo, V.H., dos Santos, J.A., and Schwartz, W.R. (2017, January 17\u201320). Activity Recognition based on a Magnitude-Orientation Stream Network. Proceedings of the 2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Niteroi, Brazil.","DOI":"10.1109\/SIBGRAPI.2017.13"},{"key":"ref_32","first-page":"1","article-title":"Pyramidal implementation of the affine lucaskanade feature tracker description of the algorithm","volume":"5","author":"Bouguet","year":"2001","journal-title":"Intel Corp."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"2543","DOI":"10.1016\/j.patcog.2011.11.023","article-title":"An entropy approach for abnormal activities detection in video streams","volume":"45","author":"Sharif","year":"2012","journal-title":"Pattern Recognit."},{"key":"ref_34","unstructured":"Statistical Visual Computing Lab (2013, February 26). UCSD Anomaly Data Set. Available online: http:\/\/www.svcl.ucsd.edu\/projects\/anomaly\/."},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"Lu, C., Shi, J., and Jia, J. (2013, January 1\u20138). Abnormal event detection at 150 FPS in MATLAB. Proceedings of the 2013 IEEE International Conference on Computer Vision (ICCV), Sydney, Australia.","DOI":"10.1109\/ICCV.2013.338"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Leyva, R., Sanchez, V., and Li, C.T. (2017, January 4\u20135). The LV dataset: A realistic surveillance video dataset for abnormal event detection. Proceedings of the 2017 5th International Workshop on Biometrics and Forensics (IWBF), Coventry, UK.","DOI":"10.1109\/IWBF.2017.7935096"},{"key":"ref_37","unstructured":"Glorot, X., and Bengio, Y. (2010, January 13\u201315). Understanding the difficulty of training deep feedforward neural networks. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Sardinia, Italy."},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"6263","DOI":"10.1007\/s11042-015-3199-8","article-title":"Anomaly detection based on spatio-temporal sparse representation and visual attention analysis","volume":"76","author":"Wang","year":"2017","journal-title":"Multimedia Tools Appl."},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1007\/s00138-016-0800-8","article-title":"Abnormality detection in crowd videos by tracking sparse components","volume":"28","author":"Biswas","year":"2017","journal-title":"Mach. Vis. Appl."},{"key":"ref_40","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/s11760-016-0935-0","article-title":"An efficient system for anomaly detection using deep learning classifier","volume":"11","author":"Revathi","year":"2017","journal-title":"Signal Image Video Process."},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"6","DOI":"10.1016\/j.image.2017.09.002","article-title":"Anomaly detection based on two global grid motion templates","volume":"60","author":"Li","year":"2018","journal-title":"Signal Process. Image Commun."},{"key":"ref_42","first-page":"1","article-title":"Detection and localization of crowd behavior using a novel tracklet-based model","volume":"8","author":"Rabiee","year":"2017","journal-title":"Int. J. Mach. Learn. Cybern."},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Klaser, A., Marsza\u0142ek, M., and Schmid, C. (2008, January 1\u20134). A spatio-temporal descriptor based on 3D-gradients. Proceedings of the British Machine Vision Conference, Leeds, UK.","DOI":"10.5244\/C.22.99"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Chaudhry, R., Ravichandran, A., Hager, G., and Vidal, R. (2009, January 20\u201325). Histograms of oriented optical flow and binet-cauchy kernels on nonlinear dynamical systems for the recognition of human actions. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2009), Miami, FL, USA.","DOI":"10.1109\/CVPRW.2009.5206821"},{"key":"ref_45","doi-asserted-by":"crossref","first-page":"60","DOI":"10.1007\/s11263-012-0594-8","article-title":"Dense trajectories and motion boundary descriptors for action recognition","volume":"103","author":"Wang","year":"2013","journal-title":"Int. J. Comput. Vis."},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"673","DOI":"10.1109\/TCSVT.2016.2637778","article-title":"Histograms of Optical Flow Orientation and Magnitude and Entropy to Detect Anomalous Events in Videos","volume":"27","author":"Colque","year":"2017","journal-title":"IEEE Trans. Circuits Syst. Video Technol."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Colque, R.V.H.M., Junior, C.A.C., and Schwartz, W.R. (2015, January 26\u201329). Histograms of optical flow orientation and magnitude to detect anomalous events in videos. Proceedings of the 2015 28th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI), Salvador, Brazil.","DOI":"10.1109\/SIBGRAPI.2015.21"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Mahadevan, V., Li, W., Bhalodia, V., and Vasconcelos, N. (2010, January 13\u201318). Anomaly detection in crowded scenes. Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA.","DOI":"10.1109\/CVPR.2010.5539872"},{"key":"ref_49","doi-asserted-by":"crossref","unstructured":"Cong, Y., Yuan, J., and Liu, J. (2011, January 20\u201325). Sparse reconstruction cost for abnormal event detection. Proceedings of the 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, RI, USA.","DOI":"10.1109\/CVPR.2011.5995434"},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1016\/j.cviu.2016.10.010","article-title":"Detecting anomalous events in videos by learning deep representations of appearance and motion","volume":"156","author":"Xu","year":"2017","journal-title":"Comput. Vis. Image Underst."}],"container-title":["Journal of Imaging"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2313-433X\/4\/6\/79\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T15:07:56Z","timestamp":1760195276000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2313-433X\/4\/6\/79"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,6,8]]},"references-count":50,"journal-issue":{"issue":"6","published-online":{"date-parts":[[2018,6]]}},"alternative-id":["jimaging4060079"],"URL":"https:\/\/doi.org\/10.3390\/jimaging4060079","relation":{},"ISSN":["2313-433X"],"issn-type":[{"type":"electronic","value":"2313-433X"}],"subject":[],"published":{"date-parts":[[2018,6,8]]}}}