{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,14]],"date-time":"2026-03-14T06:38:37Z","timestamp":1773470317288,"version":"3.50.1"},"reference-count":89,"publisher":"Springer Science and Business Media LLC","issue":"19","license":[{"start":{"date-parts":[[2022,3,29]],"date-time":"2022-03-29T00:00:00Z","timestamp":1648512000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,3,29]],"date-time":"2022-03-29T00:00:00Z","timestamp":1648512000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Multimed Tools Appl"],"published-print":{"date-parts":[[2022,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Gathering a large number of people in a shared physical area is very common in urban culture. Although there are limitless examples of mega crowds, the Islamic religious ritual, the Hajj, is considered as one of the greatest crowd scenarios in the world. The Hajj is carried out once in a year with a congregation of millions of people when the Muslims visit the holy city of Makkah at a given time and date. Such a big crowd is always prone to public safety issues, and therefore requires proper measures to ensure safe and comfortable arrangement. Through the advances in computer vision based scene understanding, automatic analysis of crowd scenes is gaining popularity. However, existing crowd analysis algorithms might not be able to correctly interpret the video content in the context of the Hajj. This is because the Hajj is a unique congregation of millions of people crowded in a small area, which can overwhelm the use of existing video and computer vision based sophisticated algorithms. Through our studies on crowd analysis, crowd counting, density estimation, and the Hajj crowd behavior, we faced the need of a review work to get a research direction for abnormal behavior analysis of Hajj pilgrims. Therefore, this review aims to summarize the research works relevant to the broader field of video analytics using deep learning with a special focus on the visual surveillance in the Hajj. The review identifies the challenges and leading-edge techniques of visual surveillance in general, which may gracefully be adaptable to the applications of Hajj and Umrah. The paper presents detailed reviews on existing techniques and approaches employed for crowd analysis from crowd videos, specifically the techniques that use deep learning in detecting abnormal behavior. These observations give us the impetus to undertake a painstaking yet exhilarating journey on crowd analysis, classification and detection of any abnormal movement of the Hajj pilgrims. Furthermore, because the Hajj pilgrimage is the most crowded domain for video-related extensive research activities, this study motivates us to critically analyze the crowd on a large scale.<\/jats:p>","DOI":"10.1007\/s11042-022-12833-z","type":"journal-article","created":{"date-parts":[[2022,3,29]],"date-time":"2022-03-29T06:14:26Z","timestamp":1648534466000},"page":"27895-27922","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":31,"title":["Video analytics using deep learning for crowd analysis: a review"],"prefix":"10.1007","volume":"81","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-8919-4459","authenticated-orcid":false,"given":"Md Roman","family":"Bhuiyan","sequence":"first","affiliation":[]},{"given":"Junaidi","family":"Abdullah","sequence":"additional","affiliation":[]},{"given":"Noramiza","family":"Hashim","sequence":"additional","affiliation":[]},{"given":"Fahmid","family":"Al Farid","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2022,3,29]]},"reference":[{"key":"12833_CR1","unstructured":"\"True Islam\". Quran-Islam.org. Archived from the original on 2013-03-13. Retrieved 2013-07-31."},{"key":"12833_CR2","unstructured":"\"A history of hajj tragedies | world news\". London: theguardian.com. January 13, 2006. Archived from the original on August 29, 2013. Retrieved 2013-07-31"},{"issue":"2","key":"12833_CR3","doi-asserted-by":"publisher","first-page":"2183","DOI":"10.32604\/cmc.2020.014227","volume":"66","author":"W Albattah","year":"2020","unstructured":"Albattah W et al (2020) Hajj crowd management using CNN-based approach. Comput, Mat Continua 66(2):2183\u20132197. https:\/\/doi.org\/10.32604\/cmc.2020.014227","journal-title":"Comput, Mat Continua"},{"key":"12833_CR4","doi-asserted-by":"publisher","unstructured":"Amirgholipour S et al. (2018). 'A-CCNN : Adaptive CCNN for Density Estimation and Crowd Counting Global Big Data Technologies Centre , University of Technology Sydney , Australia Quantitative Imaging , CSIRO Data61 , Australia Institute for Sustainable Futures , University of Technology'. 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, pp. 948\u2013952. https:\/\/doi.org\/10.1109\/ICIP.2018.8451399.","DOI":"10.1109\/ICIP.2018.8451399"},{"issue":"11","key":"12833_CR5","doi-asserted-by":"publisher","first-page":"1555","DOI":"10.1109\/TCSVT.2008.2005603","volume":"18","author":"N Anjum","year":"2008","unstructured":"Anjum N, Cavallaro A (2008) Multifeature object trajectory clustering for video analysis. IEEE Trans Circ Syst Video Technol 18(11):1555\u20131564. https:\/\/doi.org\/10.1109\/TCSVT.2008.2005603","journal-title":"IEEE Trans Circ Syst Video Technol"},{"key":"12833_CR6","doi-asserted-by":"crossref","unstructured":"Arteta C, Lempitsky V. Zisserman A (2016) Counting in the wild. In Proceedings of the European Conference on Computer Vision, Amsterdam, the Netherlands, 8\u201316; pp. 483\u2013498.","DOI":"10.1007\/978-3-319-46478-7_30"},{"issue":"12","key":"12833_CR7","doi-asserted-by":"publisher","first-page":"18839","DOI":"10.1007\/s11042-021-10646-0","volume":"80","author":"M Bansal","year":"2021","unstructured":"Bansal M, Kumar M, Kumar M (2021) 2D object recognition: a comparative analysis of SIFT, SURF and ORB feature descriptors. Multimed Tools Appl 80(12):18839\u201318857. https:\/\/doi.org\/10.1007\/s11042-021-10646-0","journal-title":"Multimed Tools Appl"},{"key":"12833_CR8","doi-asserted-by":"crossref","unstructured":"Basharat A, Gritai A, Shah M (2008) Learning object motion patterns for anomaly detection and improved object detection, in: proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1\u20138.","DOI":"10.1109\/CVPR.2008.4587510"},{"key":"12833_CR9","doi-asserted-by":"publisher","unstructured":"Bendali-Braham M et al (2021) \u2018Recent trends in crowd analysis: a review\u2019, machine learning with applications. Elsevier ltd., 4(October 2020), p. 100023. https:\/\/doi.org\/10.1016\/j.mlwa.2021.100023","DOI":"10.1016\/j.mlwa.2021.100023"},{"key":"12833_CR10","doi-asserted-by":"publisher","unstructured":"Boominathan L, Kruthiventi SSS, Venkatesh Babu R (2016) \u2018CrowdNet: a deep convolutional network for dense crowd counting\u2019. MM 2016 - Proceed 2016 ACM Multimedia Conf pp. 640\u2013644. https:\/\/doi.org\/10.1145\/2964284.2967300","DOI":"10.1145\/2964284.2967300"},{"key":"12833_CR11","doi-asserted-by":"publisher","unstructured":"Brostow GJ, Cipolla R (2006) \u2018Unsupervised bayesian detection of independent motion in crowds\u2019, Proceedings of the IEEE computer society conference on computer vision and pattern recognition, 1, pp. 594\u2013601. https:\/\/doi.org\/10.1109\/CVPR.2006.320","DOI":"10.1109\/CVPR.2006.320"},{"issue":"1","key":"12833_CR12","doi-asserted-by":"publisher","first-page":"206","DOI":"10.1109\/TITS.2009.2030963","volume":"11","author":"J Candamo","year":"2010","unstructured":"Candamo J, \u2026 Kasturi R (2010) Understanding transit scenes: a survey on human behavior-recognition algorithms. IEEE Trans Intell Transp Syst 11(1):206\u2013224. https:\/\/doi.org\/10.1109\/TITS.2009.2030963","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"12833_CR13","doi-asserted-by":"publisher","unstructured":"Chan AB, Vasconcelos N (2009) \u2018Bayesian poisson regression for crowd counting\u2019. Proceed IEEE Int Conf Comput Vision, (Iccv). 545\u2013551. https:\/\/doi.org\/10.1109\/ICCV.2009.5459191","DOI":"10.1109\/ICCV.2009.5459191"},{"key":"12833_CR14","doi-asserted-by":"publisher","unstructured":"Chan AB, Liang ZSJ, Vasconcelos N (2008) \u2018Privacy preserving crowd monitoring: counting people without people models or tracking\u2019, 26th IEEE conference on computer vision and pattern recognition. CVPR. https:\/\/doi.org\/10.1109\/CVPR.2008.4587569","DOI":"10.1109\/CVPR.2008.4587569"},{"issue":"12","key":"12833_CR15","doi-asserted-by":"publisher","first-page":"5017","DOI":"10.1109\/TIP.2015.2475625","volume":"24","author":"TH Chan","year":"2015","unstructured":"Chan TH, \u2026 Ma Y (2015) PCANet: a simple deep learning baseline for image classification? IEEE Trans Image Process 24(12):5017\u20135032. https:\/\/doi.org\/10.1109\/TIP.2015.2475625","journal-title":"IEEE Trans Image Process"},{"key":"12833_CR16","doi-asserted-by":"publisher","unstructured":"Chattopadhyay P et al (2017) \u2018Counting everyday objects in everyday scenes\u2019, proceedings - 30th IEEE conference on computer vision and pattern recognition, CVPR 2017, 2017-January, pp. 4428\u20134437. https:\/\/doi.org\/10.1109\/CVPR.2017.471","DOI":"10.1109\/CVPR.2017.471"},{"key":"12833_CR17","doi-asserted-by":"publisher","unstructured":"Chen K et al (2012) \u2018Feature mining for localised crowd counting\u2019, BMVC 2012 - electronic proceedings of the British machine vision conference 2012. https:\/\/doi.org\/10.5244\/C.26.21","DOI":"10.5244\/C.26.21"},{"issue":"7","key":"12833_CR18","doi-asserted-by":"publisher","first-page":"673","DOI":"10.1016\/j.jvcir.2011.07.001","volume":"22","author":"HY Cheng","year":"2011","unstructured":"Cheng HY, Hwang JN (2011) Integrated video object tracking with applications in trajectory-based event detection. J Visual Comm Image Represent. Elsevier Inc 22(7):673\u2013685. https:\/\/doi.org\/10.1016\/j.jvcir.2011.07.001","journal-title":"J Visual Comm Image Represent. Elsevier Inc"},{"key":"12833_CR19","doi-asserted-by":"publisher","unstructured":"Cohen JP et al (2017) \u2018Count-ception: counting by fully convolutional redundant counting\u2019, proceedings - 2017 IEEE international conference on computer vision workshops, ICCVW 2017, 2018-January, pp. 18\u201326. https:\/\/doi.org\/10.1109\/ICCVW.2017.9","DOI":"10.1109\/ICCVW.2017.9"},{"key":"12833_CR20","doi-asserted-by":"publisher","unstructured":"Cui X et al (2011) \u2018Abnormal detection using interaction energy potentials\u2019, proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp. 3161\u20133167. https:\/\/doi.org\/10.1109\/CVPR.2011.5995558","DOI":"10.1109\/CVPR.2011.5995558"},{"issue":"4","key":"12833_CR21","doi-asserted-by":"publisher","first-page":"1071","DOI":"10.1007\/s11831-019-09344-w","volume":"27","author":"S Dargan","year":"2020","unstructured":"Dargan S, \u2026 Kumar G (2020) A survey of deep learning and its applications: a new paradigm to machine learning. Arch Comput Methods Eng. Springer Netherlands 27(4):1071\u20131092. https:\/\/doi.org\/10.1007\/s11831-019-09344-w","journal-title":"Arch Comput Methods Eng. Springer Netherlands"},{"key":"12833_CR22","doi-asserted-by":"publisher","unstructured":"Deb D, Ventura J (2018) \u2018An aggregated multicolumn dilated convolution network for perspective-free counting\u2019, IEEE computer society conference on computer vision and pattern recognition workshops. IEEE, 2018-June, pp. 308\u2013317. https:\/\/doi.org\/10.1109\/CVPRW.2018.00057","DOI":"10.1109\/CVPRW.2018.00057"},{"issue":"4","key":"12833_CR23","doi-asserted-by":"publisher","first-page":"743","DOI":"10.1109\/TPAMI.2011.155","volume":"34","author":"P Doll\u00e1r","year":"2012","unstructured":"Doll\u00e1r P et al (2012) Pedestrian detection: an evaluation of the state of the art. IEEE Trans Pattern Anal Mach Intell 34(4):743\u2013761. https:\/\/doi.org\/10.1109\/TPAMI.2011.155","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"12833_CR24","unstructured":"Du D, Qi H, Huang Q, Zeng W, Zhang C (2013) Abnormal event detection in crowded scenes based on structural multi-scale motion interrelated patterns, in: proceedings of the IEEE international conference on multimedia and Expo, pp. 1\u20136."},{"issue":"9","key":"12833_CR25","doi-asserted-by":"publisher","first-page":"95","DOI":"10.3390\/jimaging6090095","volume":"6","author":"S Elbishlawi","year":"2020","unstructured":"Elbishlawi S, Abdelpakey MH, Eltantawy A, Shehata MS, Mohamed MM (2020) Deep learning-based crowd scene analysis survey. J Imaging 6(9):95. https:\/\/doi.org\/10.3390\/jimaging6090095","journal-title":"J Imaging"},{"issue":"July 2016","key":"12833_CR26","doi-asserted-by":"publisher","first-page":"548","DOI":"10.1016\/j.neucom.2016.09.063","volume":"219","author":"Y Feng","year":"2017","unstructured":"Feng Y, Yuan Y, Lu X (2017) \u2018Learning deep event models for crowd anomaly detection\u2019, Neurocomputing. Elsevier 219(July 2016):548\u2013556. https:\/\/doi.org\/10.1016\/j.neucom.2016.09.063","journal-title":"Elsevier"},{"issue":"5","key":"12833_CR27","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0034782","volume":"7","author":"T Fiaschi","year":"2012","unstructured":"Fiaschi T, Giannoni E, Taddei ML, Chiarugi P (2012) Globular adiponectin activates motility and regenerative traits of muscle satellite cells. PLoS One 7(5):e34782. https:\/\/doi.org\/10.1371\/journal.pone.0034782","journal-title":"PLoS One"},{"issue":"2","key":"12833_CR28","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1109\/MC.2014.42","volume":"47","author":"D Forsyth","year":"2014","unstructured":"Forsyth D (2014) Object detection with discriminatively trained part-based models\u2019. Computer 47(2):6\u20137. https:\/\/doi.org\/10.1109\/MC.2014.42","journal-title":"Computer"},{"key":"12833_CR29","doi-asserted-by":"publisher","first-page":"81","DOI":"10.1016\/j.engappai.2015.04.006","volume":"43","author":"M Fu","year":"2015","unstructured":"Fu M, \u2026 Zhu C (2015) \u2018Fast crowd density estimation with convolutional neural networks\u2019, engineering applications of artificial intelligence. Elsevier 43:81\u201388. https:\/\/doi.org\/10.1016\/j.engappai.2015.04.006","journal-title":"Elsevier"},{"key":"12833_CR30","unstructured":"Gao, G. et al. (2020). 'CNN-based density estimation and crowd counting: a Survey', pp. 1\u201325. Available at: http:\/\/arxiv.org\/abs\/2003.12783."},{"key":"12833_CR31","doi-asserted-by":"publisher","unstructured":"Gao J, Wang Q, Li X (2019) PCC net: perspective crowd counting via spatial convolutional network. IEEE Trans Circ Syst Video Technol 30:1\u20131, 3498. https:\/\/doi.org\/10.1109\/tcsvt.2019.2919139","DOI":"10.1109\/tcsvt.2019.2919139"},{"key":"12833_CR32","doi-asserted-by":"publisher","unstructured":"Gupta, S., Kumar, M. and Garg, A. (2019) \u2018Improved object recognition results using SIFT and ORB feature detector\u2019, multimedia tools and applications. Multimedia Tools and Applications, 78(23), pp. 34157\u201334171. doi: https:\/\/doi.org\/10.1007\/s11042-019-08232-6.","DOI":"10.1007\/s11042-019-08232-6"},{"issue":"4","key":"12833_CR33","doi-asserted-by":"publisher","first-page":"632","DOI":"10.20965\/jaciii.2017.p0632","volume":"21","author":"K Han","year":"2017","unstructured":"Han K et al (2017) Image crowd counting using convolutional neural network and Markov random field. J Advanc Comput Intell Intell Inform 21(4):632\u2013638. https:\/\/doi.org\/10.20965\/jaciii.2017.p0632","journal-title":"J Advanc Comput Intell Intell Inform"},{"key":"12833_CR34","doi-asserted-by":"publisher","unstructured":"Hossain MA et al (2019) 'Crowd counting using scale-aware attention networks', proceedings - 2019 IEEE winter conference on applications of computer vision, WACV 2019, pp. 1280\u20131288. https:\/\/doi.org\/10.1109\/WACV.2019.00141","DOI":"10.1109\/WACV.2019.00141"},{"key":"12833_CR35","doi-asserted-by":"publisher","first-page":"530","DOI":"10.1016\/j.jvcir.2016.03.021","volume":"38","author":"Y Hu","year":"2016","unstructured":"Hu Y, \u2026 Li T (2016) Dense crowd counting from still images with convolutional neural networks. J Visual Commun Image Representation Elsevier Inc 38:530\u2013539. https:\/\/doi.org\/10.1016\/j.jvcir.2016.03.021","journal-title":"J Visual Commun Image Representation Elsevier Inc"},{"issue":"3","key":"12833_CR36","doi-asserted-by":"publisher","first-page":"1049","DOI":"10.1109\/TIP.2017.2740160","volume":"27","author":"S Huang","year":"2018","unstructured":"Huang S, \u2026 Han J (2018) Body structure aware deep crowd counting. IEEE Trans Image Process 27(3):1049\u20131059. https:\/\/doi.org\/10.1109\/TIP.2017.2740160","journal-title":"IEEE Trans Image Process"},{"key":"12833_CR37","doi-asserted-by":"publisher","unstructured":"Idrees H et al (2013) \u2018Multi-source multi-scale counting in extremely dense crowd images\u2019, Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp. 2547\u20132554. https:\/\/doi.org\/10.1109\/CVPR.2013.329","DOI":"10.1109\/CVPR.2013.329"},{"key":"12833_CR38","doi-asserted-by":"publisher","unstructured":"Idrees H et al (2018) \u2018Composition loss for counting, density map estimation and localization in dense crowds\u2019, lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics), 11206 LNCS, pp. 544\u2013559. https:\/\/doi.org\/10.1007\/978-3-030-01216-8_33","DOI":"10.1007\/978-3-030-01216-8_33"},{"key":"12833_CR39","doi-asserted-by":"publisher","unstructured":"Ilyas N, Shahzad A, Kim K (2020) Convolutional-neural network-based image crowd counting: review, categorisation, analysis, and performance evaluation. Sensors (Switzerland) 20(1). https:\/\/doi.org\/10.3390\/s20010043","DOI":"10.3390\/s20010043"},{"key":"12833_CR40","doi-asserted-by":"publisher","unstructured":"Ji S, \u2026 Yu K (2013) \u20183D convolutional neural networks for human action recognition\u2019, IEEE transactions on pattern analysis and machine intelligence. IEEE 35(1):221\u2013231. https:\/\/doi.org\/10.1109\/TPAMI.2012.59","DOI":"10.1109\/TPAMI.2012.59"},{"key":"12833_CR41","unstructured":"Kang D, Chan A (2019) \u2018Crowd counting by adaptively fusing predictions from an image pyramid\u2019, British machine vision conference 2018. BMVC 2018:1\u201312"},{"issue":"7","key":"12833_CR42","first-page":"2322","volume":"12","author":"A Kumar","year":"2021","unstructured":"Kumar A, Arunnehru J (2021) Crowd behavior monitoring and analysis in surveillance applications: a survey crowd behavior monitoring and analysis in surveillance applications: a survey. Turkish J Comput Math Educ 12(7):2322\u20132336","journal-title":"Turkish J Comput Math Educ"},{"key":"12833_CR43","unstructured":"Lempitsky V and Zisserman A (2010) \u201cLearning To Count Objects in Images,\u201d pp. 1\u20139"},{"key":"12833_CR44","doi-asserted-by":"publisher","unstructured":"Li W, Zhao R, Xiao T, and Wang X (2014), \u201cDeepReID: Deep filter pairing neural network for person re-identification,\u201d Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 152\u2013159, doi: https:\/\/doi.org\/10.1109\/CVPR.2014.27.","DOI":"10.1109\/CVPR.2014.27"},{"key":"12833_CR45","doi-asserted-by":"crossref","unstructured":"Li Y, Zhang X, Chen D (2018) \u2018CSRNet : dilated convolutional neural networks for understanding the highly congested scenes\u2019, pp. 1091\u20131100.","DOI":"10.1109\/CVPR.2018.00120"},{"key":"12833_CR46","doi-asserted-by":"publisher","unstructured":"Li Y, Zhang X, Chen D (2018) 'CSRNet: dilated convolutional neural networks for understanding the highly congested Scenes'. Proceed IEEE comput Soc Conf Comput Vision Patt Recogn. pp. 1091\u20131100. https:\/\/doi.org\/10.1109\/CVPR.2018.00120","DOI":"10.1109\/CVPR.2018.00120"},{"issue":"0","key":"12833_CR47","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1080\/15472450.2020.1746909","volume":"0","author":"Y Li","year":"2020","unstructured":"Li Y, Sarvi M, Khoshelham K, Haghani M (2020) Multi-view crowd congestion monitoring system based on an ensemble of convolutional neural network classifiers. J Intell Trans Syst: Technol, Plann, Oper. Taylor & Francis 0(0):1\u201312. https:\/\/doi.org\/10.1080\/15472450.2020.1746909","journal-title":"J Intell Trans Syst: Technol, Plann, Oper. Taylor & Francis"},{"key":"12833_CR48","doi-asserted-by":"publisher","first-page":"978","DOI":"10.1109\/TPAMI.2010.147","volume":"33","author":"C Liu","year":"2011","unstructured":"Liu C, Yuen J, Torralba A (2011) Sift flow: dense correspondence across scenes and its applications. TPAMI 33:978\u2013994","journal-title":"TPAMI"},{"key":"12833_CR49","doi-asserted-by":"publisher","unstructured":"Liu J et al (2018) \u2018DecideNet: counting varying density crowds through attention guided detection and density estimation\u2019, proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp. 5197\u20135206. https:\/\/doi.org\/10.1109\/CVPR.2018.00545","DOI":"10.1109\/CVPR.2018.00545"},{"key":"12833_CR50","doi-asserted-by":"publisher","unstructured":"Liu L et al (2018) \u2018Crowd counting using deep recurrent spatial-aware network\u2019, IJCAI international joint conference on artificial intelligence, 2018-July, pp. 849\u2013855. https:\/\/doi.org\/10.24963\/ijcai.2018\/118.","DOI":"10.24963\/ijcai.2018\/118"},{"key":"12833_CR51","doi-asserted-by":"publisher","unstructured":"Liu W, Salzmann M, Fua P (2019) 'Context-aware crowd counting', proceedings of the IEEE computer society conference on computer vision and pattern recognition, 2019-June, pp. 5094\u20135103. https:\/\/doi.org\/10.1109\/CVPR.2019.00524","DOI":"10.1109\/CVPR.2019.00524"},{"key":"12833_CR52","doi-asserted-by":"publisher","unstructured":"Lu C, Shi J, Jia J (2013) \u2018Abnormal event detection at 150 FPS in MATLAB\u2019, Proceedings of the IEEE International Conference on Computer Vision, pp. 2720\u20132727. https:\/\/doi.org\/10.1109\/ICCV.2013.338","DOI":"10.1109\/ICCV.2013.338"},{"key":"12833_CR53","doi-asserted-by":"crossref","unstructured":"Lu T, Wu L, Ma X, Shivakumara P, Tan CL (2014) Anomaly detection through spatiotemporal context modeling in crowded scenes, in: proceedings of the international conference on pattern recognition, pp. 2203\u20132208.","DOI":"10.1109\/ICPR.2014.383"},{"key":"12833_CR54","doi-asserted-by":"publisher","unstructured":"Marsden M, McGuinness K et al (2017) 'Fully convolutional crowd counting on highly congested scenes', VISIGRAPP 2017 - proceedings of the 12th international joint conference on computer vision, imaging and computer graphics theory and applications, 5(Visigrapp), pp. 27\u201333. https:\/\/doi.org\/10.5220\/0006097300270033","DOI":"10.5220\/0006097300270033"},{"key":"12833_CR55","doi-asserted-by":"publisher","unstructured":"Marsden M et al (2017) \u2018ResnetCrowd: a residual deep learning architecture for crowd counting, violent behaviour detection and crowd density level classification\u2019, 2017 14th IEEE international conference on advanced video and signal based surveillance, AVSS 2017. https:\/\/doi.org\/10.1109\/AVSS.2017.8078482","DOI":"10.1109\/AVSS.2017.8078482"},{"key":"12833_CR56","doi-asserted-by":"crossref","unstructured":"Mundhenk TN, Konjevod G, Sakla WA, Boakye K (2016). A large contextual dataset for classification, detection and counting of cars with deep learning. In Proceedings of the European Conference on Computer Vision, Munich, Germany, 8\u201316; pp. 785\u2013800","DOI":"10.1007\/978-3-319-46487-9_48"},{"key":"12833_CR57","unstructured":"Onoro-Rubio D, Niepert M, Lopez-Sastre RJ (2019) \u2018Learning short-cut connections for object counting\u2019, British machine vision conference 2018. BMVC 2018:1\u201312"},{"key":"12833_CR58","doi-asserted-by":"publisher","unstructured":"Ouyang W and Wang X (2013), \u201cJoint deep learning for pedestrian detection,\u201d Proc. IEEE Int. Conf. Comput. Vis., pp. 2056\u20132063, doi: https:\/\/doi.org\/10.1109\/ICCV.2013.257.","DOI":"10.1109\/ICCV.2013.257"},{"key":"12833_CR59","doi-asserted-by":"publisher","unstructured":"Pham VQ et al (2015) \u2018COUNT forest: co-voting uncertain number of targets using random forest for crowd density estimation\u2019, proceedings of the IEEE international conference on computer vision, 2015 international conference on computer vision, ICCV 2015, pp. 3253\u20133261. https:\/\/doi.org\/10.1109\/ICCV.2015.372","DOI":"10.1109\/ICCV.2015.372"},{"key":"12833_CR60","doi-asserted-by":"crossref","unstructured":"Rodriguez M, Sivic J, Laptev I, Audibert J (2011) Datadriven crowd analysis in videos In ICCV","DOI":"10.1109\/ICCV.2011.6126374"},{"key":"12833_CR61","doi-asserted-by":"publisher","unstructured":"Ryan D et al (2009) \u2018Crowd counting using multiple local features\u2019, DICTA 2009 - digital image computing: techniques and applications, pp. 81\u201388. https:\/\/doi.org\/10.1109\/DICTA.2009.22","DOI":"10.1109\/DICTA.2009.22"},{"key":"12833_CR62","doi-asserted-by":"crossref","unstructured":"Sam DB (2017) \u2018Switching convolutional neural network for crowd counting\u2019, pp. 5744\u20135752.","DOI":"10.1109\/CVPR.2017.429"},{"key":"12833_CR63","unstructured":"Sermanet P, Eigen D, Zhang X, Mathieu M, Fergus R, LeCun Y (2013) Overfeat: integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229."},{"issue":"9","key":"12833_CR64","doi-asserted-by":"publisher","first-page":"2627","DOI":"10.1109\/TCSVT.2018.2803115","volume":"29","author":"MB Shami","year":"2019","unstructured":"Shami MB, \u2026 Cheung SCS (2019) People counting in dense crowd images using sparse head detections. IEEE Trans Circ Syst for Video Technol 29(9):2627\u20132636. https:\/\/doi.org\/10.1109\/TCSVT.2018.2803115","journal-title":"IEEE Trans Circ Syst for Video Technol"},{"key":"12833_CR65","doi-asserted-by":"publisher","unstructured":"Sindagi VA, Patel VM (2017) 'CNN-based cascaded multitask learning of high-level prior and density estimation for crowd counting', 2017 14th IEEE international conference on advanced video and signal based surveillance, AVSS 2017. https:\/\/doi.org\/10.1109\/AVSS.2017.8078491","DOI":"10.1109\/AVSS.2017.8078491"},{"key":"12833_CR66","doi-asserted-by":"publisher","unstructured":"Sindagi VA, Patel VM (2017) Pt us CRPattern Recognition Letters. Elsevier B.V. https:\/\/doi.org\/10.1016\/j.patrec.2017.07.007","DOI":"10.1016\/j.patrec.2017.07.007"},{"key":"12833_CR67","doi-asserted-by":"publisher","unstructured":"Sindagi VA, Patel VM (2017) \u2018Generating high-quality crowd density maps using contextual pyramid CNNs\u2019, proceedings of the IEEE international conference on computer vision, 2017-October, pp. 1879\u20131888. https:\/\/doi.org\/10.1109\/ICCV.2017.206","DOI":"10.1109\/ICCV.2017.206"},{"key":"12833_CR68","doi-asserted-by":"publisher","unstructured":"Singh K, \u2026 Walia GS (2020) 'Crowd anomaly detection using aggregation of ensembles of fine-tuned ConvNets', Neuro computing. Elsevier BV 371:188\u2013198. https:\/\/doi.org\/10.1016\/j.neucom.2019.08.059","DOI":"10.1016\/j.neucom.2019.08.059"},{"key":"12833_CR69","doi-asserted-by":"crossref","unstructured":"Tian Y et al (2020) \u2018PaDNet : pan-density crowd counting\u2019, pp. 1\u201314.","DOI":"10.1109\/TIP.2019.2952083"},{"key":"12833_CR70","doi-asserted-by":"crossref","unstructured":"J. Tighe and S. Lazebnik. (2010). Superparsing: scalable nonparametric image parsing with superpixels. In ECCV. Springer.","DOI":"10.1007\/978-3-642-15555-0_26"},{"issue":"10","key":"12833_CR71","doi-asserted-by":"publisher","first-page":"1713","DOI":"10.1109\/TPAMI.2008.75","volume":"30","author":"O Tuzel","year":"2008","unstructured":"Tuzel O, Porikli F, Meer P (2008) Pedestrian detection via classification on Riemannian manifolds. IEEE Trans Pattern Anal Mach Intell 30(10):1713\u20131727. https:\/\/doi.org\/10.1109\/TPAMI.2008.75","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"12833_CR72","unstructured":"van den Oord A, Schrauwen B (2014) Factoring variations in natural images with deep gaussian mixture models, in: proceedings of the advances in neural information processing systems, pp. 3518\u20133526."},{"key":"12833_CR73","doi-asserted-by":"crossref","unstructured":"Walach E, Wolf L (2016). Learning to count with CNN boosting. In Proceedings of the European Conference on Computer Vision, Amsterdam, the Netherlands, 11\u201314; pp. 660\u2013676","DOI":"10.1007\/978-3-319-46475-6_41"},{"key":"12833_CR74","doi-asserted-by":"publisher","unstructured":"Wang C et al (2015) \u2018Deep people counting in extremely dense crowds\u2019, MM 2015 - proceedings of the 2015 ACM multimedia conference, pp. 1299\u20131302. https:\/\/doi.org\/10.1145\/2733373.2806337","DOI":"10.1145\/2733373.2806337"},{"key":"12833_CR75","unstructured":"Wang, Li et al. (2018). \u2018Crowd Counting with Density Adaption Networks\u2019, pp. 1\u20135. Available at: http:\/\/arxiv.org\/abs\/1806.10040."},{"key":"12833_CR76","unstructured":"Wang, Q. et al. (2020). 'NWPU-Crowd: A Large-Scale Benchmark for Crowd Counting,' 10 jan 2020, pp. 1\u20138. Available at: http:\/\/arxiv.org\/abs\/2001.03360."},{"issue":"2","key":"12833_CR77","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1007\/s11263-006-0027-7","volume":"75","author":"B Wu","year":"2007","unstructured":"Wu B, Nevatia R (2007) Detection and tracking of multiple, partially occluded humans by Bayesian combination of edgelet based part detectors. Int J Comput Vis 75(2):247\u2013266. https:\/\/doi.org\/10.1007\/s11263-006-0027-7","journal-title":"Int J Comput Vis"},{"key":"12833_CR78","doi-asserted-by":"publisher","unstructured":"Wu S, Moore BE, Shah M (2010) \u2018Chaotic invariants of lagrangian particle trajectories for anomaly detection in crowded scenes\u2019, Proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp. 2054\u20132060. https:\/\/doi.org\/10.1109\/CVPR.2010.5539882","DOI":"10.1109\/CVPR.2010.5539882"},{"key":"12833_CR79","doi-asserted-by":"publisher","unstructured":"Xu M et al (2019) \u2018Depth information guided crowd counting for complex crowd scenes\u2019, Pattern Recognition Letters. Elsevier B.V., 125, pp. 563\u2013569. https:\/\/doi.org\/10.1016\/j.patrec.2019.02.026","DOI":"10.1016\/j.patrec.2019.02.026"},{"key":"12833_CR80","doi-asserted-by":"publisher","unstructured":"Xu M et al (2019) \u2018Depth information guided crowd counting for complex crowd scenes\u2019. Patt Recogn Lett. Elsevier B.V., 125, pp. 563\u2013569. https:\/\/doi.org\/10.1016\/j.patrec.2019.02.026","DOI":"10.1016\/j.patrec.2019.02.026"},{"key":"12833_CR81","doi-asserted-by":"publisher","first-page":"118","DOI":"10.1016\/j.image.2018.03.004","volume":"64","author":"B Yang","year":"2018","unstructured":"Yang B, \u2026 Zou L (2018) \u2018Counting challenging crowds robustly using a multi-column multi-task convolutional neural network\u2019, signal processing: image communication. Elsevier Ltd 64:118\u2013129. https:\/\/doi.org\/10.1016\/j.image.2018.03.004","journal-title":"Elsevier Ltd"},{"key":"12833_CR82","doi-asserted-by":"publisher","first-page":"112977","DOI":"10.1016\/j.eswa.2019.112977","volume":"141","author":"X Zeng","year":"2020","unstructured":"Zeng X, \u2026 Ye Y (2020) 'DSPNet: deep scale purifier network for dense crowd counting', expert systems with applications. Elsevier Ltd 141:112977. https:\/\/doi.org\/10.1016\/j.eswa.2019.112977","journal-title":"Elsevier Ltd"},{"key":"12833_CR83","doi-asserted-by":"publisher","unstructured":"Zhang C et al (2015) \u2018Cross-scene crowd counting via deep convolutional neural networks\u2019, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 07-12-June-2015, pp. 833\u2013841. https:\/\/doi.org\/10.1109\/CVPR.2015.7298684","DOI":"10.1109\/CVPR.2015.7298684"},{"key":"12833_CR84","doi-asserted-by":"publisher","unstructured":"Zhang C et al (2015) \u2018Cross-scene crowd counting via deep convolutional neural networks\u2019, proceedings of the IEEE computer society conference on computer vision and pattern recognition, 07-12-June, pp. 833\u2013841. https:\/\/doi.org\/10.1109\/CVPR.2015.7298684","DOI":"10.1109\/CVPR.2015.7298684"},{"key":"12833_CR85","doi-asserted-by":"publisher","unstructured":"Zhang Y et al (2016) \u2018Single-image crowd counting via multi-column convolutional neural network\u2019, proceedings of the IEEE computer society conference on computer vision and pattern recognition, pp. 589\u2013597. https:\/\/doi.org\/10.1109\/CVPR.2016.70","DOI":"10.1109\/CVPR.2016.70"},{"key":"12833_CR86","doi-asserted-by":"publisher","unstructured":"Zhang Y et al (2016) \u2018Single-image crowd counting via multi-column convolutional neural network\u2019, proceedings of the IEEE computer society conference on computer vision and pattern recognition, 2016-December, pp. 589\u2013597. https:\/\/doi.org\/10.1109\/CVPR.2016.70","DOI":"10.1109\/CVPR.2016.70"},{"key":"12833_CR87","doi-asserted-by":"publisher","unstructured":"Zhang Y et al (2018) \u2018Auxiliary learning for crowd counting via count-net\u2019. Neuro Computing. Elsevier B.V., 273, pp. 190\u2013198. https:\/\/doi.org\/10.1016\/j.neucom.2017.08.018","DOI":"10.1016\/j.neucom.2017.08.018"},{"key":"12833_CR88","doi-asserted-by":"publisher","unstructured":"Zhou S, \u2026 Zhang Z (2016) \u2018Spatial-temporal convolutional neural networks for anomaly detection and localization in crowded scenes\u2019. Signal Process: Image Comm. Elsevier 47:358\u2013368. https:\/\/doi.org\/10.1016\/j.image.2016.06.007","DOI":"10.1016\/j.image.2016.06.007"},{"key":"12833_CR89","doi-asserted-by":"publisher","unstructured":"Zhu J, Feng F and Shen B (2018) \u2018People counting and pedestrian flow statistics based on convolutional neural network and recurrent neural network\u2019, Proceedings - 2018 33rd Youth Academic Annual Conference of Chinese Association of Automation, YAC 2018. IEEE, pp. 993\u2013998. doi: https:\/\/doi.org\/10.1109\/YAC.2018.8406516.","DOI":"10.1109\/YAC.2018.8406516"}],"container-title":["Multimedia Tools and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-022-12833-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11042-022-12833-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-022-12833-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,19]],"date-time":"2022-07-19T07:25:29Z","timestamp":1658215529000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11042-022-12833-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,3,29]]},"references-count":89,"journal-issue":{"issue":"19","published-print":{"date-parts":[[2022,8]]}},"alternative-id":["12833"],"URL":"https:\/\/doi.org\/10.1007\/s11042-022-12833-z","relation":{},"ISSN":["1380-7501","1573-7721"],"issn-type":[{"value":"1380-7501","type":"print"},{"value":"1573-7721","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,3,29]]},"assertion":[{"value":"18 January 2021","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"2 February 2022","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"9 March 2022","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"29 March 2022","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors hereby declare that there are no conflicts of interest in this research.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}