{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,14]],"date-time":"2026-04-14T16:22:46Z","timestamp":1776183766564,"version":"3.50.1"},"reference-count":68,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2022,7,15]],"date-time":"2022-07-15T00:00:00Z","timestamp":1657843200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Safe Cities\u2014\u201cInova\u00e7\u00e3o para Construir Cidades Seguras\u201d","award":["POCI-01-0247-FEDER-041435"],"award-info":[{"award-number":["POCI-01-0247-FEDER-041435"]}]},{"name":"Safe Cities\u2014\u201cInova\u00e7\u00e3o para Construir Cidades Seguras\u201d","award":["COMPETE 2020"],"award-info":[{"award-number":["COMPETE 2020"]}]},{"name":"European Regional Development Fund (ERDF)","award":["POCI-01-0247-FEDER-041435"],"award-info":[{"award-number":["POCI-01-0247-FEDER-041435"]}]},{"name":"European Regional Development Fund (ERDF)","award":["COMPETE 2020"],"award-info":[{"award-number":["COMPETE 2020"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The crowd counting task has become a pillar for crowd control as it provides information concerning the number of people in a scene. It is helpful in many scenarios such as video surveillance, public safety, and future event planning. To solve such tasks, researchers have proposed different solutions. In the beginning, researchers went with more traditional solutions, while recently the focus is on deep learning methods and, more specifically, on Convolutional Neural Networks (CNNs), because of their efficiency. This review explores these methods by focusing on their key differences, advantages, and disadvantages. We have systematically analyzed algorithms and works based on the different models suggested and the problems they are trying to solve. The main focus is on the shift made in the history of crowd counting methods, moving from the heuristic models to CNN models by identifying each category and discussing its different methods and architectures. After a deep study of the literature on crowd counting, the survey partitions current datasets into sparse and crowded ones. It discusses the reviewed methods by comparing their results on the different datasets. The findings suggest that the heuristic models could be even more effective than the CNN models in sparse scenarios.<\/jats:p>","DOI":"10.3390\/s22145286","type":"journal-article","created":{"date-parts":[[2022,7,18]],"date-time":"2022-07-18T01:53:22Z","timestamp":1658109202000},"page":"5286","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["Convolutional Neural Networks and Heuristic Methods for Crowd Counting: A Systematic Review"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2670-3364","authenticated-orcid":false,"given":"Khouloud Ben Ali","family":"Hassen","sequence":"first","affiliation":[{"name":"Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s\/n, 4200-465 Porto, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-1094-0114","authenticated-orcid":false,"given":"Jos\u00e9 J. M.","family":"Machado","sequence":"additional","affiliation":[{"name":"Departamento de Engenharia Mec\u00e2nica, Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s\/n, 4200-465 Porto, Portugal"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7603-6526","authenticated-orcid":false,"given":"Jo\u00e3o Manuel R. S.","family":"Tavares","sequence":"additional","affiliation":[{"name":"Departamento de Engenharia Mec\u00e2nica, Faculdade de Engenharia, Universidade do Porto, Rua Dr. Roberto Frias, s\/n, 4200-465 Porto, Portugal"}]}],"member":"1968","published-online":{"date-parts":[[2022,7,15]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"5046727","DOI":"10.1155\/2017\/5046727","article-title":"Low-Rank and Sparse Based Deep-Fusion Convolutional Neural Network for Crowd Counting","volume":"2017","author":"Tang","year":"2017","journal-title":"Math. Probl. Eng."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"4822","DOI":"10.1109\/TCYB.2020.3034316","article-title":"Feature-aware adaptation and density alignment for crowd counting in video surveillance","volume":"51","author":"Gao","year":"2020","journal-title":"IEEE Trans. Cybern."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Marsden, M., McGuinness, K., Little, S., and O\u2019Connor, N.E. (2016). Fully convolutional crowd counting on highly congested scenes. arXiv.","DOI":"10.5220\/0006097300270033"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Ding, X., Lin, Z., He, F., Wang, Y., and Huang, Y. (2018, January 15\u201320). A deeply-recursive convolutional network for crowd counting. Proceedings of the 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, AB, Canada.","DOI":"10.1109\/ICASSP.2018.8461772"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Marsden, M., McGuinness, K., Little, S., and O\u2019Connor, N.E. (September, January 29). Resnetcrowd: A residual deep learning architecture for crowd counting, violent behavior detection and crowd density level classification. Proceedings of the 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS), Lecce, Italy.","DOI":"10.1109\/AVSS.2017.8078482"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Pan, X., Mo, H., Zhou, Z., and Wu, W. (2020, January 4\u20138). Attention guided region division for crowd counting. Proceedings of the ICASSP 2020\u20142020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain.","DOI":"10.1109\/ICASSP40776.2020.9053761"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"886","DOI":"10.1109\/CVPR.2005.177","article-title":"Histograms of oriented gradients for human detection","volume":"Volume 1","author":"Dalal","year":"2005","journal-title":"Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905)"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"878","DOI":"10.1109\/CVPR.2005.272","article-title":"Pedestrian detection in crowded scenes","volume":"Volume 1","author":"Leibe","year":"2005","journal-title":"Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201905)"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"1713","DOI":"10.1109\/TPAMI.2008.75","article-title":"Pedestrian detection via classification on riemannian manifolds","volume":"30","author":"Tuzel","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"1627","DOI":"10.1109\/TPAMI.2009.167","article-title":"Object detection with discriminatively trained part-based models","volume":"32","author":"Felzenszwalb","year":"2009","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"645","DOI":"10.1109\/3468.983420","article-title":"Estimation of number of people in crowded scenes using perspective transformation","volume":"31","author":"Lin","year":"2001","journal-title":"IEEE Trans. Syst. Man Cybern. Part A Syst. Hum."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"1198","DOI":"10.1109\/TPAMI.2007.70770","article-title":"Segmentation and tracking of multiple humans in crowded environments","volume":"30","author":"Zhao","year":"2008","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Ge, W., and Collins, R.T. (2009, January 20\u201325). Marked point processes for crowd counting. Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA.","DOI":"10.1109\/CVPRW.2009.5206621"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Yang, D.B., Gonz\u00e1lez-Banos, H.H., and Guibas, L.J. (2003, January 13\u201316). Counting People in Crowds with a Real-Time Network of Simple Image Sensors. Proceedings of the Ninth IEEE International Conference on Computer Vision, Nice, France.","DOI":"10.1109\/ICCV.2003.1238325"},{"key":"ref_15","unstructured":"Wang, M., Li, W., and Wang, X. (2012, January 16\u201321). Transferring a generic pedestrian detector towards specific scenes. Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Wang, M., and Wang, X. (2011, January 20\u201325). Automatic adaptation of a generic pedestrian detector to a specific traffic scene. Proceedings of the CVPR 2011, Colorado Springs, CO, USA.","DOI":"10.1109\/CVPR.2011.5995698"},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Chan, A.B., Liang, Z.S.J., and Vasconcelos, N. (2008, January 23\u201328). Privacy preserving crowd monitoring: Counting people without people models or tracking. Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, Alaska.","DOI":"10.1109\/CVPR.2008.4587569"},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Shehzed, A., Jalal, A., and Kim, K. (2019, January 27\u201329). Multi-person tracking in smart surveillance system for crowd counting and normal\/abnormal events detection. Proceedings of the 2019 International Conference on Applied and Engineering Mathematics (ICAEM), Taxila, Pakistan.","DOI":"10.1109\/ICAEM.2019.8853756"},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"e4017","DOI":"10.1002\/ett.4017","article-title":"Deep learning recognition of diseased and normal cell representation","volume":"32","author":"Iqbal","year":"2021","journal-title":"Trans. Emerg. Telecommun. Technol."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"113","DOI":"10.1007\/s12596-018-0508-4","article-title":"Efficient cell classification of mitochondrial images by using deep learning","volume":"48","author":"Iqbal","year":"2019","journal-title":"J. Opt."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Wang, C., Zhang, H., Yang, L., Liu, S., and Cao, X. (2015, January 26\u201330). Deep people counting in extremely dense crowds. Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia.","DOI":"10.1145\/2733373.2806337"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"81","DOI":"10.1016\/j.engappai.2015.04.006","article-title":"Fast crowd density estimation with convolutional neural networks","volume":"43","author":"Fu","year":"2015","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Zhou, D., Chen, S., Gao, S., and Ma, Y. (2016, January 27\u201330). Single-image crowd counting via multi-column convolutional neural network. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.70"},{"key":"ref_24","doi-asserted-by":"crossref","unstructured":"Boominathan, L., Kruthiventi, S.S., and Babu, R.V. (2016, January 15\u201319). Crowdnet: A deep convolutional network for dense crowd counting. Proceedings of the 24th ACM International Conference on Multimedia, Amsterdam, The Netherlands.","DOI":"10.1145\/2964284.2967300"},{"key":"ref_25","unstructured":"Zhang, A., Shen, J., Xiao, Z., Zhu, F., Zhen, X., Cao, X., and Shao, L. (November, January 27). Relational attention network for crowd counting. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Zhang, L., Shi, M., and Chen, Q. (2018, January 12\u201315). Crowd counting via scale-adaptive convolutional neural network. Proceedings of the 2018 IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA.","DOI":"10.1109\/WACV.2018.00127"},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Li, Y., Zhang, X., and Chen, D. (2018, January 18\u201322). Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00120"},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Jiang, X., Xiao, Z., Zhang, B., Zhen, X., Cao, X., Doermann, D., and Shao, L. (2019, January 15\u201320). Crowd counting and density estimation by trellis encoder-decoder networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00629"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Shi, Z., Zhang, L., Liu, Y., Cao, X., Ye, Y., Cheng, M.M., and Zheng, G. (2018, January 18\u201322). Crowd counting with deep negative correlation learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00564"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2897","DOI":"10.1007\/s00521-018-3810-9","article-title":"Single-column CNN for crowd counting with pixel-wise attention mechanism","volume":"32","author":"Wang","year":"2020","journal-title":"Neural Comput. Appl."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Loy, C.C., Chen, K., Gong, S., and Xiang, T. (2013). Crowd counting and profiling: Methodology and evaluation. Modeling, Simulation and Visual Analysis of Crowds, Springer.","DOI":"10.1007\/978-1-4614-8483-7_14"},{"key":"ref_32","doi-asserted-by":"crossref","first-page":"1187","DOI":"10.1109\/ICPR.2006.197","article-title":"A viewpoint invariant approach for crowd counting","volume":"Volume 3","author":"Kong","year":"2006","journal-title":"Proceedings of the 18th International Conference on Pattern Recognition (ICPR\u201906)"},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"703","DOI":"10.1109\/IAEAC50856.2021.9390989","article-title":"Switching Detection and Density Regression Network for Crowd Counting","volume":"Volume 5","author":"Song","year":"2021","journal-title":"Proceedings of the 2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Menon, A., Omman, B., and Asha, S. (2021, January 11\u201312). Pedestrian Counting Using Yolo V3. Proceedings of the 2021 International Conference on Innovative Trends in Information Technology (ICITIIT), Kottayam, India.","DOI":"10.1109\/ICITIIT51526.2021.9399607"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"2188","DOI":"10.1109\/TPAMI.2011.70","article-title":"Hough forests for object detection, tracking, and action recognition","volume":"33","author":"Gall","year":"2011","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Li, M., Zhang, Z., Huang, K., and Tan, T. (2008, January 8\u201311). Estimating the number of people in crowded scenes by mid based foreground segmentation and head-shoulder detection. Proceedings of the 2008 19th International Conference on Pattern Recognition, Tampa, FL, USA.","DOI":"10.1109\/ICPR.2008.4761705"},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Subburaman, V.B., Descamps, A., and Carincotte, C. (2012, January 18\u201321). Counting people in the crowd using a generic head detector. Proceedings of the 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance, Beijing, China.","DOI":"10.1109\/AVSS.2012.87"},{"key":"ref_38","first-page":"3","article-title":"Feature mining for localised crowd counting","volume":"1","author":"Chen","year":"2012","journal-title":"Bmvc"},{"key":"ref_39","unstructured":"Marana, A., Costa, L.d.F., Lotufo, R., and Velastin, S. (1998, January 20\u201323). On the efficacy of texture analysis for crowd monitoring. Proceedings of the SIBGRAPI\u201998. International Symposium on Computer Graphics, Image Processing, and Vision (Cat. No. 98EX237), Rio de Janeiro, Brazil."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Ryan, D., Denman, S., Fookes, C., and Sridharan, S. (2009, January 1\u20133). Crowd counting using multiple local features. Proceedings of the 2009 Digital Image Computing: Techniques and Applications, Melbourne, Australia.","DOI":"10.1109\/DICTA.2009.22"},{"key":"ref_41","first-page":"I","article-title":"A MRF-based approach for real-time subway monitoring","volume":"Volume 1","author":"Paragios","year":"2001","journal-title":"Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Tian, Y., Sigal, L., Badino, H., Torre, F.D.l., and Liu, Y. (2010, January 8\u201312). Latent gaussian mixture regression for human pose estimation. Proceedings of the Asian Conference on Computer Vision, Queenstown, New Zealand.","DOI":"10.1007\/978-3-642-19318-7_53"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Idrees, H., Saleemi, I., Seibert, C., and Shah, M. (2013, January 23\u201328). Multi-source multi-scale counting in extremely dense crowd images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA.","DOI":"10.1109\/CVPR.2013.329"},{"key":"ref_44","doi-asserted-by":"crossref","first-page":"2160","DOI":"10.1109\/TIP.2011.2172800","article-title":"Counting people with low-level features and Bayesian regression","volume":"21","author":"Chan","year":"2011","journal-title":"IEEE Trans. Image Process."},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Benabbas, Y., Ihaddadene, N., Yahiaoui, T., Urruty, T., and Djeraba, C. (September, January 29). Spatio-temporal optical flow analysis for people counting. Proceedings of the 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance, Boston, MA, USA.","DOI":"10.1109\/AVSS.2010.29"},{"key":"ref_46","unstructured":"Lin, T.Y., Lin, Y.Y., Weng, M.F., Wang, Y.C., Hsu, Y.F., and Liao, H.Y.M. (December, January 29). Cross camera people counting with perspective estimation and occlusion handling. Proceedings of the 2011 IEEE International Workshop on Information Forensics and Security, Iguacu Falls, Brazil."},{"key":"ref_47","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1016\/0165-1684(96)00075-8","article-title":"Distributed data fusion for real-time crowding estimation","volume":"53","author":"Regazzoni","year":"1996","journal-title":"Signal Process."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1109\/CVPR.2006.92","article-title":"Counting crowded moving objects","volume":"Volume 1","author":"Rabaud","year":"2006","journal-title":"Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201906)"},{"key":"ref_49","doi-asserted-by":"crossref","first-page":"103","DOI":"10.1016\/j.engappai.2015.01.007","article-title":"Recent survey on crowd density estimation and counting for visual surveillance","volume":"41","author":"Saleh","year":"2015","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_50","doi-asserted-by":"crossref","first-page":"594","DOI":"10.1109\/CVPR.2006.320","article-title":"Unsupervised bayesian detection of independent motion in crowds","volume":"Volume 1","author":"Brostow","year":"2006","journal-title":"Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR\u201906)"},{"key":"ref_51","doi-asserted-by":"crossref","first-page":"1533","DOI":"10.1007\/s00371-014-1032-4","article-title":"Estimation of crowd density by clustering motion cues","volume":"31","author":"Rao","year":"2015","journal-title":"Vis. Comput."},{"key":"ref_52","unstructured":"Antonini, G., and Thiran, J. (2004). Trajectories Clustering in ICA Space: An Application to Automatic Counting of Pedestrians in Video Sequences, IEEE. Technical Report."},{"key":"ref_53","unstructured":"O\u2019Shea, K., and Nash, R. (2015). An introduction to convolutional neural networks. arXiv."},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Albawi, S., Mohammed, T.A., and Al-Zawi, S. (2017, January 21\u201323). Understanding of a convolutional neural network. Proceedings of the 2017 International Conference on Engineering and Technology (ICET), Antalya, Turkey.","DOI":"10.1109\/ICEngTechnol.2017.8308186"},{"key":"ref_55","first-page":"2-D","article-title":"Classification of image using convolutional neural network (CNN)","volume":"19","author":"Hossain","year":"2019","journal-title":"Glob. J. Comput. Sci. Technol."},{"key":"ref_56","doi-asserted-by":"crossref","first-page":"110722","DOI":"10.1109\/ACCESS.2019.2934354","article-title":"Blind channel identification aided generalized automatic modulation recognition based on deep learning","volume":"7","author":"Gu","year":"2019","journal-title":"IEEE Access"},{"key":"ref_57","doi-asserted-by":"crossref","first-page":"104242","DOI":"10.1016\/j.imavis.2021.104242","article-title":"CFFNet: Coordinated feature fusion network for crowd counting","volume":"112","author":"Xia","year":"2021","journal-title":"Image Vis. Comput."},{"key":"ref_58","doi-asserted-by":"crossref","first-page":"118","DOI":"10.1016\/j.image.2018.03.004","article-title":"Counting challenging crowds robustly using a multi-column multi-task convolutional neural network","volume":"64","author":"Yang","year":"2018","journal-title":"Signal Process. Image Commun."},{"key":"ref_59","unstructured":"Zhang, C., Li, H., Wang, X., and Yang, X. (2015, January 7\u201312). Cross-scene crowd counting via deep convolutional neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Idrees, H., Tayyab, M., Athrey, K., Zhang, D., Al-Maadeed, S., Rajpoot, N., and Shah, M. (2018, January 8\u201314). Composition loss for counting, density map estimation and localization in dense crowds. Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany.","DOI":"10.1007\/978-3-030-01216-8_33"},{"key":"ref_61","doi-asserted-by":"crossref","unstructured":"Zeng, L., Xu, X., Cai, B., Qiu, S., and Zhang, T. (2017, January 17\u201320). Multi-scale convolutional neural networks for crowd counting. Proceedings of the 2017 IEEE International Conference on Image Processing (ICIP), Beijing, China.","DOI":"10.1109\/ICIP.2017.8296324"},{"key":"ref_62","doi-asserted-by":"crossref","unstructured":"Xiong, F., Shi, X., and Yeung, D.Y. (2017, January 22\u201329). Spatiotemporal modeling for crowd counting in videos. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.","DOI":"10.1109\/ICCV.2017.551"},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Liu, J., Gao, C., Meng, D., and Hauptmann, A.G. (2018, January 18\u201322). Decidenet: Counting varying density crowds through attention guided detection and density estimation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA.","DOI":"10.1109\/CVPR.2018.00545"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Shi, M., Yang, Z., Xu, C., and Chen, Q. (2019, January 15\u201320). Revisiting perspective information for efficient crowd counting. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00745"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Hossain, M., Hosseinzadeh, M., Chanda, O., and Wang, Y. (2019, January 7\u201311). Crowd counting using scale-aware attention networks. Proceedings of the 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI, USA.","DOI":"10.1109\/WACV.2019.00141"},{"key":"ref_66","unstructured":"Yan, Z., Yuan, Y., Zuo, W., Tan, X., Wang, Y., Wen, S., and Ding, E. (November, January 27). Perspective-guided convolution networks for crowd counting. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Bai, S., He, Z., Qiao, Y., Hu, H., Wu, W., and Yan, J. (2020, January 13\u201319). Adaptive dilated network with self-correction supervision for counting. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00465"},{"key":"ref_68","doi-asserted-by":"crossref","unstructured":"Song, Q., Wang, C., Wang, Y., Tai, Y., Wang, C., Li, J., Wu, J., and Ma, J. (2021, January 2\u20139). To choose or to fuse? Scale selection for crowd counting. Proceedings of the AAAI Conference on Artificial Intelligence, Virtual.","DOI":"10.1609\/aaai.v35i3.16360"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/14\/5286\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:50:55Z","timestamp":1760140255000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/14\/5286"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,15]]},"references-count":68,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2022,7]]}},"alternative-id":["s22145286"],"URL":"https:\/\/doi.org\/10.3390\/s22145286","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,15]]}}}