{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,22]],"date-time":"2026-04-22T17:54:32Z","timestamp":1776880472358,"version":"3.51.2"},"reference-count":82,"publisher":"Springer Science and Business Media LLC","issue":"15","license":[{"start":{"date-parts":[[2023,10,19]],"date-time":"2023-10-19T00:00:00Z","timestamp":1697673600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,10,19]],"date-time":"2023-10-19T00:00:00Z","timestamp":1697673600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100009567","name":"Budapest University of Technology and Economics","doi-asserted-by":"crossref","id":[{"id":"10.13039\/100009567","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Multimed Tools Appl"],"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The fast improvement of deep learning methods resulted in breakthroughs in image classification, object detection, and object tracking. Autonomous driving and traffic monitoring systems, especially the on-premise installed fixed position multi-camera configurations, benefit greatly from recent advances. In this paper, we propose a Multi-Camera Multi-Target (MCMT) vehicle tracking system using a constrained hierarchical clustering solution, which improves trajectory matching, and thus provides a more robust tracking of objects transitioning between cameras. YOLOv5, ByteTrack, and ResNet50-IBN ReID networks are used for vehicle detection and tracking. Static attributes such as vehicle type and vehicle color are determined from ReID features with SVM. The proposed ReID feature-based attribute categorization shows better performance, than its pure CNN counterpart. Single-camera trajectories (SCTs) are combined into multi-camera trajectories (MCTs) using hierarchical agglomerative clustering (HAC) with time and space constraints (our proposed algorithm is denoted by MCT#MAC). Similarities between SCTs are measured by comparing the mean ReID features cumulated on the trajectory. The system was evaluated on more datasets, and our experiments demonstrate that constraining HAC by manipulating the proximity matrix greatly improves the multi-camera IDF1 score.<\/jats:p>","DOI":"10.1007\/s11042-023-17397-0","type":"journal-article","created":{"date-parts":[[2023,10,19]],"date-time":"2023-10-19T07:02:53Z","timestamp":1697698973000},"page":"44879-44902","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":8,"title":["Multi-camera trajectory matching based on hierarchical clustering and constraints"],"prefix":"10.1007","volume":"83","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5781-1088","authenticated-orcid":false,"given":"G\u00e1bor","family":"Sz\u0171cs","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Reg\u0151","family":"Borsodi","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8814-2745","authenticated-orcid":false,"given":"D\u00e1vid","family":"Papp","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2023,10,19]]},"reference":[{"key":"17397_CR1","doi-asserted-by":"publisher","unstructured":"Amosa TI, Sebastian P, Izhar LI, Ibrahim O, Ayinla Bahashwan AA, Bala A, Samaila YA (2023) Multi-camera multi-object tracking: a review of current trends and future advances. Neurocomputing, Volume 552, 126558. https:\/\/doi.org\/10.1016\/j.neucom.2023.126558","DOI":"10.1016\/j.neucom.2023.126558"},{"key":"17397_CR2","doi-asserted-by":"publisher","first-page":"6653","DOI":"10.1007\/s11042-021-11804-0","volume":"81","author":"E Av\u015far","year":"2022","unstructured":"Av\u015far E, Av\u015far Y\u00d6 (2022) Moving vehicle detection and tracking at roundabouts using deep learning with trajectory union. Multimed Tools Appl 81:6653\u20136680. https:\/\/doi.org\/10.1007\/s11042-021-11804-0","journal-title":"Multimed Tools Appl"},{"key":"17397_CR3","doi-asserted-by":"crossref","unstructured":"Bergmann P, Meinhardt T, Leal-Taixe L (2019) Tracking without bells and whistles. In Proceedings of the IEEE\/CVF International Conference on Computer Vision, Seoul, Korea (South), pp. 941\u2013951. https:\/\/doi.org\/10.1109\/ICCV.2019.00103","DOI":"10.1109\/ICCV.2019.00103"},{"key":"17397_CR4","doi-asserted-by":"publisher","unstructured":"Bewley A, Ge Z, Ott L, Ramos F, Upcroft B (2016) Simple online and realtime tracking. In 2016 IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, pp. 3464\u20133468. https:\/\/doi.org\/10.1109\/ICIP.2016.7533003","DOI":"10.1109\/ICIP.2016.7533003"},{"key":"17397_CR5","doi-asserted-by":"publisher","unstructured":"Bochinski E, Eiselein V, Sikora T (2017) High-speed tracking-by-detection without using image information. In 14th IEEE international Conference on Advanced Video and Signal Based Surveillance (AVSS) Lecce, Italy, 2017, pp. 1\u20136. https:\/\/doi.org\/10.1109\/AVSS.2017.8078516","DOI":"10.1109\/AVSS.2017.8078516"},{"key":"17397_CR6","doi-asserted-by":"publisher","unstructured":"Cao L, Chen W, Chen X, Zheng S, Huang K (2015) An equalised global graphical model-based approach for multi-camera object tracking. arXiv preprint arXiv:1502.03532 , 8. https:\/\/doi.org\/10.48550\/arXiv.1502.03532","DOI":"10.48550\/arXiv.1502.03532"},{"issue":"3","key":"17397_CR7","doi-asserted-by":"publisher","first-page":"1840","DOI":"10.1109\/TITS.2020.3025687","volume":"22","author":"C Chen","year":"2021","unstructured":"Chen C, Liu B, Wan S, Qiao P, Pei Q (2021) An edge traffic flow detection scheme based on deep learning in an intelligent transportation system. IEEE Trans Intell Transp Syst 22(3):1840\u20131852. https:\/\/doi.org\/10.1109\/TITS.2020.3025687","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"17397_CR8","doi-asserted-by":"publisher","first-page":"103432","DOI":"10.1016\/j.jvcir.2021.103432","volume":"83","author":"Y Chen","year":"2022","unstructured":"Chen Y, Ke W, Lin H, Lam CT, Lv K, Sheng H, Xiong Z (2022) Local perspective based synthesis for vehicle re-identification: A transformation state adversarial method. J Vis Commun Image Represent 83:103432. https:\/\/doi.org\/10.1016\/j.jvcir.2021.103432","journal-title":"J Vis Commun Image Represent"},{"key":"17397_CR9","doi-asserted-by":"publisher","first-page":"845","DOI":"10.1007\/s11263-020-01393-0","volume":"129","author":"P Dendorfer","year":"2021","unstructured":"Dendorfer P, Osep A, Milan A, Schindler K, Cremers D, Reid I, Roth S, Leal-Taix\u00e9 L (2021) Motchallenge: A benchmark for single-camera multiple target tracking. Int J Comput Vision 129:845\u2013881. https:\/\/doi.org\/10.1007\/s11263-020-01393-0","journal-title":"Int J Comput Vision"},{"key":"17397_CR10","doi-asserted-by":"publisher","unstructured":"Girshick R (2015) Fast R-CNN, IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 2015, pp. 1440\u20131448. https:\/\/doi.org\/10.1109\/ICCV.2015.169","DOI":"10.1109\/ICCV.2015.169"},{"key":"17397_CR11","doi-asserted-by":"publisher","unstructured":"Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 2014, pp. p587. https:\/\/doi.org\/10.1109\/CVPR.2014.81","DOI":"10.1109\/CVPR.2014.81"},{"key":"17397_CR12","doi-asserted-by":"publisher","unstructured":"Gong S, Xiang T (2011) Person Re-identification. In: Visual Analysis of Behaviour. Springer, London. https:\/\/doi.org\/10.1007\/978-0-85729-670-2_14","DOI":"10.1007\/978-0-85729-670-2_14"},{"key":"17397_CR13","doi-asserted-by":"publisher","unstructured":"He K, Gkioxari G, Doll\u00e1r P, Girshick R (2017) Mask R-CNN, IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 2017, pp. 2980\u20132988. https:\/\/doi.org\/10.1109\/ICCV.2017.322","DOI":"10.1109\/ICCV.2017.322"},{"key":"17397_CR14","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA: IEEE. pp. 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"17397_CR15","unstructured":"He Z, Lei Y, Bai S, Wu W (2019) Multi-camera vehicle tracking with powerful visual features and spatial-temporal cue. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Long Beach, CA, USA, 15\u201320 June 2019, pp. 203\u2013212"},{"key":"17397_CR16","unstructured":"Hou Y, Du H, Zheng L (2019) A locality aware city-scale multi-camera vehicle tracking system. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Long Beach, CA, USA, 15\u201320 June 2019, pp. 167\u2013174"},{"key":"17397_CR17","doi-asserted-by":"publisher","first-page":"5198","DOI":"10.1109\/TIP.2021.3078124","volume":"30","author":"HM Hsu","year":"2021","unstructured":"Hsu HM, Cai J, Wang Y, Hwang JN, Kim KJ (2021) Multi-target multi-camera tracking of vehicles using metadata-aided re-id and trajectory-based camera link model. IEEE Trans Image Process 30:5198\u20135210. https:\/\/doi.org\/10.1109\/TIP.2021.3078124","journal-title":"IEEE Trans Image Process"},{"key":"17397_CR18","unstructured":"Hsu HM, Huang TW, Wang G, Cai J, Lei Z, Hwang JN (2019) Multi-camera tracking of vehicles based on deep features re-id and trajectory-based camera link models. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Long Beach, CA, USA, 15\u201320 June 2019, pp. 416\u2013424"},{"key":"17397_CR19","doi-asserted-by":"publisher","unstructured":"Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 2261\u20132269. https:\/\/doi.org\/10.1109\/CVPR.2017.243","DOI":"10.1109\/CVPR.2017.243"},{"issue":"2","key":"17397_CR20","doi-asserted-by":"publisher","first-page":"146","DOI":"10.1016\/j.cviu.2007.01.003","volume":"109","author":"O Javed","year":"2008","unstructured":"Javed O, Shafique K, Rasheed Z, Shah M (2008) Modeling inter-camera space\u2013time and appearance relationships for tracking across non-overlapping views. Comput Vis Image Underst 109(2):146\u2013162","journal-title":"Comput Vis Image Underst"},{"key":"17397_CR21","doi-asserted-by":"publisher","first-page":"1294","DOI":"10.1109\/TMM.2022.3141267","volume":"25","author":"M Jia","year":"2023","unstructured":"Jia M, Cheng X, Lu S, Zhang J (2023) Learning disentangled representation implicitly via transformer for occluded person re-identification. IEEE Trans Multimedia 25:1294\u20131305. https:\/\/doi.org\/10.1109\/TMM.2022.3141267","journal-title":"IEEE Trans Multimedia"},{"key":"17397_CR22","doi-asserted-by":"crossref","unstructured":"Jocher G (2020) YOLOv5 by Ultralytics. https:\/\/github.com\/ultralytics\/yolov5. Accessed\u00a01 Aug 2023","DOI":"10.1155\/2023\/9757050"},{"issue":"1","key":"17397_CR23","doi-asserted-by":"publisher","first-page":"35","DOI":"10.1115\/1.3662552","volume":"82","author":"RE Kalman","year":"1960","unstructured":"Kalman RE (1960) A new approach to linear filtering and prediction problems. J Basic Eng 82(1):35\u201345. https:\/\/doi.org\/10.1115\/1.3662552","journal-title":"J Basic Eng"},{"key":"17397_CR24","doi-asserted-by":"publisher","unstructured":"Kanac\u0131 A, Zhu X, Gong S (2019) Vehicle re-identification in context. In Pattern Recognition: 40th German Conference, GCPR 2018, Stuttgart, Germany, October 9\u201312, 2018, Proceedings 40 (pp. 377\u2013390). Springer International Publishing. https:\/\/doi.org\/10.48550\/arXiv.1809.09409","DOI":"10.48550\/arXiv.1809.09409"},{"key":"17397_CR25","doi-asserted-by":"publisher","first-page":"50","DOI":"10.1016\/j.cviu.2019.03.001","volume":"182","author":"SD Khan","year":"2019","unstructured":"Khan SD, Ullah H (2019) A survey of advances in vision-based vehicle re-identification. Comput Vis Image Underst 182:50\u201363. https:\/\/doi.org\/10.1016\/j.cviu.2019.03.001","journal-title":"Comput Vis Image Underst"},{"key":"17397_CR26","doi-asserted-by":"publisher","unstructured":"Khorramshahi P, Peri N, Chen JC, Chellappa R (2020) The devil is in the details: Self-supervised Attention for Vehicle Re-identification. In: Vedaldi A, Bischof H, Brox T, Frahm JM. (eds) Computer Vision \u2013 ECCV 2020. ECCV 2020. Lecture Notes in Computer Science, vol 12359. pp. 369\u2013386. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-030-58568-6_22","DOI":"10.1007\/978-3-030-58568-6_22"},{"key":"17397_CR27","doi-asserted-by":"publisher","first-page":"83","DOI":"10.1002\/nav.3800020109","volume":"2","author":"HW Kuhn","year":"1955","unstructured":"Kuhn HW (1955) The Hungarian method for the assignment problem. Naval Res Logist Q 2:83\u201397. https:\/\/doi.org\/10.1002\/nav.3800020109","journal-title":"Naval Res Logist Q"},{"key":"17397_CR28","doi-asserted-by":"publisher","unstructured":"Kuma R, Weill E, Aghdasi F, Sriram P (2019) Vehicle Re-identification: an Efficient Baseline Using Triplet Embedding, International Joint Conference on Neural Networks (IJCNN), Budapest, Hungary, 2019, pp. 1\u20139, IEEE. https:\/\/doi.org\/10.1109\/IJCNN.2019.8852059","DOI":"10.1109\/IJCNN.2019.8852059"},{"issue":"1","key":"17397_CR29","doi-asserted-by":"publisher","first-page":"27","DOI":"10.2478\/jaiscr-2020-0003","volume":"10","author":"R Kumar","year":"2020","unstructured":"Kumar R, Weill E, Aghdasi F, Sriram P (2020) A strong and efficient baseline for vehicle re-identification using deep triplet embedding. J Artif Intell Soft Comput Res 10(1):27\u201345. https:\/\/doi.org\/10.2478\/jaiscr-2020-0003","journal-title":"J Artif Intell Soft Comput Res"},{"key":"17397_CR30","doi-asserted-by":"publisher","unstructured":"Li F, Wang Z, Nie D, Zhang S, Jiang X, Zhao X, Hu P (2022) Multi-camera vehicle tracking system for AI City Challenge 2022. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 2022, pp. 3264-3272. https:\/\/doi.org\/10.1109\/CVPRW56347.2022.00369","DOI":"10.1109\/CVPRW56347.2022.00369"},{"key":"17397_CR31","unstructured":"Li P, Li G, Yan Z, Li Y, Lu M, Xu P, Gu Y, Bai B, Zhang Y, Chuxing D (2019) Spatio-temporal consistency and hierarchical matching for multi-target multi-camera vehicle tracking. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Long Beach, CA, USA, 15\u201320 June 2019, pp. 222\u2013230"},{"key":"17397_CR32","doi-asserted-by":"publisher","unstructured":"Lin TY, Doll\u00e1r P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection, IEEE conference on computer vision and pattern recognition (CVPR), Honolulu, HI, USA, 2017, pp. 936\u2013944. https:\/\/doi.org\/10.1109\/CVPR.2017.106","DOI":"10.1109\/CVPR.2017.106"},{"key":"17397_CR33","doi-asserted-by":"publisher","unstructured":"Liu C, Zhang Y, Luo H, Tang J, Chen W, Xu X, Wang F, Li H, Shen YD (2021) City-scale multi-camera vehicle tracking guided by crossroad zones, In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, 2021, pp. 4124\u20134132. https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00466","DOI":"10.1109\/CVPRW53098.2021.00466"},{"key":"17397_CR34","doi-asserted-by":"publisher","unstructured":"Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) SSD: Single shot multibox detector. In: Leibe B, Matas J, Sebe N, Welling M (eds) Computer Vision \u2013 ECCV 2016. Lecture Notes in Computer Science, vol 9905, pp. 21\u201337, Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"17397_CR35","doi-asserted-by":"publisher","unstructured":"Lou Y, Bai Y, Liu J, Wang S, Duan L (2019) Veri-wild: A large dataset and a new method for vehicle re-identification in the wild. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 3235\u20133243). https:\/\/doi.org\/10.1109\/CVPR.2019.00335","DOI":"10.1109\/CVPR.2019.00335"},{"issue":"5","key":"17397_CR36","doi-asserted-by":"publisher","first-page":"7063","DOI":"10.1007\/s11042-022-11923-2","volume":"81","author":"E Luna","year":"2022","unstructured":"Luna E, SanMiguel JC, Mart\u00ednez JM, Escudero-Vinolo M (2022) Online clustering-based multi-camera vehicle tracking in scenarios with overlapping FOVs. Multimed Tools Appl 81(5):7063\u20137083","journal-title":"Multimed Tools Appl"},{"key":"17397_CR37","doi-asserted-by":"publisher","unstructured":"Luo H, et al (2021) An empirical study of vehicle re-identification on the AI city challenge, IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, 2021, pp. 4090\u20134097. https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00462","DOI":"10.1109\/CVPRW53098.2021.00462"},{"key":"17397_CR38","doi-asserted-by":"publisher","unstructured":"Luo H, Gu Y, Liao X, Lai S, Jiang W (2019) Bag of tricks and a strong baseline for deep person re-identification, IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Long Beach, CA, USA, 2019, pp. 1487\u20131495. https:\/\/doi.org\/10.1109\/CVPRW.2019.00190","DOI":"10.1109\/CVPRW.2019.00190"},{"key":"17397_CR39","doi-asserted-by":"publisher","first-page":"7077","DOI":"10.1007\/s11042-018-6467-6","volume":"78","author":"N Mahmoudi","year":"2019","unstructured":"Mahmoudi N, Ahadi SM, Rahmati M (2019) Multi-target tracking using CNN-based features: CNNMTT. Multimed Tools Appl 78:7077\u20137096. https:\/\/doi.org\/10.1007\/s11042-018-6467-6","journal-title":"Multimed Tools Appl"},{"key":"17397_CR40","doi-asserted-by":"publisher","first-page":"28347","DOI":"10.1007\/s11042-022-12715-4","volume":"81","author":"M Othmani","year":"2022","unstructured":"Othmani M (2022) A vehicle detection and tracking method for traffic video based on faster R-CNN. Multimed Tools Appl 81:28347\u201328365. https:\/\/doi.org\/10.1007\/s11042-022-12715-4","journal-title":"Multimed Tools Appl"},{"key":"17397_CR41","doi-asserted-by":"publisher","unstructured":"Pan H, Wang Y, Sz\u0171cs G (2022) Work-traffic crashes and aberrant driving behaviors among full-time ride-hailing and taxi drivers: a comparative study. Transportation Letters. https:\/\/doi.org\/10.1080\/19427867.2022.2157075","DOI":"10.1080\/19427867.2022.2157075"},{"key":"17397_CR42","doi-asserted-by":"publisher","unstructured":"Pan X, Luo P, Shi J, Tang X (2018) Two at Once: Enhancing Learning and Generalization Capacities via IBN-Net. In: Ferrari V, Hebert M, Sminchisescu C, Weiss Y (eds) Computer Vision \u2013 ECCV 2018. Lecture Notes in Computer Science, vol 11208. pp. 464\u2013479, Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-030-01225-0_29","DOI":"10.1007\/978-3-030-01225-0_29"},{"key":"17397_CR43","doi-asserted-by":"publisher","unstructured":"Papp D, Borsodi R (2022) Determining Hybrid re-id features of vehicles in videos for transport analysis. Infocommunications J 4(1);17\u201323.\u00a0\u00a0https:\/\/doi.org\/10.36244\/ICJ.2022.1.3","DOI":"10.36244\/ICJ.2022.1.3"},{"key":"17397_CR44","unstructured":"Papp D, Lovas D, Sz\u0171cs G (2016) Object detection, classification, tracking and individual recognition for sea images and videos. In CLEF (Working Notes) pp. 525\u2013533"},{"key":"17397_CR45","unstructured":"Papp D, Mogyor\u00f3si F, Sz\u0171cs G (2017) Image matching for individual recognition with SIFT, RANSAC and MCL. In CLEF (Working Notes)"},{"key":"17397_CR46","doi-asserted-by":"crossref","unstructured":"Qian Y, Yu L, Liu W, Hauptmann A (2020) Electricity: An efficient multi-camera vehicle tracking system for intelligent city. 2020 IEEE. In CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 2511\u20132519","DOI":"10.1109\/CVPRW50498.2020.00302"},{"key":"17397_CR47","doi-asserted-by":"publisher","unstructured":"Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 779\u2013788. https:\/\/doi.org\/10.1109\/CVPR.2016.91.","DOI":"10.1109\/CVPR.2016.91"},{"key":"17397_CR48","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"Ren S, He K, Girshick R, Sun J (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans Pattern Anal Mach Intell 39:1137\u20131149. https:\/\/doi.org\/10.1109\/TPAMI.2016.2577031","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"17397_CR49","doi-asserted-by":"publisher","unstructured":"Ristani E, Tomasi C (2018) Features for multi-target multi-camera tracking and re-identification. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 2018, pp. 6036\u20136046. https:\/\/doi.org\/10.1109\/CVPR.2018.00632","DOI":"10.1109\/CVPR.2018.00632"},{"key":"17397_CR50","doi-asserted-by":"publisher","unstructured":"Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: Hua G, J\u00e9gou H. (eds) Computer Vision \u2013 ECCV 2016 Workshops. Lecture Notes in Computer Science 9914:17\u201335, Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-319-48881-3_2","DOI":"10.1007\/978-3-319-48881-3_2"},{"issue":"11","key":"17397_CR51","doi-asserted-by":"publisher","first-page":"9049","DOI":"10.1109\/JIOT.2021.3119525","volume":"9","author":"F Shen","year":"2022","unstructured":"Shen F, Zhu J, Zhu X, Huang J, Zeng H, Lei Z, Cai C (2022) An efficient multiresolution network for vehicle reidentification. IEEE Internet Things J 9(11):9049\u20139059. https:\/\/doi.org\/10.1109\/JIOT.2021.3119525","journal-title":"IEEE Internet Things J"},{"key":"17397_CR52","doi-asserted-by":"crossref","unstructured":"Specker A, Florin L, Cormier M, Beyerer J (2022) Improving multi-target multi-camera tracking by track refinement and completion. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 3199\u20133209","DOI":"10.1109\/CVPRW56347.2022.00361"},{"key":"17397_CR53","doi-asserted-by":"publisher","unstructured":"Specker A, Stadler D, Florin L, Beyerer J (2021) An occlusion-aware multi-target multi-camera tracking system, IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, 2021, pp. 4168\u20134177. https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00471","DOI":"10.1109\/CVPRW53098.2021.00471"},{"issue":"1","key":"17397_CR54","doi-asserted-by":"publisher","first-page":"26","DOI":"10.36244\/ICJ.2021.1.4","volume":"13","author":"G Sz\u0171cs","year":"2021","unstructured":"Sz\u0171cs G, N\u00e9meth M (2021) Double-view matching network for few-shot learning to classify covid-19 in X-ray images. Infocommunications Journal 13(1):26\u201334","journal-title":"Infocommunications Journal"},{"key":"17397_CR55","unstructured":"Sz\u0171cs G, Papp D, Lovas D (2015) SVM classification of moving objects tracked by Kalman filter and Hungarian method. In Working Notes of CLEF 2015 Conference, Toulouse, France. 10 pages"},{"key":"17397_CR56","unstructured":"Tan M, Le QV (2019) EfficientNet: Rethinking model scaling for convolutional neural networks. Proceedings of the 36th International Conference on Machine Learning, ICML 2019, Long Beach, 9\u201315 June 2019, 6105\u20136114. http:\/\/proceedings.mlr.press\/v97\/tan19a.html"},{"key":"17397_CR57","doi-asserted-by":"crossref","unstructured":"Tan M, Pang R, Le QV (2020) Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781\u201310790","DOI":"10.1109\/CVPR42600.2020.01079"},{"key":"17397_CR58","doi-asserted-by":"crossref","unstructured":"Tang Z, Naphade M, Liu MY, Yang X, Birchfield S, Wang S, Kumar R, Anastasiu D, Hwang JN (2019) Cityflow: A city-scale benchmark for multi-target multi-camera vehicle tracking and re-identification. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 8797\u20138806","DOI":"10.1109\/CVPR.2019.00900"},{"key":"17397_CR59","doi-asserted-by":"crossref","unstructured":"Tang Z, Wang G, Xiao H, Zheng A, Hwang JN (2018) Single-camera and inter-camera vehicle tracking and 3D speed estimation based on fusion of visual and semantic features. In Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp. 108\u2013115","DOI":"10.1109\/CVPRW.2018.00022"},{"key":"17397_CR60","doi-asserted-by":"publisher","first-page":"154","DOI":"10.1007\/s11263-013-0620-5","volume":"104","author":"JR Uijlings","year":"2013","unstructured":"Uijlings JR, Van De Sande KE, Gevers T, Smeulders AW (2013) Selective Search for Object Recognition. Int J Comput Vision 104:154\u2013171. https:\/\/doi.org\/10.1007\/s11263-013-0620-5","journal-title":"Int J Comput Vision"},{"key":"17397_CR61","doi-asserted-by":"crossref","unstructured":"Wang CY, Bochkovskiy A, Liao HYM (2023) YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 7464\u20137475","DOI":"10.1109\/CVPR52729.2023.00721"},{"issue":"10","key":"17397_CR62","doi-asserted-by":"publisher","first-page":"3349","DOI":"10.1109\/TPAMI.2020.2983686","volume":"43","author":"J Wang","year":"2020","unstructured":"Wang J, Sun K, Cheng T, Jiang B, Deng C, Zhao Y, Liu D, Mu Y, Tan M, Wang X, Liu W, Xiao B (2020) Deep high-resolution representation learning for visual recognition. IEEE Trans Pattern Anal Mach Intell 43(10):3349\u20133364","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"issue":"10","key":"17397_CR63","doi-asserted-by":"publisher","first-page":"6631","DOI":"10.1002\/int.22857","volume":"37","author":"X Wang","year":"2022","unstructured":"Wang X, Jin Y, Li C, Cen Y, Li Y (2022) VSLN: View-aware sphere learning network for cross-view vehicle re-identification. Int J Intell Syst 37(10):6631\u20136651. https:\/\/doi.org\/10.1002\/int.22857","journal-title":"Int J Intell Syst"},{"key":"17397_CR64","doi-asserted-by":"publisher","unstructured":"Wang Z, Zheng L, Liu Y, Li Y, Wang S (2020) Towards real-time multi-object tracking. In: Vedaldi A, Bischof H, Brox T, Frahm JM. (eds) Computer Vision \u2013 ECCV 2020. Lecture Notes in Computer Science, vol 12356. pp. 107\u2013122, Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-030-58621-8_7","DOI":"10.1007\/978-3-030-58621-8_7"},{"key":"17397_CR65","doi-asserted-by":"publisher","first-page":"102907","DOI":"10.1016\/j.cviu.2020.102907","volume":"193","author":"L Wen","year":"2020","unstructured":"Wen L, Du D, Cai Z, Lei Z, Chang MC, Qi H, Lim J, Yang MH, Lyu S (2020) UA-DETRAC: A new benchmark and protocol for multi-object detection and tracking. Comput Vis Image Underst 193:102907. https:\/\/doi.org\/10.1016\/j.cviu.2020.102907","journal-title":"Comput Vis Image Underst"},{"key":"17397_CR66","doi-asserted-by":"publisher","unstructured":"Wojke N, Bewley A, Paulus D (2017) Simple online and realtime tracking with a deep association metric. In IEEE International Conference on Image Processing (ICIP) Beijing, China, 2017, pp. 3645\u20133649. https:\/\/doi.org\/10.1109\/ICIP.2017.8296962","DOI":"10.1109\/ICIP.2017.8296962"},{"key":"17397_CR67","doi-asserted-by":"publisher","unstructured":"Wu M, Qian Y, Wang C, Yang M (2021) A multi-camera vehicle tracking system based on city-scale vehicle re-id and spatial-temporal information. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 2021, pp. 4072-4081. https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00460","DOI":"10.1109\/CVPRW53098.2021.00460"},{"key":"17397_CR68","doi-asserted-by":"publisher","unstructured":"Wu Y, Lin Y, Dong X, Yan Y, Ouyang W, Yang Y (2018) Exploit the unknown gradually: One-shot video-based person re-identification by stepwise learning. In Proceedings of the IEEE conference on computer vision and pattern recognition, Salt Lake City, UT, USA, pp. 5177\u20135186. https:\/\/doi.org\/10.1109\/CVPR.2018.00543","DOI":"10.1109\/CVPR.2018.00543"},{"key":"17397_CR69","doi-asserted-by":"publisher","unstructured":"Xie S, Girshick R, Doll\u00e1r P, Tu Z, He K (2017) Aggregated residual transformations for deep neural networks, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 2017, pp. 5987\u20135995. https:\/\/doi.org\/10.1109\/CVPR.2017.634","DOI":"10.1109\/CVPR.2017.634"},{"key":"17397_CR70","doi-asserted-by":"publisher","unstructured":"Yang X, Ye J, Lu J, Gong C, Jiang M, Lin X, Zhang W, Tan X, Li Y, Ye X, Ding E (2022) Box-grained reranking matching for multi-camera multi-target tracking. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 2022, pp. 3095-3105. https:\/\/doi.org\/10.1109\/CVPRW56347.2022.00349","DOI":"10.1109\/CVPRW56347.2022.00349"},{"key":"17397_CR71","doi-asserted-by":"publisher","unstructured":"Yao H, Duan Z, Xie Z, Chen J, Wu X, Xu D, Gao Y (2022) City-scale multi-camera vehicle tracking based on space-time-appearance features. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 2022, pp. 3309-3317. https:\/\/doi.org\/10.1109\/CVPRW56347.2022.00374","DOI":"10.1109\/CVPRW56347.2022.00374"},{"key":"17397_CR72","doi-asserted-by":"publisher","unstructured":"Ye J, et al (2021) A robust MTMC tracking system for AI-City challenge 2021. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Nashville, TN, USA, 2021, pp. 4039\u20134048. https:\/\/doi.org\/10.1109\/CVPRW53098.2021.00456","DOI":"10.1109\/CVPRW53098.2021.00456"},{"issue":"6","key":"17397_CR73","doi-asserted-by":"publisher","first-page":"2872","DOI":"10.1109\/TPAMI.2021.3054775","volume":"44","author":"M Ye","year":"2022","unstructured":"Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SC (2022) Deep learning for person re-identification: a survey and outlook. IEEE Trans Pattern Anal Mach Intell 44(6):2872\u20132893. https:\/\/doi.org\/10.1109\/TPAMI.2021.3054775","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"17397_CR74","doi-asserted-by":"publisher","unstructured":"Yu F, Li W, Li Q, Liu Y, Shi X, Yan J (2016) POI: Multiple object tracking with high performance detection and appearance feature. In: Hua, G., J\u00e9gou, H. (eds) Computer Vision \u2013 ECCV 2016 Workshops. ECCV 2016. Lecture Notes in Computer Science, vol 9914. pp. 36\u201342, Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-319-48881-3_3","DOI":"10.1007\/978-3-319-48881-3_3"},{"key":"17397_CR75","doi-asserted-by":"publisher","unstructured":"Zhang Y, Sun P, Jiang Y, Yu D, Weng F, Yuan Z, Luo P, Liu W, Wang X (2022) ByteTrack: Multi-object Tracking by Associating Every Detection Box. In: Avidan S, Brostow G, Ciss\u00e9 M, Farinella GM, Hassner T (eds) Computer Vision \u2013 ECCV 2022. ECCV 2022. Lecture Notes in Computer Science, vol 13682. pp. 1\u201321, Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-031-20047-2_1","DOI":"10.1007\/978-3-031-20047-2_1"},{"key":"17397_CR76","doi-asserted-by":"publisher","first-page":"3069","DOI":"10.1007\/s11263-021-01513-4","volume":"129","author":"Y Zhang","year":"2021","unstructured":"Zhang Y, Wang C, Wang X, Zeng W, Liu W (2021) FairMOT: On the fairness of detection and re-identification in multiple object tracking. Int J Comput Vision 129:3069\u20133087. https:\/\/doi.org\/10.1007\/s11263-021-01513-4","journal-title":"Int J Comput Vision"},{"key":"17397_CR77","doi-asserted-by":"publisher","unstructured":"Zheng L, Bie Z, Sun Y, Wang J, Su C, Wang S, Tian Q (2016) MARS: A video benchmark for large-scale person re-identification. In Computer Vision\u2013ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11\u201314, 2016, Proceedings, Part VI 14, pp. 868\u2013884. Springer International Publishing. https:\/\/doi.org\/10.1007\/978-3-319-46466-4_52","DOI":"10.1007\/978-3-319-46466-4_52"},{"key":"17397_CR78","doi-asserted-by":"publisher","unstructured":"Zheng Z, Yang X, Yu Z, Zheng L, Yang Y, Kautz J (2019) Joint discriminative and generative learning for person re-identification. In proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, Long Beach, CA, USA, 2019, pp. 2133-2142. https:\/\/doi.org\/10.1109\/CVPR.2019.00224","DOI":"10.1109\/CVPR.2019.00224"},{"issue":"9","key":"17397_CR79","doi-asserted-by":"publisher","first-page":"5056","DOI":"10.1109\/TPAMI.2021.3069237","volume":"44","author":"K Zhou","year":"2021","unstructured":"Zhou K, Yang Y, Cavallaro A, Xiang T (2021) Learning generalisable omni-scale representations for person re-identification. IEEE Trans Pattern Anal Mach Intell 44(9):5056\u20135069. https:\/\/doi.org\/10.1109\/TPAMI.2021.3069237","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"17397_CR80","doi-asserted-by":"publisher","unstructured":"Zhou K, Yang Y, Qiao Y, Xiang T (2021) Domain generalization with mixstyle. ICLR (International Conference on Learning Representations).\u00a0https:\/\/doi.org\/10.48550\/arXiv.2104.02008","DOI":"10.48550\/arXiv.2104.02008"},{"key":"17397_CR81","doi-asserted-by":"publisher","unstructured":"Zhou X, Koltun V, Kr\u00e4henb\u00fchl P (2020) Tracking objects as points. In: Vedaldi A, Bischof H, Brox T, Frahm JM. (eds) Computer Vision \u2013 ECCV 2020. Lecture Notes in Computer Science, vol 12349. Springer, Cham. https:\/\/doi.org\/10.1007\/978-3-030-58548-8_28","DOI":"10.1007\/978-3-030-58548-8_28"},{"key":"17397_CR82","doi-asserted-by":"publisher","unstructured":"Zhu X, Luo Z, Fu P, Ji X (2020) VOC-ReID: Vehicle re-identification based on vehicle-orientation-camera. In Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA, 2020, pp. 2566-2573. https:\/\/doi.org\/10.1109\/CVPRW50498.2020.00309","DOI":"10.1109\/CVPRW50498.2020.00309"}],"container-title":["Multimedia Tools and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-023-17397-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11042-023-17397-0\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11042-023-17397-0.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,4,29]],"date-time":"2024-04-29T11:36:47Z","timestamp":1714390607000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11042-023-17397-0"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,19]]},"references-count":82,"journal-issue":{"issue":"15","published-online":{"date-parts":[[2024,5]]}},"alternative-id":["17397"],"URL":"https:\/\/doi.org\/10.1007\/s11042-023-17397-0","relation":{},"ISSN":["1573-7721"],"issn-type":[{"value":"1573-7721","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,19]]},"assertion":[{"value":"15 May 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 August 2023","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"1 October 2023","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 October 2023","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"Not Applicable.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Ethical approval"}},{"value":"The authors, D\u00e1vid Papp, Reg\u0151 Borsodi, and G\u00e1bor Sz\u0171cs declare that they have no conflict of interest.","order":3,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}]}}