{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,24]],"date-time":"2026-02-24T02:55:00Z","timestamp":1771901700195,"version":"3.50.1"},"reference-count":49,"publisher":"Springer Science and Business Media LLC","issue":"4","license":[{"start":{"date-parts":[[2023,1,3]],"date-time":"2023-01-03T00:00:00Z","timestamp":1672704000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,1,3]],"date-time":"2023-01-03T00:00:00Z","timestamp":1672704000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["62072053"],"award-info":[{"award-number":["62072053"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100015401","name":"Key Research and Development Projects of Shaanxi Province","doi-asserted-by":"publisher","award":["2020GY-027"],"award-info":[{"award-number":["2020GY-027"]}],"id":[{"id":"10.13039\/501100015401","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Key Research and Development Program of China","award":["SQ2019YFB160023"],"award-info":[{"award-number":["SQ2019YFB160023"]}]},{"name":"Key Research and Development Project of Shaanxi Science and Technology Department","award":["2018ZDXM-GY-047"],"award-info":[{"award-number":["2018ZDXM-GY-047"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Complex Intell. Syst."],"published-print":{"date-parts":[[2023,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Monocular 3D vehicle localization is an important task for vehicle behaviour analysis, traffic flow parameter estimation and autonomous driving in Intelligent Transportation System (ITS) and Cooperative Vehicle Infrastructure System (CVIS), which is usually achieved by monocular 3D vehicle detection. However, monocular cameras cannot obtain depth information directly due to the inherent imaging mechanism, resulting in more challenging monocular 3D tasks. Currently, most of the monocular 3D vehicle detection methods still rely on 2D detectors and additional geometric constraint modules to recover 3D vehicle information, which reduces the efficiency. At the same time, most of the research is based on datasets of onboard scenes, instead of roadside perspective, which is limited in large-scale 3D perception. Therefore, we focus on 3D vehicle detection without 2D detectors in roadside scenes. We propose a 3D vehicle localization network CenterLoc3D for roadside monocular cameras, which directly predicts centroid and eight vertexes in image space, and the dimension of 3D bounding boxes without 2D detectors. To improve the precision of 3D vehicle localization, we propose a multi-scale weighted-fusion module and a loss with spatial constraints embedded in CenterLoc3D. Firstly, the transformation matrix between 2D image space and 3D world space is solved by camera calibration. Secondly, vehicle type, centroid, eight vertexes, and the dimension of 3D vehicle bounding boxes are obtained by CenterLoc3D. Finally, centroid in 3D world space can be obtained by camera calibration and CenterLoc3D for 3D vehicle localization. To the best of our knowledge, this is the first application of 3D vehicle localization for roadside monocular cameras. Hence, we also propose a benchmark for this application including a dataset (SVLD-3D), an annotation tool (LabelImg-3D), and evaluation metrics. Through experimental validation, the proposed method achieves high accuracy with <jats:inline-formula><jats:alternatives><jats:tex-math>$$A{P_{3D}}$$<\/jats:tex-math><mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\">\n                  <mml:mrow>\n                    <mml:mi>A<\/mml:mi>\n                    <mml:msub>\n                      <mml:mi>P<\/mml:mi>\n                      <mml:mrow>\n                        <mml:mn>3<\/mml:mn>\n                        <mml:mi>D<\/mml:mi>\n                      <\/mml:mrow>\n                    <\/mml:msub>\n                  <\/mml:mrow>\n                <\/mml:math><\/jats:alternatives><\/jats:inline-formula> of 51.30%, average 3D localization precision of 98%, average 3D dimension precision of 85% and real-time performance with FPS of 41.18.<\/jats:p>","DOI":"10.1007\/s40747-022-00962-9","type":"journal-article","created":{"date-parts":[[2023,1,3]],"date-time":"2023-01-03T06:02:55Z","timestamp":1672725775000},"page":"4349-4368","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":18,"title":["CenterLoc3D: monocular 3D vehicle localization network for roadside surveillance cameras"],"prefix":"10.1007","volume":"9","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-2553-3623","authenticated-orcid":false,"given":"Xinyao","family":"Tang","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9775-9774","authenticated-orcid":false,"given":"Wei","family":"Wang","sequence":"additional","affiliation":[]},{"given":"Huansheng","family":"Song","sequence":"additional","affiliation":[]},{"given":"Chunhui","family":"Zhao","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,1,3]]},"reference":[{"key":"962_CR1","doi-asserted-by":"publisher","unstructured":"Wang C, Musaev A, Sheinidashtegol P, Atkison T (2019) Towards detection of abnormal vehicle behavior using traffic cameras. In: International Conference on Big Data, pp. 125\u2013136. https:\/\/doi.org\/10.1007\/978-3-030-23551-2_9","DOI":"10.1007\/978-3-030-23551-2_9"},{"key":"962_CR2","doi-asserted-by":"publisher","unstructured":"Huang C, Hu Z, Xu Y, Wang Y (2020) Vision-based abnormal vehicle behavior detection: a survey. Pattern Recognition and Artificial Intelligence 33(3):234\u2013248. https:\/\/doi.org\/10.16451\/j.cnki.issn1003-6059.202003005","DOI":"10.16451\/j.cnki.issn1003-6059.202003005"},{"issue":"4","key":"962_CR3","doi-asserted-by":"publisher","first-page":"114","DOI":"10.1177\/0361198120912742","volume":"2674","author":"H Liang","year":"2020","unstructured":"Liang H, Song H, Li H, Dai Z (2020) Vehicle counting system using deep learning and multi-object tracking. Transp Res Record 2674(4):114\u2013128. https:\/\/doi.org\/10.1177\/0361198120912742","journal-title":"Transp Res Record"},{"key":"962_CR4","doi-asserted-by":"publisher","unstructured":"Tasgaonkar PP, Garg RD, Garg PK (2020) Vehicle detection and traffic estimation with sensors technologies for intelligent transportation systems. Sens Imaging 21(1). https:\/\/doi.org\/10.1007\/s11220-020-00295-2","DOI":"10.1007\/s11220-020-00295-2"},{"key":"962_CR5","doi-asserted-by":"publisher","unstructured":"Dai Z, Song H, Liang H, Wu F, Wang X, Jia J, Fang Y (2020) Traffic parameter estimation and control system based on machine vision. J Ambient Intell Hum Comput: 1\u201313. https:\/\/doi.org\/10.1007\/s12652-020-02052-5","DOI":"10.1007\/s12652-020-02052-5"},{"key":"962_CR6","doi-asserted-by":"publisher","unstructured":"Shi S, Wang X, Li H (2019) Pointrcnn: 3d object proposal generation and detection from point cloud. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770\u2013779. https:\/\/doi.org\/10.1109\/CVPR.2019.00086","DOI":"10.1109\/CVPR.2019.00086"},{"key":"962_CR7","doi-asserted-by":"publisher","unstructured":"Shi S, Guo C, Jiang L, Wang Z, Shi J, Wang X, Li H (2020) Pv-rcnn: Point-voxel feature set abstraction for 3d object detection. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10526\u201310535. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01054","DOI":"10.1109\/CVPR42600.2020.01054"},{"key":"962_CR8","doi-asserted-by":"publisher","unstructured":"Lang AH, Vora S, Caesar H, Zhou L, Yang J, Beijbom O (2019) Pointpillars: fast encoders for object detection from point clouds. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 12689\u201312697. https:\/\/doi.org\/10.1109\/CVPR.2019.01298","DOI":"10.1109\/CVPR.2019.01298"},{"key":"962_CR9","doi-asserted-by":"publisher","unstructured":"Zhou Y, Tuzel O (2018) Voxelnet: End-to-end learning for point cloud based 3d object detection. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 4490\u20134499. https:\/\/doi.org\/10.1109\/CVPR.2018.00472","DOI":"10.1109\/CVPR.2018.00472"},{"issue":"5","key":"962_CR10","doi-asserted-by":"publisher","first-page":"1259","DOI":"10.1109\/TPAMI.2017.2706685","volume":"40","author":"X Chen","year":"2018","unstructured":"Chen X, Kundu K, Zhu Y, Ma H, Fidler S, Urtasun R (2018) 3d object proposals using stereo imagery for accurate object class detection. IEEE Trans Pattern Anal Mach Intell 40(5):1259\u20131272. https:\/\/doi.org\/10.1109\/TPAMI.2017.2706685","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"962_CR11","doi-asserted-by":"publisher","unstructured":"Qi CR, Liu W, Wu C, Su H, Guibas LJ (2018) Frustum pointnets for 3d object detection from rgb-d data. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 918\u2013927. https:\/\/doi.org\/10.1109\/CVPR.2018.00102","DOI":"10.1109\/CVPR.2018.00102"},{"key":"962_CR12","doi-asserted-by":"publisher","unstructured":"Li P, Chen X, Shen S (2019) Stereo r-cnn based 3d object detection for autonomous driving. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7636\u20137644. https:\/\/doi.org\/10.1109\/CVPR.2019.00783","DOI":"10.1109\/CVPR.2019.00783"},{"key":"962_CR13","doi-asserted-by":"crossref","unstructured":"Qin Z, Wang J, Lu Y (2019) Monogrnet: a geometric reasoning network for 3d object localization. In: The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)","DOI":"10.1609\/aaai.v33i01.33018851"},{"key":"962_CR14","doi-asserted-by":"publisher","unstructured":"Ku J, Pon AD, Waslander SL (2019) Monocular 3d object detection leveraging accurate proposals and shape reconstruction. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11859\u201311868. https:\/\/doi.org\/10.1109\/CVPR.2019.01214","DOI":"10.1109\/CVPR.2019.01214"},{"key":"962_CR15","doi-asserted-by":"crossref","unstructured":"Liu Z, Wu Z, T\u00f3th R (2020) SMOKE: single-stage monocular 3d object detection via keypoint estimation. arXiv preprint arXiv:2002.10111","DOI":"10.1109\/CVPRW50498.2020.00506"},{"key":"962_CR16","doi-asserted-by":"crossref","unstructured":"Ding M, Huo Y, Yi H, Wang Z, Shi J, Lu Z, Luo P (2020) Learning depth-guided convolutions for monocular 3d object detection. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 11672\u201311681","DOI":"10.1109\/CVPR42600.2020.01169"},{"key":"962_CR17","doi-asserted-by":"crossref","unstructured":"Zhang Y, Lu J, Zhou J (2021) Objects are different: flexible monocular 3d object detection. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3289\u20133298","DOI":"10.1109\/CVPR46437.2021.00330"},{"key":"962_CR18","doi-asserted-by":"crossref","unstructured":"Wang T, Zhu X, Pang J, Lin D (2021) Fcos3d: fully convolutional one-stage monocular 3d object detection. arXiv preprint arXiv:2104.10956","DOI":"10.1109\/ICCVW54120.2021.00107"},{"key":"962_CR19","doi-asserted-by":"publisher","unstructured":"Mousavian A, Anguelov D, Flynn J, Ko\u0161eck\u00e1 J (2017) 3d bounding box estimation using deep learning and geometry. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5632\u20135640. https:\/\/doi.org\/10.1109\/CVPR.2017.597","DOI":"10.1109\/CVPR.2017.597"},{"key":"962_CR20","doi-asserted-by":"publisher","unstructured":"Chabot F, Chaouch M, Rabarisoa J, Teuli\u00e8re C, Chateau T (2017) Deep manta: a coarse-to-fine many-task network for joint 2d and 3d vehicle analysis from monocular image. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1827\u20131836. https:\/\/doi.org\/10.1109\/CVPR.2017.198","DOI":"10.1109\/CVPR.2017.198"},{"issue":"4","key":"962_CR21","doi-asserted-by":"publisher","first-page":"925","DOI":"10.1109\/TRO.2019.2909168","volume":"35","author":"S Yang","year":"2019","unstructured":"Yang S, Scherer S (2019) Cubeslam: monocular 3-d object slam. IEEE Trans Robot 35(4):925\u2013938. https:\/\/doi.org\/10.1109\/TRO.2019.2909168","journal-title":"IEEE Trans Robot"},{"key":"962_CR22","doi-asserted-by":"publisher","unstructured":"Li B, Ouyang W, Sheng L, Zeng X, Wang X (2019) Gs3d: an efficient 3d object detection framework for autonomous driving. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1019\u20131028. https:\/\/doi.org\/10.1109\/CVPR.2019.00111","DOI":"10.1109\/CVPR.2019.00111"},{"key":"962_CR23","doi-asserted-by":"publisher","unstructured":"Gupta I, Rangesh A, Trivedi M (2019) 3d bounding boxes for road vehicles: a one-stage, localization prioritized approach using single monocular images. In: 2018 European Conference on Computer Vision Workshops (ECCV), pp. 626\u2013641. https:\/\/doi.org\/10.1007\/978-3-030-11021-5_39","DOI":"10.1007\/978-3-030-11021-5_39"},{"key":"962_CR24","doi-asserted-by":"crossref","unstructured":"Barabanau I, Artemov A, Burnaev E, Murashkin V (2020) Monocular 3d object detection via geometric reasoning on keypoints. In: 15th International Conference on Computer Vision Theory and Applications","DOI":"10.5220\/0009102506520659"},{"key":"962_CR25","doi-asserted-by":"publisher","unstructured":"Li P, Zhao H, Liu P, Cao F (2020) Rtm3d: real-time monocular 3d detection from object keypoints for autonomous driving. In: 2020 European Conference on Computer Vision Workshops (ECCV), pp. 644\u2013660. https:\/\/doi.org\/10.1007\/978-3-030-58580-8_38","DOI":"10.1007\/978-3-030-58580-8_38"},{"issue":"7","key":"962_CR26","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s00138-020-01117-x","volume":"31","author":"V Kocur","year":"2020","unstructured":"Kocur V, Ft\u00e1nik M (2020) Detection of 3d bounding boxes of vehicles using perspective transformation for accurate speed measurement. Mach Vis Appl 31(7):1\u201315. https:\/\/doi.org\/10.1007\/s00138-020-01117-x","journal-title":"Mach Vis Appl"},{"key":"962_CR27","unstructured":"Yang L, Zhang X, Wang L, Zhu M, Li J (2021) Lite-FPN for Keypoint-based Monocular 3D Object Detection. arXiv:2105.00268"},{"issue":"3","key":"962_CR28","doi-asserted-by":"publisher","first-page":"5565","DOI":"10.1109\/LRA.2021.3061343","volume":"6","author":"P Li","year":"2021","unstructured":"Li P, Zhao H (2021) Monocular 3d detection with geometric constraint embedding and semi-supervised training. IEEE Robot Autom Lett 6(3):5565\u20135572. https:\/\/doi.org\/10.1109\/LRA.2021.3061343","journal-title":"IEEE Robot Autom Lett"},{"issue":"6","key":"962_CR29","doi-asserted-by":"publisher","first-page":"1137","DOI":"10.1109\/TPAMI.2016.2577031","volume":"39","author":"S Ren","year":"2017","unstructured":"Ren S, He K, Girshick R, Sun J (2017) Faster r-cnn: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137\u20131149. https:\/\/doi.org\/10.1109\/TPAMI.2016.2577031","journal-title":"IEEE Trans Pattern Anal Mach Intell"},{"key":"962_CR30","doi-asserted-by":"publisher","unstructured":"Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu C-Y, Berg AC (2016) Ssd: single shot multibox detector. In: 2016 European Conference on Computer Vision (ECCV), pp. 21\u201337. https:\/\/doi.org\/10.1007\/978-3-319-46448-0_2","DOI":"10.1007\/978-3-319-46448-0_2"},{"key":"962_CR31","unstructured":"Bochkovskiy A, Wang C-Y, Liao H-YM (2020) YOLOv4: Optimal speed and accuracy of object detection. arXiv:2004.10934pdf"},{"key":"962_CR32","doi-asserted-by":"publisher","unstructured":"Lin T-Y, Goyal P, Girshick R, He K, Doll\u00e1r P (2017) Focal loss for dense object detection. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2999\u20133007. https:\/\/doi.org\/10.1109\/ICCV.2017.324","DOI":"10.1109\/ICCV.2017.324"},{"key":"962_CR33","unstructured":"Zhou X, Wang D, Kr\u00e4henb\u00fchl P (2019) Objects as points. In: arXiv Preprint arXiv:1904.07850"},{"key":"962_CR34","doi-asserted-by":"publisher","unstructured":"Chen X, Kundu K, Zhang Z, Ma H, Fidler S, Urtasun R (2016) Monocular 3d object detection for autonomous driving. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2147\u20132156. https:\/\/doi.org\/10.1109\/CVPR.2016.236","DOI":"10.1109\/CVPR.2016.236"},{"key":"962_CR35","doi-asserted-by":"publisher","unstructured":"Liu L, Lu J, Xu C, Tian Q, Zhou J (2019) Deep fitting degree scoring network for monocular 3d object detection. In: 2019 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1057\u20131066. https:\/\/doi.org\/10.1109\/CVPR.2019.00115","DOI":"10.1109\/CVPR.2019.00115"},{"key":"962_CR36","doi-asserted-by":"publisher","unstructured":"He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 770\u2013778. https:\/\/doi.org\/10.1109\/CVPR.2016.90","DOI":"10.1109\/CVPR.2016.90"},{"key":"962_CR37","doi-asserted-by":"publisher","first-page":"87","DOI":"10.1016\/j.cviu.2017.05.015","volume":"161","author":"J Sochor","year":"2017","unstructured":"Sochor J, Jur\u00e1nek R, Herout A (2017) Traffic surveillance camera calibration by 3d model bounding box alignment for accurate vehicle speed measurement. Comput Vis Image Understand 161:87\u201398. https:\/\/doi.org\/10.1016\/j.cviu.2017.05.015","journal-title":"Comput Vis Image Understand"},{"key":"962_CR38","doi-asserted-by":"publisher","unstructured":"Yu F, Wang D, Shelhamer E, Darrell T (2018) Deep layer aggregation. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 2403\u20132412. https:\/\/doi.org\/10.1109\/CVPR.2018.00255","DOI":"10.1109\/CVPR.2018.00255"},{"issue":"2","key":"962_CR39","doi-asserted-by":"publisher","first-page":"441","DOI":"10.1109\/TITS.2010.2045500","volume":"11","author":"NK Kanhere","year":"2010","unstructured":"Kanhere NK, Birchfield ST (2010) A taxonomy and analysis of camera calibration methods for traffic monitoring applications. IEEE Trans Intell Transp Syst 11(2):441\u2013452. https:\/\/doi.org\/10.1109\/TITS.2010.2045500","journal-title":"IEEE Trans Intell Transp Syst"},{"issue":"11","key":"962_CR40","doi-asserted-by":"publisher","first-page":"1955","DOI":"10.3724\/SP.J.1089.2019.17737","volume":"31","author":"W Wang","year":"2019","unstructured":"Wang W, Zhang C, Tang X, Song H, Cui H (2019) Automatic self-calibration and optimization algorithm of traffic camera in road scene. J Comput Aid Design Comput Graph 31(11):1955\u20131962. https:\/\/doi.org\/10.3724\/SP.J.1089.2019.17737","journal-title":"J Comput Aid Design Comput Graph"},{"key":"962_CR41","doi-asserted-by":"publisher","unstructured":"Lin T-Y, Doll\u00e1r P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 936\u2013944. https:\/\/doi.org\/10.1109\/CVPR.2017.106","DOI":"10.1109\/CVPR.2017.106"},{"key":"962_CR42","doi-asserted-by":"publisher","unstructured":"Hu J, Shen L, Sun G (2018) Squeeze-and-excitation networks. In: 2018 IEEE\/CVF Conference on Computer Vision and Pattern Recognition, pp. 7132\u20137141. https:\/\/doi.org\/10.1109\/CVPR.2018.00745","DOI":"10.1109\/CVPR.2018.00745"},{"key":"962_CR43","doi-asserted-by":"publisher","unstructured":"Zheng Z, Wang P, Liu W, Li J, Ye R, Ren D (2020) Distance-iou loss: faster and better learning for bounding box regression. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 34, pp. 12993\u201313000. https:\/\/doi.org\/10.1609\/aaai.v34i07.6999","DOI":"10.1609\/aaai.v34i07.6999"},{"key":"962_CR44","doi-asserted-by":"publisher","unstructured":"Geiger A, Lenz P, Urtasun R (2012) Are we ready for autonomous driving? The kitti vision benchmark suite. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3354\u20133361. https:\/\/doi.org\/10.1109\/CVPR.2012.6248074","DOI":"10.1109\/CVPR.2012.6248074"},{"key":"962_CR45","doi-asserted-by":"publisher","unstructured":"Sun P, Kretzschmar H, Dotiwalla X, Chouard A, Patnaik V, Tsui P, Guo J, Zhou Y, Chai Y, Caine B, Vasudevan V, Han W, Ngiam J, Zhao H, Timofeev A, Ettinger S, Krivokon M, Gao A, Joshi A, Zhang Y, Shlens J, Chen Z, Anguelov D (2020) Scalability in perception for autonomous driving: Waymo open dataset. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2443\u20132451. https:\/\/doi.org\/10.1109\/CVPR42600.2020.00252","DOI":"10.1109\/CVPR42600.2020.00252"},{"key":"962_CR46","doi-asserted-by":"publisher","unstructured":"Caesar H, Bankiti V, Lang AH, Vora S, Liong VE, Xu Q, Krishnan A, Pan Y, Baldan G, Beijbom O (2020) nuscenes: A multimodal dataset for autonomous driving. In: 2020 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11618\u201311628. https:\/\/doi.org\/10.1109\/CVPR42600.2020.01164","DOI":"10.1109\/CVPR42600.2020.01164"},{"issue":"5","key":"962_CR47","doi-asserted-by":"publisher","first-page":"1633","DOI":"10.1109\/TITS.2018.2825609","volume":"20","author":"J Sochor","year":"2019","unstructured":"Sochor J, Jur\u00e1nek R, \u0160pa\u0148hel J, Mar\u0161\u00edk L, \u0160irok\u00fd A, Herout A, Zem\u010d\u00edk P (2019) Comprehensive data set for automatic single camera visual speed measurement. IEEE Trans Intell Transp Syst 20(5):1633\u20131643. https:\/\/doi.org\/10.1109\/TITS.2018.2825609","journal-title":"IEEE Trans Intell Transp Syst"},{"key":"962_CR48","doi-asserted-by":"publisher","unstructured":"Wang W, Tang X, Zhang C, Song H, Cui H (2021) Spatial positioning method of vehicle in cross-camera traffic scene. J Comput Aid Design Comput Graph 33(6):873\u2013882. https:\/\/doi.org\/10.3724\/SP.J.1089.2021.18612","DOI":"10.3724\/SP.J.1089.2021.18612"},{"key":"962_CR49","doi-asserted-by":"publisher","first-page":"303","DOI":"10.1007\/s11263-009-0275-4","volume":"88","author":"M Everingham","year":"2010","unstructured":"Everingham M, Van Gool L, Williams C, Winn J, Zisserman A (2010) The pascal visual object classes (voc) challenge. Int J Comput Vis 88:303\u2013338. https:\/\/doi.org\/10.1007\/s11263-009-0275-4","journal-title":"Int J Comput Vis"}],"container-title":["Complex &amp; Intelligent Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00962-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s40747-022-00962-9\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s40747-022-00962-9.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,7,27]],"date-time":"2023-07-27T13:26:21Z","timestamp":1690464381000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s40747-022-00962-9"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,1,3]]},"references-count":49,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,8]]}},"alternative-id":["962"],"URL":"https:\/\/doi.org\/10.1007\/s40747-022-00962-9","relation":{},"ISSN":["2199-4536","2198-6053"],"issn-type":[{"value":"2199-4536","type":"print"},{"value":"2198-6053","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,1,3]]},"assertion":[{"value":"9 June 2022","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"26 December 2022","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"3 January 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}