{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T00:50:29Z","timestamp":1760057429988,"version":"build-2065373602"},"reference-count":130,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2025,2,2]],"date-time":"2025-02-02T00:00:00Z","timestamp":1738454400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"the Australian Government Research Training Program (RTP) scholarship"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["JSAN"],"abstract":"<jats:p>High-definition (HD) maps aim to provide detailed road information with centimeter-level accuracy, essential for enabling precise navigation and safe operation of autonomous vehicles (AVs). Traditional offline construction methods involve several complex steps, such as data collection, point cloud generation, and feature extraction, but these methods are resource-intensive and struggle to keep pace with the rapidly changing road environments. In contrast, online HD map construction leverages onboard sensor data to dynamically generate local HD maps, offering a bird\u2019s-eye view (BEV) representation of the surrounding road environment. This approach has the potential to improve adaptability to spatial and temporal changes in road conditions while enhancing cost-efficiency by reducing the dependency on frequent map updates and expensive survey fleets. This survey provides a comprehensive analysis of online HD map construction, including the task background, high-level motivations, research methodology, key advancements, existing challenges, and future trends. We systematically review the latest advancements in three key sub-tasks: map segmentation, map element detection, and lane graph construction, aiming to bridge gaps in the current literature. We also discuss existing challenges and future trends, covering standardized map representation design, multitask learning, and multi-modality fusion, while offering suggestions for potential improvements.<\/jats:p>","DOI":"10.3390\/jsan14010015","type":"journal-article","created":{"date-parts":[[2025,2,3]],"date-time":"2025-02-03T04:36:51Z","timestamp":1738557411000},"page":"15","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Online High-Definition Map Construction for Autonomous Vehicles: A Comprehensive Survey"],"prefix":"10.3390","volume":"14","author":[{"ORCID":"https:\/\/orcid.org\/0009-0005-9028-846X","authenticated-orcid":false,"given":"Hongyu","family":"Lyu","sequence":"first","affiliation":[{"name":"Australian Centre for Robotics, The University of Sydney, Camperdown, NSW 2006, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3126-7042","authenticated-orcid":false,"given":"Julie Stephany","family":"Berrio Perez","sequence":"additional","affiliation":[{"name":"Australian Centre for Robotics, The University of Sydney, Camperdown, NSW 2006, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5969-1593","authenticated-orcid":false,"given":"Yaoqi","family":"Huang","sequence":"additional","affiliation":[{"name":"Australian Centre for Robotics, The University of Sydney, Camperdown, NSW 2006, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8527-1875","authenticated-orcid":false,"given":"Kunming","family":"Li","sequence":"additional","affiliation":[{"name":"Australian Centre for Robotics, The University of Sydney, Camperdown, NSW 2006, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5032-6581","authenticated-orcid":false,"given":"Mao","family":"Shan","sequence":"additional","affiliation":[{"name":"Australian Centre for Robotics, The University of Sydney, Camperdown, NSW 2006, Australia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7940-4742","authenticated-orcid":false,"given":"Stewart","family":"Worrall","sequence":"additional","affiliation":[{"name":"Australian Centre for Robotics, The University of Sydney, Camperdown, NSW 2006, Australia"}]}],"member":"1968","published-online":{"date-parts":[[2025,2,2]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"324","DOI":"10.1017\/S0373463319000638","article-title":"High definition map for automated driving: Overview and analysis","volume":"73","author":"Liu","year":"2020","journal-title":"J. Navig."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Fischer, P., Azimi, S.M., Roschlaub, R., and Krau\u00df, T. (2018). Towards hd maps from aerial imagery: Robust lane marking segmentation using country-scale imagery. ISPRS Int. J. Geo-Inf., 7.","DOI":"10.3390\/ijgi7120458"},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"527","DOI":"10.1109\/OJITS.2023.3295502","article-title":"High-definition maps: Comprehensive survey, challenges and future perspectives","volume":"4","author":"Elghazaly","year":"2023","journal-title":"IEEE Open J. Intell. Transp. Syst."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Ebrahimi Soorchaei, B., Razzaghpour, M., Valiente, R., Raftari, A., and Fallah, Y.P. (2022). High-definition map representation techniques for automated vehicles. Electronics, 11.","DOI":"10.20944\/preprints202209.0067.v1"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Diaz-Diaz, A., Oca\u00f1a, M., Llamazares, \u00c1., G\u00f3mez-Hu\u00e9lamo, C., Revenga, P., and Bergasa, L.M. (2022, January 5\u20139). Hd maps: Exploiting opendrive potential for path planning and map monitoring. Proceedings of the 2022 IEEE Intelligent Vehicles Symposium (IV), Aachen, Germany.","DOI":"10.1109\/IV51971.2022.9827297"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Bu\u010dko, B., Z\u00e1bovsk\u00e1, K., Ristvej, J., and J\u00e1no\u0161\u00edkov\u00e1, M. (October, January 27). HD Maps and Usage of Laser Scanned Data as a Potential Map Layer. Proceedings of the 2021 44th International Convention on Information, Communication and Electronic Technology (MIPRO), Opatija, Croatia.","DOI":"10.23919\/MIPRO52101.2021.9596722"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"106125","DOI":"10.1016\/j.engappai.2023.106125","article-title":"A review of high-definition map creation methods for autonomous driving","volume":"122","author":"Bao","year":"2023","journal-title":"Eng. Appl. Artif. Intell."},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Ilci, V., and Toth, C. (2020). High definition 3D map creation using GNSS\/IMU\/LiDAR sensor integration to support autonomous vehicle navigation. Sensors, 20.","DOI":"10.3390\/s20030899"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Asrat, K.T., and Cho, H.J. (2024). A Comprehensive Survey on High-Definition Map Generation and Maintenance. ISPRS Int. J. Geo-Inf., 13.","DOI":"10.3390\/ijgi13070232"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Javanmardi, M., Javanmardi, E., Gu, Y., and Kamijo, S. (2017). Towards high-definition 3D urban mapping: Road feature-based registration of mobile mapping systems and aerial imagery. Remote Sens., 9.","DOI":"10.3390\/rs9100975"},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Massow, K., Kwella, B., Pfeifer, N., H\u00e4usler, F., Pontow, J., Radusch, I., Hipp, J., D\u00f6litzscher, F., and Haueis, M. (2016, January 1\u20134). Deriving HD maps for highly automated driving from vehicular probe data. Proceedings of the 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), Rio de Janeiro, Brazil.","DOI":"10.1109\/ITSC.2016.7795794"},{"key":"ref_12","first-page":"1","article-title":"LOAM: Lidar odometry and mapping in real-time","volume":"2","author":"Zhang","year":"2014","journal-title":"Proceedings of the Robotics: Science and Systems"},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Shan, T., Englot, B., Meyers, D., Wang, W., Ratti, C., and Rus, D. (2020, January 25\u201329). Lio-sam: Tightly-coupled lidar inertial odometry via smoothing and mapping. Proceedings of the 2020 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, USA.","DOI":"10.1109\/IROS45743.2020.9341176"},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Yi, S., Worrall, S., and Nebot, E. (2019, January 27\u201330). Geographical map registration and fusion of lidar-aerial orthoimagery in gis. Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference (ITSC), Auckland, New Zealand.","DOI":"10.1109\/ITSC.2019.8917305"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"Huang, X., Mei, G., and Zhang, J. (2020, January 14\u201319). Feature-metric registration: A fast semi-supervised approach for robust point cloud registration without correspondences. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01138"},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Shan, M., Narula, K., Worrall, S., Wong, Y.F., Perez, J.S.B., Gray, P., and Nebot, E. (2022, January 8\u201312). A Novel Probabilistic V2X Data Fusion Framework for Cooperative Perception. Proceedings of the 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC), Macau, China.","DOI":"10.1109\/ITSC55140.2022.9922251"},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"7637","DOI":"10.1109\/TITS.2021.3071647","article-title":"Camera-LIDAR integration: Probabilistic sensor fusion for semantic mapping","volume":"23","author":"Berrio","year":"2021","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"10427","DOI":"10.1109\/TITS.2021.3094485","article-title":"Long-term map maintenance pipeline for autonomous vehicles","volume":"23","author":"Berrio","year":"2021","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Can, Y.B., Liniger, A., Paudel, D.P., and Van Gool, L. (2021, January 10\u201317). Structured bird\u2019s-eye-view traffic scene understanding from onboard images. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.01537"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Can, Y.B., Liniger, A., Paudel, D.P., and Van Gool, L. (2022, January 18\u201324). Topology preserving local road network estimation from single onboard camera image. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.01675"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Li, Q., Wang, Y., Wang, Y., and Zhao, H. (2022, January 19\u201323). Hdmapnet: An online hd map construction and evaluation framework. Proceedings of the 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA.","DOI":"10.1109\/ICRA46639.2022.9812383"},{"key":"ref_22","unstructured":"Liu, Y., Yuan, T., Wang, Y., Wang, Y., and Zhao, H. (2023, January 23\u201329). Vectormapnet: End-to-end vectorized hd map learning. Proceedings of the International Conference on Machine Learning, PMLR, Honolulu, HI, USA."},{"key":"ref_23","first-page":"1","article-title":"High-Definition Maps Construction Based on Visual Sensor: A Comprehensive Survey","volume":"99","author":"Tang","year":"2023","journal-title":"IEEE Trans. Intell. Veh."},{"key":"ref_24","unstructured":"Ma, Y., Wang, T., Bai, X., Yang, H., Hou, Y., Wang, Y., Qiao, Y., Yang, R., Manocha, D., and Zhu, X. (2022). Vision-centric bev perception: A survey. arXiv."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2151","DOI":"10.1109\/TPAMI.2023.3333838","article-title":"Delving into the devils of bird\u2019s-eye-view perception: A review, evaluation and recipe","volume":"46","author":"Li","year":"2023","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Murciego, E., Hu\u00e9lamo, C.G., Barea, R., Bergasa, L.M., Romera, E., Arango, J.F., Tradacete, M., and S\u00e1ez, \u00c1. (2018, January 22\u201323). Topological road mapping for autonomous driving applications. Proceedings of the Advances in Physical Agents: Proceedings of the 19th International Workshop of Physical Agents (WAF 2018), Madrid, Spain.","DOI":"10.1007\/978-3-319-99885-5_18"},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"187","DOI":"10.5194\/isprs-archives-XLVIII-2-2024-187-2024","article-title":"A Review on End-to-End High-Definition Map Generation","volume":"48","author":"Kwag","year":"2024","journal-title":"Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"210","DOI":"10.1186\/s13643-016-0384-4","article-title":"Rayyan\u2014A web and mobile app for systematic reviews","volume":"5","author":"Ouzzani","year":"2016","journal-title":"Syst. Rev."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Caesar, H., Bankiti, V., Lang, A.H., Vora, S., Liong, V.E., Xu, Q., Krishnan, A., Pan, Y., Baldan, G., and Beijbom, O. (2020, January 13\u201319). nuscenes: A multimodal dataset for autonomous driving. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01164"},{"key":"ref_30","unstructured":"Wilson, B., Qi, W., Agarwal, T., Lambert, J., Singh, J., Khandelwal, S., Pan, B., Kumar, R., Hartnett, A., and Pontes, J.K. (2021, January 6\u201314). Argoverse 2: Next generation datasets for self-driving perception and forecasting. Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks, Vistual."},{"key":"ref_31","unstructured":"Wang, H., Li, T., Li, Y., Chen, L., Sima, C., Liu, Z., Wang, B., Jia, P., Wang, Y., and Jiang, S. (2024). Openlane-v2: A topology reasoning benchmark for unified 3d hd mapping. Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks, IEEE."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Sun, P., Kretzschmar, H., Dotiwalla, X., Chouard, A., Patnaik, V., Tsui, P., Guo, J., Zhou, Y., Chai, Y., and Caine, B. (2020, January 13\u201319). Scalability in perception for autonomous driving: Waymo open dataset. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.00252"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Chang, M.F., Lambert, J., Sangkloy, P., Singh, J., Bak, S., Hartnett, A., Wang, D., Carr, P., Lucey, S., and Ramanan, D. (2019, January 15\u201320). Argoverse: 3d tracking and forecasting with rich maps. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.00895"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Forsyth, D., Torr, P., and Zisserman, A. (2008). Learning to Localize Objects with Structured Output Regression. Lecture Notes in Computer Science, Proceedings of the Computer Vision\u2014ECCV 2008, Marseille, France, 12\u201318 October 2008, Springer.","DOI":"10.1007\/978-3-540-88693-8"},{"key":"ref_35","doi-asserted-by":"crossref","unstructured":"He, Y., Bian, C., Xia, J., Shi, S., Yan, Z., Song, Q., and Xing, G. (2023, January 2\u20136). VI-Map: Infrastructure-Assisted Real-Time HD Mapping for Autonomous Driving. Proceedings of the 29th Annual International Conference on Mobile Computing and Networking, ACM MobiCom \u201923, New York, NY, USA.","DOI":"10.1145\/3570361.3613280"},{"key":"ref_36","doi-asserted-by":"crossref","unstructured":"Peng, R., Cai, X., Xu, H., Lu, J., Wen, F., Zhang, W., and Zhang, L. (2024). LaneGraph2Seq: Lane Topology Extraction with Language Model via Vertex-Edge Encoding and Connectivity Enhancement. arXiv.","DOI":"10.1609\/aaai.v38i5.28248"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"3292","DOI":"10.1109\/TPAMI.2022.3179507","article-title":"Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d","volume":"45","author":"Liao","year":"2022","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Roddick, T., and Cipolla, R. (2020, January 13\u201319). Predicting semantic map representations from images using pyramid occupancy networks. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR42600.2020.01115"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Philion, J., and Fidler, S. (2020, January 23\u201328). Lift, splat, shoot: Encoding images from arbitrary camera rigs by implicitly unprojecting to 3d. Proceedings of the Computer Vision\u2013ECCV 2020: 16th European Conference, Glasgow, UK. Proceedings, Part XIV 16.","DOI":"10.1007\/978-3-030-58568-6_12"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Reiher, L., Lampe, B., and Eckstein, L. (2020, January 20\u201323). A sim2real deep learning approach for the transformation of images from multiple vehicle-mounted cameras to a semantically segmented image in bird\u2019s eye view. Proceedings of the 2020 IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC), Rhodes, Greece.","DOI":"10.1109\/ITSC45102.2020.9294462"},{"key":"ref_41","doi-asserted-by":"crossref","first-page":"4867","DOI":"10.1109\/LRA.2020.3004325","article-title":"Cross-view semantic segmentation for sensing surroundings","volume":"5","author":"Pan","year":"2020","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Yang, W., Li, Q., Liu, W., Yu, Y., Ma, Y., He, S., and Pan, J. (2021, January 20\u201325). Projecting your view attentively: Monocular road scene layout estimation via cross-view transformation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01528"},{"key":"ref_43","doi-asserted-by":"crossref","unstructured":"Saha, A., Mendez, O., Russell, C., and Bowden, R. (June, January 30). Enabling spatio-temporal aggregation in birds-eye-view vehicle estimation. Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi\u2019an, China.","DOI":"10.1109\/ICRA48506.2021.9561169"},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"Zhou, B., and Kr\u00e4henb\u00fchl, P. (2023, January 13). Cross-view transformers for real-time map-view semantic segmentation. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52688.2022.01339"},{"key":"ref_45","unstructured":"Li, Z., Wang, W., Li, H., Xie, E., Sima, C., Lu, T., Qiao, Y., and Dai, J. Bevformer: Learning bird\u2019s-eye-view representation from multi-camera images via spatiotemporal transformers. Proceedings of the European Conference on Computer Vision."},{"key":"ref_46","unstructured":"Lu, J., Zhou, Z., Zhu, X., Xu, H., and Zhang, L. Learning ego 3d representation as ray tracing. Proceedings of the European Conference on Computer Vision."},{"key":"ref_47","unstructured":"Bartoccioni, F., Zablocki, \u00c9., Bursuc, A., P\u00e9rez, P., Cord, M., and Alahari, K. (2023, January 6\u20139). Lara: Latents and rays for multi-camera bird\u2019s-eye-view semantic segmentation. Proceedings of the Conference on Robot Learning, PMLR, Atlanta, GA, USA."},{"key":"ref_48","doi-asserted-by":"crossref","first-page":"1968","DOI":"10.1109\/LRA.2022.3142418","article-title":"Bird\u2019s-eye-view panoptic segmentation using monocular frontal view images","volume":"7","author":"Gosala","year":"2022","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_49","unstructured":"Xie, E., Yu, Z., Zhou, D., Philion, J., Anandkumar, A., Fidler, S., Luo, P., and Alvarez, J.M. (2022). BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation. arXiv."},{"key":"ref_50","unstructured":"Zhang, Y., Zhu, Z., Zheng, W., Huang, J., Huang, G., Zhou, J., and Lu, J. (2022). Beverse: Unified perception and prediction in birds-eye-view for vision-centric autonomous driving. arXiv."},{"key":"ref_51","doi-asserted-by":"crossref","unstructured":"Peng, L., Chen, Z., Fu, Z., Liang, P., and Cheng, E. (2023, January 3\u20137). BEVSegFormer: Bird\u2019s Eye View Semantic Segmentation From Arbitrary Camera Rigs. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV56688.2023.00588"},{"key":"ref_52","doi-asserted-by":"crossref","unstructured":"Liu, Z., Tang, H., Amini, A., Yang, X., Mao, H., Rus, D.L., and Han, S. (June, January 29). Bevfusion: Multi-task multi-sensor fusion with unified bird\u2019s-eye view representation. Proceedings of the 2023 IEEE international conference on robotics and automation (ICRA), London, UK.","DOI":"10.1109\/ICRA48891.2023.10160968"},{"key":"ref_53","doi-asserted-by":"crossref","unstructured":"Harley, A.W., Fang, Z., Li, J., Ambrus, R., and Fragkiadaki, K. (June, January 29). Simple-bev: What really matters for multi-sensor bev perception?. Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK.","DOI":"10.1109\/ICRA48891.2023.10160831"},{"key":"ref_54","doi-asserted-by":"crossref","unstructured":"Zou, J., Zhu, Z., Huang, J., Yang, T., Huang, G., and Wang, X. (2023, January 29). HFT: Lifting Perspective Representations via Hybrid Feature Transformation for BEV Perception. Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK.","DOI":"10.1109\/ICRA48891.2023.10161214"},{"key":"ref_55","doi-asserted-by":"crossref","unstructured":"Liu, Y., Yan, J., Jia, F., Li, S., Gao, A., Wang, T., and Zhang, X. (2023, January 2\u20133). Petrv2: A unified framework for 3d perception from multi-camera images. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00302"},{"key":"ref_56","doi-asserted-by":"crossref","unstructured":"Liao, B., Chen, S., Wang, X., Cheng, T., Zhang, Q., Liu, W., and Huang, C. (2023, January 1). MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction. Proceedings of the International Conference on Learning Representations, Kigali, Rwanda.","DOI":"10.1007\/s11263-024-02235-z"},{"key":"ref_57","doi-asserted-by":"crossref","unstructured":"Qiao, L., Ding, W., Qiu, X., and Zhang, C. (2023, January 17\u201324). End-to-End Vectorized HD-Map Construction With Piecewise Bezier Curve. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01270"},{"key":"ref_58","doi-asserted-by":"crossref","unstructured":"Xiong, X., Liu, Y., Yuan, T., Wang, Y., Wang, Y., and Zhao, H. (2023, January 18\u201322). Neural map prior for autonomous driving. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.01682"},{"key":"ref_59","unstructured":"Shin, J., Rameau, F., Jeong, H., and Kum, D. (2023, January 17\u201324). Instagram: Instance-level graph modeling for vectorized hd map learning. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops 2023, Vancouver, BC, Canada."},{"key":"ref_60","doi-asserted-by":"crossref","unstructured":"Ding, W., Qiao, L., Qiu, X., and Zhang, C. (2023, January 4\u20136). Pivotnet: Vectorized pivot learning for end-to-end hd map construction. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00340"},{"key":"ref_61","unstructured":"Yu, J., Zhang, Z., Xia, S., and Sang, J. (2023, January 6\u20139). ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map Construction. Proceedings of the Conference on Robot Learning, PMLR, Atlanta, GA, USA."},{"key":"ref_62","unstructured":"Zhang, G., Lin, J., Wu, S., Luo, Z., Xue, Y., Lu, S., and Wang, Z. (2023). Online map vectorization for autonomous driving: A rasterization perspective. Adv. Neural Inf. Process. Syst., 36."},{"key":"ref_63","doi-asserted-by":"crossref","unstructured":"Yuan, T., Liu, Y., Wang, Y., Wang, Y., and Zhao, H. (2024, January 3\u20138). Streammapnet: Streaming mapping network for vectorized online hd map construction. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA.","DOI":"10.1109\/WACV57701.2024.00719"},{"key":"ref_64","doi-asserted-by":"crossref","unstructured":"Liu, R., and Yuan, Z. (2024, January 18\u201322). Compact HD Map Construction via Douglas\u2013Peucker Point Transformer. Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada.","DOI":"10.1609\/aaai.v38i4.28160"},{"key":"ref_65","doi-asserted-by":"crossref","unstructured":"Dong, H., Gu, W., Zhang, X., Xu, J., Ai, R., Lu, H., Kannala, J., and Chen, X. (2024, January 13\u201317). Superfusion: Multilevel LiDAR\u2013camera fusion for long-range hd map generation. Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA), Yokohama, Japan.","DOI":"10.1109\/ICRA57147.2024.10611320"},{"key":"ref_66","unstructured":"Zhou, Y., Zhang, H., Yu, J., Yang, Y., Jung, S., Park, S.I., and Yoo, B. HIMap: HybrId Representation Learning for End-to-end Vectorized HD Map Construction. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition."},{"key":"ref_67","doi-asserted-by":"crossref","unstructured":"Liu, X., Wang, S., Li, W., Yang, R., Chen, J., and Zhu, J. (2024, January 17\u201321). Mgmap: Mask-guided learning for online vectorized hd map construction. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA.","DOI":"10.1109\/CVPR52733.2024.01403"},{"key":"ref_68","unstructured":"Xu, Z., K Wong, K.Y., and Zhao, H. InsMapper: Exploring inner-instance information for vectorized HD mapping. Proceedings of the European Conference on Computer Vision."},{"key":"ref_69","unstructured":"Zhang, Z., Zhang, Y., Ding, X., Jin, F., and Yue, X. Online vectorized hd map construction using geometry. Proceedings of the European Conference on Computer Vision."},{"key":"ref_70","unstructured":"Zhang, X., Liu, G., Liu, Z., Xu, N., Liu, Y., and Zhao, J. (October, January 29). Enhancing vectorized map perception with historical rasterized maps. Proceedings of the European Conference on Computer Vision, Milan, Italy."},{"key":"ref_71","unstructured":"Hao, X., Li, R., Zhang, H., Li, D., Yin, R., Jung, S., Park, S.I., Yoo, B., Zhao, H., and Zhang, J. (October, January 29). Mapdistill: Boosting efficient camera-based hd map construction via camera-lidar fusion model distillation. Proceedings of the European Conference on Computer Vision, Milan, Italy."},{"key":"ref_72","doi-asserted-by":"crossref","unstructured":"Wang, S., Jia, F., Liu, Y., Zhao, Y., Chen, Z., Wang, T., Zhang, C., Zhang, X., and Zhao, F. (2025). Stream Query Denoising for Vectorized HD Map Construction. European Conference on Computer Vision, Springer.","DOI":"10.1007\/978-3-031-72655-2_12"},{"key":"ref_73","unstructured":"Chen, J., Wu, Y., Tan, J., Ma, H., and Furukawa, Y. Maptracker: Tracking with strided memory fusion for consistent vector hd mapping. Proceedings of the European Conference on Computer Vision."},{"key":"ref_74","doi-asserted-by":"crossref","unstructured":"Hu, H., Wang, F., Wang, Y., Hu, L., Xu, J., and Zhang, Z. (2025, January 26\u201327). ADMap: Anti-disturbance framework for reconstructing online vectorized HD map. Proceedings of the European Conference on Computer Vision, Paris, France.","DOI":"10.1007\/978-3-031-72673-6_17"},{"key":"ref_75","unstructured":"Liu, Z., Zhang, X., Liu, G., Zhao, J., and Xu, N. Leveraging enhanced queries of point sets for vectorized map construction. Proceedings of the European Conference on Computer Vision."},{"key":"ref_76","doi-asserted-by":"crossref","unstructured":"Liao, B., Chen, S., Zhang, Y., Jiang, B., Zhang, Q., Liu, W., Huang, C., and Wang, X. (2024). Maptrv2: An end-to-end framework for online vectorized hd map construction. Int. J. Comput. Vis., 1\u201323.","DOI":"10.1007\/s11263-024-02235-z"},{"key":"ref_77","doi-asserted-by":"crossref","first-page":"21672","DOI":"10.1109\/TITS.2024.3450704","article-title":"DTCLMapper: Dual Temporal Consistent Learning for Vectorized HD Map Construction","volume":"25","author":"Li","year":"2024","journal-title":"Trans. Intell. Transport. Sys."},{"key":"ref_78","doi-asserted-by":"crossref","first-page":"8539","DOI":"10.1109\/LRA.2024.3447450","article-title":"P-MapNet: Far-Seeing Map Generator Enhanced by Both SDMap and HDMap Priors","volume":"9","author":"Jiang","year":"2024","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_79","unstructured":"Peng, N., Zhou, X., Wang, M., Yang, X., Chen, S., and Chen, G. (2024, January 3\u20138). PrevPredMap: Exploring Temporal Modeling with Previous Predictions for Online Vectorized HD Map Construction. Proceedings of the IEEE\/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA."},{"key":"ref_80","doi-asserted-by":"crossref","unstructured":"Xu, Z., Liu, Y., Sun, Y., Liu, M., and Wang, L. (June, January 29). Centerlinedet: Centerline graph detection for road lanes with vehicle-mounted sensors by transformer for hd map generation. Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK.","DOI":"10.1109\/ICRA48891.2023.10161508"},{"key":"ref_81","doi-asserted-by":"crossref","unstructured":"Can, Y.B., Liniger, A., Paudel, D., and Van Gool, L. (2023, January 24\u201328). Online Lane Graph Extraction from Onboard Video. Proceedings of the 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), Bilbao, Spain.","DOI":"10.1109\/ITSC57777.2023.10421898"},{"key":"ref_82","doi-asserted-by":"crossref","unstructured":"Can, Y.B., Liniger, A., Paudel, D., and Van Gool, L. (2023, January 24\u201328). Prior Based Online Lane Graph Extraction from Single Onboard Camera Image. Proceedings of the 2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC), Bilbao, Spain.","DOI":"10.1109\/ITSC57777.2023.10422171"},{"key":"ref_83","doi-asserted-by":"crossref","unstructured":"Can, Y.B., Liniger, A., Paudel, D.P., and Van Gool, L. (2023, January 4\u20136). Improving online lane graph extraction by object-lane clustering. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00789"},{"key":"ref_84","doi-asserted-by":"crossref","unstructured":"Lu, J., Peng, R., Cai, X., Xu, H., Li, H., Wen, F., Zhang, W., and Zhang, L. (2023, January 1\u20136). Translating Images to Road Network: A Non-Autoregressive Sequence-to-Sequence Approach. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00009"},{"key":"ref_85","unstructured":"Li, T., Chen, L., Wang, H., Li, Y., Yang, J., Geng, X., Jiang, S., Wang, Y., Xu, H., and Xu, C. (2023). Graph-based topology reasoning for driving scenes. arXiv."},{"key":"ref_86","unstructured":"Li, T., Jia, P., Wang, B., Chen, L., Jiang, K., Yan, J., and Li, H. (2024, January 7\u201311). LaneSegNet: Map Learning with Lane Segment Perception for Autonomous Driving. Proceedings of the ICLR, Paris, France."},{"key":"ref_87","unstructured":"Wu, D., Chang, J., Jia, F., Liu, Y., Wang, T., and Shen, J. (2024, January 7\u201311). TopoMLP: An Simple yet Strong Pipeline for Driving Topology Reasoning. Proceedings of the International Conference on Learning Representations, Vienna, Austria."},{"key":"ref_88","doi-asserted-by":"crossref","unstructured":"Luo, K.Z., Weng, X., Wang, Y., Wu, S., Li, J., Weinberger, K.Q., Wang, Y., and Pavone, M. (2024, January 13\u201317). Augmenting lane perception and topology understanding with standard definition navigation maps. Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA), Yokohama, Japan.","DOI":"10.1109\/ICRA57147.2024.10610276"},{"key":"ref_89","doi-asserted-by":"crossref","unstructured":"Zhu, T., Leng, J., Zhong, J., Zhang, Z., and Sun, C. (2024, January 2\u20135). Lanemapnet: Lane network recognization and hd map construction using curve region aware temporal bird\u2019s-eye-view perception. Proceedings of the 2024 IEEE Intelligent Vehicles Symposium (IV), Jeju Island, Republic of Korea.","DOI":"10.1109\/IV55156.2024.10588419"},{"key":"ref_90","unstructured":"Han, Y., Yu, K., and Li, Z. Continuity preserving online centerline graph learning. Proceedings of the European Conference on Computer Vision."},{"key":"ref_91","unstructured":"Ma, Z., Liang, S., Wen, Y., Lu, W., and Wan, G. Roadpainter: Points are ideal navigators for topology transformer. Proceedings of the European Conference on Computer Vision."},{"key":"ref_92","unstructured":"Liao, B., Chen, S., Jiang, B., Cheng, T., Zhang, Q., Liu, W., Huang, C., and Wang, X. Lane graph as path: Continuity-preserving path-wise modeling for online lane graph construction. Proceedings of the European Conference on Computer Vision."},{"key":"ref_93","unstructured":"Kalfaoglu, M., Ozturk, H.I., Kilinc, O., and Temizel, A. (2024, January 17\u201318). TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem. Proceedings of the IEEE\/CVF European Conference on Computer Vision Workshops, Seattle, WA, USA."},{"key":"ref_94","unstructured":"Fu, Y., Liao, W., Liu, X., Ma, Y., Dai, F., and Zhang, Y. (2024). TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes. Advances in Neural Information Processing Systems, Morgan Kaufmann Publishers Inc."},{"key":"ref_95","doi-asserted-by":"crossref","first-page":"1909","DOI":"10.1007\/s11263-023-01790-1","article-title":"3D object detection for autonomous driving: A comprehensive survey","volume":"131","author":"Mao","year":"2023","journal-title":"Int. J. Comput. Vis."},{"key":"ref_96","doi-asserted-by":"crossref","first-page":"107623","DOI":"10.1016\/j.patcog.2020.107623","article-title":"A review of lane detection methods based on deep learning","volume":"111","author":"Tang","year":"2021","journal-title":"Pattern Recognit."},{"key":"ref_97","doi-asserted-by":"crossref","first-page":"177","DOI":"10.1007\/BF00201978","article-title":"Inverse perspective mapping simplifies optical flow computation and obstacle detection","volume":"64","author":"Mallot","year":"1991","journal-title":"Biol. Cybern."},{"key":"ref_98","doi-asserted-by":"crossref","first-page":"108","DOI":"10.1016\/j.inffus.2014.09.003","article-title":"Multimodal inverse perspective mapping","volume":"24","author":"Oliveira","year":"2015","journal-title":"Inf. Fusion"},{"key":"ref_99","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1016\/S0262-8856(97)00093-0","article-title":"Stereo inverse perspective mapping: Theory and applications","volume":"16","author":"Bertozz","year":"1998","journal-title":"Image Vis. Comput."},{"key":"ref_100","doi-asserted-by":"crossref","first-page":"4350","DOI":"10.1109\/TITS.2019.2939832","article-title":"Restricted deformable convolution-based road scene semantic segmentation using surround view cameras","volume":"21","author":"Deng","year":"2019","journal-title":"IEEE Trans. Intell. Transp. Syst."},{"key":"ref_101","unstructured":"S\u00e4mann, T., Amende, K., Milz, S., Witt, C., Simon, M., and Petzold, J. Efficient semantic segmentation for visual bird\u2019s-eye view interpretation. Proceedings of the Intelligent Autonomous Systems 15: Proceedings of the 15th International Conference IAS-15."},{"key":"ref_102","unstructured":"Zhu, X., Su, W., Lu, L., Li, B., Wang, X., and Dai, J. (2020). Deformable detr: Deformable transformers for end-to-end object detection. arXiv."},{"key":"ref_103","doi-asserted-by":"crossref","unstructured":"Samani, E.U., Tao, F., Dasari, H.R., Ding, S., and Banerjee, A.G. (, January 1\u20135). F2BEV: Bird\u2019s Eye View Generation from Surround-View Fisheye Camera Images for Automated Driving. Proceedings of the 2023 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Detroit, MI, USA.","DOI":"10.1109\/IROS55552.2023.10341862"},{"key":"ref_104","doi-asserted-by":"crossref","unstructured":"Dutta, P., Sistu, G., Yogamani, S., Galv\u00e1n, E., and McDonald, J. (2023, January 18\u201323). ViT-BEVSeg: A hierarchical transformer network for monocular birds-eye-view segmentation. Proceedings of the 2022 International Joint Conference on Neural Networks (IJCNN), Padua, Italy.","DOI":"10.1109\/IJCNN55064.2022.9891987"},{"key":"ref_105","doi-asserted-by":"crossref","first-page":"3302","DOI":"10.1109\/LRA.2022.3146898","article-title":"Understanding bird\u2019s-eye view of road semantics using an onboard camera","volume":"7","author":"Can","year":"2022","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_106","unstructured":"Tesla (2024, November 19). Tesla AI Day 2021. Available online: https:\/\/www.youtube.com\/watch?v=j0z4FweCy4M."},{"key":"ref_107","doi-asserted-by":"crossref","first-page":"445","DOI":"10.1109\/LRA.2019.2891028","article-title":"Monocular semantic occupancy grid mapping with convolutional variational encoder\u2013decoder networks","volume":"4","author":"Lu","year":"2019","journal-title":"IEEE Robot. Autom. Lett."},{"key":"ref_108","unstructured":"Roddick, T., Kendall, A., and Cipolla, R. (2018). Orthographic feature transform for monocular 3d object detection. arXiv."},{"key":"ref_109","doi-asserted-by":"crossref","unstructured":"Long, J., Shelhamer, E., and Darrell, T. (2015, January 7\u201312). Fully convolutional networks for semantic segmentation. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA.","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"ref_110","unstructured":"Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., and Zagoruyko, S. End-to-end object detection with transformers. Proceedings of the European Conference on Computer Vision."},{"key":"ref_111","unstructured":"Tan, M., and Le, Q. (2019, January 9\u201315). Efficientnet: Rethinking model scaling for convolutional neural networks. Proceedings of the International Conference on Machine Learning, PMLR, Long Beach, CA, USA."},{"key":"ref_112","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., and Sun, J. (2016, January 27\u201330). Deep residual learning for image recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.","DOI":"10.1109\/CVPR.2016.90"},{"key":"ref_113","doi-asserted-by":"crossref","unstructured":"Lang, A.H., Vora, S., Caesar, H., Zhou, L., Yang, J., and Beijbom, O. (2019, January 15\u201320). Pointpillars: Fast encoders for object detection from point clouds. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA.","DOI":"10.1109\/CVPR.2019.01298"},{"key":"ref_114","doi-asserted-by":"crossref","unstructured":"Yan, Y., Mao, Y., and Li, B. (2018). Second: Sparsely embedded convolutional detection. Sensors, 18.","DOI":"10.3390\/s18103337"},{"key":"ref_115","unstructured":"Liu, Y., Wang, T., Zhang, X., and Sun, J. Petr: Position embedding transformation for multi-view 3d object detection. Proceedings of the European Conference on Computer Vision."},{"key":"ref_116","doi-asserted-by":"crossref","unstructured":"Gu, X., Song, G., Gilitschenski, I., Pavone, M., and Ivanovic, B. (2024). Producing and Leveraging Online Map Uncertainty in Trajectory Prediction. arXiv.","DOI":"10.1109\/CVPR52733.2024.01376"},{"key":"ref_117","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1023\/A:1007379606734","article-title":"Multitask learning","volume":"28","author":"Caruana","year":"1997","journal-title":"Mach. Learn."},{"key":"ref_118","doi-asserted-by":"crossref","unstructured":"Ranganatha, N.E., Zhang, H., Venkatramani, S., Liao, J.Y., and Christensen, H.I. (2024). SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations. arXiv.","DOI":"10.1109\/IV55156.2024.10588555"},{"key":"ref_119","doi-asserted-by":"crossref","unstructured":"Choi, S., Kim, J., Shin, H., and Choi, J.W. (2024). Mask2map: Vectorized hd map construction using bird\u2019s eye view segmentation masks. arXiv.","DOI":"10.1007\/978-3-031-72890-7_2"},{"key":"ref_120","doi-asserted-by":"crossref","unstructured":"Berrio, J.S., Zhou, W., Ward, J., Worrall, S., and Nebot, E. (2018, January 1\u20135). Octree map based on sparse point cloud and heuristic probability distribution for labeled images. Proceedings of the 2018 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain.","DOI":"10.1109\/IROS.2018.8594024"},{"key":"ref_121","doi-asserted-by":"crossref","unstructured":"Cao, A.Q., and De Charette, R. (2022, January 18\u201324). Monoscene: Monocular 3D semantic scene completion. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA.","DOI":"10.1109\/CVPR52688.2022.00396"},{"key":"ref_122","doi-asserted-by":"crossref","unstructured":"Li, Y., Yu, Z., Choy, C., Xiao, C., Alvarez, J.M., Fidler, S., Feng, C., and Anandkumar, A. (2023, January 17\u201324). Voxformer: Sparse voxel transformer for camera-based 3D semantic scene completion. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada.","DOI":"10.1109\/CVPR52729.2023.00877"},{"key":"ref_123","doi-asserted-by":"crossref","unstructured":"Zhang, Y., Zhu, Z., and Du, D. (2023, January 2\u20136). Occformer: Dual-path transformer for vision-based 3d semantic occupancy prediction. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.00865"},{"key":"ref_124","doi-asserted-by":"crossref","unstructured":"Ming, Z., Berrio, J.S., Shan, M., and Worrall, S. (2024). InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy Prediction. arXiv.","DOI":"10.1109\/IROS58592.2024.10802434"},{"key":"ref_125","doi-asserted-by":"crossref","unstructured":"Ming, Z., Berrio, J.S., Shan, M., and Worrall, S. (2024). OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy Prediction. IEEE Trans. Intell. Veh., early access.","DOI":"10.1109\/TIV.2024.3453293"},{"key":"ref_126","doi-asserted-by":"crossref","unstructured":"Casas, S., Sadat, A., and Urtasun, R. (2021, January 20\u201325). Mp3: A unified model to map, perceive, predict and plan. Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA.","DOI":"10.1109\/CVPR46437.2021.01417"},{"key":"ref_127","doi-asserted-by":"crossref","unstructured":"Chitta, K., Prakash, A., and Geiger, A. (2021, January 11\u2013177). Neat: Neural attention fields for end-to-end autonomous driving. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Montreal, BC, Canada.","DOI":"10.1109\/ICCV48922.2021.01550"},{"key":"ref_128","doi-asserted-by":"crossref","unstructured":"Sima, C., Renz, K., Chitta, K., Chen, L., Zhang, H., Xie, C., Bei\u00dfwenger, J., Luo, P., Geiger, A., and Li, H. (2025, January 26\u201327). Drivelm: Driving with graph visual question answering. Proceedings of the European Conference on Computer Vision, Paris, France.","DOI":"10.1007\/978-3-031-72943-0_15"},{"key":"ref_129","unstructured":"Marcu, A.M., Chen, L., H\u00fcnermann, J., Karnsund, A., Hanotte, B., Chidananda, P., Nair, S., Badrinarayanan, V., Kendall, A., and Shotton, J. (October, January 29). LingoQA: Visual question answering for autonomous driving. Proceedings of the European Conference on Computer Vision, Milan, Italy."},{"key":"ref_130","doi-asserted-by":"crossref","unstructured":"Guo, Y., Yin, F., Li, X.h., Yan, X., Xue, T., Mei, S., and Liu, C.L. (2023, January 1\u20136). Visual traffic knowledge graph generation from scene images. Proceedings of the IEEE\/CVF International Conference on Computer Vision, Paris, France.","DOI":"10.1109\/ICCV51070.2023.01975"}],"container-title":["Journal of Sensor and Actuator Networks"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2224-2708\/14\/1\/15\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T16:25:56Z","timestamp":1760027156000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2224-2708\/14\/1\/15"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,2,2]]},"references-count":130,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2025,2]]}},"alternative-id":["jsan14010015"],"URL":"https:\/\/doi.org\/10.3390\/jsan14010015","relation":{},"ISSN":["2224-2708"],"issn-type":[{"type":"electronic","value":"2224-2708"}],"subject":[],"published":{"date-parts":[[2025,2,2]]}}}