{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,5]],"date-time":"2026-05-05T12:01:26Z","timestamp":1777982486791,"version":"3.51.4"},"reference-count":76,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2026,3,1]],"date-time":"2026-03-01T00:00:00Z","timestamp":1772323200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T00:00:00Z","timestamp":1773100800000},"content-version":"vor","delay-in-days":9,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Auton Robot"],"published-print":{"date-parts":[[2026,3]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>Recent advances in mapping techniques have enabled the creation of highly accurate dense 3D maps during robotic missions, such as point clouds, meshes, or NeRF-based representations. These developments present new opportunities for reusing these maps for localization. However, there remains a lack of a unified approach that can operate seamlessly across different map representations. This paper presents and evaluates a global visual localization system capable of localizing a single camera image across various 3D map representations built using both visual and lidar sensing. Our system generates a database by synthesizing novel views of the scene, creating RGB and depth image pairs. Leveraging the precise 3D geometric map, our method automatically defines rendering poses, reducing the number of database images while preserving retrieval performance. To bridge the domain gap between real query camera images and synthetic database images, our approach utilizes learning-based descriptors and feature detectors. We evaluate the system\u2019s performance through extensive real-world experiments conducted in both indoor and outdoor settings, assessing the effectiveness of each map representation and demonstrating its advantages over traditional structure-from-motion (SfM) localization approaches. The results show that all three map representations can achieve consistent localization success rates of 55% and higher across various environments. NeRF synthesized images show superior performance, localizing query images at an average success rate of 72%. Furthermore, we demonstrate an advantage over SfM-based approaches that our synthesized database enables localization in the reverse travel direction which is unseen during the mapping process. Our system, operating in real-time on a mobile laptop equipped with a GPU, achieves a processing rate of 1\u00a0Hz.<\/jats:p>","DOI":"10.1007\/s10514-025-10232-5","type":"journal-article","created":{"date-parts":[[2026,3,10]],"date-time":"2026-03-10T07:09:16Z","timestamp":1773126556000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Visual localization in 3D maps: comparing point cloud, mesh, and NeRF representations"],"prefix":"10.1007","volume":"50","author":[{"given":"Lintong","family":"Zhang","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yifu","family":"Tao","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jiarong","family":"Lin","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Fu","family":"Zhang","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maurice","family":"Fallon","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2026,3,10]]},"reference":[{"key":"10232_CR1","doi-asserted-by":"publisher","unstructured":"Alcantarilla, P.F., Nuevo, J., & Bartoli, A. (2013). Fast explicit diffusion for accelerated features in nonlinear scale spaces. In: British Machine Vision Conf. (BMVC). https:\/\/doi.org\/10.5244\/C.27.13","DOI":"10.5244\/C.27.13"},{"key":"10232_CR2","doi-asserted-by":"crossref","unstructured":"Ali-bey, A., Chaib-draa, B., & Gigu\u00e8re, P. (2023). MixVPR: Feature mixing for visual place recognition. In: IEEE\/CVF Winter Conf. on Applications of Computer Vision, pp. 2998\u20133007.","DOI":"10.1109\/WACV56688.2023.00301"},{"issue":"6","key":"10232_CR3","doi-asserted-by":"publisher","first-page":"1437","DOI":"10.1109\/TPAMI.2017.2711011","volume":"40","author":"R Arandjelovi\u0107","year":"2018","unstructured":"Arandjelovi\u0107, R., Gronat, P., Torii, A., Pajdla, T., & Sivic, J. (2018). NetVLAD: CNN architecture for weakly supervised place recognition. IEEE Trans. Pattern Anal. Mach. Intell., 40(6), 1437\u20131451. https:\/\/doi.org\/10.1109\/TPAMI.2017.2711011","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"10232_CR4","volume-title":"Graphics Gems II","author":"J Arvo","year":"1991","unstructured":"Arvo, J. (1991). Graphics Gems II. San Francisco, CA, USA: Morgan Kaufmann Publishers Inc."},{"key":"10232_CR5","doi-asserted-by":"publisher","unstructured":"Barron, J.T., Mildenhall, B., Verbin, D., Srinivasan, P.P., & Hedman, P. (2022). Mip-NeRF 360: Unbounded anti-aliased neural radiance fields. IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR52688.2022.00539","DOI":"10.1109\/CVPR52688.2022.00539"},{"key":"10232_CR6","doi-asserted-by":"publisher","unstructured":"Bernreiter, L., Ott, L., Nieto, J., Siegwart, R., & Cadena, C. (2021). Spherical multi-modal place recognition for heterogeneous sensor systems. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 1743\u20131750. https:\/\/doi.org\/10.1109\/ICRA48506.2021.9561078 . IEEE","DOI":"10.1109\/ICRA48506.2021.9561078"},{"key":"10232_CR7","doi-asserted-by":"publisher","unstructured":"Berton, G., Trivigno, G., Caputo, B., & Masone, C. (2023). EigenPlaces: Training viewpoint robust models for visual place recognition. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR), pp. 11080\u201311090. https:\/\/doi.org\/10.1109\/ICCV51070.2023.01017","DOI":"10.1109\/ICCV51070.2023.01017"},{"issue":"2","key":"10232_CR8","doi-asserted-by":"publisher","first-page":"239","DOI":"10.1109\/34.121791","volume":"14","author":"PJ Besl","year":"1992","unstructured":"Besl, P. J., & McKay, N. D. (1992). A method for registration of 3D shapes. IEEE Trans. Pattern Anal. Mach. Intell., 14(2), 239\u2013256. https:\/\/doi.org\/10.1109\/34.121791","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"10232_CR9","doi-asserted-by":"publisher","unstructured":"Borges, P., Zlot, R., Bosse, M., Nuske, S., & Tews, A. (2010). Vision-based localization using an edge map extracted from 3D laser range data. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 4902\u20134909. https:\/\/doi.org\/10.1109\/ROBOT.2010.5509517","DOI":"10.1109\/ROBOT.2010.5509517"},{"key":"10232_CR10","doi-asserted-by":"publisher","unstructured":"Botsch, M., Kobbelt, L., Pauly, M., Alliez, P., & L\u00e9vy, B. (2010). Polygon Mesh Processing. A K Peters, Natick, Massachusetts. https:\/\/doi.org\/10.1201\/b10688","DOI":"10.1201\/b10688"},{"key":"10232_CR11","doi-asserted-by":"publisher","unstructured":"Cattaneo, D., Vaghi, M., Ballardini, A.L., Fontana, S., Sorrenti, D.G., & Burgard, W. (2019). Cmrnet: Camera to lidar-map registration. In: IEEE Intelligent Transportation Systems Conference (ITSC), pp. 1283\u20131289. https:\/\/doi.org\/10.1109\/ITSC.2019.8917470","DOI":"10.1109\/ITSC.2019.8917470"},{"key":"10232_CR12","doi-asserted-by":"publisher","unstructured":"Chen, X., Vizzo, I., L\u00e4be, T., Behley, J., & Stachniss, C. (2021). Range image-based lidar localization for autonomous vehicles. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), pp. 5802\u20135808. https:\/\/doi.org\/10.1109\/ICRA48506.2021.9561335","DOI":"10.1109\/ICRA48506.2021.9561335"},{"key":"10232_CR13","doi-asserted-by":"publisher","unstructured":"Chen, K., Yu, H., Yang, W., Yu, L., Scherer, S., & Xia, G.-S. (2022). I2d-loc: Camera localization via image to lidar depth flow. ISPRS Journal of Photogrammetry and Remote Sensing 194, 209\u2013221. https:\/\/doi.org\/10.1016\/j.isprsjprs.2022.10.009","DOI":"10.1016\/j.isprsjprs.2022.10.009"},{"issue":"6","key":"10232_CR14","doi-asserted-by":"publisher","first-page":"647","DOI":"10.1177\/0278364908090961","volume":"27","author":"M Cummins","year":"2008","unstructured":"Cummins, M., & Newman, P. (2008). FAB-MAP: Probabilistic localization and mapping in the space of appearance. Intl. J. of Robot. Res., 27(6), 647\u2013665. https:\/\/doi.org\/10.1177\/0278364908090961","journal-title":"Intl. J. of Robot. Res."},{"key":"10232_CR15","doi-asserted-by":"publisher","unstructured":"Dai, A., Chang, A.X., Savva, M., Halber, M., Funkhouser, T., & Nie\u00dfner, M. (2017). ScanNet: Richly-annotated 3D reconstructions of indoor scenes. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR.2017.261","DOI":"10.1109\/CVPR.2017.261"},{"key":"10232_CR16","doi-asserted-by":"publisher","unstructured":"DeTone, D., Malisiewicz, T., & Rabinovich, A. (2018). SuperPoint: Self-supervised interest point detection and description. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 224\u2013236. https:\/\/doi.org\/10.1109\/CVPRW.2018.00060","DOI":"10.1109\/CVPRW.2018.00060"},{"key":"10232_CR17","doi-asserted-by":"publisher","unstructured":"Deng, K., Liu, A., Zhu, J.-Y., & Ramanan, D. (2022). Depth-supervised nerf: Fewer views and faster training for free. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR), pp. 12882\u201312891. https:\/\/doi.org\/10.1109\/CVPR52688.2022.01254","DOI":"10.1109\/CVPR52688.2022.01254"},{"issue":"2\u20133","key":"10232_CR18","doi-asserted-by":"publisher","first-page":"339","DOI":"10.1177\/0278364919863090","volume":"39","author":"R Dub\u00e9","year":"2020","unstructured":"Dub\u00e9, R., Cramariuc, A., Dugas, D., Sommer, H., Dymczyk, M., Nieto, J., Siegwart, R., & Cadena, C. (2020). Segmap: Segment-based mapping and localization using data-driven descriptors. Intl. J. of Robot. Res., 39(2\u20133), 339\u2013355. https:\/\/doi.org\/10.1177\/0278364919863090","journal-title":"Intl. J. of Robot. Res."},{"key":"10232_CR19","doi-asserted-by":"publisher","unstructured":"Dusmanu, M., Rocco, I., Pajdla, T., Pollefeys, M., Sivic, J., Torii, A., & Sattler, T. (2019). D2-Net: A trainable cnn for joint detection and description of local features. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR.2019.00828","DOI":"10.1109\/CVPR.2019.00828"},{"key":"10232_CR20","doi-asserted-by":"publisher","unstructured":"Fridovich-Keil, S., Yu, A., Tancik, M., Chen, Q., Recht, B., & Kanazawa, A. (2022). Plenoxels: Radiance fields without neural networks. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR), pp. 5501\u20135510. https:\/\/doi.org\/10.1109\/CVPR52688.2022.00542","DOI":"10.1109\/CVPR52688.2022.00542"},{"key":"10232_CR21","doi-asserted-by":"publisher","unstructured":"Fu, L.F.T., Chebrolu, N., & Fallon, M. (2023). Extrinsic calibration of camera to lidar using a differentiable checkerboard model. In: IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), pp. 1825\u20131831. https:\/\/doi.org\/10.1109\/IROS55552.2023.10341781","DOI":"10.1109\/IROS55552.2023.10341781"},{"issue":"5","key":"10232_CR22","doi-asserted-by":"publisher","first-page":"1188","DOI":"10.1109\/TRO.2012.2197158","volume":"28","author":"D Galvez-L\u00f3pez","year":"2012","unstructured":"Galvez-L\u00f3pez, D., & Tardos, J. D. (2012). Bags of binary words for fast place recognition in image sequences. IEEE Trans. Robotics, 28(5), 1188\u20131197. https:\/\/doi.org\/10.1109\/TRO.2012.2197158","journal-title":"IEEE Trans. Robotics"},{"key":"10232_CR23","doi-asserted-by":"publisher","unstructured":"Gehring, C., Fankhauser, P., Isler, L., Diethelm, R., Bachmann, S., Potz, M., Gerstenberg, L., & Hutter, M. (2021). ANYmal in the Field: Solving industrial inspection of an offshore hvdc platform with a quadrupedal robot. In: Field and Service Robotics, pp. 247\u2013260. Springer, Singapore. https:\/\/doi.org\/10.1007\/978-981-15-9460-1_18","DOI":"10.1007\/978-981-15-9460-1_18"},{"key":"10232_CR24","doi-asserted-by":"publisher","unstructured":"Hausler, S., Garg, S., Xu, M., Milford, M., & Fischer, T. (2021). Patch-NetVLAD: Multi-scale fusion of locally-global descriptors for place recognition. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR), pp. 14141\u201314152 . https:\/\/doi.org\/10.1109\/CVPR46437.2021.01392","DOI":"10.1109\/CVPR46437.2021.01392"},{"key":"10232_CR25","doi-asserted-by":"publisher","unstructured":"He, L., Wang, X., & Zhang, H. (2016). M2DP: A novel 3D point cloud descriptor and its application in loop closure detection. In: IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), pp. 231\u2013237. https:\/\/doi.org\/10.1109\/IROS.2016.7759060","DOI":"10.1109\/IROS.2016.7759060"},{"key":"10232_CR26","doi-asserted-by":"publisher","unstructured":"Himstedt, M., Frost, J., Hellbach, S., B\u00f6hme, H.-J., & Maehle, E. (2014). Large scale place recognition in 2D lidar scans using geometrical landmark relations. In: IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), pp. 5030\u20135035. https:\/\/doi.org\/10.1109\/IROS.2014.6943277","DOI":"10.1109\/IROS.2014.6943277"},{"issue":"3","key":"10232_CR27","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1145\/800031.808594","volume":"18","author":"JT Kajiya","year":"1984","unstructured":"Kajiya, J. T., & Von Herzen, B. P. (1984). Ray tracing volume densities. SIGGRAPH, 18(3), 165\u2013174. https:\/\/doi.org\/10.1145\/800031.808594","journal-title":"Ray tracing volume densities. SIGGRAPH"},{"key":"10232_CR28","doi-asserted-by":"publisher","unstructured":"Kerbl, B., Kopanas, G., Leimk\u00fchler, T., & Drettakis, G. (2023). 3D Gaussian splatting for real-time radiance field rendering. ACM Trans. on Graphics 42(4). https:\/\/doi.org\/10.1145\/3592433","DOI":"10.1145\/3592433"},{"key":"10232_CR29","doi-asserted-by":"publisher","unstructured":"Kim, G., Kim, A. (2018). Scan Context: Egocentric spatial descriptor for place recognition within 3D point cloud map. In: IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), pp. 4802\u20134809. https:\/\/doi.org\/10.1109\/IROS.2018.8593953","DOI":"10.1109\/IROS.2018.8593953"},{"key":"10232_CR30","doi-asserted-by":"publisher","unstructured":"Kong, X., Yang, X., Zhai, G., Zhao, X., Zeng, X., Wang, M., Liu, Y., Li, W., & Wen, F. (2020). Semantic graph based place recognition for 3D point clouds. In: IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), pp. 8216\u20138223. https:\/\/doi.org\/10.1109\/IROS45743.2020.9341060","DOI":"10.1109\/IROS45743.2020.9341060"},{"key":"10232_CR31","doi-asserted-by":"publisher","unstructured":"Lai, H., Yin, P., & Scherer, S. (2022). AdaFusion: Visual-lidar fusion with adaptive weights for place recognition. IEEE Robot. Autom. Lett. (RA-L) 7(4), 12038\u201312045 https:\/\/doi.org\/10.1109\/LRA.2022.3210880","DOI":"10.1109\/LRA.2022.3210880"},{"key":"10232_CR32","doi-asserted-by":"publisher","unstructured":"Lee, A.J., Song, S., Lim, H., Lee, W., & Myung, H. (2023). (LC)$$^{2}$$: Lidar-camera loop constraints for cross-modal place recognition. IEEE Robot. Autom. Lett. (RA-L) 8(6) https:\/\/doi.org\/10.1109\/LRA.2023.3268848","DOI":"10.1109\/LRA.2023.3268848"},{"key":"10232_CR33","doi-asserted-by":"publisher","unstructured":"Li, Z., & Snavely, N. (2018). Megadepth: Learning single-view depth prediction from internet photos. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR.2018.00218","DOI":"10.1109\/CVPR.2018.00218"},{"key":"10232_CR34","doi-asserted-by":"publisher","unstructured":"Liang, S., Zhang, Y., Tian, R., Zhu, D., Yang, L., & Cao, Z. (2022). SemLoc: Accurate and robust visual localization with semantic and structural constraints from prior maps. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 4135\u20134141. https:\/\/doi.org\/10.1109\/ICRA46639.2022.9811925","DOI":"10.1109\/ICRA46639.2022.9811925"},{"key":"10232_CR35","doi-asserted-by":"publisher","DOI":"10.1109\/TRO.2023.3321227","author":"J Lin","year":"2023","unstructured":"Lin, J., Yuan, C., Cai, Y., Li, H., Ren, Y., Zou, Y., Hong, X., & Zhang, F. (2023). ImMesh: An immediate lidar localization and meshing framework. IEEE Trans. Robotics. https:\/\/doi.org\/10.1109\/TRO.2023.3321227","journal-title":"IEEE Trans. Robotics"},{"key":"10232_CR36","doi-asserted-by":"publisher","unstructured":"Lin, J., & Zhang, F. (2022). $$\\text{R}^3$$LIVE++: a robust, real-time, radiance reconstruction package with a tightly-coupled lidar-inertial-visual state estimator. arXiv preprint arXiv:2209.03666. https:\/\/doi.org\/10.48550\/ARXIV.2209.03666","DOI":"10.48550\/ARXIV.2209.03666"},{"key":"10232_CR37","doi-asserted-by":"publisher","unstructured":"Liu, J., Nie, Q., Liu, Y., & Wang, C. (2023). NeRF-Loc: Visual localization with conditional neural radiance field. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 9385\u20139392 . https:\/\/doi.org\/10.1109\/ICRA48891.2023.10161420","DOI":"10.1109\/ICRA48891.2023.10161420"},{"key":"10232_CR38","doi-asserted-by":"publisher","unstructured":"Liu, J., Nie, Q., Liu, Y., & Wang, C. (2023). Nerf-loc: Visual localization with conditional neural radiance field. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 9385\u20139392. https:\/\/doi.org\/10.1109\/ICRA48891.2023.10161420","DOI":"10.1109\/ICRA48891.2023.10161420"},{"key":"10232_CR39","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"DG Lowe","year":"2004","unstructured":"Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. Intl. J. of Computer Vision, 60, 91\u2013110. https:\/\/doi.org\/10.1023\/B:VISI.0000029664.99615.94","journal-title":"Intl. J. of Computer Vision"},{"key":"10232_CR41","doi-asserted-by":"publisher","unstructured":"Maggio, D., Abate, M., Shi, J., Mario, C., & Carlone, L. (2023). Loc-nerf: Monte carlo localization using neural radiance fields. In: 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 4018\u20134025. https:\/\/doi.org\/10.1109\/ICRA48891.2023.10160782","DOI":"10.1109\/ICRA48891.2023.10160782"},{"key":"10232_CR42","doi-asserted-by":"publisher","unstructured":"Marchand, E., Uchiyama, H., & Spindler, F. (2016). Pose estimation for augmented reality: A hands-on survey. IEEE Trans. on Visualization and Computer Graphics 22(12), 2633\u20132651. https:\/\/doi.org\/10.1109\/TVCG.2015.2513408","DOI":"10.1109\/TVCG.2015.2513408"},{"key":"10232_CR43","doi-asserted-by":"publisher","unstructured":"Martin-Brualla, R., Radwan, N., Sajjadi, M.S.M., Barron, J.T., Dosovitskiy, A., & Duckworth, D. (2021). NeRF in the Wild: Neural radiance fields for unconstrained photo collections. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR46437.2021.00713","DOI":"10.1109\/CVPR46437.2021.00713"},{"key":"10232_CR44","doi-asserted-by":"publisher","unstructured":"Matsuki, H., Murai, R., Kelly, P.H.J., & Davison, A.J. (2024). Gaussian splatting SLAM. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR), pp. 18039\u201318048 https:\/\/doi.org\/10.1109\/CVPR52733.2024.01708","DOI":"10.1109\/CVPR52733.2024.01708"},{"key":"10232_CR45","doi-asserted-by":"publisher","unstructured":"Max, N. (1995). Optical models for direct volume rendering. IEEE Trans. on Visualization and Computer Graphics 1(2), 99\u2013108. https:\/\/doi.org\/10.1109\/2945.468400","DOI":"10.1109\/2945.468400"},{"key":"10232_CR46","doi-asserted-by":"publisher","unstructured":"Mildenhall, B., Srinivasan, P.P., Tancik, M., Barron, J.T., Ramamoorthi, R., & Ng, R. (2020). Nerf: Representing scenes as neural radiance fields for view synthesis. In: Eur. Conf. on Computer Vision (ECCV). https:\/\/doi.org\/10.1007\/978-3-030-58452-8_24","DOI":"10.1007\/978-3-030-58452-8_24"},{"key":"10232_CR47","doi-asserted-by":"publisher","unstructured":"M\u00fcller, T., Evans, A., Schied, C., & Keller, A. (2022). Instant neural graphics primitives with a multiresolution hash encoding. In: SIGGRAPH. https:\/\/doi.org\/10.1145\/3528223.3530127","DOI":"10.1145\/3528223.3530127"},{"key":"10232_CR48","doi-asserted-by":"publisher","unstructured":"Panek, V., Kukelova, Z., & Sattler, T. (2022). MeshLoc: Mesh-based visual localization. In: Eur. Conf. on Computer Vision (ECCV). https:\/\/doi.org\/10.1007\/978-3-031-20047-2_34","DOI":"10.1007\/978-3-031-20047-2_34"},{"key":"10232_CR49","doi-asserted-by":"publisher","unstructured":"Pascoe, G., Maddern, W., & Newman, P. (2015). Direct visual localisation and calibration for road vehicles in changing city environments. In: IEEE Intl. Conf. on Computer Vision Workshop (ICCVW), pp. 98\u2013105. https:\/\/doi.org\/10.1109\/ICCVW.2015.23","DOI":"10.1109\/ICCVW.2015.23"},{"key":"10232_CR50","doi-asserted-by":"publisher","unstructured":"Ratz, S., Dymczyk, M., Siegwart, R.Y., & Dub\u00e9, R. (2020). OneShot Global Localization: Instant lidar-visual pose estimation. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 5415\u20135421. https:\/\/doi.org\/10.1109\/ICRA40945.2020.9197458","DOI":"10.1109\/ICRA40945.2020.9197458"},{"key":"10232_CR51","unstructured":"Revaud, J., Weinzaepfel, P., Souza, C.R., & Humenberger, M. (2019). R2D2: Repeatable and reliable detector and descriptor. In: Intl. Conf. on Neural Information Processing Systems (NeurIPS). Curran Associates Inc., Red Hook, NY, USA."},{"key":"10232_CR52","doi-asserted-by":"publisher","unstructured":"Sarlin, P.-E., Cadena, C., Siegwart, R., & Dymczyk, M. (2019). From Coarse to Fine: Robust hierarchical localization at large scale. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR) https:\/\/doi.org\/10.1109\/CVPR.2019.01300.","DOI":"10.1109\/CVPR.2019.01300"},{"key":"10232_CR53","doi-asserted-by":"publisher","unstructured":"Sarlin, P.-E., DeTone, D., Malisiewicz, T., & Rabinovich, A. (2020). SuperGlue: Learning feature matching with graph neuralnetworks. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR). https:\/\/doi.org\/10.1109\/CVPR42600.2020.00499","DOI":"10.1109\/CVPR42600.2020.00499"},{"issue":"9","key":"10232_CR54","doi-asserted-by":"publisher","first-page":"1744","DOI":"10.1109\/TPAMI.2016.2611662","volume":"39","author":"T Sattler","year":"2017","unstructured":"Sattler, T., Leibe, B., & Kobbelt, L. (2017). Efficient & effective prioritized matching for large-scale image-based localization. IEEE Trans. Pattern Anal. Mach. Intell., 39(9), 1744\u20131756. https:\/\/doi.org\/10.1109\/TPAMI.2016.2611662","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"10232_CR55","doi-asserted-by":"publisher","unstructured":"Sch\u00f6nberger, J.L., & Frahm, J.-M. (2016). Structure-from-motion revisited. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR) . https:\/\/doi.org\/10.1109\/CVPR.2016.445","DOI":"10.1109\/CVPR.2016.445"},{"key":"10232_CR56","unstructured":"Shubodh, S., Omama, M., Zaidi, H., Parihar, U.S., & Krishna, M. (2024). Lip-loc: Lidar image pretraining for cross-modal localization. In: IEEE\/CVF Winter Conference on Applications of Computer Vision (WACV) Workshops, pp. 948\u2013957."},{"key":"10232_CR57","doi-asserted-by":"publisher","unstructured":"Tancik, M., Weber, E., Ng, E., Li, R., Yi, B., Wang, T., Kristoffersen, A., Austin, J., Salahi, K., & Ahuja, A., etal.: (2023). Nerfstudio: A modular framework for neural radiance field development. In: SIGGRAPH. https:\/\/doi.org\/10.1145\/3588432.3591516","DOI":"10.1145\/3588432.3591516"},{"key":"10232_CR58","doi-asserted-by":"crossref","unstructured":"Tao, Y., Mu\u00f1oz-Ba\u00f1\u00f3n, M.\u00c1., Zhang, L., Wang, J., Fu, L.F.T., & Fallon, M. (2025). The oxford spires dataset: Benchmarking large-scale lidar-visual localisation, reconstruction and radiance field methods. Intl. J. of Robot. Res.","DOI":"10.1177\/02783649251369905"},{"key":"10232_CR59","doi-asserted-by":"publisher","unstructured":"Tao, Y., Bhalgat, Y., Fu, L.F.T., Mattamala, M., Chebrolu, N., & Fallon, M. (2024). SiLVR: Scalable lidar-visual reconstruction with neural radiance fields for robotic inspection. In: IEEE Intl. Conf. on Robotics and Automation (ICRA). https:\/\/doi.org\/10.1109\/ICRA57147.2024.10611278","DOI":"10.1109\/ICRA57147.2024.10611278"},{"key":"10232_CR60","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2409868","author":"A Torii","year":"2015","unstructured":"Torii, A., Sivic, J., Okutomi, M., & Pajdla, T. (2015). Visual place recognition with repetitive structures. IEEE Trans. Pattern Anal. Mach. Intell. https:\/\/doi.org\/10.1109\/TPAMI.2015.2409868","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell."},{"key":"10232_CR61","doi-asserted-by":"crossref","unstructured":"Trivigno, G., Masone, C., Caputo, B., & Sattler, T. (2024). The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement https:\/\/arxiv.org\/abs\/2404.10438","DOI":"10.1109\/CVPR52733.2024.01215"},{"key":"10232_CR62","doi-asserted-by":"publisher","unstructured":"Uy, M.A., & Lee, G.H. (2018). PointNetVLAD: Deep point cloud based retrieval for large-scale place recognition. In: IEEE Intl. Conf. Computer Vision and Pattern Recognition (CVPR) . https:\/\/doi.org\/10.1109\/CVPR.2018.00470","DOI":"10.1109\/CVPR.2018.00470"},{"key":"10232_CR63","doi-asserted-by":"publisher","unstructured":"Vidanapathirana, K., Moghadam, P., Harwood, B., Zhao, M., Sridharan, S., & Fookes, C. (2020). Locus: Lidar-based place recognition using spatiotemporal higher-order pooling. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 5075\u20135081. https:\/\/doi.org\/10.1109\/ICRA48506.2021.9560915","DOI":"10.1109\/ICRA48506.2021.9560915"},{"key":"10232_CR64","doi-asserted-by":"publisher","unstructured":"Vidanapathirana, K., Ramezani, M., Moghadam, P., Sridharan, S., & Fookes, C. (2022). LoGG3D-Net: Locally guided global descriptor learning for 3D place recognition. In: IEEE Intl. Conf. on Robotics and Automation (ICRA), pp. 2215\u20132221. https:\/\/doi.org\/10.1109\/ICRA46639.2022.9811753","DOI":"10.1109\/ICRA46639.2022.9811753"},{"key":"10232_CR40","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1007\/S11222-007-9033-Z","volume":"17","author":"U Von Luxburg","year":"2007","unstructured":"Von Luxburg, U. (2007). A tutorial on spectral clustering. Statistics Comput, 17, 395\u2013416. https:\/\/doi.org\/10.1007\/S11222-007-9033-Z","journal-title":"Statistics Comput"},{"key":"10232_CR65","doi-asserted-by":"publisher","first-page":"4643","DOI":"10.1109\/TRO.2025.3588454","volume":"41","author":"G Wang","year":"2025","unstructured":"Wang, G., Zheng, Y., Wu, Y., Guo, Y., Liu, Z., Zhu, Y., Burgard, W., & Wang, H. (2025). End-to-end 2d\u20133d registration between image and lidar point cloud for vehicle localization. IEEE Transactions on Robotics, 41, 4643\u20134662. https:\/\/doi.org\/10.1109\/TRO.2025.3588454","journal-title":"IEEE Transactions on Robotics"},{"key":"10232_CR66","doi-asserted-by":"publisher","unstructured":"Wolcott, R.W., & Eustice, R.M. (2014). Visual localization within lidar maps for automated urban driving. In: IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), pp. 176\u2013183. https:\/\/doi.org\/10.1109\/IROS.2014.6942558","DOI":"10.1109\/IROS.2014.6942558"},{"issue":"4","key":"10232_CR67","doi-asserted-by":"publisher","first-page":"2053","DOI":"10.1109\/TRO.2022.3141876","volume":"38","author":"W Xu","year":"2022","unstructured":"Xu, W., Cai, Y., He, D., Lin, J., & Zhang, F. (2022). FAST-LIO2: Fast direct lidar-inertial odometry. IEEE Trans. Robotics, 38(4), 2053\u20132073. https:\/\/doi.org\/10.1109\/TRO.2022.3141876","journal-title":"IEEE Trans. Robotics"},{"key":"10232_CR68","doi-asserted-by":"publisher","DOI":"10.1016\/j.autcon.2022.104641","volume":"146","author":"H Yin","year":"2023","unstructured":"Yin, H., Lin, Z., & Yeoh, J. K. W. (2023). Semantic localization on BIM-generated maps using a 3D lidar sensor. Automation in Construction, 146, Article 104641. https:\/\/doi.org\/10.1016\/j.autcon.2022.104641","journal-title":"Automation in Construction"},{"key":"10232_CR69","doi-asserted-by":"publisher","unstructured":"Yu, H., Zhen, W., Yang, W., Zhang, J., & Scherer, S. (2020). Monocular camera localization in prior lidar maps with 2D-3D line correspondences. IEEE\/RSJ Intl. Conf. on Intelligent Robots and Systems (IROS), 4588\u20134594. https:\/\/doi.org\/10.1109\/IROS45743.2020.9341690","DOI":"10.1109\/IROS45743.2020.9341690"},{"key":"10232_CR70","unstructured":"Yu, Z., Peng, S., Niemeyer, M., Sattler, T., & Geiger, A. (2022). MonoSDF: Exploring monocular geometric cues for neural implicit surface reconstruction. Intl. Conf. on Neural Information Processing Systems (NeurIPS)."},{"key":"10232_CR71","doi-asserted-by":"publisher","unstructured":"Zeisl, B., Sattler, T., & Pollefeys, M. (2015). Camera pose voting for large-scale image-based localization. In: Intl. Conf. on Computer Vision (ICCV), pp. 2704\u20132712. https:\/\/doi.org\/10.1109\/ICCV.2015.310","DOI":"10.1109\/ICCV.2015.310"},{"key":"10232_CR72","doi-asserted-by":"publisher","unstructured":"Zhang, Z., Sattler, T., & Scaramuzza, D. (2021). Reference pose generation for long-term visual localization via learned features and view synthesis. ijcv 129(4), 821\u2013844. https:\/\/doi.org\/10.1007\/s11263-020-01399-8","DOI":"10.1007\/s11263-020-01399-8"},{"key":"10232_CR73","doi-asserted-by":"publisher","unstructured":"Zhang, L., Tejaswi\u00a0Digumarti, S., Tinchev, G., & Fallon, M. (2023). InstaLoc: One-shot global lidar localisation in indoor environments through instance learning. In: Robotics: Science and Systems (RSS). https:\/\/doi.org\/10.15607\/RSS.2023.XIX.070","DOI":"10.15607\/RSS.2023.XIX.070"},{"key":"10232_CR74","unstructured":"Zhang, L., Camurri, M., Wisth, D., & Fallon, M. (2021). Multi-Camera LiDAR Inertial Extension to the Newer College Dataset. arXiv:2112.08854."},{"key":"10232_CR75","doi-asserted-by":"publisher","unstructured":"Zhou, Q.-Y., Park, J., & Koltun, V. (2018). Open3D: A modern library for 3D data processing. arXiv:1801.09847. https:\/\/doi.org\/10.48550\/ARXIV.1801.09847","DOI":"10.48550\/ARXIV.1801.09847"},{"issue":"6","key":"10232_CR76","doi-asserted-by":"publisher","first-page":"1003","DOI":"10.1002\/ROB.21936","volume":"37","author":"X Zuo","year":"2020","unstructured":"Zuo, X., Ye, W., Yang, Y., Zheng, R., Vidal-Calleja, T., Huang, G., & Liu, Y. (2020). Multimodal localization: Stereo over LiDAR map. J. Field Robot., 37(6), 1003\u20131026. https:\/\/doi.org\/10.1002\/ROB.21936","journal-title":"J. Field Robot."}],"container-title":["Autonomous Robots"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10514-025-10232-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10514-025-10232-5","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10514-025-10232-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T15:02:24Z","timestamp":1774450944000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10514-025-10232-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,3]]},"references-count":76,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2026,3]]}},"alternative-id":["10232"],"URL":"https:\/\/doi.org\/10.1007\/s10514-025-10232-5","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-5293991\/v1","asserted-by":"object"}]},"ISSN":["0929-5593","1573-7527"],"issn-type":[{"value":"0929-5593","type":"print"},{"value":"1573-7527","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,3]]},"assertion":[{"value":"19 October 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"16 September 2025","order":2,"name":"revised","label":"Revised","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"24 November 2025","order":3,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 March 2026","order":4,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no Conflict of interest.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"14"}}