{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,28]],"date-time":"2026-03-28T03:11:26Z","timestamp":1774667486005,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":80,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,9,20]],"date-time":"2019-09-20T00:00:00Z","timestamp":1568937600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,9,20]]},"DOI":"10.1145\/3366194.3366211","type":"proceedings-article","created":{"date-parts":[[2019,11,20]],"date-time":"2019-11-20T13:56:52Z","timestamp":1574258212000},"page":"94-105","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":19,"title":["A review of Visual-Based Localization"],"prefix":"10.1145","author":[{"given":"Xing","family":"Xin","sequence":"first","affiliation":[{"name":"College of Systems Engineering, National University of Defense Technology, Changsha, China"}]},{"given":"Jie","family":"Jiang","sequence":"additional","affiliation":[{"name":"College of Systems Engineering, National University of Defense Technology, Changsha, China"}]},{"given":"Yin","family":"Zou","sequence":"additional","affiliation":[{"name":"College of Systems Engineering, National University of Defense Technology, Changsha, China"}]}],"member":"320","published-online":{"date-parts":[[2019,9,20]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2017.09.013"},{"key":"e_1_3_2_1_2_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4104--4113)","author":"Schonberger J. L.","unstructured":"Schonberger , J. L. , and Frahm , J. M . 2016. Structure-from-motion revisited . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4104--4113) . Schonberger, J. L., and Frahm, J. M. 2016. Structure-from-motion revisited. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4104--4113)."},{"key":"e_1_3_2_1_3_1","series-title":"Vol. 1","volume-title":"Robotics: Science and Systems","author":"Lynen S.","year":"2015","unstructured":"Lynen , S. , Sattler , T. , Bosse , M. , Hesch , J. A. , Pollefeys , M. , and Siegwart , R . 2015 . Get Out of My Lab: Large-scale , Real-Time Visual-Inertial Localization . In Robotics: Science and Systems ( Vol. 1 ). Lynen, S., Sattler, T., Bosse, M., Hesch, J. A., Pollefeys, M., and Siegwart, R. 2015. Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization. In Robotics: Science and Systems (Vol. 1)."},{"key":"e_1_3_2_1_4_1","volume-title":"2013 IEEE Intelligent Vehicles Symposium (IV) (pp. 449--454)","author":"Schreiber M.","unstructured":"Schreiber , M. , Kn\u00f6ppel , C. , and Franke , U . 2013. Laneloc: Lane marking based localization using highly accurate maps . In 2013 IEEE Intelligent Vehicles Symposium (IV) (pp. 449--454) . Schreiber, M., Kn\u00f6ppel, C., and Franke, U. 2013. Laneloc: Lane marking based localization using highly accurate maps. In 2013 IEEE Intelligent Vehicles Symposium (IV) (pp. 449--454)."},{"key":"e_1_3_2_1_5_1","volume-title":"Segmatch: Segment based loop-closure for 3d point clouds. In ICRA, arXiv preprint arXiv:1609.07720.","author":"Dub\u00e9 R.","year":"2016","unstructured":"Dub\u00e9 , R. , Dugas , D. , Stumm , E. , Nieto , J. , Siegwart , R. , and Cadena , C . 2016 . Segmatch: Segment based loop-closure for 3d point clouds. In ICRA, arXiv preprint arXiv:1609.07720. Dub\u00e9, R., Dugas, D., Stumm, E., Nieto, J., Siegwart, R., and Cadena, C. 2016. Segmatch: Segment based loop-closure for 3d point clouds. In ICRA, arXiv preprint arXiv:1609.07720."},{"key":"e_1_3_2_1_6_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5297--5307)","author":"Arandjelovic R.","unstructured":"Arandjelovic , R. , Gronat , P. , Torii , A. , Pajdla , T. , and Sivic , J . 2016. NetVLAD: CNN architecture for weakly supervised place recognition . In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5297--5307) . Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., and Sivic, J. 2016. NetVLAD: CNN architecture for weakly supervised place recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5297--5307)."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Mur-Artal R. Montiel J. M. M. and Tardos J. D. 2015. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE transactions on robotics 31(5) 1147--1163.  Mur-Artal R. Montiel J. M. M. and Tardos J. D. 2015. ORB-SLAM: a versatile and accurate monocular SLAM system. IEEE transactions on robotics 31(5) 1147--1163.","DOI":"10.1109\/TRO.2015.2463671"},{"key":"e_1_3_2_1_8_1","volume-title":"2017 IEEE International Conference on Robotics and Automation (ICRA) (pp. 3223--3230)","author":"Chen Z.","unstructured":"Chen , Z. , Jacobson , A. , S\u00fcnderhauf , N. , and Milford , M . 2017. Deep learning features at scale for visual place recognition . In 2017 IEEE International Conference on Robotics and Automation (ICRA) (pp. 3223--3230) . Chen, Z., Jacobson, A., S\u00fcnderhauf, N., and Milford, M. 2017. Deep learning features at scale for visual place recognition. In 2017 IEEE International Conference on Robotics and Automation (ICRA) (pp. 3223--3230)."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 8601--8610)","author":"Sattler T.","unstructured":"Sattler , T. , Maddern , W. , Toft , C. , Torii , A. and Kahl , F . 2018. Benchmarking 6dof outdoor visual localization in changing conditions . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 8601--8610) . Sattler, T., Maddern, W., Toft, C., Torii, A. and Kahl, F. 2018. Benchmarking 6dof outdoor visual localization in changing conditions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 8601--8610)."},{"key":"e_1_3_2_1_10_1","volume-title":"European conference on computer vision (pp. 15--29)","author":"Li Y.","unstructured":"Li , Y. , Snavely , N. , Huttenlocher , D. , and Fua , P . 2012. Worldwide pose estimation using 3d point clouds . In European conference on computer vision (pp. 15--29) . Li, Y., Snavely, N., Huttenlocher, D., and Fua, P. 2012. Worldwide pose estimation using 3d point clouds. In European conference on computer vision (pp. 15--29)."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Lowe D. G. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60(2) 91--110.  Lowe D. G. 2004. Distinctive image features from scale-invariant keypoints. International journal of computer vision 60(2) 91--110.","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_3_2_1_12_1","volume-title":"European Conference on Computer Vision (pp. 467--483)","author":"Yi K. M.","unstructured":"Yi , K. M. , Trulls , E. , Lepetit , V. , and Fua , P . 2016. Lift: Learned invariant feature transform . In European Conference on Computer Vision (pp. 467--483) . Yi, K. M., Trulls, E., Lepetit, V., and Fua, P. 2016. Lift: Learned invariant feature transform. In European Conference on Computer Vision (pp. 467--483)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10044-017-0611-1"},{"key":"e_1_3_2_1_14_1","volume-title":"Cham: Springer.","author":"Hakeem A.","year":"2016","unstructured":"Hakeem , A. , Zamir , L. , Van Gool , Shah, M., and Szeliski , R . 2016 . Large-scale visual geo-localization. Cham: Springer. Hakeem, A., Zamir, L., Van Gool, Shah, M., and Szeliski, R. 2016. Large-scale visual geo-localization. Cham: Springer."},{"key":"e_1_3_2_1_15_1","volume-title":"2012 IEEE Conference on Computer Vision and Pattern Recognition (pp. 2911--2918)","author":"Arandjelovi\u0107 R.","unstructured":"Arandjelovi\u0107 , R. , and Zisserman , A . 2012. Three things everyone should know to improve object retrieval . In 2012 IEEE Conference on Computer Vision and Pattern Recognition (pp. 2911--2918) . Arandjelovi\u0107, R., and Zisserman, A. 2012. Three things everyone should know to improve object retrieval. In 2012 IEEE Conference on Computer Vision and Pattern Recognition (pp. 2911--2918)."},{"key":"e_1_3_2_1_16_1","volume-title":"CVPR 2010-23rd IEEE Conference on Computer Vision & Pattern Recognition (pp. 3304--3311)","author":"J\u00e9gou H.","unstructured":"J\u00e9gou , H. , Douze , M. , Schmid , C. , and P\u00e9rez , P . 2010. Aggregating local descriptors into a compact image representation . In CVPR 2010-23rd IEEE Conference on Computer Vision & Pattern Recognition (pp. 3304--3311) . J\u00e9gou, H., Douze, M., Schmid, C., and P\u00e9rez, P. 2010. Aggregating local descriptors into a compact image representation. In CVPR 2010-23rd IEEE Conference on Computer Vision & Pattern Recognition (pp. 3304--3311)."},{"key":"e_1_3_2_1_17_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1808--1817)","author":"Torii A.","unstructured":"Torii , A. , Arandjelovic , R. , Sivic , J. , Okutomi , M. , and Pajdla , T . 2015. 24\/7 place recognition by view synthesis . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1808--1817) . Torii, A., Arandjelovic, R., Sivic, J., Okutomi, M., and Pajdla, T. 2015. 24\/7 place recognition by view synthesis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1808--1817)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.139"},{"key":"e_1_3_2_1_19_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3310--3317)","author":"J\u00e9gou H.","unstructured":"J\u00e9gou , H. , and Zisserman , A . 2014. Triangulation embedding and democratic aggregation for image search . In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3310--3317) . J\u00e9gou, H., and Zisserman, A. 2014. Triangulation embedding and democratic aggregation for image search. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3310--3317)."},{"key":"e_1_3_2_1_20_1","volume-title":"FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance. IJRR, 27(6):647--665","author":"Cummins M.","year":"2008","unstructured":"Cummins , M. , and Newman , P . FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance. IJRR, 27(6):647--665 , 2008 . Cummins, M., and Newman, P. FAB-MAP: Probabilistic Localization and Mapping in the Space of Appearance. IJRR, 27(6):647--665, 2008."},{"key":"e_1_3_2_1_21_1","first-page":"1","volume-title":"British Machine Vision Conference (BMVC). No. 2.","author":"Azzi C.","unstructured":"Azzi , C. , Asmar , D. , Fakih , A. , and Zelek , J ., 2016. Filtering 3D Keypoints Using GIST For Accurate Image-Based Localization . In: British Machine Vision Conference (BMVC). No. 2. pp. 1 -- 12 . Azzi, C., Asmar, D., Fakih, A., and Zelek, J., 2016. Filtering 3D Keypoints Using GIST For Accurate Image-Based Localization. In: British Machine Vision Conference (BMVC). No. 2. pp. 1--12."},{"key":"e_1_3_2_1_22_1","first-page":"3","volume-title":"Proceedings of the IEEE European Conference on Computer Vision (ECCV).","volume":"9905","author":"Radenovi\u0107 F.","unstructured":"Radenovi\u0107 , F. , Tolias , G. , and Chum , O ., 2016. CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples . In: Proceedings of the IEEE European Conference on Computer Vision (ECCV). Vol. 9905 . pp. 3 -- 20 Radenovi\u0107, F., Tolias, G., and Chum, O., 2016. CNN Image Retrieval Learns from BoW: Unsupervised Fine-Tuning with Hard Examples. In: Proceedings of the IEEE European Conference on Computer Vision (ECCV). Vol. 9905. pp. 3--20"},{"key":"e_1_3_2_1_23_1","volume-title":"Video Google: A text retrieval approach to object matching in videos. In null (pp. 1470--1477).","author":"Sivic J.","year":"2003","unstructured":"Sivic , J. , and Zisserman , A . 2003 . Video Google: A text retrieval approach to object matching in videos. In null (pp. 1470--1477). Sivic, J., and Zisserman, A. 2003. Video Google: A text retrieval approach to object matching in videos. In null (pp. 1470--1477)."},{"key":"e_1_3_2_1_24_1","volume-title":"Lost in Quantization: Improving Particular Object Retrieval in Large Scale Image Databases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Philbin J.","unstructured":"Philbin , J. , Chum , O. , Isard , M. , Sivic , J. , and Zisserman , A ., 2008 . Lost in Quantization: Improving Particular Object Retrieval in Large Scale Image Databases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Philbin, J., Chum, O., Isard, M., Sivic, J., and Zisserman, A., 2008. Lost in Quantization: Improving Particular Object Retrieval in Large Scale Image Databases. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2011.235"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2409868"},{"key":"e_1_3_2_1_27_1","volume-title":"Proceedings of Robotics: Science and Systems XII.","author":"S\u00fcnderhauf N.","unstructured":"S\u00fcnderhauf , N. , Shirazi , S. , Jacobson , A. , Dayoub , F. , Pepperell , E. , Upcroft , B. , and Milford , M . 2015. Place recognition with convnet landmarks: Viewpoint-robust, condition-robust, training-free . Proceedings of Robotics: Science and Systems XII. S\u00fcnderhauf, N., Shirazi, S., Jacobson, A., Dayoub, F., Pepperell, E., Upcroft, B., and Milford, M. 2015. Place recognition with convnet landmarks: Viewpoint-robust, condition-robust, training-free. Proceedings of Robotics: Science and Systems XII."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-014-0774-9"},{"key":"e_1_3_2_1_29_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2930--2937","author":"Shotton J.","unstructured":"Shotton , J. , Glocker , B. , Zach , C. , Izadi , S. , Criminisi , A. , and Fitzgibbon , A ., 2013. Scene coordinate regression forests for camera relocalization in RGB-D images . Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2930--2937 Shotton, J., Glocker, B., Zach, C., Izadi, S., Criminisi, A., and Fitzgibbon, A., 2013. Scene coordinate regression forests for camera relocalization in RGB-D images. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2930--2937"},{"key":"e_1_3_2_1_30_1","first-page":"1","volume-title":"Multi-Output Learning for Camera Relocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Guzman-Rivera A.","unstructured":"Guzman-Rivera , A. , Pushmeet , K. , Glocker , B. , and Shotton , J ., 2014 . Multi-Output Learning for Camera Relocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1 -- 6 . Guzman-Rivera, A., Pushmeet, K., Glocker, B., and Shotton, J., 2014. Multi-Output Learning for Camera Relocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1--6."},{"key":"e_1_3_2_1_31_1","first-page":"4400","volume-title":"Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Valentin J.","unstructured":"Valentin , J. , Fitzgibbon , A. , Shotton , J. , and Torr , P. H. S. , 2015 . Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 4400 -- 4408 . Valentin, J., Fitzgibbon, A., Shotton, J., and Torr, P. H. S., 2015. Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 4400--4408."},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.5244\/C.30.59"},{"key":"e_1_3_2_1_33_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3364--3372)","author":"Brachmann E.","unstructured":"Brachmann , E. , Michel , F. , Krull , A. , Ying Yang , M. , and Gumhold , S . 2016. Uncertainty-driven 6D pose estimation of objects and scenes from a single RGB image . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3364--3372) . Brachmann, E., Michel, F., Krull, A., Ying Yang, M., and Gumhold, S. 2016. Uncertainty-driven 6D pose estimation of objects and scenes from a single RGB image. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3364--3372)."},{"key":"e_1_3_2_1_34_1","volume-title":"2017 IEEE International Conference on Robotics and Automation (ICRA) (pp. 5118--5125)","author":"Massiceti D.","unstructured":"Massiceti , D. , Krull , A. , Brachmann , E. , Rother , C. , and Torr , P. H . 2017. Random forests versus Neural Networks---What's best for camera localization? In 2017 IEEE International Conference on Robotics and Automation (ICRA) (pp. 5118--5125) . Massiceti, D., Krull, A., Brachmann, E., Rother, C., and Torr, P. H. 2017. Random forests versus Neural Networks---What's best for camera localization? In 2017 IEEE International Conference on Robotics and Automation (ICRA) (pp. 5118--5125)."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"crossref","unstructured":"Glocker B. Shotton J. Criminisi A. and Izadi S. 2014. Real-time RGB-D camera relocalization via randomized ferns for keyframe encoding. IEEE transactions on visualization and computer graphics 21(5) 571--583.  Glocker B. Shotton J. Criminisi A. and Izadi S. 2014. Real-time RGB-D camera relocalization via randomized ferns for keyframe encoding. IEEE transactions on visualization and computer graphics 21(5) 571--583.","DOI":"10.1109\/TVCG.2014.2360403"},{"key":"e_1_3_2_1_36_1","volume-title":"On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Cavallari T.","unstructured":"Cavallari , T. , Golodetz , S. , Lord , N. A. , and Valentin , J ., 2017 . On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Cavallari, T., Golodetz, S., Lord, N. A., and Valentin, J., 2017. On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_37_1","unstructured":"Krizhevsky A. Sutskever I. and Hinton G. E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097--1105).  Krizhevsky A. Sutskever I. and Hinton G. E. 2012. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097--1105)."},{"key":"e_1_3_2_1_38_1","volume-title":"Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770--778)","author":"He K.","unstructured":"He , K. , Zhang , X. , Ren , S. , and Sun , J . 2016. Deep residual learning for image recognition . In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770--778) . He, K., Zhang, X., Ren, S., and Sun, J. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770--778)."},{"key":"e_1_3_2_1_39_1","volume-title":"Fast R-CNN. In Proceedings of the IEEE international conference on computer vision (pp. 1440--1448)","author":"Girshick R.","year":"2015","unstructured":"Girshick , R. 2015 . Fast R-CNN. In Proceedings of the IEEE international conference on computer vision (pp. 1440--1448) . Girshick, R. 2015. Fast R-CNN. In Proceedings of the IEEE international conference on computer vision (pp. 1440--1448)."},{"key":"e_1_3_2_1_40_1","volume-title":"2016 IEEE international conference on Robotics and Automation (ICRA) (pp. 4762--4769)","author":"Kendall A.","unstructured":"Kendall , A. , and Cipolla , R . 2016. Modelling uncertainty in deep learning for camera relocalization . In 2016 IEEE international conference on Robotics and Automation (ICRA) (pp. 4762--4769) . Kendall, A., and Cipolla, R. 2016. Modelling uncertainty in deep learning for camera relocalization. In 2016 IEEE international conference on Robotics and Automation (ICRA) (pp. 4762--4769)."},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"crossref","unstructured":"Kendall A. Grimes M. and Cipolla R. 2015. Convolutional networks for real-time 6-DOF camera relocalization. CoRR abs\/1505.07427.  Kendall A. Grimes M. and Cipolla R. 2015. Convolutional networks for real-time 6-DOF camera relocalization. CoRR abs\/1505.07427.","DOI":"10.1109\/ICCV.2015.336"},{"key":"e_1_3_2_1_42_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 5974--5983)","author":"Kendall A.","unstructured":"Kendall , A. , and Cipolla , R . 2017. Geometric loss functions for camera pose regression with deep learning . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 5974--5983) . Kendall, A., and Cipolla, R. 2017. Geometric loss functions for camera pose regression with deep learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 5974--5983)."},{"key":"e_1_3_2_1_43_1","unstructured":"Walch F. Hazirbas C. Leal-Taix\u00e9 L. Sattler T. and Cremers D. 2016. Image-based localization with spatial lstms. arXiv preprint arXiv:1611.07890 2(6).  Walch F. Hazirbas C. Leal-Taix\u00e9 L. Sattler T. and Cremers D. 2016. Image-based localization with spatial lstms. arXiv preprint arXiv:1611.07890 2(6)."},{"key":"e_1_3_2_1_45_1","volume-title":"Depth-Based Local Feature Selection for Mobile Visual Search. In: Proceedings of the IEEE International Conference on Image Processing (ICIP).","author":"Liu Z.","unstructured":"Liu , Z. , Duan , L.-Y. , Chen , J. , and Huang , T ., 2016 . Depth-Based Local Feature Selection for Mobile Visual Search. In: Proceedings of the IEEE International Conference on Image Processing (ICIP). Liu, Z., Duan, L.-Y., Chen, J., and Huang, T., 2016. Depth-Based Local Feature Selection for Mobile Visual Search. In: Proceedings of the IEEE International Conference on Image Processing (ICIP)."},{"key":"e_1_3_2_1_46_1","unstructured":"Jia D. Su Y. and Li C. 2016. Deep Convolutional Neural Network for 6-DOF Image Localization. arXiv preprint (413113) 1790--1798.  Jia D. Su Y. and Li C. 2016. Deep Convolutional Neural Network for 6-DOF Image Localization. arXiv preprint (413113) 1790--1798."},{"key":"e_1_3_2_1_47_1","doi-asserted-by":"crossref","unstructured":"Contreras L. and Mayol-Cuevas W. 2017. Towards CNN Map Compression for camera relocalisation. arXiv preprint arXiv:1703.00845.  Contreras L. and Mayol-Cuevas W. 2017. Towards CNN Map Compression for camera relocalisation. arXiv preprint arXiv:1703.00845.","DOI":"10.1109\/CVPRW.2018.00067"},{"key":"e_1_3_2_1_48_1","first-page":"37","volume-title":"PlaNet - Photo Geolocation with Convolutional Neural Networks. In: Proceedings of the IEEE European Conference on Computer Vision (ECCV).","volume":"9905","author":"Weyand T.","unstructured":"Weyand , T. , Kostrikov , I. , and Philbin , J ., 2016 . PlaNet - Photo Geolocation with Convolutional Neural Networks. In: Proceedings of the IEEE European Conference on Computer Vision (ECCV). Vol. 9905 . pp. 37 -- 55 . Weyand, T., Kostrikov, I., and Philbin, J., 2016. PlaNet - Photo Geolocation with Convolutional Neural Networks. In: Proceedings of the IEEE European Conference on Computer Vision (ECCV). Vol. 9905. pp. 37--55."},{"key":"e_1_3_2_1_49_1","volume-title":"2018 IEEE International Conference on Robotics and Automation (ICRA) (pp. 6939--6946)","author":"Valada A.","unstructured":"Valada , A. , Radwan , N. , and Burgard , W . 2018, May. Deep auxiliary learning for visual localization and odometry . In 2018 IEEE International Conference on Robotics and Automation (ICRA) (pp. 6939--6946) . Valada, A., Radwan, N., and Burgard, W. 2018, May. Deep auxiliary learning for visual localization and odometry. In 2018 IEEE International Conference on Robotics and Automation (ICRA) (pp. 6939--6946)."},{"key":"e_1_3_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1109\/LRA.2018.2869640"},{"key":"e_1_3_2_1_51_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 12716--12725)","author":"Sarlin P. E.","unstructured":"Sarlin , P. E. , Cadena , C. , Siegwart , R. , and Dymczyk , M . 2019. From coarse to fine: Robust hierarchical localization at large scale . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 12716--12725) . Sarlin, P. E., Cadena, C., Siegwart, R., and Dymczyk, M. 2019. From coarse to fine: Robust hierarchical localization at large scale. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 12716--12725)."},{"key":"e_1_3_2_1_52_1","doi-asserted-by":"crossref","unstructured":"Bui M. Baur C. Navab N. Ilic S. and Albarqouni S. 2019. Adversarial Joint Image and Pose Distribution Learning for Camera Pose Regression and Refinement. arXiv preprint arXiv:1903.06646.  Bui M. Baur C. Navab N. Ilic S. and Albarqouni S. 2019. Adversarial Joint Image and Pose Distribution Learning for Camera Pose Regression and Refinement. arXiv preprint arXiv:1903.06646.","DOI":"10.1109\/ICCVW.2019.00470"},{"key":"e_1_3_2_1_53_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3302--3312)","author":"Sattler T.","unstructured":"Sattler , T. , Zhou , Q. , Pollefeys , M. , and Leal-Taixe , L . 2019. Understanding the Limitations of CNN-based Absolute Camera Pose Regression . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3302--3312) . Sattler, T., Zhou, Q., Pollefeys, M., and Leal-Taixe, L. 2019. Understanding the Limitations of CNN-based Absolute Camera Pose Regression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 3302--3312)."},{"key":"e_1_3_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Cavallari T. Bertinetto L. Mukhoti J. Torr P. and Golodetz S. 2019. Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation. arXiv preprint arXiv:1906.08744.  Cavallari T. Bertinetto L. Mukhoti J. Torr P. and Golodetz S. 2019. Let's Take This Online: Adapting Scene Coordinate Regression Network Predictions for Online RGB-D Camera Relocalisation. arXiv preprint arXiv:1906.08744.","DOI":"10.1109\/3DV.2019.00068"},{"key":"e_1_3_2_1_55_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision.","author":"Zeisl B.","unstructured":"Zeisl , B. , Sattler , T. , and Pollefeys , M . 2015. Camera Pose Voting for Large-Scale ImageBased Localization . In Proceedings of the IEEE International Conference on Computer Vision. Zeisl, B., Sattler, T., and Pollefeys, M. 2015. Camera Pose Voting for Large-Scale ImageBased Localization. In Proceedings of the IEEE International Conference on Computer Vision."},{"key":"e_1_3_2_1_56_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (pp. 2372--2381)","author":"Liu L.","unstructured":"Liu , L. , Li , H. , and Dai , Y . 2017. Efficient global 2d-3d matching for camera localization in a large-scale 3d map . In Proceedings of the IEEE International Conference on Computer Vision (pp. 2372--2381) . Liu, L., Li, H., and Dai, Y. 2017. Efficient global 2d-3d matching for camera localization in a large-scale 3d map. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2372--2381)."},{"key":"e_1_3_2_1_57_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1637--1646)","author":"Sattler T.","unstructured":"Sattler , T. , Torii , A. , Sivic , J. , Pollefeys , M. , and Taira , H . 2017. Are large-scale 3d models really necessary for accurate visual localization? In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1637--1646) . Sattler, T., Torii, A., Sivic, J., Pollefeys, M., and Taira, H. 2017. Are large-scale 3d models really necessary for accurate visual localization? In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1637--1646)."},{"key":"e_1_3_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1145\/358669.358692"},{"key":"e_1_3_2_1_59_1","volume-title":"CVPR 2011 (pp. 2969--2976)","author":"Kneip L.","unstructured":"Kneip , L. , Scaramuzza , D. , and Siegwart , R . 2011. A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation . In CVPR 2011 (pp. 2969--2976) . Kneip, L., Scaramuzza, D., and Siegwart, R. 2011. A novel parametrization of the perspective-three-point problem for a direct computation of absolute camera position and orientation. In CVPR 2011 (pp. 2969--2976)."},{"key":"e_1_3_2_1_60_1","doi-asserted-by":"crossref","unstructured":"Sattler T. Leibe B. and Kobbelt L. 2016. Efficient & effective prioritized matching for large-scale image-based localization. IEEE transactions on pattern analysis and machine intelligence 39(9) 1744--1756.  Sattler T. Leibe B. and Kobbelt L. 2016. Efficient & effective prioritized matching for large-scale image-based localization. IEEE transactions on pattern analysis and machine intelligence 39(9) 1744--1756.","DOI":"10.1109\/TPAMI.2016.2611662"},{"key":"e_1_3_2_1_61_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4545--4553)","author":"Camposeco F.","unstructured":"Camposeco , F. , Sattler , T. , Cohen , A. , and Pollefeys , M . 2017. Toroidal constraints for two-point localization under high outlier ratios . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4545--4553) . Camposeco, F., Sattler, T., Cohen, A., and Pollefeys, M. 2017. Toroidal constraints for two-point localization under high outlier ratios. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4545--4553)."},{"key":"e_1_3_2_1_62_1","doi-asserted-by":"crossref","unstructured":"Sv\u00e4rm L. Enqvist O. and Oskarsson M. 2016. City-scale localization for cameras with known vertical direction. IEEE transactions on pattern analysis and machine intelligence 39(7) 1455--1461.  Sv\u00e4rm L. Enqvist O. and Oskarsson M. 2016. City-scale localization for cameras with known vertical direction. IEEE transactions on pattern analysis and machine intelligence 39(7) 1455--1461.","DOI":"10.1109\/TPAMI.2016.2598331"},{"key":"e_1_3_2_1_63_1","volume-title":"Proceedings of the IEEE international conference on computer vision (pp. 2808--2815)","author":"Zeisl B.","unstructured":"Zeisl , B. , Koser , K. , and Pollefeys , M . 2013. Automatic registration of RGB-D scans via salient directions . In Proceedings of the IEEE international conference on computer vision (pp. 2808--2815) . Zeisl, B., Koser, K., and Pollefeys, M. 2013. Automatic registration of RGB-D scans via salient directions. In Proceedings of the IEEE international conference on computer vision (pp. 2808--2815)."},{"key":"e_1_3_2_1_64_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Irschara A.","unstructured":"Irschara , A. , Zach , C. , Frahm , J.-m. , and Bischof , H ., 2009. From Structure-from-Motion Point Clouds to Fast Location Recognition . In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Irschara, A., Zach, C., Frahm, J.-m., and Bischof, H., 2009. From Structure-from-Motion Point Clouds to Fast Location Recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR)."},{"key":"e_1_3_2_1_65_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (ICCV), 667--674","author":"Sattler T.","unstructured":"Sattler , T. , Leibe , B. , and Kobbelt , L ., 2011. Fast image-based localization using direct 2D-to-3D matching . Proceedings of the IEEE International Conference on Computer Vision (ICCV), 667--674 . Sattler, T., Leibe, B., and Kobbelt, L., 2011. Fast image-based localization using direct 2D-to-3D matching. Proceedings of the IEEE International Conference on Computer Vision (ICCV), 667--674."},{"key":"e_1_3_2_1_66_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (ICCV).","volume":"2106","author":"Sattler T.","unstructured":"Sattler , T. , Havlena , M. , Radenovi\u0107 , F. , Schindler , K. , and Pollefeys , M ., 2015. Hyperpoints and fine vocabularies for large-scale location recognition . In: Proceedings of the IEEE International Conference on Computer Vision (ICCV). Vol. 11-18-Dece. pp. 2102-- 2106 . Sattler, T., Havlena, M., Radenovi\u0107, F., Schindler, K., and Pollefeys, M., 2015. Hyperpoints and fine vocabularies for large-scale location recognition. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV). Vol. 11-18-Dece. pp. 2102--2106."},{"key":"e_1_3_2_1_67_1","first-page":"327","volume-title":"IEEE International Conference on Consumer Electronics Berlin (ICCE-Berlin).","author":"Heisterklaus I.","unstructured":"Heisterklaus , I. , Qian , N. , and Miller , A ., 2014. Image-based pose estimation using a compact 3D model . In: IEEE International Conference on Consumer Electronics Berlin (ICCE-Berlin). pp. 327 -- 330 . Heisterklaus, I., Qian, N., and Miller, A., 2014. Image-based pose estimation using a compact 3D model. In: IEEE International Conference on Consumer Electronics Berlin (ICCE-Berlin). pp. 327--330."},{"key":"e_1_3_2_1_68_1","first-page":"516","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).","author":"Donoser M.","unstructured":"Donoser , M. , and Schmalstieg , D ., 2014. Discriminative feature-to-point matching in image-based localization . In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 516 -- 523 . Donoser, M., and Schmalstieg, D., 2014. Discriminative feature-to-point matching in image-based localization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). pp. 516--523."},{"key":"e_1_3_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIP.2015.2500030"},{"key":"e_1_3_2_1_70_1","volume-title":"Proceedings of the IEEE International Conference on Computer Vision (pp. 2129--2137)","author":"Cohen A.","unstructured":"Cohen , A. , Sattler , T. , and Pollefeys , M . 2015. Merging the unmatchable: Stitching visually disconnected sfm models . In Proceedings of the IEEE International Conference on Computer Vision (pp. 2129--2137) . Cohen, A., Sattler, T., and Pollefeys, M. 2015. Merging the unmatchable: Stitching visually disconnected sfm models. In Proceedings of the IEEE International Conference on Computer Vision (pp. 2129--2137)."},{"key":"e_1_3_2_1_71_1","volume-title":"European Conference on Computer Vision (pp. 285--300)","author":"Cohen A.","unstructured":"Cohen , A. , Sch\u00f6nberger , J. L. , Speciale , P. , Sattler , T. , Frahm , J. M. , and Pollefeys , M . 2016. Indoor-outdoor 3d reconstruction alignment . In European Conference on Computer Vision (pp. 285--300) . Cohen, A., Sch\u00f6nberger, J. L., Speciale, P., Sattler, T., Frahm, J. M., and Pollefeys, M. 2016. Indoor-outdoor 3d reconstruction alignment. In European Conference on Computer Vision (pp. 285--300)."},{"key":"e_1_3_2_1_72_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1722--1731)","author":"Yu F.","unstructured":"Yu , F. , Xiao , J. , and Funkhouser , T . 2015. Semantic alignment of LiDAR data at city scale . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1722--1731) . Yu, F., Xiao, J., and Funkhouser, T. 2015. Semantic alignment of LiDAR data at city scale. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1722--1731)."},{"key":"e_1_3_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364915596589"},{"key":"e_1_3_2_1_74_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1482--1491)","author":"Schonberger J. L.","unstructured":"Schonberger , J. L. , Hardmeier , H. , Sattler , T. , and Pollefeys , M . 2017. Comparative evaluation of hand-crafted and learned local features . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1482--1491) . Schonberger, J. L., Hardmeier, H., Sattler, T., and Pollefeys, M. 2017. Comparative evaluation of hand-crafted and learned local features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1482--1491)."},{"key":"e_1_3_2_1_75_1","doi-asserted-by":"crossref","unstructured":"Hornung A. Wurm K. M. Bennewitz M. Stachniss C. and Burgard W. 2013. OctoMap: An efficient probabilistic 3D mapping framework based on octrees. Autonomous robots 34(3) 189--206.  Hornung A. Wurm K. M. Bennewitz M. Stachniss C. and Burgard W. 2013. OctoMap: An efficient probabilistic 3D mapping framework based on octrees. Autonomous robots 34(3) 189--206.","DOI":"10.1007\/s10514-012-9321-0"},{"key":"e_1_3_2_1_76_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6896--6906)","author":"Sch\u00f6nberger J. L.","unstructured":"Sch\u00f6nberger , J. L. , Pollefeys , M. , Geiger , A. , and Sattler , T . 2018. Semantic visual localization . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6896--6906) . Sch\u00f6nberger, J. L., Pollefeys, M., Geiger, A., and Sattler, T. 2018. Semantic visual localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6896--6906)."},{"key":"e_1_3_2_1_77_1","volume-title":"Proceedings of the European Conference on Computer Vision (ECCV) (pp. 383--399)","author":"Toft C.","unstructured":"Toft , C. , Stenborg , E. , Hammarstrand , L. , Brynte , L. , Pollefeys , M. , Sattler , T. , and Kahl , F . 2018. Semantic match consistency for long-term visual localization . In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 383--399) . Toft, C., Stenborg, E., Hammarstrand, L., Brynte, L., Pollefeys, M., Sattler, T., and Kahl, F. 2018. Semantic match consistency for long-term visual localization. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 383--399)."},{"key":"e_1_3_2_1_78_1","doi-asserted-by":"crossref","unstructured":"Geppert M. Liu P. Cui Z. Pollefeys M. and Sattler T. 2018. Efficient 2D-3D Matching for Multi-Camera Visual Localization. arXiv preprint arXiv:1809.06445.  Geppert M. Liu P. Cui Z. Pollefeys M. and Sattler T. 2018. Efficient 2D-3D Matching for Multi-Camera Visual Localization. arXiv preprint arXiv:1809.06445.","DOI":"10.1109\/ICRA.2019.8794280"},{"key":"e_1_3_2_1_79_1","volume-title":"2015 IEEE International Conference on Robotics and Automation (ICRA) (pp. 6374--6379)","author":"Li S.","unstructured":"Li , S. , and Calway , A . 2015. RGBD relocalisation using pairwise geometry and concise key point sets . In 2015 IEEE International Conference on Robotics and Automation (ICRA) (pp. 6374--6379) . Li, S., and Calway, A. 2015. RGBD relocalisation using pairwise geometry and concise key point sets. In 2015 IEEE International Conference on Robotics and Automation (ICRA) (pp. 6374--6379)."},{"key":"e_1_3_2_1_80_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4654--4662)","author":"Brachmann E.","unstructured":"Brachmann , E. , and Rother , C . 2018. Learning less is more-6d camera localization via 3d surface regression . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4654--4662) . Brachmann, E., and Rother, C. 2018. Learning less is more-6d camera localization via 3d surface regression. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 4654--4662)."},{"key":"e_1_3_2_1_81_1","doi-asserted-by":"crossref","unstructured":"Nadeem U. Jalwana M. A. Bennamoun M. Togneri R. and Sohel F. 2019. Direct Image to Point Cloud Descriptors Matching for 6-DOF Camera Localization in Dense 3D Point Cloud. arXiv preprint arXiv:1906.06064.  Nadeem U. Jalwana M. A. Bennamoun M. Togneri R. and Sohel F. 2019. Direct Image to Point Cloud Descriptors Matching for 6-DOF Camera Localization in Dense 3D Point Cloud. arXiv preprint arXiv:1906.06064.","DOI":"10.1007\/978-3-030-36711-4_20"}],"event":{"name":"RICAI 2019: 2019 International Conference on Robotics, Intelligent Control and Artificial Intelligence","location":"Shanghai China","acronym":"RICAI 2019"},"container-title":["Proceedings of the 2019 International Conference on Robotics, Intelligent Control and Artificial Intelligence"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366194.3366211","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366194.3366211","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:13:40Z","timestamp":1750202020000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366194.3366211"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,9,20]]},"references-count":80,"alternative-id":["10.1145\/3366194.3366211","10.1145\/3366194"],"URL":"https:\/\/doi.org\/10.1145\/3366194.3366211","relation":{},"subject":[],"published":{"date-parts":[[2019,9,20]]},"assertion":[{"value":"2019-09-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}