{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,6]],"date-time":"2026-05-06T15:07:37Z","timestamp":1778080057129,"version":"3.51.4"},"reference-count":139,"publisher":"Springer Science and Business Media LLC","issue":"7","license":[{"start":{"date-parts":[[2021,5,7]],"date-time":"2021-05-07T00:00:00Z","timestamp":1620345600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2021,5,7]],"date-time":"2021-05-07T00:00:00Z","timestamp":1620345600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["EP\/R02572X\/1"],"award-info":[{"award-number":["EP\/R02572X\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["EP\/P017487\/1"],"award-info":[{"award-number":["EP\/P017487\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000266","name":"Engineering and Physical Sciences Research Council","doi-asserted-by":"publisher","award":["EP\/R026173\/1"],"award-info":[{"award-number":["EP\/R026173\/1"]}],"id":[{"id":"10.13039\/501100000266","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Int J Comput Vis"],"published-print":{"date-parts":[[2021,7]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Visual place recognition (VPR) is the process of recognising a previously visited place using visual information, often under varying appearance conditions and viewpoint changes and with computational constraints. VPR is related to the concepts of localisation, loop closure, image retrieval and is a critical component of many autonomous navigation systems ranging from autonomous vehicles to drones and computer vision systems. While the concept of place recognition has been around for many years, VPR research has grown rapidly as a field over the past decade due to improving camera hardware and its potential for deep learning-based techniques, and has become a widely studied topic in both the computer vision and robotics communities. This growth however has led to fragmentation and a lack of standardisation in the field, especially concerning performance evaluation. Moreover, the notion of viewpoint and illumination invariance of VPR techniques has largely been assessed qualitatively and hence ambiguously in the past. In this paper, we address these gaps through a new comprehensive open-source framework for assessing the performance of VPR techniques, dubbed \u201cVPR-Bench\u201d. VPR-Bench (Open-sourced at: <jats:ext-link xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" ext-link-type=\"uri\" xlink:href=\"https:\/\/github.com\/MubarizZaffar\/VPR-Bench\">https:\/\/github.com\/MubarizZaffar\/VPR-Bench<\/jats:ext-link>) introduces two much-needed capabilities for VPR researchers: firstly, it contains a benchmark of 12 fully-integrated datasets and 10 VPR techniques, and secondly, it integrates a comprehensive variation-quantified dataset for quantifying viewpoint and illumination invariance. We apply and analyse popular evaluation metrics for VPR from both the computer vision and robotics communities, and discuss how these different metrics complement and\/or replace each other, depending upon the underlying applications and system requirements. Our analysis reveals that no universal SOTA VPR technique exists, since: (a) state-of-the-art (SOTA) performance is achieved by 8 out of the 10 techniques on at least one dataset, (b) SOTA technique in one community does not necessarily yield SOTA performance in the other given the differences in datasets and metrics. Furthermore, we identify key open challenges since: (c) all 10 techniques suffer greatly in perceptually-aliased and less-structured environments, (d) all techniques suffer from viewpoint variance where lateral change has less effect than 3D change, and (e) directional illumination change has more adverse effects on matching confidence than uniform illumination change. We also present detailed meta-analyses regarding the roles of varying ground-truths, platforms, application requirements and technique parameters. Finally, VPR-Bench provides a unified implementation to deploy these VPR techniques, metrics and datasets, and is extensible through templates.<\/jats:p>","DOI":"10.1007\/s11263-021-01469-5","type":"journal-article","created":{"date-parts":[[2021,5,7]],"date-time":"2021-05-07T04:02:53Z","timestamp":1620360173000},"page":"2136-2174","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":124,"title":["VPR-Bench: An Open-Source Visual Place Recognition Evaluation Framework with Quantifiable Viewpoint and Appearance Change"],"prefix":"10.1007","volume":"129","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-9368-2391","authenticated-orcid":false,"given":"Mubariz","family":"Zaffar","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sourav","family":"Garg","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Michael","family":"Milford","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Julian","family":"Kooij","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Flynn","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Klaus","family":"McDonald-Maier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shoaib","family":"Ehsan","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2021,5,7]]},"reference":[{"issue":"1","key":"1469_CR1","doi-asserted-by":"publisher","first-page":"18","DOI":"10.1007\/s11263-011-0473-8","volume":"97","author":"H Aan\u00e6s","year":"2012","unstructured":"Aan\u00e6s, H., Dahl, A. L., & Pedersen, K. S. (2012). Interesting interest points. International Journal of Computer Vision, 97(1), 18\u201335.","journal-title":"International Journal of Computer Vision"},{"issue":"10","key":"1469_CR2","doi-asserted-by":"publisher","first-page":"105","DOI":"10.1145\/2001269.2001293","volume":"54","author":"S Agarwal","year":"2011","unstructured":"Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S. M., et al. (2011). Building Rome in a day. Communications of the ACM, 54(10), 105\u2013112.","journal-title":"Communications of the ACM"},{"key":"1469_CR3","doi-asserted-by":"crossref","unstructured":"Agrawal, M., Konolige, K., & Blas, M. R. (2008). Censure: Center surround extremas for realtime feature detection and matching. In European conference on computer vision (pp. 102\u2013115). Springer.","DOI":"10.1007\/978-3-540-88693-8_8"},{"issue":"8","key":"1469_CR4","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1016\/S1474-6670(17)31947-X","volume":"37","author":"H Andreasson","year":"2004","unstructured":"Andreasson, H., & Duckett, T. (2004). Topological localization for mobile robots using omni-directional vision and local features. IFAC Proceedings Volumes, 37(8), 36\u201341.","journal-title":"IFAC Proceedings Volumes"},{"key":"1469_CR5","doi-asserted-by":"crossref","unstructured":"Angeli, A., Doncieux, S., Meyer, J. A., & Filliat, D. (2008). Incremental vision-based topological slam. In IROS (pp. 1031\u20131036) IEEE.","DOI":"10.1109\/IROS.2008.4650675"},{"key":"1469_CR6","doi-asserted-by":"crossref","unstructured":"Arandjelovi\u0107, R., & Zisserman, A. (2014a). Dislocation: Scalable descriptor distinctiveness for location recognition. In Asian conference on computer vision (pp. 188\u2013204). Springer.","DOI":"10.1007\/978-3-319-16817-3_13"},{"key":"1469_CR7","doi-asserted-by":"crossref","unstructured":"Arandjelovi\u0107, R., & Zisserman, A. (2014b). Visual vocabulary with a semantic twist. In Asian conference on computer vision (pp. 178\u2013195). Springer.","DOI":"10.1007\/978-3-319-16865-4_12"},{"key":"1469_CR8","doi-asserted-by":"crossref","unstructured":"Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., & Sivic, J. (2016). NetVLAD: CNN architecture for weakly supervised place recognition. In CVPR (pp. 5297\u20135307).","DOI":"10.1109\/CVPR.2016.572"},{"key":"1469_CR9","doi-asserted-by":"crossref","unstructured":"Babenko, A., Slesarev, A., Chigorin, A., & Lempitsky, V. (2014). Neural codes for image retrieval. In European conference on computer vision (pp. 584\u2013599). Springer","DOI":"10.1007\/978-3-319-10590-1_38"},{"key":"1469_CR10","doi-asserted-by":"crossref","unstructured":"Badino, H., Huber, D., & Kanade, T. (2012). Real-time topometric localization. In ICRA (pp. 1635\u20131642). IEEE.","DOI":"10.1109\/ICRA.2012.6224716"},{"key":"1469_CR11","doi-asserted-by":"crossref","unstructured":"Bay, H., Tuytelaars, T., & Van\u00a0Gool, L. (2006). Surf: Speeded up robust features. In ECCV (pp. 404\u2013417). Springer.","DOI":"10.1007\/11744023_32"},{"issue":"6","key":"1469_CR12","doi-asserted-by":"publisher","first-page":"1309","DOI":"10.1109\/TRO.2016.2624754","volume":"32","author":"C Cadena","year":"2016","unstructured":"Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza, D., Neira, J., et al. (2016). Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE T-RO, 32(6), 1309\u20131332.","journal-title":"IEEE T-RO"},{"issue":"7","key":"1469_CR13","doi-asserted-by":"publisher","first-page":"1281","DOI":"10.1109\/TPAMI.2011.222","volume":"34","author":"M Calonder","year":"2011","unstructured":"Calonder, M., Lepetit, V., Ozuysal, M., Trzcinski, T., Strecha, C., & Fua, P. (2011). Brief: Computing a local binary descriptor very fast. IEEE T-PAMI, 34(7), 1281\u20131298.","journal-title":"IEEE T-PAMI"},{"key":"1469_CR14","doi-asserted-by":"crossref","unstructured":"Camara, L. G., G\u00e4bert, C., & Preucil, L. (2019). Highly robust visual place recognition through spatial matching of CNN features. ResearchGate Preprint.","DOI":"10.1109\/ICRA40945.2020.9196967"},{"key":"1469_CR15","doi-asserted-by":"crossref","unstructured":"Camara, L. G., & P\u0159eu\u010dil, L. (2019). Spatio-semantic convnet-based visual place recognition. In 2019 European conference on mobile robots (ECMR) (pp. 1\u20138). IEEE.","DOI":"10.1109\/ECMR.2019.8870948"},{"key":"1469_CR16","doi-asserted-by":"crossref","unstructured":"Cao, B., Araujo, A., & Sim, J. (2020). Unifying deep local and global features for image search. arXiv:2001.05027","DOI":"10.1007\/978-3-030-58565-5_43"},{"issue":"2","key":"1469_CR17","doi-asserted-by":"publisher","first-page":"993","DOI":"10.1109\/LRA.2020.2967324","volume":"5","author":"M Chanc\u00e1n","year":"2020","unstructured":"Chanc\u00e1n, M., Hernandez-Nunez, L., Narendra, A., Barron, A. B., & Milford, M. (2020). A hybrid compact neural architecture for visual place recognition. IEEE Robotics and Automation Letters, 5(2), 993\u20131000.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"1469_CR18","doi-asserted-by":"crossref","unstructured":"Chen, D. M., Baatz, G., K\u00f6ser, K., Tsai, S. S., Vedantham, R., Pylv\u00e4n\u00e4inen, T., Roimela, K., Chen, X., Bach, J., Pollefeys, M., et\u00a0al. (2011). City-scale landmark identification on mobile devices. In CVPR 2011 (pp. 737\u2013744).","DOI":"10.1109\/CVPR.2011.5995610"},{"key":"1469_CR19","doi-asserted-by":"crossref","unstructured":"Chen, Z., Jacobson, A., Erdem, U. M., Hasselmo, M. E., & Milford, M. (2014a). Multi-scale bio-inspired place recognition. In 2014 IEEE international conference on robotics and automation (ICRA). IEEE","DOI":"10.1109\/ICRA.2014.6907109"},{"key":"1469_CR20","unstructured":"Chen, Z., Lam, O., Jacobson, A., & Milford, M. (2014b). Convolutional neural network-based place recognition. preprint arXiv:1411.1509."},{"key":"1469_CR21","doi-asserted-by":"crossref","unstructured":"Chen, Z., Maffra, F., Sa, I., & Chli, M. (2017a). Only look once, mining distinctive landmarks from convnet for visual place recognition. In IROS (pp. 9\u201316). IEEE.","DOI":"10.1109\/IROS.2017.8202131"},{"issue":"4","key":"1469_CR22","doi-asserted-by":"publisher","first-page":"4015","DOI":"10.1109\/LRA.2018.2859916","volume":"3","author":"Z Chen","year":"2018","unstructured":"Chen, Z., Liu, L., Sa, I., Ge, Z., & Chli, M. (2018). Learning context flexible attention model for long-term visual place recognition. IEEE Robotics and Automation Letters, 3(4), 4015\u20134022.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"1469_CR23","doi-asserted-by":"crossref","unstructured":"Chen, Z., et\u00a0al. (2017b). Deep learning features at scale for visual place recognition. In ICRA (pp. 3223\u20133230). IEEE.","DOI":"10.1109\/ICRA.2017.7989366"},{"key":"1469_CR24","unstructured":"Ch\u00e9ron, C. T. E. (2018). An evaluation of features for pose estimation and its application to free viewpoint video. PhD thesis, Trinity College."},{"key":"1469_CR25","doi-asserted-by":"crossref","unstructured":"Cieslewski, T., & Scaramuzza, D. (2017). Efficient decentralized visual place recognition from full-image descriptors. In 2017 International symposium on multi-robot and multi-agent systems (MRS) (pp. 78\u201382). IEEE.","DOI":"10.1109\/MRS.2017.8250934"},{"key":"1469_CR26","doi-asserted-by":"crossref","unstructured":"Cieslewski, T., Choudhary, S., & Scaramuzza, D. (2018). Data-efficient decentralized visual slam. In 2018 IEEE international conference on robotics and automation (ICRA) (pp. 2466\u20132473). IEEE.","DOI":"10.1109\/ICRA.2018.8461155"},{"issue":"9","key":"1469_CR27","first-page":"1100","volume":"30","author":"M Cummins","year":"2011","unstructured":"Cummins, M., & Newman, P. (2011). Appearance-only slam at large scale with fab-map 2.0. IJRR, 30(9), 1100\u20131123.","journal-title":"IJRR"},{"issue":"6","key":"1469_CR28","doi-asserted-by":"publisher","first-page":"1052","DOI":"10.1109\/TPAMI.2007.1049","volume":"29","author":"AJ Davison","year":"2007","unstructured":"Davison, A. J., Reid, I. D., Molton, N. D., & Stasse, O. (2007). MonoSLAM: Real-time single camera slam. IEEE Transactions on Pattern analysis and Machine Intelligence, 29(6), 1052\u20131067.","journal-title":"IEEE Transactions on Pattern analysis and Machine Intelligence"},{"key":"1469_CR29","doi-asserted-by":"crossref","unstructured":"Demir, M., & Bozma, H. I. (2018). Automated place detection based on coherent segments. In 2018 IEEE 12th international conference on semantic computing (ICSC) (pp. 71\u201376). IEEE.","DOI":"10.1109\/ICSC.2018.00019"},{"key":"1469_CR30","doi-asserted-by":"crossref","unstructured":"DeTone, D., Malisiewicz, T., & Rabinovich, A. (2018). Superpoint: Self-supervised interest point detection and description. In CVPR workshops (pp. 224\u2013236).","DOI":"10.1109\/CVPRW.2018.00060"},{"key":"1469_CR31","doi-asserted-by":"crossref","unstructured":"Dusmanu, M., et\u00a0al. (2019). D2-net: A trainable CNN for joint description and detection of local features. In CVPR (pp. 8092\u20138101).","DOI":"10.1109\/CVPR.2019.00828"},{"issue":"2","key":"1469_CR32","doi-asserted-by":"publisher","first-page":"1688","DOI":"10.1109\/LRA.2020.2969197","volume":"5","author":"B Ferrarini","year":"2020","unstructured":"Ferrarini, B., Waheed, M., Waheed, S., Ehsan, S., Milford, M. J., & McDonald-Maier, K. D. (2020). Exploring performance bounds of visual place recognition using extended precision. IEEE Robotics and Automation Letters, 5(2), 1688\u20131695.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"1469_CR33","doi-asserted-by":"crossref","unstructured":"Filliat, D. (2007). A visual bag of words method for interactive qualitative localization and mapping. In ICRA (pp. 3921\u20133926). IEEE.","DOI":"10.1109\/ROBOT.2007.364080"},{"key":"1469_CR34","doi-asserted-by":"crossref","unstructured":"Fraundorfer, F., Engels, C., & Nist\u00e9r, D. (2007). Topological mapping, localization and navigation using image collections. In 2007 IEEE\/RSJ international conference on intelligent robots and systems (pp. 3872\u20133877). IEEE.","DOI":"10.1109\/IROS.2007.4399123"},{"issue":"6","key":"1469_CR35","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3130800.3130891","volume":"36","author":"MA Gardner","year":"2017","unstructured":"Gardner, M. A., Sunkavalli, K., Yumer, E., Shen, X., Gambaretto, E., Gagn\u00e9, C., et al. (2017). Learning to predict indoor illumination from a single image. ACM Transactions on Graphics (TOG), 36(6), 1\u201314.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"1469_CR36","unstructured":"Garg, S., Fischer, T., & Milford, M. (2021). Where is your place, visual place recognition? arXiv preprint arXiv:2103.06443."},{"key":"1469_CR37","doi-asserted-by":"crossref","unstructured":"Garg, S., Suenderhauf, N., & Milford, M. (2018a). Don\u2019t look back: Robustifying place categorization for viewpoint- and condition-invariant place recognition. In IEEE international conference on robotics and automation (ICRA).","DOI":"10.1109\/ICRA.2018.8461051"},{"key":"1469_CR38","doi-asserted-by":"crossref","unstructured":"Garg, S., Suenderhauf, N., & Milford, M. (2018b). Lost? appearance-invariant place recognition for opposite viewpoints using visual semantics. In Proceedings of robotics: Science and systems XIV.","DOI":"10.15607\/RSS.2018.XIV.022"},{"issue":"1\u20132","key":"1469_CR39","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1561\/2300000059","volume":"8","author":"S Garg","year":"2020","unstructured":"Garg, S., S\u00fcnderhauf, N., Dayoub, F., Morrison, D., Cosgun, A., Carneiro, G., et al. (2020). Semantics for robotic mapping, perception and interaction: A survey. Found Trends Robot, 8(1\u20132), 1\u2013224. https:\/\/doi.org\/10.1561\/2300000059.","journal-title":"Found Trends Robot"},{"key":"1469_CR40","doi-asserted-by":"crossref","unstructured":"Girdhar, Y., & Dudek, G. (2010). Online navigation summaries. In 2010 IEEE international conference on robotics and automation (pp 5035\u20135040). IEEE.","DOI":"10.1109\/ROBOT.2010.5509464"},{"key":"1469_CR41","doi-asserted-by":"publisher","unstructured":"Glover, A. (2014). Day and night, left and right. https:\/\/doi.org\/10.5281\/zenodo.4590133","DOI":"10.5281\/zenodo.4590133"},{"key":"1469_CR42","doi-asserted-by":"crossref","unstructured":"Gordo, A., Almaz\u00e1n, J., Revaud, J., & Larlus, D. (2016). Deep image retrieval: Learning global representations for image search. In European conference on computer vision. (pp 241\u2013257). Springer.","DOI":"10.1007\/978-3-319-46466-4_15"},{"issue":"2","key":"1469_CR43","doi-asserted-by":"publisher","first-page":"237","DOI":"10.1007\/s11263-017-1016-8","volume":"124","author":"A Gordo","year":"2017","unstructured":"Gordo, A., Almazan, J., Revaud, J., & Larlus, D. (2017). End-to-end learning of deep visual representations for image retrieval. International Journal of Computer Vision, 124(2), 237\u2013254.","journal-title":"International Journal of Computer Vision"},{"issue":"2","key":"1469_CR44","doi-asserted-by":"publisher","first-page":"1924","DOI":"10.1109\/LRA.2019.2898427","volume":"4","author":"S Hausler","year":"2019","unstructured":"Hausler, S., Jacobson, A., & Milford, M. (2019). Multi-process fusion: Visual place recognition using multiple image processing methods. IEEE Robotics and Automation Letters, 4(2), 1924\u20131931.","journal-title":"IEEE Robotics and Automation Letters"},{"issue":"3","key":"1469_CR45","doi-asserted-by":"publisher","first-page":"261","DOI":"10.1007\/s11263-006-0020-1","volume":"74","author":"KL Ho","year":"2007","unstructured":"Ho, K. L., & Newman, P. (2007). Detecting loop closure with scene sequences. IJCV, 74(3), 261\u2013286.","journal-title":"IJCV"},{"key":"1469_CR46","doi-asserted-by":"crossref","unstructured":"Hold-Geoffroy, Y., Sunkavalli, K., Hadap, S., Gambaretto, E., & Lalonde, J. F. (2017). Deep outdoor illumination estimation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 7312\u20137321).","DOI":"10.1109\/CVPR.2017.255"},{"issue":"3\u20134","key":"1469_CR47","doi-asserted-by":"publisher","first-page":"505","DOI":"10.1007\/s10846-017-0735-y","volume":"92","author":"Y Hou","year":"2018","unstructured":"Hou, Y., Zhang, H., & Zhou, S. (2018). Evaluation of object proposals and convnet features for landmark-based visual place recognition. Journal of Intelligent & Robotic Systems, 92(3\u20134), 505\u2013520.","journal-title":"Journal of Intelligent & Robotic Systems"},{"key":"1469_CR48","doi-asserted-by":"crossref","unstructured":"Jegou, H., Douze, M., & Schmid, C. (2008). Hamming embedding and weak geometric consistency for large scale image search. In European conference on computer vision (pp. 304\u2013317). Springer.","DOI":"10.1007\/978-3-540-88682-2_24"},{"key":"1469_CR49","doi-asserted-by":"crossref","unstructured":"J\u00e9gou, H., Douze, M., Schmid, C., & P\u00e9rez, P. (2010). Aggregating local descriptors into a compact image representation. In CVPR (pp. 3304\u20133311). IEEE Computer Society.","DOI":"10.1109\/CVPR.2010.5540039"},{"key":"1469_CR50","doi-asserted-by":"crossref","unstructured":"Jenicek, T., & Chum, O. (2019). No fear of the dark: Image retrieval under varying illumination conditions. In Proceedings of the IEEE international conference on computer vision (pp. 9696\u20139704).","DOI":"10.1109\/ICCV.2019.00979"},{"key":"1469_CR51","doi-asserted-by":"crossref","unstructured":"Jin, Y., Mishkin, D., Mishchuk, A., Matas, J., Fua, P., Yi, K. M., & Trulls, E. (2020). Image matching across wide baselines: From paper to practice. arXiv preprint arXiv:2003.01587.","DOI":"10.1007\/s11263-020-01385-0"},{"key":"1469_CR52","doi-asserted-by":"crossref","unstructured":"Johns, E., & Yang, G. Z. (2011). From images to scenes: Compressing an image cluster into a single scene model for place recognition. In 2011 International conference on computer vision (pp 874\u2013881). IEEE.","DOI":"10.1109\/ICCV.2011.6126328"},{"key":"1469_CR53","doi-asserted-by":"crossref","unstructured":"Khaliq, A., Ehsan, S., Chen, Z., Milford, M., & McDonald-Maier, K. (2019). A holistic visual place recognition approach using lightweight CNNs for significant viewpoint and appearance changes. IEEE Transactions on Robotics.","DOI":"10.1109\/TRO.2019.2956352"},{"issue":"5","key":"1469_CR54","doi-asserted-by":"publisher","first-page":"1066","DOI":"10.1109\/TRO.2008.2004832","volume":"24","author":"K Konolige","year":"2008","unstructured":"Konolige, K., & Agrawal, M. (2008). FrameSLAM: From bundle adjustment to real-time visual mapping. IEEE Transactions on Robotics, 24(5), 1066\u20131077.","journal-title":"IEEE Transactions on Robotics"},{"key":"1469_CR55","doi-asserted-by":"crossref","unstructured":"Kopitkov, D., & Indelman, V. (2018). Bayesian information recovery from cnn for probabilistic inference. In 2018 IEEE\/RSJ international conference on intelligent robots and systems (IROS) (pp. 7795\u20137802). IEEE.","DOI":"10.1109\/IROS.2018.8594506"},{"issue":"1","key":"1469_CR56","doi-asserted-by":"publisher","first-page":"27","DOI":"10.1016\/j.robot.2005.03.008","volume":"52","author":"J Ko\u0161eck\u00e1","year":"2005","unstructured":"Ko\u0161eck\u00e1, J., Li, F., & Yang, X. (2005). Global localization and relative positioning based on scale-invariant keypoints. Robotics and Autonomous Systems, 52(1), 27\u201338.","journal-title":"Robotics and Autonomous Systems"},{"key":"1469_CR57","first-page":"86","volume":"66","author":"I Kostavelis","year":"2015","unstructured":"Kostavelis, I., & Gasteratos, A. (2015). Semantic mapping for mobile robotics tasks: A survey. RAS, 66, 86\u2013103.","journal-title":"RAS"},{"key":"1469_CR58","unstructured":"Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (pp. 1097\u20131105)."},{"key":"1469_CR59","doi-asserted-by":"crossref","unstructured":"Larsson, M., Stenborg, E., Hammarstrand, L., Pollefeys, M., Sattler, T., & Kahl, F. (2019). A cross-season correspondence dataset for robust semantic segmentation. In CVPR (pp. 9532\u20139542).","DOI":"10.1109\/CVPR.2019.00976"},{"key":"1469_CR60","doi-asserted-by":"crossref","unstructured":"Lategahn, H., Beck, J., Kitt, B., & Stiller, C. (2013). How to learn an illumination robust image feature for place recognition. In 2013 IEEE intelligent vehicles symposium (IV) (pp. 285\u2013291). IEEE.","DOI":"10.1109\/IVS.2013.6629483"},{"issue":"2","key":"1469_CR61","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","volume":"60","author":"DG Lowe","year":"2004","unstructured":"Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. IJCV, Springer, 60(2), 91\u2013110.","journal-title":"IJCV, Springer"},{"issue":"1","key":"1469_CR62","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1109\/TRO.2015.2496823","volume":"32","author":"S Lowry","year":"2015","unstructured":"Lowry, S., S\u00fcnderhauf, N., Newman, P., Leonard, J. J., Cox, D., Corke, P., et al. (2015). Visual place recognition: A survey. IEEE Transactions on Robotics, 32(1), 1\u201319.","journal-title":"IEEE Transactions on Robotics"},{"issue":"4","key":"1469_CR63","first-page":"429","volume":"31","author":"W Maddern","year":"2012","unstructured":"Maddern, W., Milford, M., & Wyeth, G. (2012). CAT-SLAM: Probabilistic localisation and mapping using a continuous appearance-based trajectory. IJRR, 31(4), 429\u2013451.","journal-title":"IJRR"},{"issue":"1","key":"1469_CR64","doi-asserted-by":"publisher","first-page":"3","DOI":"10.1177\/0278364916679498","volume":"36","author":"W Maddern","year":"2017","unstructured":"Maddern, W., Pascoe, G., Linegar, C., & Newman, P. (2017). 1 year, 1000 km: The oxford robotcar dataset. The International Journal of Robotics Research, 36(1), 3\u201315.","journal-title":"The International Journal of Robotics Research"},{"key":"1469_CR65","doi-asserted-by":"publisher","first-page":"19516","DOI":"10.1109\/ACCESS.2021.3054937","volume":"9","author":"C Masone","year":"2021","unstructured":"Masone, C., & Caputo, B. (2021). A survey on deep visual place recognition. IEEE Access, 9, 19516\u201319547.","journal-title":"IEEE Access"},{"key":"1469_CR66","doi-asserted-by":"crossref","unstructured":"McManus, C., Upcroft, B., & Newmann, P. (2014). Scene signatures: Localised and point-less features for localisation. In Robotics, science and systems conference.","DOI":"10.15607\/RSS.2014.X.023"},{"key":"1469_CR67","doi-asserted-by":"crossref","unstructured":"Mei, C., Sibley, G., Cummins, M., Newman, P., & Reid, I. (2009). A constant-time efficient stereo slam system. In Proceedings of the British machine vision conference (Vol. 1). BMVA Press","DOI":"10.5244\/C.23.54"},{"key":"1469_CR68","doi-asserted-by":"crossref","unstructured":"Merrill, N., & Huang, G. (2018). Lightweight unsupervised deep loop closure. Robotics Science and Systems Conference. arXiv preprint arXiv:1805.07703.","DOI":"10.15607\/RSS.2018.XIV.032"},{"issue":"7","key":"1469_CR69","doi-asserted-by":"publisher","first-page":"766","DOI":"10.1177\/0278364913490323","volume":"32","author":"M Milford","year":"2013","unstructured":"Milford, M. (2013). Vision-based place recognition: How low can you go? The International Journal of Robotics Research, 32(7), 766\u2013789.","journal-title":"The International Journal of Robotics Research"},{"key":"1469_CR70","doi-asserted-by":"crossref","unstructured":"Milford, M. J., & Wyeth, G. F. (2012). SeqSLAM: Visual route-based navigation for sunny summer days and stormy winter nights. In International conference on robotics and automation (pp. 1643\u20131649). IEEE.","DOI":"10.1109\/ICRA.2012.6224623"},{"key":"1469_CR71","unstructured":"Mishkin, D., Perdoch, M., & Matas, J. (2015). Place recognition with WxBS retrieval. In PCVPR 2015 workshop on visual place recognition in changing environments (Vol. 30)."},{"issue":"4","key":"1469_CR72","doi-asserted-by":"publisher","first-page":"652","DOI":"10.1109\/TVCG.2007.1008","volume":"13","author":"A Mohan","year":"2007","unstructured":"Mohan, A., Bailey, R., Waite, J., Tumblin, J., Grimm, C., & Bodenheimer, B. (2007). Tabletop computed lighting for practical digital photography. IEEE Transactions on Visualization and Computer Graphics, 13(4), 652\u2013662.","journal-title":"IEEE Transactions on Visualization and Computer Graphics"},{"key":"1469_CR73","doi-asserted-by":"crossref","unstructured":"Mount, J., & Milford, M. (2016). 2d visual place recognition for domestic service robots at night. In 2016 IEEE international conference on robotics and automation (ICRA) (pp. 4822\u20134829). IEEE.","DOI":"10.1109\/ICRA.2016.7487686"},{"key":"1469_CR74","doi-asserted-by":"crossref","unstructured":"Mousavian, A., Ko\u0161eck\u00e1, J., & Lien, J. M. (2015). Semantically guided location recognition for outdoors scenes. In 2015 IEEE international conference on robotics and automation (ICRA) (pp. 4882\u20134889). IEEE.","DOI":"10.1109\/ICRA.2015.7139877"},{"key":"1469_CR75","doi-asserted-by":"crossref","unstructured":"Murillo, A. C., & Kosecka, J. (2009). Experiments in place recognition using gist panoramas. In ICCV workshops (pp 2196\u20132203). IEEE.","DOI":"10.1109\/ICCVW.2009.5457552"},{"key":"1469_CR76","doi-asserted-by":"crossref","unstructured":"Murillo, A. C., Guerrero, J. J., & Sagues, C. (2007). Surf features for efficient robot localization with omnidirectional images. In Proceedings of IEEE ICRA (pp. 3901\u20133907).","DOI":"10.1109\/ROBOT.2007.364077"},{"issue":"6","key":"1469_CR77","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/2980179.2980219","volume":"35","author":"L Murmann","year":"2016","unstructured":"Murmann, L., Davis, A., Kautz, J., & Durand, F. (2016). Computational bounce flash for indoor portraits. ACM Transactions on Graphics (TOG), 35(6), 1\u20139.","journal-title":"ACM Transactions on Graphics (TOG)"},{"key":"1469_CR78","doi-asserted-by":"crossref","unstructured":"Murmann, L., Gharbi, M., Aittala, M., & Durand, F. (2019). A multi-illumination dataset of indoor object appearance. In 2019 IEEE international conference on computer vision (ICCV).","DOI":"10.1109\/ICCV.2019.00418"},{"key":"1469_CR79","doi-asserted-by":"crossref","unstructured":"Nardi, L., Bodin, B., Zia, M. Z., Mawer, J., Nisbet, A., Kelly, P. H., Davison, A. J., Luj\u00e1n, M., O\u2019Boyle, M. F., Riley, G., et\u00a0al. (2015). Introducing slambench, a performance and accuracy benchmarking methodology for slam. In 2015 IEEE international conference on robotics and automation (ICRA) (pp. 5783\u20135790). IEEE.","DOI":"10.1109\/ICRA.2015.7140009"},{"key":"1469_CR80","doi-asserted-by":"crossref","unstructured":"Naseer, T., Oliveira, G.L., Brox, T., & Burgard, W. (2017). Semantics-aware visual localization under challenging perceptual conditions. In 2017 IEEE ICRA (pp. 2614\u20132620).","DOI":"10.1109\/ICRA.2017.7989305"},{"key":"1469_CR81","doi-asserted-by":"crossref","unstructured":"Noh, H., Araujo, A., Sim, J., Weyand, T., & Han, B. (2017). Large-scale image retrieval with attentive deep local features. In Proceedings of the IEEE international conference on computer vision (pp. 3456\u20133465).","DOI":"10.1109\/ICCV.2017.374"},{"key":"1469_CR82","doi-asserted-by":"crossref","unstructured":"Odo, A., McKenna, S., Flynn, D., & Vorstius, J. (2020). Towards the automatic visual monitoring of electricity pylons from aerial images. In 15th International joint conference on computer vision, imaging and computer graphics theory and applications 2020 (pp. 566\u2013573). SciTePress.","DOI":"10.5220\/0009345005660573"},{"key":"1469_CR83","doi-asserted-by":"publisher","first-page":"23","DOI":"10.1016\/S0079-6123(06)55002-2","volume":"155","author":"A Oliva","year":"2006","unstructured":"Oliva, A., & Torralba, A. (2006). Building the gist of a scene: The role of global image features in recognition. Progress in Brain Research, 155, 23\u201336.","journal-title":"Progress in Brain Research"},{"key":"1469_CR84","doi-asserted-by":"crossref","unstructured":"Paul, R., Feldman, D., Rus, D., & Newman, P. (2014). Visual precis generation using coresets. In 2014 IEEE international conference on robotics and automation (ICRA) (pp. 1304\u20131311). IEEE.","DOI":"10.1109\/ICRA.2014.6907021"},{"key":"1469_CR85","doi-asserted-by":"crossref","unstructured":"Pepperell, E., Corke, P. I., & Milford, M. J. (2014). All-environment visual place recognition with smart. In 2014 IEEE international conference on robotics and automation (ICRA) (pp. 1612\u20131618). IEEE.","DOI":"10.1109\/ICRA.2014.6907067"},{"key":"1469_CR86","doi-asserted-by":"crossref","unstructured":"Pepperell, E., Corke, P. I., & Milford, M. J. (2015). Automatic image scaling for place recognition in changing environments. In 2015 IEEE international conference on robotics and automation (ICRA) (pp. 1118\u20131124). IEEE.","DOI":"10.1109\/ICRA.2015.7139316"},{"key":"1469_CR87","doi-asserted-by":"crossref","unstructured":"Perronnin, F., Liu, Y., S\u00e1nchez, J., & Poirier, H. (2010). Large-scale image retrieval with compressed fisher vectors. In 2010 IEEE computer society conference on computer vision and pattern recognition (pp. 3384\u20133391). IEEE.","DOI":"10.1109\/CVPR.2010.5540009"},{"key":"1469_CR88","doi-asserted-by":"crossref","unstructured":"Philbin, J., Chum, O., Isard, M., Sivic, J., & Zisserman, A. (2007). Object retrieval with large vocabularies and fast spatial matching. In IEEE conference on computer vision and pattern recognition.","DOI":"10.1109\/CVPR.2007.383172"},{"key":"1469_CR89","doi-asserted-by":"crossref","unstructured":"Philbin, J., Chum, O., Isard, M., Sivic, J., & Zisserman, A. (2008). Lost in quantization: Improving particular object retrieval in large scale image databases. In IEEE conference on computer vision and pattern recognition.","DOI":"10.1109\/CVPR.2008.4587635"},{"key":"1469_CR90","doi-asserted-by":"crossref","unstructured":"Porav, H., Maddern, W., & Newman, P. (2018). Adversarial training for adverse conditions: Robust metric localisation using appearance transfer. In 2018 IEEE international conference on robotics and automation (ICRA) (pp. 1011\u20131018). IEEE.","DOI":"10.1109\/ICRA.2018.8462894"},{"key":"1469_CR91","doi-asserted-by":"crossref","unstructured":"Radenovi\u0107, F., Iscen, A., Tolias, G., Avrithis, Y., & Chum, O. (2018). Revisiting oxford and paris: Large-scale image retrieval benchmarking. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).","DOI":"10.1109\/CVPR.2018.00598"},{"issue":"7","key":"1469_CR92","doi-asserted-by":"publisher","first-page":"1655","DOI":"10.1109\/TPAMI.2018.2846566","volume":"41","author":"F Radenovi\u0107","year":"2018","unstructured":"Radenovi\u0107, F., Tolias, G., & Chum, O. (2018). Fine-tuning CNN image retrieval with no human annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 41(7), 1655\u20131668.","journal-title":"IEEE Transactions on Pattern Analysis and Machine Intelligence"},{"key":"1469_CR93","unstructured":"Ranganathan, A. (2013). Detecting and labeling places using runtime change-point detection and place labeling classifiers. US Patent 8,559,717."},{"key":"1469_CR94","doi-asserted-by":"crossref","unstructured":"Revaud, J., Almaz\u00e1n, J., Rezende, R. S., & Souza, C. R. D. (2019a). Learning with average precision: Training image retrieval with a listwise loss. In Proceedings of the IEEE international conference on computer vision (pp. 5107\u20135116).","DOI":"10.1109\/ICCV.2019.00521"},{"key":"1469_CR95","unstructured":"Revaud, J., De\u00a0Souza, C., Humenberger, M., & Weinzaepfel, P. (2019b). R2d2: Reliable and repeatable detector and descriptor. In Advances in neural information processing systems (pp. 12405\u201312415)."},{"key":"1469_CR96","doi-asserted-by":"crossref","unstructured":"Robertson, D. P., & Cipolla, R. (2004). An image-based system for urban navigation. In BMVC (Vol. 19, p. 165). Citeseer.","DOI":"10.5244\/C.18.84"},{"key":"1469_CR97","doi-asserted-by":"crossref","unstructured":"Ros, G., Sellart, L., Materzynska, J., Vazquez, D., & Lopez, A. M. (2016). The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 3234\u20133243).","DOI":"10.1109\/CVPR.2016.352"},{"key":"1469_CR98","doi-asserted-by":"crossref","unstructured":"Rosten, E., & Drummond, T. (2006). Machine learning for high-speed corner detection. In ECCV (pp. 430\u2013443). Springer.","DOI":"10.1007\/11744023_34"},{"key":"1469_CR99","doi-asserted-by":"crossref","unstructured":"Sahdev, R., & Tsotsos, J. K. (2016). Indoor place recognition system for localization of mobile robots. In 2016 13th Conference on computer and robot vision (CRV) (pp. 53\u201360). IEEE.","DOI":"10.1109\/CRV.2016.38"},{"key":"1469_CR100","doi-asserted-by":"crossref","unstructured":"Sarlin, P. E., Cadena, C., Siegwart, R., & Dymczyk, M. (2019). From coarse to fine: Robust hierarchical localization at large scale. In CVPR (pp .12716\u201312725).","DOI":"10.1109\/CVPR.2019.01300"},{"key":"1469_CR101","doi-asserted-by":"crossref","unstructured":"Sattler, T., Havlena, M., Schindler, K., & Pollefeys, M. (2016). Large-scale location recognition and the geometric burstiness problem. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1582\u20131590).","DOI":"10.1109\/CVPR.2016.175"},{"key":"1469_CR102","doi-asserted-by":"crossref","unstructured":"Sattler, T., Maddern, W., Toft, C., Torii, A., Hammarstrand, L., Stenborg, E., Safari, D., Okutomi, M., Pollefeys, M., Sivic, J., et\u00a0al. (2018). Benchmarking 6dof outdoor visual localization in changing conditions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 8601\u20138610).","DOI":"10.1109\/CVPR.2018.00897"},{"key":"1469_CR103","doi-asserted-by":"crossref","unstructured":"Sch\u00f6nberger, J. L., Pollefeys, M., Geiger, A., & Sattler, T. (2018). Semantic visual localization. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 6896\u20136906).","DOI":"10.1109\/CVPR.2018.00721"},{"issue":"8","key":"1469_CR104","first-page":"735","volume":"21","author":"S Se","year":"2002","unstructured":"Se, S., Lowe, D., & Little, J. (2002). Mobile robot localization and mapping with uncertainty using scale-invariant visual landmarks. IJRR, 21(8), 735\u2013758.","journal-title":"IJRR"},{"key":"1469_CR105","unstructured":"Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R., & LeCun, Y. (2014). Overfeat: Integrated recognition, localization and detection using convolutional networks. In 2nd International conference on learning representations, ICLR 2014."},{"key":"1469_CR106","doi-asserted-by":"crossref","unstructured":"Sim\u00e9oni, O., Avrithis, Y., & Chum, O. (2019). Local features and visual words emerge in activations. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 11651\u201311660).","DOI":"10.1109\/CVPR.2019.01192"},{"key":"1469_CR107","unstructured":"Singh, G., & Kosecka, J. (2010). Visual loop closing using gist descriptors in manhattan world. In ICRA omnidirectional vision workshop (pp. 4042\u20134047)."},{"key":"1469_CR108","doi-asserted-by":"crossref","unstructured":"Sivic, J., & Zisserman, A. (2003). Video google: A text retrieval approach to object matching in videos. In Null (p. 1470). IEEE.","DOI":"10.1109\/ICCV.2003.1238663"},{"key":"1469_CR109","doi-asserted-by":"crossref","unstructured":"Skinner, J., Garg, S., S\u00fcnderhauf, N., Corke, P., Upcroft, B., & Milford, M. (2016). High-fidelity simulation for evaluating robotic vision performance. In 2016 IEEE\/RSJ international conference on intelligent robots and systems (IROS) (pp. 2737\u20132744). IEEE.","DOI":"10.1109\/IROS.2016.7759425"},{"key":"1469_CR110","unstructured":"Skrede, S. (2013). Nordland dataset. https:\/\/bit.ly\/2QVBOym."},{"key":"1469_CR111","doi-asserted-by":"crossref","unstructured":"Stenborg, E., Toft, C., & Hammarstrand, L. (2018). Long-term visual localization using semantically segmented images. In 2018 IEEE ICRA (pp. 6484\u20136490).","DOI":"10.1109\/ICRA.2018.8463150"},{"key":"1469_CR112","doi-asserted-by":"crossref","unstructured":"Stumm, E., Mei, C., & Lacroix, S. (2013). Probabilistic place recognition with covisibility maps. In IROS (pp. 4158\u20134163). IEEE.","DOI":"10.1109\/IROS.2013.6696952"},{"key":"1469_CR113","doi-asserted-by":"crossref","unstructured":"Sturm, J., Engelhard, N., Endres, F., Burgard, W., & Cremers, D. (2012). A benchmark for the evaluation of RGB-D slam systems. In 2012 IEEE\/RSJ international conference on intelligent robots and systems. (pp. 573\u2013580). IEEE.","DOI":"10.1109\/IROS.2012.6385773"},{"key":"1469_CR114","doi-asserted-by":"crossref","unstructured":"S\u00fcnderhauf, N., & Protzel, P. (2011). Brief-gist-closing the loop by simple means. In IROS (pp. 1234\u20131241). IEEE.","DOI":"10.1109\/IROS.2011.6048590"},{"key":"1469_CR115","unstructured":"S\u00fcnderhauf, N., Neubert, P., & Protzel, P. (2013). Are we there yet? challenging SeqSLAM on a 3000 km journey across all four seasons. In Proc. of workshop on long-term autonomy, IEEE international conference on robotics and automation (ICRA) (p. 2013). Citeseer."},{"key":"1469_CR116","doi-asserted-by":"crossref","unstructured":"S\u00fcnderhauf, N., Shirazi, S., Dayoub, F., Upcroft, B., & Milford, M. (2015). On the performance of convnet features for place recognition. In IROS (pp. 4297\u20134304). IEEE.","DOI":"10.1109\/IROS.2015.7353986"},{"key":"1469_CR117","doi-asserted-by":"crossref","unstructured":"Talbot, B., Garg, S., & Milford, M. (2018). OpenSeqSLAM2. 0: An open source toolbox for visual place recognition under changing conditions. In 2018 IEEE\/RSJ international conference on intelligent robots and systems (IROS) (pp. 7758\u20137765). IEEE.","DOI":"10.1109\/IROS.2018.8593761"},{"key":"1469_CR118","doi-asserted-by":"crossref","unstructured":"Tipaldi, G. D., Spinello, L., & Burgard, W. (2013). Geometrical flirt phrases for large scale place recognition in 2d range data. In 2013 IEEE international conference on robotics and automation (pp. 2693\u20132698). IEEE.","DOI":"10.1109\/ICRA.2013.6630947"},{"key":"1469_CR119","doi-asserted-by":"crossref","unstructured":"Tolias, G., Avrithis, Y., & J\u00e9gou, H. (2013). To aggregate or not to aggregate: Selective match kernels for image search. In Proceedings of the IEEE international conference on computer vision (pp. 1401\u20131408).","DOI":"10.1109\/ICCV.2013.177"},{"issue":"3","key":"1469_CR120","doi-asserted-by":"publisher","first-page":"247","DOI":"10.1007\/s11263-015-0810-4","volume":"116","author":"G Tolias","year":"2016","unstructured":"Tolias, G., Avrithis, Y., & J\u00e9gou, H. (2016a). Image search with selective match kernels: aggregation across single and multiple images. International Journal of Computer Vision, 116(3), 247\u2013261.","journal-title":"International Journal of Computer Vision"},{"key":"1469_CR121","unstructured":"Tolias, G., Sicre, R., & J\u00e9gou, H. (2016b). Particular object retrieval with integral max-pooling of CNN activations. In ICLR. arXiv:1511.05879."},{"key":"1469_CR122","unstructured":"Tomit\u0103, M. A., Zaffar, M., Milford, M., McDonald-Maier, K., & Ehsan, S. (2020). ConvSequential-SLAM: A sequence-based, training-less visual place recognition technique for changing environments. arXiv preprint arXiv:2009.13454."},{"key":"1469_CR123","unstructured":"Tomit\u0103, M. A., Zaffar, M., Milford, M., McDonald-Maier, K., & Ehsan, S. (2021). Sequence-based filtering for visual route-based navigation: Analysing the benefits, trade-offs and design choices. arXiv preprint arXiv:2103.01994."},{"key":"1469_CR124","doi-asserted-by":"crossref","unstructured":"Topp, E. A., & Christensen, H. I. (2008). Detecting structural ambiguities and transitions during a guided tour. In 2008 IEEE international conference on robotics and automation (pp. 2564\u20132570). IEEE.","DOI":"10.1109\/ROBOT.2008.4543599"},{"key":"1469_CR125","doi-asserted-by":"crossref","unstructured":"Torii, A., Arandjelovic, R., Sivic, J., Okutomi, M., Pajdla, T. (2015). 24\/7 Place recognition by view synthesis. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1808\u20131817).","DOI":"10.1109\/CVPR.2015.7298790"},{"key":"1469_CR126","doi-asserted-by":"crossref","unstructured":"Torii, A., Sivic, J., Pajdla, T., & Okutomi, M. (2013). Visual place recognition with repetitive structures. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 883\u2013890).","DOI":"10.1109\/CVPR.2013.119"},{"key":"1469_CR127","unstructured":"Torii, A., Taira, H., Sivic, J., Pollefeys, M., Okutomi, M., Pajdla, T., & Sattler, T. (2019). Are large-scale 3d models really necessary for accurate visual localization? IEEE Transactions on Pattern Analysis and Machine Intelligence."},{"key":"1469_CR128","doi-asserted-by":"crossref","unstructured":"Uy, M. A., & Lee, G. H. (2018). Pointnetvlad: Deep point cloud based retrieval for large-scale place recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4470\u20134479).","DOI":"10.1109\/CVPR.2018.00470"},{"key":"1469_CR129","doi-asserted-by":"crossref","unstructured":"Wang, J., Zha, H., & Cipolla, R. (2005). Combining interest points and edges for content-based image retrieval. In IEEE international conference on image processing 2005 (Vol. 3, pp. III\u20131256). IEEE.","DOI":"10.1109\/ICIP.2005.1530627"},{"key":"1469_CR130","doi-asserted-by":"crossref","unstructured":"Warburg, F., Hauberg, S., L\u00f3pez-Antequera, M., Gargallo, P., Kuang, Y., & Civera, J. (2020). Mapillary street-level sequences: A dataset for lifelong place recognition. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 2626\u20132635).","DOI":"10.1109\/CVPR42600.2020.00270"},{"key":"1469_CR131","doi-asserted-by":"crossref","unstructured":"Weyand, T., Araujo, A., Cao, B., & Sim, J. (2020). Google landmarks dataset v2-a large-scale benchmark for instance-level recognition and retrieval. In Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition (pp. 2575\u20132584).","DOI":"10.1109\/CVPR42600.2020.00265"},{"key":"1469_CR132","doi-asserted-by":"crossref","unstructured":"Ye, Y., Cieslewski, T., Loquercio, A., & Scaramuzza, D. (2017). Place recognition in semi-dense maps: Geometric and learning-based approaches. In British machine vision conference (BMVC).","DOI":"10.5244\/C.31.74"},{"key":"1469_CR133","doi-asserted-by":"crossref","unstructured":"Yi, K. M., Trulls, E., Lepetit, V., & Fua, P. (2016). Lift: Learned invariant feature transform. In European conference on computer vision. (pp 467\u2013483). Springer.","DOI":"10.1007\/978-3-319-46466-4_28"},{"key":"1469_CR134","unstructured":"Zaffar, M., Ehsan, S., Milford, M., & Maier, K. M. (2018). Memorable maps: A framework for re-defining places in visual place recognition. arXiv preprint arXiv:1811.03529."},{"issue":"2","key":"1469_CR135","doi-asserted-by":"publisher","first-page":"1835","DOI":"10.1109\/LRA.2020.2969917","volume":"5","author":"M Zaffar","year":"2020","unstructured":"Zaffar, M., Ehsan, S., Milford, M., & McDonald-Maier, K. (2020). Cohog: A light-weight, compute-efficient, and training-free visual place recognition technique for changing environments. IEEE Robotics and Automation Letters, 5(2), 1835\u20131842.","journal-title":"IEEE Robotics and Automation Letters"},{"key":"1469_CR136","unstructured":"Zaffar, M., Khaliq, A., Ehsan, S., Milford, M., Alexis, K., & McDonald-Maier, K. (2019a). Are state-of-the-art visual place recognition techniques any good for aerial robotics? In ICRA 2019 workshop on aerial robotics. arXiv preprint arXiv:1904.07967."},{"key":"1469_CR137","unstructured":"Zaffar, M., Khaliq, A., Ehsan, S., Milford, M., & McDonald-Maier, K. (2019b). Levelling the playing field: A comprehensive comparison of visual place recognition approaches under changing conditions. In IEEE ICRA workshop on database generation and benchmarking. arXiv preprint arXiv:1903.09107."},{"key":"1469_CR138","doi-asserted-by":"crossref","unstructured":"Zeng, F., Jacobson, A., Smith, D., Boswell, N., Peynot, T., & Milford, M. (2019). Lookup: Vision-only real-time precise underground localisation for autonomous mining vehicles. In 2019 International conference on robotics and automation (ICRA) (pp. 1444\u20131450). IEEE.","DOI":"10.1109\/ICRA.2019.8794453"},{"key":"1469_CR139","doi-asserted-by":"publisher","first-page":"107760","DOI":"10.1016\/j.patcog.2020.107760","volume":"113","author":"X Zhang","year":"2021","unstructured":"Zhang, X., Wang, L., & Su, Y. (2021). Visual place recognition: A survey from deep learning perspective. Pattern Recognition, 113, 107760.","journal-title":"Pattern Recognition"}],"container-title":["International Journal of Computer Vision"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-021-01469-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11263-021-01469-5\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11263-021-01469-5.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2021,6,9]],"date-time":"2021-06-09T07:10:53Z","timestamp":1623222653000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11263-021-01469-5"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,5,7]]},"references-count":139,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2021,7]]}},"alternative-id":["1469"],"URL":"https:\/\/doi.org\/10.1007\/s11263-021-01469-5","relation":{},"ISSN":["0920-5691","1573-1405"],"issn-type":[{"value":"0920-5691","type":"print"},{"value":"1573-1405","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,5,7]]},"assertion":[{"value":"15 May 2020","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 April 2021","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"7 May 2021","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}