{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,12,24]],"date-time":"2025-12-24T12:20:39Z","timestamp":1766578839821,"version":"3.41.2"},"reference-count":34,"publisher":"Frontiers Media SA","license":[{"start":{"date-parts":[[2024,1,19]],"date-time":"2024-01-19T00:00:00Z","timestamp":1705622400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["frontiersin.org"],"crossmark-restriction":true},"short-container-title":["Front. Neurorobot."],"abstract":"<jats:p>Loop closure detection is an important module for simultaneous localization and mapping (SLAM). Correct detection of loops can reduce the cumulative drift in positioning. Because traditional detection methods rely on handicraft features, false positive detections can occur when the environment changes, resulting in incorrect estimates and an inability to obtain accurate maps. In this research paper, a loop closure detection method based on a variational autoencoder (VAE) is proposed. It is intended to be used as a feature extractor to extract image features through neural networks to replace the handicraft features used in traditional methods. This method extracts a low-dimensional vector as the representation of the image. At the same time, the attention mechanism is added to the network and constraints are added to improve the loss function for better image representation. In the back-end feature matching process, geometric checking is used to filter out the wrong matching for the false positive problem. Finally, through numerical experiments, the proposed method is demonstrated to have a better precision-recall curve than the traditional method of the bag-of-words model and other deep learning methods and is highly robust to environmental changes. In addition, experiments on datasets from three different scenarios also demonstrate that the method can be applied in real-world scenarios and that it has a good performance.<\/jats:p>","DOI":"10.3389\/fnbot.2023.1301785","type":"journal-article","created":{"date-parts":[[2024,1,19]],"date-time":"2024-01-19T04:17:31Z","timestamp":1705637851000},"update-policy":"https:\/\/doi.org\/10.3389\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["Loop closure detection of visual SLAM based on variational autoencoder"],"prefix":"10.3389","volume":"17","author":[{"given":"Shibin","family":"Song","sequence":"first","affiliation":[]},{"given":"Fengjie","family":"Yu","sequence":"additional","affiliation":[]},{"given":"Xiaojie","family":"Jiang","sequence":"additional","affiliation":[]},{"given":"Jie","family":"Zhu","sequence":"additional","affiliation":[]},{"given":"Weihao","family":"Cheng","sequence":"additional","affiliation":[]},{"given":"Xiao","family":"Fang","sequence":"additional","affiliation":[]}],"member":"1965","published-online":{"date-parts":[[2024,1,19]]},"reference":[{"key":"B1","first-page":"5297","article-title":"NetVLAD: CNN architecture for weakly supervised place recognition","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Arandjelovic","year":"2016"},{"key":"B2","doi-asserted-by":"publisher","first-page":"1243","DOI":"10.3390\/s21041243","article-title":"Role of deep learning in loop closure detection for visual and lidar slam: a survey","volume":"21","author":"Arshad","year":"2021","journal-title":"Sensors"},{"key":"B3","doi-asserted-by":"publisher","first-page":"404","DOI":"10.1007\/11744023_32","article-title":"Surf: speeded up robust features","volume":"3951","author":"Bay","year":"2006","journal-title":"Lect. Notes Comput. Sci."},{"key":"B4","doi-asserted-by":"publisher","first-page":"1798","DOI":"10.1109\/TPAMI.2013.50","article-title":"Representation learning: a review and new perspectives","volume":"35","author":"Bengio","year":"2013","journal-title":"Trans. Pattern Anal. Mach. Intell"},{"key":"B5","doi-asserted-by":"crossref","DOI":"10.1109\/CVPR.2018.00132","article-title":"COCO-stuff: thing and stuff classes in context","volume-title":"2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)","author":"Caesar","year":"2018"},{"key":"B6","doi-asserted-by":"crossref","first-page":"778","DOI":"10.1007\/978-3-642-15561-1_56","article-title":"Brief: binary robust independent elementary features","volume-title":"Computer Vision\u2013ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece, September 5-11, 2010 Proceedings, Part IV 11","author":"Calonder","year":"2010"},{"key":"B7","doi-asserted-by":"publisher","first-page":"647","DOI":"10.1177\/0278364908090961","article-title":"FAB-MAP: probabilistic localization and mapping in the space of appearance","volume":"27","author":"Cummins","year":"2008","journal-title":"Int. J. Rob. Res"},{"key":"B8","doi-asserted-by":"publisher","first-page":"4168","DOI":"10.3390\/electronics11244168","article-title":"FPSNET: an architecture for neural-network-based feature point extraction for SLAM","volume":"11","author":"Farrukh","year":"2022","journal-title":"Electronics"},{"key":"B9","doi-asserted-by":"publisher","first-page":"2006","DOI":"10.3390\/electronics12092006","article-title":"Deep learning for visual SLAM: the state-of-the-art and future trends","volume":"12","author":"Favorskaya","year":"2023","journal-title":"Electronics"},{"key":"B10","first-page":"3921","article-title":"A visual bag of words method for interactive qualitative localization and mapping","volume-title":"International Conference on Robotics and Automation","author":"Filliat","year":"2007"},{"key":"B11","doi-asserted-by":"publisher","first-page":"1188","DOI":"10.1109\/TRO.2012.2197158","article-title":"Bags of binary words for fast place recognition in image sequences","volume":"28","author":"G\u00e1lvez-L\u00f3pez","year":"2012","journal-title":"IEEE Trans. Robot"},{"key":"B12","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1007\/s10514-015-9516-2","article-title":"Unsupervised learning to detect loops using deep neural networks for visual SLAM system","volume":"41","author":"Gao","year":"2017","journal-title":"Auton. Robots"},{"key":"B13","doi-asserted-by":"publisher","first-page":"3051","DOI":"10.1109\/LRA.2018.2849609","article-title":"IBoW-LCD: an appearance-based loop-closure detection approach using incremental bags of binary words","volume":"3","author":"Garcia-Fidalgo","year":"2018","journal-title":"IEEE Robot. Autom. Lett"},{"key":"B14","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1804.05526","article-title":"Lost? appearance-invariant place recognition for opposite viewpoints using visual semantics","author":"Garg","year":"2018","journal-title":"arXiv"},{"key":"B15","doi-asserted-by":"crossref","first-page":"3354","DOI":"10.1109\/CVPR.2012.6248074","article-title":"Are we ready for autonomous driving? The kitti vision benchmark suite","volume-title":"2012 IEEE Conference on Computer Vision and Pattern Recognition","author":"Geiger","year":"2012"},{"key":"B16","first-page":"2238","article-title":"Convolutional neural network-based image representation for visual loop closure detection","volume-title":"IEEE International Conference on Information and Automation","author":"Hou","year":"2015"},{"key":"B17","first-page":"7132","article-title":"Squeeze-and-excitation networks","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Hu","year":"2018"},{"key":"B18","doi-asserted-by":"publisher","first-page":"201","DOI":"10.1016\/j.robot.2018.11.009","article-title":"Semi-direct monocular visual and visual-inertial SLAM with loop closure detection","volume":"112","author":"Li","year":"2019","journal-title":"Robot. Auton. Syst"},{"key":"B19","doi-asserted-by":"publisher","first-page":"2876","DOI":"10.1109\/TMECH.2023.3253715","article-title":"Dlc-slam: a robust lidar-slam system with learning-based denoising and loop closure","volume":"28","author":"Liu","year":"2023","journal-title":"IEEE\/ASME Trans. Mechatron"},{"key":"B20","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.2012.09439","article-title":"FG-Net: fast large-scale LiDAR point clouds understanding network leveraging correlated feature mining and geometric-aware modelling","author":"Liu","year":"2020","journal-title":"arXiv"},{"key":"B21","doi-asserted-by":"publisher","first-page":"91","DOI":"10.1023\/B:VISI.0000029664.99615.94","article-title":"Distinctive image features from scale-invariant keypoints","volume":"60","author":"Lowe","year":"2004","journal-title":"Int. J. Comput. Vis"},{"key":"B22","doi-asserted-by":"publisher","first-page":"824","DOI":"10.1109\/TPAMI.2018.2889473","article-title":"Efficient and robust approximate nearest neighbor search using hierarchical navigable small worldgraphs","volume":"42","author":"Malkov","year":"2018","journal-title":"IEEE Trans. Pattern Anal. Mach. Intell"},{"key":"B23","doi-asserted-by":"publisher","first-page":"103470","DOI":"10.1016\/j.robot.2020.103470","article-title":"Loop closure detection using supervised and unsupervised deep neural networks for monocular SLAM systems","volume":"126","author":"Memon","year":"2020","journal-title":"Rob. Auton. Syst"},{"key":"B24","doi-asserted-by":"crossref","DOI":"10.15607\/RSS.2018.XIV.032","article-title":"Lightweight unsupervised deep loop closure","volume-title":"Proc. of Robotics: Science and Systems (RSS)","author":"Merrill","year":"2018"},{"key":"B25","doi-asserted-by":"publisher","first-page":"1147","DOI":"10.1109\/TRO.2015.2463671","article-title":"ORB-SLAM: a versatile and accurate monocular SLAM system","volume":"31","author":"Mur-Artal","year":"2015","journal-title":"IEEE Trans. Robot."},{"key":"B26","doi-asserted-by":"publisher","first-page":"1255","DOI":"10.1109\/TRO.2017.2705103","article-title":"Orb-slam2: an open-source slam system for monocular, stereo, and RGB-D cameras","volume":"33","author":"Mur-Artal","year":"2017","journal-title":"IEEE Trans. Robot."},{"key":"B27","doi-asserted-by":"publisher","first-page":"105797","DOI":"10.1016\/j.engappai.2022.105797","article-title":"PlaceNet: a multi-scale semantic-aware model for visual loop closure detection","volume":"119","author":"Osman","year":"2023","journal-title":"Eng. Appl. Artif. Intell."},{"key":"B28","first-page":"426","article-title":"Loop closure detection in SLAM by combining visual CNN features and submaps","volume-title":"Proceedings of the 4th International Conference on Control, Automation and Robotics, ICCAR, Auckland, New Zealand, 20\u201323 April","author":"Qin","year":"2018"},{"key":"B29","doi-asserted-by":"publisher","first-page":"787659","DOI":"10.3389\/fnsys.2022.787659","article-title":"Generalized simultaneous localization and mapping (G-SLAM) as unification framework for natural and artificial intelligences: towards reverse engineering the hippocampal\/entorhinal system and principles of high-level cognition","volume":"16","author":"Safron","year":"2022","journal-title":"Front. Syst. Neurosci."},{"key":"B30","first-page":"6896","article-title":"Semantic visual localization","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition","author":"Sch\u00f6nberger","year":"2018"},{"key":"B31","doi-asserted-by":"crossref","first-page":"888","DOI":"10.1109\/IEEECONF44664.2019.9048921","article-title":"A closer look at disentangling in \u03b2-VAE","volume-title":"2019 53rd Asilomar Conference on Signals, Systems, and Computers","author":"Sikka","year":"2019"},{"key":"B32","doi-asserted-by":"publisher","first-page":"60552","DOI":"10.1109\/ACCESS.2020.2982228","article-title":"Compressed holistic convnet representations for detecting loop closures in dynamic environments","volume":"8","author":"Wang","year":"2020","journal-title":"IEEE Access"},{"key":"B33","doi-asserted-by":"publisher","first-page":"1087","DOI":"10.1109\/JAS.2022.105635","article-title":"Loop closure detection with reweighting NetVLAD and local motion and structure consensus","volume":"9","author":"Zhang","year":"2022","journal-title":"IEEE\/CAA J. Autom. Sin."},{"key":"B34","first-page":"1","article-title":"Loop closure detection for visual SLAM systems using convolutional neural network","volume-title":"23rd International Conference on Automation and Computing (ICAC)","author":"Zhang","year":"2017"}],"container-title":["Frontiers in Neurorobotics"],"original-title":[],"link":[{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fnbot.2023.1301785\/full","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,1,19]],"date-time":"2024-01-19T04:17:41Z","timestamp":1705637861000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.frontiersin.org\/articles\/10.3389\/fnbot.2023.1301785\/full"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,1,19]]},"references-count":34,"alternative-id":["10.3389\/fnbot.2023.1301785"],"URL":"https:\/\/doi.org\/10.3389\/fnbot.2023.1301785","relation":{},"ISSN":["1662-5218"],"issn-type":[{"type":"electronic","value":"1662-5218"}],"subject":[],"published":{"date-parts":[[2024,1,19]]},"article-number":"1301785"}}