Lightweight Unsupervised Deep Loop Closure
Nate Merrill, Guoquan Huang

TL;DR
This paper introduces an unsupervised deep neural network for visual loop closure detection in SLAM, using a robust feature embedding trained with geometric and appearance invariance, achieving real-time performance.
Contribution
It presents a novel autoencoder-based architecture that learns robust features without labeled data, tailored for efficient and reliable loop closure detection.
Findings
Outperforms state-of-the-art methods in effectiveness
Operates in real time without dimensionality reduction
Robust to extreme appearance variations
Abstract
Robust efficient loop closure detection is essential for large-scale real-time SLAM. In this paper, we propose a novel unsupervised deep neural network architecture of a feature embedding for visual loop closure that is both reliable and compact. Our model is built upon the autoencoder architecture, tailored specifically to the problem at hand. To train our network, we inflict random noise on our input data as the denoising autoencoder does, but, instead of applying random dropout, we warp images with randomized projective transformations to emulate natural viewpoint changes due to robot motion. Moreover, we utilize the geometric information and illumination invariance provided by histogram of oriented gradients (HOG), forcing the encoder to reconstruct a HOG descriptor instead of the original image. As a result, our trained model extracts features robust to extreme variations in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
