CALC2.0: Combining Appearance, Semantic and Geometric Information for Robust and Efficient Visual Loop Closure
Nathaniel Merrill, Guoquan Huang

TL;DR
CALC2.0 introduces a deep learning-based loop closure detection method that combines appearance, semantic, and geometric information from a single CNN, eliminating thresholding and improving robustness and speed over existing methods.
Contribution
The paper presents a novel CNN architecture that integrates semantic, appearance, and geometric features for robust, threshold-free visual loop closure detection.
Findings
Outperforms state-of-the-art NetVLAD in precision-recall.
Achieves real-time speeds for loop closure detection.
Requires no user-defined thresholding, reducing false positives.
Abstract
Traditional attempts for loop closure detection typically use hand-crafted features, relying on geometric and visual information only, whereas more modern approaches tend to use semantic, appearance or geometric features extracted from deep convolutional neural networks (CNNs). While these approaches are successful in many applications, they do not utilize all of the information that a monocular image provides, and many of them, particularly the deep-learning based methods, require user-chosen thresholding to actually close loops -- which may impact generality in practical applications. In this work, we address these issues by extracting all three modes of information from a custom deep CNN trained specifically for the task of place recognition. Our network is built upon a combination of a semantic segmentator, Variational Autoencoder (VAE) and triplet embedding network. The network is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
