CALC2.0: Combining Appearance, Semantic and Geometric Information for   Robust and Efficient Visual Loop Closure

Nathaniel Merrill; Guoquan Huang

arXiv:1910.14103·cs.RO·November 1, 2019·1 cites

CALC2.0: Combining Appearance, Semantic and Geometric Information for Robust and Efficient Visual Loop Closure

Nathaniel Merrill, Guoquan Huang

PDF

Open Access 1 Repo

TL;DR

CALC2.0 introduces a deep learning-based loop closure detection method that combines appearance, semantic, and geometric information from a single CNN, eliminating thresholding and improving robustness and speed over existing methods.

Contribution

The paper presents a novel CNN architecture that integrates semantic, appearance, and geometric features for robust, threshold-free visual loop closure detection.

Findings

01

Outperforms state-of-the-art NetVLAD in precision-recall.

02

Achieves real-time speeds for loop closure detection.

03

Requires no user-defined thresholding, reducing false positives.

Abstract

Traditional attempts for loop closure detection typically use hand-crafted features, relying on geometric and visual information only, whereas more modern approaches tend to use semantic, appearance or geometric features extracted from deep convolutional neural networks (CNNs). While these approaches are successful in many applications, they do not utilize all of the information that a monocular image provides, and many of them, particularly the deep-learning based methods, require user-chosen thresholding to actually close loops -- which may impact generality in practical applications. In this work, we address these issues by extracting all three modes of information from a custom deep CNN trained specifically for the task of place recognition. Our network is built upon a combination of a semantic segmentator, Variational Autoencoder (VAE) and triplet embedding network. The network is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rpng/calc2.0
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging