Leveraging Deep Visual Descriptors for Hierarchical Efficient   Localization

Paul-Edouard Sarlin; Fr\'ed\'eric Debraine; Marcin Dymczyk; Roland; Siegwart; Cesar Cadena

arXiv:1809.01019·cs.CV·September 20, 2018

Leveraging Deep Visual Descriptors for Hierarchical Efficient Localization

Paul-Edouard Sarlin, Fr\'ed\'eric Debraine, Marcin Dymczyk, Roland, Siegwart, Cesar Cadena

PDF

1 Repo

TL;DR

This paper introduces a hierarchical visual localization method that combines global deep descriptors and local 2D-3D matching, achieving real-time, high-accuracy pose estimation on resource-limited robotic platforms.

Contribution

It proposes a novel hierarchical approach leveraging deep learning for efficient localization, overcoming limitations of binary descriptors in large-scale environments.

Findings

01

Achieves state-of-the-art localization accuracy.

02

Runs in real-time on mobile robotic platforms.

03

Effectively handles perceptual aliasing and environmental changes.

Abstract

Many robotics applications require precise pose estimates despite operating in large and changing environments. This can be addressed by visual localization, using a pre-computed 3D model of the surroundings. The pose estimation then amounts to finding correspondences between 2D keypoints in a query image and 3D points in the model using local descriptors. However, computational power is often limited on robotic platforms, making this task challenging in large-scale environments. Binary feature descriptors significantly speed up this 2D-3D matching, and have become popular in the robotics community, but also strongly impair the robustness to perceptual aliasing and changes in viewpoint, illumination and scene structure. In this work, we propose to leverage recent advances in deep learning to perform an efficient hierarchical localization. We first localize at the map level using learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ethz-asl/hierarchical_loc
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings