Ultimate SLAM? Combining Events, Images, and IMU for Robust Visual SLAM   in HDR and High Speed Scenarios

Antoni Rosinol Vidal; Henri Rebecq; Timo Horstschaefer; Davide; Scaramuzza

arXiv:1709.06310·cs.CV·April 6, 2018

Ultimate SLAM? Combining Events, Images, and IMU for Robust Visual SLAM in HDR and High Speed Scenarios

Antoni Rosinol Vidal, Henri Rebecq, Timo Horstschaefer, Davide, Scaramuzza

PDF

TL;DR

This paper introduces a hybrid visual SLAM system that fuses event camera data, standard images, and inertial measurements to achieve robust and accurate state estimation in HDR and high-speed scenarios, enabling autonomous drone flights in challenging environments.

Contribution

The paper presents the first tightly-coupled fusion pipeline combining events, frames, and IMU data for robust visual SLAM, demonstrating significant accuracy improvements and enabling new autonomous flight capabilities.

Findings

01

130% accuracy improvement over event-only methods

02

85% accuracy improvement over standard-frames-only systems

03

First autonomous quadrotor flight using event camera for state estimation

Abstract

Event cameras are bio-inspired vision sensors that output pixel-level brightness changes instead of standard intensity frames. These cameras do not suffer from motion blur and have a very high dynamic range, which enables them to provide reliable visual information during high speed motions or in scenes characterized by high dynamic range. However, event cameras output only little information when the amount of motion is limited, such as in the case of almost still motion. Conversely, standard cameras provide instant and rich information about the environment most of the time (in low-speed and good lighting scenarios), but they fail severely in case of fast motions, or difficult lighting such as high dynamic range or low light scenes. In this paper, we present the first state estimation pipeline that leverages the complementary advantages of these two sensors by fusing in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings