Ultimate SLAM? Combining Events, Images, and IMU for Robust Visual SLAM in HDR and High Speed Scenarios
Antoni Rosinol Vidal, Henri Rebecq, Timo Horstschaefer, Davide, Scaramuzza

TL;DR
This paper introduces a hybrid visual SLAM system that fuses event camera data, standard images, and inertial measurements to achieve robust and accurate state estimation in HDR and high-speed scenarios, enabling autonomous drone flights in challenging environments.
Contribution
The paper presents the first tightly-coupled fusion pipeline combining events, frames, and IMU data for robust visual SLAM, demonstrating significant accuracy improvements and enabling new autonomous flight capabilities.
Findings
130% accuracy improvement over event-only methods
85% accuracy improvement over standard-frames-only systems
First autonomous quadrotor flight using event camera for state estimation
Abstract
Event cameras are bio-inspired vision sensors that output pixel-level brightness changes instead of standard intensity frames. These cameras do not suffer from motion blur and have a very high dynamic range, which enables them to provide reliable visual information during high speed motions or in scenes characterized by high dynamic range. However, event cameras output only little information when the amount of motion is limited, such as in the case of almost still motion. Conversely, standard cameras provide instant and rich information about the environment most of the time (in low-speed and good lighting scenarios), but they fail severely in case of fast motions, or difficult lighting such as high dynamic range or low light scenes. In this paper, we present the first state estimation pipeline that leverages the complementary advantages of these two sensors by fusing in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
