EMA-VIO: Deep Visual-Inertial Odometry with External Memory Attention

Zheming Tu; Changhao Chen; Xianfei Pan; Ruochen Liu; Jiarui Cui; Jun; Mao

arXiv:2209.08490·cs.CV·September 20, 2022

EMA-VIO: Deep Visual-Inertial Odometry with External Memory Attention

Zheming Tu, Changhao Chen, Xianfei Pan, Ruochen Liu, Jiarui Cui, Jun, Mao

PDF

Open Access

TL;DR

This paper introduces EMA-VIO, a deep learning visual-inertial odometry model with external memory attention, achieving accurate and robust localization in challenging environments by effectively fusing multimodal sensor data.

Contribution

The paper presents a novel VIO framework utilizing external memory attention, improving data fusion efficiency and robustness over existing recurrent-based models.

Findings

01

Outperforms traditional and learning-based VIO methods in various scenes.

02

Maintains accuracy in challenging conditions like overcast days and water-filled terrains.

03

Demonstrates robustness and efficiency in pose estimation.

Abstract

Accurate and robust localization is a fundamental need for mobile agents. Visual-inertial odometry (VIO) algorithms exploit the information from camera and inertial sensors to estimate position and translation. Recent deep learning based VIO models attract attentions as they provide pose information in a data-driven way, without the need of designing hand-crafted algorithms. Existing learning based VIO models rely on recurrent models to fuse multimodal data and process sensor signal, which are hard to train and not efficient enough. We propose a novel learning based VIO framework with external memory attention that effectively and efficiently combines visual and inertial features for states estimation. Our proposed model is able to estimate pose accurately and robustly, even in challenging scenarios, e.g., on overcast days and water-filled ground , which are difficult for traditional…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage