SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions
Ali Samadzadeh, Ahmad Nickabadi

TL;DR
This paper introduces SRVIO, a hybrid visual-inertial odometry framework combining geometric and deep learning methods to achieve robust localization in dynamic and challenging environments, outperforming existing approaches.
Contribution
The paper presents a novel hybrid framework based on Vins-Mono that integrates DNNs with geometric SLAM to handle complex real-world challenges more effectively.
Findings
Achieves state-of-the-art results on multiple datasets.
Robustly handles dynamic objects and challenging lighting conditions.
Performs well in extreme simulated scenarios.
Abstract
There has been extensive research on visual localization and odometry for autonomous robots and virtual reality during the past decades. Traditionally, this problem has been solved with the help of expensive sensors, such as lidars. Nowadays, the focus of the leading research in this field is on robust localization using more economic sensors, such as cameras and IMUs. Consequently, geometric visual localization methods have become more accurate in time. However, these methods still suffer from significant loss and divergence in challenging environments, such as a room full of moving people. Scientists started using deep neural networks (DNNs) to mitigate this problem. The main idea behind using DNNs is to better understand challenging aspects of the data and overcome complex conditions such as the movement of a dynamic object in front of the camera that covers the full view of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Image and Video Retrieval Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
