UMS-VINS: United Monocular-Stereo Features for Visual-Inertial Tightly Coupled Odometry
Chaoyang Jiang, Xiaoni Zheng, Zhe Jin, and Chengpu Yu

TL;DR
UMS-VINS enhances visual-inertial odometry by integrating monocular-stereo features, improving robustness and accuracy in pose estimation through novel feature extraction and fallback strategies.
Contribution
It introduces a unified approach combining monocular and stereo features into a tightly coupled visual-inertial odometry system, with improved feature tracking and robustness mechanisms.
Findings
Outperforms VINS-FUSION in accuracy and robustness
Effective in real-world and public datasets
Handles visual initialization and alignment failures
Abstract
This paper introduces the united monocular-stereo features into a visual-inertial tightly coupled odometry (UMS-VINS) for robust pose estimation. UMS-VINS requires two cameras and a low-cost inertial measurement unit (IMU). The UMS-VINS is an evolution of VINS-FUSION, which modifies the VINS-FUSION from the following three perspectives. 1) UMS-VINS extracts and tracks features from the sub-pixel plane to achieve better positions of the features. 2) UMS-VINS introduces additional 2-dimensional features from the left and/or right cameras. 3) If the visual initialization fails, the IMU propagation is directly used for pose estimation, and if the visual-IMU alignment fails, UMS-VINS estimates the pose via the visual odometry. The performances on both public datasets and new real-world experiments indicate that the proposed UMS-VINS outperforms the VINS-FUSION from the perspective of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · 3D Surveying and Cultural Heritage
