TL;DR
This paper introduces a feature-free initialization method for monocular visual-inertial systems using a feed-forward 3D model, significantly reducing initialization time and increasing robustness in degraded environments.
Contribution
The authors propose a novel feature-free initialization framework leveraging predicted point clouds, eliminating the need for visual feature tracking in VINS.
Findings
Achieves over 90% success rate in initialization.
Reduces data duration needed for initialization to under 1.2 seconds.
Demonstrates robustness in visually degraded environments.
Abstract
Fast and reliable initialization is critical for monocular visual-inertial navigation systems (VINS), as it establishes the starting conditions for subsequent state estimation. Despite steady progress, most existing methods heavily rely on visual feature correspondences and require 3-4 seconds of sensory data for successful initialization, which limits their applicability and efficiency. With the advent of feed-forward 3D models that can directly predict point clouds from images, we revisit the visual-inertial initialization problem from a concise perspective. In this work, we propose a feature-free initialization framework that leverages up-to-scale point clouds predicted by a feed-forward 3D model, thereby obviating the need for visual feature tracking and estimation. This design substantially reduces system complexity and improves the reliability of initialization. Experiments on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
