Efficient Feature-Free Initialization for Monocular Visual-Inertial Systems Using a Feed-Forward 3D Model

Yuantai Zhang; Jiaqi Yang; Huajian Zeng; Changhao Chen; Haoang Li; Liang Li; Dezhen Song; Xingxing Zuo

arXiv:2605.17327·cs.RO·May 19, 2026

Efficient Feature-Free Initialization for Monocular Visual-Inertial Systems Using a Feed-Forward 3D Model

Yuantai Zhang, Jiaqi Yang, Huajian Zeng, Changhao Chen, Haoang Li, Liang Li, Dezhen Song, Xingxing Zuo

PDF

1 Repo

TL;DR

This paper introduces a feature-free initialization method for monocular visual-inertial systems using a feed-forward 3D model, significantly reducing initialization time and increasing robustness in degraded environments.

Contribution

The authors propose a novel feature-free initialization framework leveraging predicted point clouds, eliminating the need for visual feature tracking in VINS.

Findings

01

Achieves over 90% success rate in initialization.

02

Reduces data duration needed for initialization to under 1.2 seconds.

03

Demonstrates robustness in visually degraded environments.

Abstract

Fast and reliable initialization is critical for monocular visual-inertial navigation systems (VINS), as it establishes the starting conditions for subsequent state estimation. Despite steady progress, most existing methods heavily rely on visual feature correspondences and require 3-4 seconds of sensory data for successful initialization, which limits their applicability and efficiency. With the advent of feed-forward 3D models that can directly predict point clouds from images, we revisit the visual-inertial initialization problem from a concise perspective. In this work, we propose a feature-free initialization framework that leverages up-to-scale point clouds predicted by a feed-forward 3D model, thereby obviating the need for visual feature tracking and estimation. This design substantially reduces system complexity and improves the reliability of initialization. Experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Yuantai-Z/FF-VIO-Init
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.