HyVGGT-VO: Tightly Coupled Hybrid Dense Visual Odometry with Feed-Forward Models

Junxiang Pan; Lipu Zhou; Baojie Chen

arXiv:2604.02107·cs.RO·April 3, 2026

HyVGGT-VO: Tightly Coupled Hybrid Dense Visual Odometry with Feed-Forward Models

Junxiang Pan, Lipu Zhou, Baojie Chen

PDF

1 Repo

TL;DR

HyVGGT-VO is a novel dense visual odometry framework that combines traditional sparse methods with feed-forward models for real-time dense mapping, achieving significant speed and accuracy improvements.

Contribution

This work introduces the first tight coupling of traditional VO with VGGT feed-forward models, including an adaptive hybrid tracker and hierarchical optimization for global scale consistency.

Findings

01

Achieves 5x speedup over existing VGGT methods.

02

Reduces trajectory error by 85% on EuRoC dataset.

03

Reduces trajectory error by 12% on KITTI benchmark.

Abstract

Dense visual odometry (VO), which provides pose estimation and dense 3D reconstruction, serves as the cornerstone for applications ranging from robotics to augmented reality. Recently, feed-forward models have demonstrated remarkable capabilities in dense mapping. However, when these models are used in dense visual SLAM systems, their heavy computational burden restricts them to yielding sparse pose outputs at keyframes while still failing to achieve real-time pose estimation. In contrast, traditional sparse methods provide high computational efficiency and high-frequency pose outputs, but lack the capability for dense reconstruction. To address these limitations, we propose HyVGGT-VO, a novel framework that combines the computational efficiency of sparse VO with the dense reconstruction capabilities of feed-forward models. To the best of our knowledge, this is the first work to tightly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://geneta2580.github.io/HyVGGT-VO.io
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.