OpenVO: Open-World Visual Odometry with Temporal Dynamics Awareness
Phuc D.A. Nguyen, Anh N. Nhu, Ming C. Lin

TL;DR
OpenVO is a new visual odometry framework that robustly estimates ego-motion from monocular dashcam footage under limited input conditions by encoding temporal dynamics and using geometric priors.
Contribution
It introduces a method that explicitly encodes temporal dynamics and leverages geometric priors, improving robustness across varying observation rates and uncalibrated cameras.
Findings
Achieves over 20% performance improvement on major benchmarks.
Reduces errors by 46%-92% under varying observation rates.
Effective for real-world 3D reconstruction and downstream tasks.
Abstract
We introduce OpenVO, a novel framework for Open-world Visual Odometry (VO) with temporal awareness under limited input conditions. OpenVO effectively estimates real-world-scale ego-motion from monocular dashcam footage with varying observation rates and uncalibrated cameras, enabling robust trajectory dataset construction from rare driving events recorded in dashcam. Existing VO methods are trained on fixed observation frequency (e.g., 10Hz or 12Hz), completely overlooking temporal dynamics information. Many prior methods also require calibrated cameras with known intrinsic parameters. Consequently, their performance degrades when (1) deployed under unseen observation frequencies or (2) applied to uncalibrated cameras. These significantly limit their generalizability to many downstream tasks, such as extracting trajectories from dashcam footage. To address these challenges, OpenVO (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
