YOPO-Nav: Visual Navigation using 3DGS Graphs from One-Pass Videos
Ryan Meegan, Adam D'Souza, Bryan Bo Cao, Shubham Jain, Kristin Dana

TL;DR
YOPO-Nav is a novel visual navigation method that encodes environments into compact 3D Gaussian Splatting models from one-pass videos, enabling robots to retrace trajectories without detailed maps.
Contribution
The paper introduces YOPO-Nav, a new approach that uses 3DGS models and hierarchical localization for efficient visual navigation from single-pass videos.
Findings
YOPO-Nav achieves high accuracy in image-goal navigation tasks.
The YOPO-Campus dataset provides a new benchmark for real-world visual navigation.
Experimental results demonstrate YOPO-Nav's effectiveness on physical robots.
Abstract
Visual navigation has emerged as a practical alternative to traditional robotic navigation pipelines that rely on detailed mapping and path planning. However, constructing and maintaining 3D maps is often computationally expensive and memory-intensive. We address the problem of visual navigation when exploration videos of a large environment are available. The videos serve as a visual reference, allowing a robot to retrace the explored trajectories without relying on metric maps. Our proposed method, YOPO-Nav (You Only Pass Once), encodes an environment into a compact spatial representation composed of interconnected local 3D Gaussian Splatting (3DGS) models. During navigation, the framework aligns the robot's current visual observation with this representation and predicts actions that guide it back toward the demonstrated trajectory. YOPO-Nav employs a hierarchical design: a visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Multimodal Machine Learning Applications · Advanced Vision and Imaging
