ViKiNG: Vision-Based Kilometer-Scale Navigation with Geographic Hints
Dhruv Shah, Sergey Levine

TL;DR
ViKiNG is a vision-based navigation system that combines learned local traversability models with heuristic planning using geographic hints, enabling long-range navigation in unseen environments without explicit 3D reconstruction.
Contribution
It introduces a novel integration of learning and planning that utilizes side information like maps and GPS as heuristics, not relying on precise geometric reconstruction.
Findings
Successfully navigates up to 3 km in unseen environments
Operates effectively with unreliable maps and GPS
Demonstrates complex behaviors like probing and backtracking
Abstract
Robotic navigation has been approached as a problem of 3D reconstruction and planning, as well as an end-to-end learning problem. However, long-range navigation requires both planning and reasoning about local traversability, as well as being able to utilize general knowledge about global geography, in the form of a roadmap, GPS, or other side information providing important cues. In this work, we propose an approach that integrates learning and planning, and can utilize side information such as schematic roadmaps, satellite maps and GPS coordinates as a planning heuristic, without relying on them being accurate. Our method, ViKiNG, incorporates a local traversability model, which looks at the robot's current camera observation and a potential subgoal to infer how easily that subgoal can be reached, as well as a heuristic model, which looks at overhead maps for hints and attempts to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGreedy Policy Search
