Versatile Navigation under Partial Observability via Value-guided Diffusion Policy
Gengyu Zhang, Hao Tang, Yan Yan

TL;DR
This paper introduces a versatile, value-guided diffusion policy for 2D and 3D route planning under partial observability, outperforming existing methods by predicting actions with foresight and generalizing across environments.
Contribution
The paper presents a novel value-guided diffusion approach with a differentiable planner and zero-shot transfer capability from 2D to 3D environments, addressing partial observability challenges.
Findings
Outperforms state-of-the-art autoregressive and diffusion baselines.
Enables zero-shot transfer from 2D to 3D environments.
Achieves higher planning success rates in complex scenarios.
Abstract
Route planning for navigation under partial observability plays a crucial role in modern robotics and autonomous driving. Existing route planning approaches can be categorized into two main classes: traditional autoregressive and diffusion-based methods. The former often fails due to its myopic nature, while the latter either assumes full observability or struggles to adapt to unfamiliar scenarios, due to strong couplings with behavior cloning from experts. To address these deficiencies, we propose a versatile diffusion-based approach for both 2D and 3D route planning under partial observability. Specifically, our value-guided diffusion policy first generates plans to predict actions across various timesteps, providing ample foresight to the planning. It then employs a differentiable planner with state estimations to derive a value function, directing the agent's exploration and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuction Theory and Applications · Optimization and Search Problems · Game Theory and Voting Systems
MethodsDiffusion
