Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following
Brian Yang, Huangyuan Su, Nikolaos Gkanatsios, Tsung-Wei Ke, Ayush, Jain, Jeff Schneider, Katerina Fragkiadaki

TL;DR
DiffusionES introduces a gradient-free, diffusion-based planning method that efficiently optimizes complex, non-differentiable objectives for autonomous driving and instruction following, outperforming existing approaches on challenging benchmarks.
Contribution
The paper presents DiffusionES, a novel gradient-free trajectory optimization method combining diffusion models with evolutionary search for non-differentiable objectives.
Findings
Achieves state-of-the-art results on nuPlan benchmark.
Effectively optimizes non-differentiable language-shaped reward functions.
Enables complex behaviors like aggressive lane weaving.
Abstract
Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable reward function and the likelihood under the data distribution captured by a diffusion model. Reward-gradient guided denoising requires a differentiable reward function fitted to both clean and noised samples, limiting its applicability as a general trajectory optimizer. In this paper, we propose DiffusionES, a method that combines gradient-free optimization with trajectory denoising to optimize black-box non-differentiable objectives while staying in the data manifold. Diffusion-ES samples trajectories during evolutionary search from a diffusion model and scores them using a black-box reward function. It mutates high-scoring trajectories using a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotic Path Planning Algorithms · Robotic Mechanisms and Dynamics · Teaching and Learning Programming
MethodsDiffusion
