Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous   Driving and Zero-Shot Instruction Following

Brian Yang; Huangyuan Su; Nikolaos Gkanatsios; Tsung-Wei Ke; Ayush; Jain; Jeff Schneider; Katerina Fragkiadaki

arXiv:2402.06559·cs.LG·July 18, 2024·1 cites

Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following

Brian Yang, Huangyuan Su, Nikolaos Gkanatsios, Tsung-Wei Ke, Ayush, Jain, Jeff Schneider, Katerina Fragkiadaki

PDF

Open Access 1 Repo

TL;DR

DiffusionES introduces a gradient-free, diffusion-based planning method that efficiently optimizes complex, non-differentiable objectives for autonomous driving and instruction following, outperforming existing approaches on challenging benchmarks.

Contribution

The paper presents DiffusionES, a novel gradient-free trajectory optimization method combining diffusion models with evolutionary search for non-differentiable objectives.

Findings

01

Achieves state-of-the-art results on nuPlan benchmark.

02

Effectively optimizes non-differentiable language-shaped reward functions.

03

Enables complex behaviors like aggressive lane weaving.

Abstract

Diffusion models excel at modeling complex and multimodal trajectory distributions for decision-making and control. Reward-gradient guided denoising has been recently proposed to generate trajectories that maximize both a differentiable reward function and the likelihood under the data distribution captured by a diffusion model. Reward-gradient guided denoising requires a differentiable reward function fitted to both clean and noised samples, limiting its applicability as a general trajectory optimizer. In this paper, we propose DiffusionES, a method that combines gradient-free optimization with trajectory denoising to optimize black-box non-differentiable objectives while staying in the data manifold. Diffusion-ES samples trajectories during evolutionary search from a diffusion model and scores them using a black-box reward function. It mutates high-scoring trajectories using a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bhyang/diffusion-es
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Path Planning Algorithms · Robotic Mechanisms and Dynamics · Teaching and Learning Programming

MethodsDiffusion