Don't Start from Scratch: Behavioral Refinement via Interpolant-based   Policy Diffusion

Kaiqi Chen; Eugene Lim; Kelvin Lin; Yiyang Chen; Harold Soh

arXiv:2402.16075·cs.LG·July 12, 2024·1 cites

Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion

Kaiqi Chen, Eugene Lim, Kelvin Lin, Yiyang Chen, Harold Soh

PDF

Open Access 1 Repo

TL;DR

This paper introduces BRIDGER, a diffusion-based imitation learning method that starts from an informative source policy rather than Gaussian noise, improving performance especially with limited data and diffusion steps.

Contribution

The work presents a novel stochastic interpolants framework for policy diffusion, allowing flexible source policies and demonstrating superior results over existing methods.

Findings

01

BRIDGER outperforms state-of-the-art diffusion policies on benchmarks.

02

Using informative source policies enhances diffusion learning.

03

Theoretical analysis supports the benefits of policy initialization.

Abstract

Imitation learning empowers artificial agents to mimic behavior by learning from demonstrations. Recently, diffusion models, which have the ability to model high-dimensional and multimodal distributions, have shown impressive performance on imitation learning tasks. These models learn to shape a policy by diffusing actions (or states) from standard Gaussian noise. However, the target policy to be learned is often significantly different from Gaussian and this mismatch can result in poor performance when using a small number of diffusion steps (to improve inference speed) and under limited data. The key idea in this work is that initiating from a more informative source than Gaussian enables diffusion methods to mitigate the above limitations. We contribute both theoretical results, a new method, and empirical findings that show the benefits of using an informative source policy. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clear-nus/bridger
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTraffic Prediction and Management Techniques

MethodsDiffusion