Don't Start from Scratch: Behavioral Refinement via Interpolant-based Policy Diffusion
Kaiqi Chen, Eugene Lim, Kelvin Lin, Yiyang Chen, Harold Soh

TL;DR
This paper introduces BRIDGER, a diffusion-based imitation learning method that starts from an informative source policy rather than Gaussian noise, improving performance especially with limited data and diffusion steps.
Contribution
The work presents a novel stochastic interpolants framework for policy diffusion, allowing flexible source policies and demonstrating superior results over existing methods.
Findings
BRIDGER outperforms state-of-the-art diffusion policies on benchmarks.
Using informative source policies enhances diffusion learning.
Theoretical analysis supports the benefits of policy initialization.
Abstract
Imitation learning empowers artificial agents to mimic behavior by learning from demonstrations. Recently, diffusion models, which have the ability to model high-dimensional and multimodal distributions, have shown impressive performance on imitation learning tasks. These models learn to shape a policy by diffusing actions (or states) from standard Gaussian noise. However, the target policy to be learned is often significantly different from Gaussian and this mismatch can result in poor performance when using a small number of diffusion steps (to improve inference speed) and under limited data. The key idea in this work is that initiating from a more informative source than Gaussian enables diffusion methods to mitigate the above limitations. We contribute both theoretical results, a new method, and empirical findings that show the benefits of using an informative source policy. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques
MethodsDiffusion
