AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
Xixi Hu, Bo Liu, Xingchao Liu, Qiang Liu

TL;DR
AdaFlow introduces a variance-adaptive flow-based policy model for imitation learning that achieves fast inference and maintains diverse, multi-modal action generation by adaptively solving ODEs during decision-making.
Contribution
It proposes a novel variance-adaptive ODE solver for flow-based policies, enabling rapid inference while preserving diversity in multi-modal decision-making.
Findings
AdaFlow outperforms diffusion-based methods in inference speed.
It maintains high diversity in generated actions.
Automatically reduces to a one-step generator for uni-modal actions.
Abstract
Diffusion-based imitation learning improves Behavioral Cloning (BC) on multi-modal decision-making, but comes at the cost of significantly slower inference due to the recursion in the diffusion process. It urges us to design efficient policy generators while keeping the ability to generate diverse actions. To address this challenge, we propose AdaFlow, an imitation learning framework based on flow-based generative modeling. AdaFlow represents the policy with state-conditioned ordinary differential equations (ODEs), which are known as probability flows. We reveal an intriguing connection between the conditional variance of their training loss and the discretization error of the ODEs. With this insight, we propose a variance-adaptive ODE solver that can adjust its step size in the inference stage, making AdaFlow an adaptive decision-maker, offering rapid inference without sacrificing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Human Motion and Animation · Reinforcement Learning in Robotics
MethodsDiffusion
