Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers
Jianxin Bi, Kelvin Lim, Kaiqi Chen, Yifei Huang, and Harold Soh

TL;DR
This paper introduces a plan-then-control framework using Deep Koopman Operators to improve data efficiency in imitation learning, enabling robots to learn multi-modal behaviors with minimal action-labeled data.
Contribution
It presents a novel approach combining diffusion planners with deep Koopman controllers to reduce the need for extensive action-labeled demonstration data in robot imitation learning.
Findings
Significantly improves action-data efficiency in robot imitation learning.
Achieves high task success rates with limited demonstration data.
Effective on both simulated and real robot tasks.
Abstract
Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burden. In this work, we propose a plan-then-control framework aimed at improving the action-data efficiency of inverse dynamics controllers by leveraging observational demonstration data. Specifically, we adopt a Deep Koopman Operator framework to model the dynamical system and utilize observation-only trajectories to learn a latent action representation. This latent representation can then be effectively mapped to real high-dimensional continuous actions using a linear action decoder, requiring minimal action-labeled data. Through experiments on simulated robot manipulation tasks and a real…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning
