Imitation Learning with Limited Actions via Diffusion Planners and Deep   Koopman Controllers

Jianxin Bi; Kelvin Lim; Kaiqi Chen; Yifei Huang; and Harold Soh

arXiv:2410.07584·cs.RO·March 26, 2025

Imitation Learning with Limited Actions via Diffusion Planners and Deep Koopman Controllers

Jianxin Bi, Kelvin Lim, Kaiqi Chen, Yifei Huang, and Harold Soh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a plan-then-control framework using Deep Koopman Operators to improve data efficiency in imitation learning, enabling robots to learn multi-modal behaviors with minimal action-labeled data.

Contribution

It presents a novel approach combining diffusion planners with deep Koopman controllers to reduce the need for extensive action-labeled demonstration data in robot imitation learning.

Findings

01

Significantly improves action-data efficiency in robot imitation learning.

02

Achieves high task success rates with limited demonstration data.

03

Effective on both simulated and real robot tasks.

Abstract

Recent advances in diffusion-based robot policies have demonstrated significant potential in imitating multi-modal behaviors. However, these approaches typically require large quantities of demonstration data paired with corresponding robot action labels, creating a substantial data collection burden. In this work, we propose a plan-then-control framework aimed at improving the action-data efficiency of inverse dynamics controllers by leveraging observational demonstration data. Specifically, we adopt a Deep Koopman Operator framework to model the dynamical system and utilize observation-only trajectories to learn a latent action representation. This latent representation can then be effectively mapped to real high-dimensional continuous actions using a linear action decoder, requiring minimal action-labeled data. Through experiments on simulated robot manipulation tasks and a real…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jxbi1010/koap
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Robot Manipulation and Learning