Improving Robustness to Out-of-Distribution States in Imitation Learning via Deep Koopman-Boosted Diffusion Policy
Dianye Huang, Nassir Navab, Zhongliang Jiang

TL;DR
This paper introduces D3P, a dual-branch diffusion policy with Koopman dynamics for improved out-of-distribution robustness in imitation learning, enabling better task recovery and higher success rates in robotic manipulation.
Contribution
The paper proposes a novel dual-branch architecture combined with Koopman operators to enhance temporal modeling and robustness in diffusion-based imitation learning policies.
Findings
D3P outperforms state-of-the-art diffusion policies by 14.6% in simulation.
Achieves 15.0% improvement on real-world robotic tasks.
Effective recovery from intermediate goal failures through dynamic action chunk switching.
Abstract
Integrating generative models with action chunking has shown significant promise in imitation learning for robotic manipulation. However, the existing diffusion-based paradigm often struggles to capture strong temporal dependencies across multiple steps, particularly when incorporating proprioceptive input. This limitation can lead to task failures, where the policy overfits to proprioceptive cues at the expense of capturing the visually derived features of the task. To overcome this challenge, we propose the Deep Koopman-boosted Dual-branch Diffusion Policy (D3P) algorithm. D3P introduces a dual-branch architecture to decouple the roles of different sensory modality combinations. The visual branch encodes the visual observations to indicate task progression, while the fused branch integrates both visual and proprioceptive inputs for precise manipulation. Within this architecture, when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics
