Improving Robustness to Out-of-Distribution States in Imitation Learning via Deep Koopman-Boosted Diffusion Policy

Dianye Huang; Nassir Navab; Zhongliang Jiang

arXiv:2511.00555·cs.RO·November 11, 2025

Improving Robustness to Out-of-Distribution States in Imitation Learning via Deep Koopman-Boosted Diffusion Policy

Dianye Huang, Nassir Navab, Zhongliang Jiang

PDF

Open Access

TL;DR

This paper introduces D3P, a dual-branch diffusion policy with Koopman dynamics for improved out-of-distribution robustness in imitation learning, enabling better task recovery and higher success rates in robotic manipulation.

Contribution

The paper proposes a novel dual-branch architecture combined with Koopman operators to enhance temporal modeling and robustness in diffusion-based imitation learning policies.

Findings

01

D3P outperforms state-of-the-art diffusion policies by 14.6% in simulation.

02

Achieves 15.0% improvement on real-world robotic tasks.

03

Effective recovery from intermediate goal failures through dynamic action chunk switching.

Abstract

Integrating generative models with action chunking has shown significant promise in imitation learning for robotic manipulation. However, the existing diffusion-based paradigm often struggles to capture strong temporal dependencies across multiple steps, particularly when incorporating proprioceptive input. This limitation can lead to task failures, where the policy overfits to proprioceptive cues at the expense of capturing the visually derived features of the task. To overcome this challenge, we propose the Deep Koopman-boosted Dual-branch Diffusion Policy (D3P) algorithm. D3P introduces a dual-branch architecture to decouple the roles of different sensory modality combinations. The visual branch encodes the visual observations to indicate task progression, while the fused branch integrates both visual and proprioceptive inputs for precise manipulation. Within this architecture, when…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Generative Adversarial Networks and Image Synthesis · Reinforcement Learning in Robotics