Improving Generalization Ability of Robotic Imitation Learning by Resolving Causal Confusion in Observations

Yifei Chen; Yuzhe Zhang; Giovanni D'urso; Nicholas Lawrance; Brendan Tidd

arXiv:2507.22380·cs.RO·July 31, 2025

Improving Generalization Ability of Robotic Imitation Learning by Resolving Causal Confusion in Observations

Yifei Chen, Yuzhe Zhang, Giovanni D'urso, Nicholas Lawrance, Brendan Tidd

PDF

TL;DR

This paper introduces a causal structure learning framework to improve the generalization of robotic imitation learning, effectively handling domain shifts by disentangling relevant observation features from irrelevant ones.

Contribution

It proposes a simple, easily integrable causal structure learning method that enhances generalization in complex imitation learning tasks without requiring disentangled feature representations.

Findings

01

Significantly reduces generalization errors in robotic imitation learning

02

Successfully applied to Mujoco simulation with bimanual robot arms

03

Improves robustness against domain shifts in imitation tasks

Abstract

Recent developments in imitation learning have considerably advanced robotic manipulation. However, current techniques in imitation learning can suffer from poor generalization, limiting performance even under relatively minor domain shifts. In this work, we aim to enhance the generalization capabilities of complex imitation learning algorithms to handle unpredictable changes from the training environments to deployment environments. To avoid confusion caused by observations that are not relevant to the target task, we propose to explicitly learn the causal relationship between observation components and expert actions, employing a framework similar to [6], where a causal structural function is learned by intervention on the imitation learning policy. Disentangling the feature representation from image input as in [6] is hard to satisfy in complex imitation learning process in robotic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.