Improving Generalization Ability of Robotic Imitation Learning by Resolving Causal Confusion in Observations
Yifei Chen, Yuzhe Zhang, Giovanni D'urso, Nicholas Lawrance, Brendan Tidd

TL;DR
This paper introduces a causal structure learning framework to improve the generalization of robotic imitation learning, effectively handling domain shifts by disentangling relevant observation features from irrelevant ones.
Contribution
It proposes a simple, easily integrable causal structure learning method that enhances generalization in complex imitation learning tasks without requiring disentangled feature representations.
Findings
Significantly reduces generalization errors in robotic imitation learning
Successfully applied to Mujoco simulation with bimanual robot arms
Improves robustness against domain shifts in imitation tasks
Abstract
Recent developments in imitation learning have considerably advanced robotic manipulation. However, current techniques in imitation learning can suffer from poor generalization, limiting performance even under relatively minor domain shifts. In this work, we aim to enhance the generalization capabilities of complex imitation learning algorithms to handle unpredictable changes from the training environments to deployment environments. To avoid confusion caused by observations that are not relevant to the target task, we propose to explicitly learn the causal relationship between observation components and expert actions, employing a framework similar to [6], where a causal structural function is learned by intervention on the imitation learning policy. Disentangling the feature representation from image input as in [6] is hard to satisfy in complex imitation learning process in robotic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
