Causal Confusion in Imitation Learning
Pim de Haan, Dinesh Jayaraman, and Sergey Levine

TL;DR
This paper highlights the importance of causal reasoning in imitation learning, demonstrating that ignoring causality can cause performance issues, and proposes interventions to correctly identify causal models for improved policy learning.
Contribution
It reveals the problem of causal misidentification in imitation learning and introduces targeted interventions to address it, improving policy robustness.
Findings
Causal misidentification occurs in benchmark and real-world domains.
Targeted interventions improve causal model identification.
Proposed method outperforms DAgger and baselines.
Abstract
Behavioral cloning reduces policy learning to supervised learning by training a discriminative model to predict expert actions given observations. Such discriminative models are non-causal: the training procedure is unaware of the causal structure of the interaction between the expert and the environment. We point out that ignoring causality is particularly damaging because of the distributional shift in imitation learning. In particular, it leads to a counter-intuitive "causal misidentification" phenomenon: access to more information can yield worse performance. We investigate how this problem arises, and propose a solution to combat it through targeted interventions---either environment interaction or expert queries---to determine the correct causal model. We show that causal misidentification occurs in several benchmark control domains as well as realistic driving settings, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning
