Causal Confusion in Imitation Learning

Pim de Haan; Dinesh Jayaraman; and Sergey Levine

arXiv:1905.11979·cs.LG·November 5, 2019·127 cites

Causal Confusion in Imitation Learning

Pim de Haan, Dinesh Jayaraman, and Sergey Levine

PDF

Open Access 2 Repos

TL;DR

This paper highlights the importance of causal reasoning in imitation learning, demonstrating that ignoring causality can cause performance issues, and proposes interventions to correctly identify causal models for improved policy learning.

Contribution

It reveals the problem of causal misidentification in imitation learning and introduces targeted interventions to address it, improving policy robustness.

Findings

01

Causal misidentification occurs in benchmark and real-world domains.

02

Targeted interventions improve causal model identification.

03

Proposed method outperforms DAgger and baselines.

Abstract

Behavioral cloning reduces policy learning to supervised learning by training a discriminative model to predict expert actions given observations. Such discriminative models are non-causal: the training procedure is unaware of the causal structure of the interaction between the expert and the environment. We point out that ignoring causality is particularly damaging because of the distributional shift in imitation learning. In particular, it leads to a counter-intuitive "causal misidentification" phenomenon: access to more information can yield worse performance. We investigate how this problem arises, and propose a solution to combat it through targeted interventions---either environment interaction or expert queries---to determine the correct causal model. We show that causal misidentification occurs in several benchmark control domains as well as realistic driving settings, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · AI-based Problem Solving and Planning