Can Active Sampling Reduce Causal Confusion in Offline Reinforcement   Learning?

Gunshi Gupta; Tim G. J. Rudner; Rowan Thomas McAllister; Adrien; Gaidon; Yarin Gal

arXiv:2312.17168·cs.LG·December 29, 2023·1 cites

Can Active Sampling Reduce Causal Confusion in Offline Reinforcement Learning?

Gunshi Gupta, Tim G. J. Rudner, Rowan Thomas McAllister, Adrien, Gaidon, Yarin Gal

PDF

Open Access 1 Repo

TL;DR

This paper explores how active sampling can mitigate causal confusion in offline reinforcement learning, leading to safer and more reliable policies by better identifying true causal relationships in data.

Contribution

It demonstrates that active sampling techniques can effectively reduce causal confusion in offline RL, outperforming uniform sampling in efficiency and accuracy.

Findings

01

Active sampling reduces causal confusion during training.

02

Active sampling is more efficient than uniform sampling.

03

Models trained with active sampling perform better in real-world deployment.

Abstract

Causal confusion is a phenomenon where an agent learns a policy that reflects imperfect spurious correlations in the data. Such a policy may falsely appear to be optimal during training if most of the training data contain such spurious correlations. This phenomenon is particularly pronounced in domains such as robotics, with potentially large gaps between the open- and closed-loop performance of an agent. In such settings, causally confused models may appear to perform well according to open-loop metrics during training but fail catastrophically when deployed in the real world. In this paper, we study causal confusion in offline reinforcement learning. We investigate whether selectively sampling appropriate points from a dataset of demonstrations may enable offline reinforcement learning agents to disambiguate the underlying causal mechanisms of the environment, alleviate causal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gunshi/offline_active_rl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Auction Theory and Applications · Machine Learning and Algorithms

MethodsSparse Evolutionary Training