GalilAI: Out-of-Task Distribution Detection using Causal Active   Experimentation for Safe Transfer RL

Sumedh A Sontakke; Stephen Iota; Zizhao Hu; Arash Mehrjou; Laurent; Itti; Bernhard Sch\"olkopf

arXiv:2110.15489·cs.LG·November 1, 2021

GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL

Sumedh A Sontakke, Stephen Iota, Zizhao Hu, Arash Mehrjou, Laurent, Itti, Bernhard Sch\"olkopf

PDF

Open Access

TL;DR

GalilAI introduces a causal active experimentation approach for out-of-task distribution detection in reinforcement learning, enabling agents to identify environment shifts through active testing, thus improving robustness and safety.

Contribution

The paper proposes a novel causal framework and an active experimentation method, GalilAI, for out-of-task distribution detection in RL, addressing a key gap in safe transfer learning.

Findings

01

GalilAI outperforms the baseline in OOTD detection accuracy.

02

Active experimentation improves environment shift detection.

03

The causal framework guides effective exploration for OOD detection.

Abstract

Out-of-distribution (OOD) detection is a well-studied topic in supervised learning. Extending the successes in supervised learning methods to the reinforcement learning (RL) setting, however, is difficult due to the data generating process - RL agents actively query their environment for data, and the data are a function of the policy followed by the agent. An agent could thus neglect a shift in the environment if its policy did not lead it to explore the aspect of the environment that shifted. Therefore, to achieve safe and robust generalization in RL, there exists an unmet need for OOD detection through active experimentation. Here, we attempt to bridge this lacuna by first defining a causal framework for OOD scenarios or environments encountered by RL agents in the wild. Then, we propose a novel task: that of Out-of-Task Distribution (OOTD) detection. We introduce an RL agent that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics · Machine Learning and Algorithms

MethodsTest