Measuring Interventional Robustness in Reinforcement Learning

Katherine Avery; Jack Kenney; Pracheta Amaranath; Erica Cai; David; Jensen

arXiv:2209.09058·cs.LG·September 20, 2022

Measuring Interventional Robustness in Reinforcement Learning

Katherine Avery, Jack Kenney, Pracheta Amaranath, Erica Cai, David, Jensen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new measure called interventional robustness (IR) to evaluate how consistent reinforcement learning policies are under different incidental training conditions, revealing that high performance does not always mean high IR.

Contribution

The paper defines and quantifies interventional robustness in reinforcement learning, providing a new way to assess policy stability across various training interventions.

Findings

01

IR varies with training amount and algorithm type

02

High performance does not necessarily imply high IR

03

IR measurement applied to eight algorithms in Atari environments

Abstract

Recent work in reinforcement learning has focused on several characteristics of learned policies that go beyond maximizing reward. These properties include fairness, explainability, generalization, and robustness. In this paper, we define interventional robustness (IR), a measure of how much variability is introduced into learned policies by incidental aspects of the training procedure, such as the order of training data or the particular exploratory actions taken by agents. A training procedure has high IR when the agents it produces take very similar actions under intervention, despite variation in these incidental aspects of the training procedure. We develop an intuitive, quantitative measure of IR and calculate it for eight algorithms in three Atari environments across dozens of interventions and states. From these experiments, we find that IR varies with the amount of training and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kdl-umass/xai-interventional-robustness
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning