Global and Local Analysis of Interestingness for Competency-Aware Deep   Reinforcement Learning

Pedro Sequeira; Jesse Hostetler; Melinda Gervasio

arXiv:2211.06376·cs.AI·November 14, 2022

Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning

Pedro Sequeira, Jesse Hostetler, Melinda Gervasio

PDF

Open Access 1 Repo

TL;DR

This paper introduces a comprehensive framework for analyzing and interpreting the competence of reinforcement learning agents through interestingness measures, enabling better understanding of their capabilities and limitations.

Contribution

It extends explainable RL with new interestingness-based analysis tools, including behavior clustering and task element attribution using SHAP values, applicable across various RL algorithms.

Findings

01

Provides measures of RL agent competence based on interestingness analysis.

02

Introduces clustering of behavior traces to identify competency patterns.

03

Uses SHAP values for global and local analysis of task elements influencing behavior.

Abstract

In recent years, advances in deep learning have resulted in a plethora of successes in the use of reinforcement learning (RL) to solve complex sequential decision tasks with high-dimensional inputs. However, existing systems lack the necessary mechanisms to provide humans with a holistic view of their competence, presenting an impediment to their adoption, particularly in critical applications where the decisions an agent makes can have significant consequences. Yet, existing RL-based systems are essentially competency-unaware in that they lack the necessary interpretation mechanisms to allow human operators to have an insightful, holistic view of their competency. In this paper, we extend a recently-proposed framework for explainable RL that is based on analyses of "interestingness." Our new framework provides various measures of RL agent competence stemming from interestingness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sri-aic/22-llaama-ixdrl-data
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics

MethodsShapley Additive Explanations