Visualizing and Understanding Atari Agents

Sam Greydanus; Anurag Koul; Jonathan Dodge; Alan Fern

arXiv:1711.00138·cs.AI·September 12, 2018·91 cites

Visualizing and Understanding Atari Agents

Sam Greydanus, Anurag Koul, Jonathan Dodge, Alan Fern

PDF

Open Access 3 Repos

TL;DR

This paper uses saliency maps to interpret Atari reinforcement learning agents, revealing their decision-making strategies, attention focus, and learning evolution, thereby enhancing understanding of deep RL behaviors.

Contribution

Introduces a novel saliency map method for analyzing Atari agents, enabling insights into their attention, decision reasons, and learning dynamics.

Findings

01

Saliency maps reveal what agents attend to during decision-making

02

Agents sometimes make decisions for the wrong reasons

03

Saliency visualization improves human understanding of RL agents

Abstract

While deep reinforcement learning (deep RL) agents are effective at maximizing rewards, it is often unclear what strategies they use to do so. In this paper, we take a step toward explaining deep RL agents through a case study using Atari 2600 environments. In particular, we focus on using saliency maps to understand how an agent learns and executes a policy. We introduce a method for generating useful saliency maps and use it to show 1) what strong agents attend to, 2) whether agents are making decisions for the right or wrong reasons, and 3) how agents evolve during learning. We also test our method on non-expert human subjects and find that it improves their ability to reason about these agents. Overall, our results show that saliency information can provide significant insight into an RL agent's decisions and learning behavior.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Ethics and Social Impacts of AI