Towards Interpretable Reinforcement Learning Using Attention Augmented   Agents

Alex Mott; Daniel Zoran; Mike Chrzanowski; Daan Wierstra; Danilo J.; Rezende

arXiv:1906.02500·cs.LG·June 7, 2019·26 cites

Towards Interpretable Reinforcement Learning Using Attention Augmented Agents

Alex Mott, Daniel Zoran, Mike Chrzanowski, Daan Wierstra, Danilo J., Rezende

PDF

Open Access 1 Repo

TL;DR

This paper introduces an attention-based reinforcement learning model that enhances interpretability by revealing the information used for decision-making, while maintaining competitive performance on ATARI games.

Contribution

It presents a novel soft attention mechanism for RL agents that improves interpretability without sacrificing performance.

Findings

01

Agents learn multiple strategies across games

02

The model queries space and content separately

03

Achieves state-of-the-art performance on ATARI

Abstract

Inspired by recent work in attention models for image captioning and question answering, we present a soft attention model for the reinforcement learning domain. This model uses a soft, top-down attention mechanism to create a bottleneck in the agent, forcing it to focus on task-relevant information by sequentially querying its view of the environment. The output of the attention mechanism allows direct observation of the information used by the agent to select its actions, enabling easier interpretation of this model than of traditional models. We analyze different strategies that the agents learn and show that a handful of strategies arise repeatedly across different games. We also show that the model learns to query separately about space and content (`where' vs. `what'). We demonstrate that an agent using this mechanism can achieve performance competitive with state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cjlovering/Towards-Interpretable-Reinforcement-Learning-Using-Attention-Augmented-Agents-Replication
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning