Attention or memory? Neurointerpretable agents in space and time

Lennart Bramlage; Aurelio Cortese

arXiv:2007.04862·cs.AI·July 14, 2020·1 cites

Attention or memory? Neurointerpretable agents in space and time

Lennart Bramlage, Aurelio Cortese

PDF

Open Access

TL;DR

This paper demonstrates that self-attention mechanisms in deep reinforcement learning agents improve robustness to noise, enable working-memory functions, and enhance interpretability of decision features in dynamic environments.

Contribution

It introduces a self-attention based model in deep RL that improves noise robustness, supports working-memory, and offers interpretability of feature selection and composition.

Findings

01

Self-attention increases robustness to irrelevant features.

02

The model can implement a transient working-memory.

03

Semantic observations reveal feature selection and composition.

Abstract

In neuroscience, attention has been shown to bidirectionally interact with reinforcement learning (RL) processes. This interaction is thought to support dimensionality reduction of task representations, restricting computations to relevant features. However, it remains unclear whether these properties can translate into real algorithmic advantages for artificial agents, especially in dynamic environments. We design a model incorporating a self-attention mechanism that implements task-state representations in semantic feature-space, and test it on a battery of Atari games. To evaluate the agent's selective properties, we add a large volume of task-irrelevant features to observations. In line with neuroscience predictions, self-attention leads to increased robustness to noise compared to benchmark models. Strikingly, this self-attention mechanism is general enough, such that it can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Reinforcement Learning in Robotics · Neural dynamics and brain function

MethodsInterpretability