Memory Lens: How Much Memory Does an Agent Use?

Christoph Dann; Katja Hofmann; Sebastian Nowozin

arXiv:1611.06928·cs.AI·November 22, 2016·2 cites

Memory Lens: How Much Memory Does an Agent Use?

Christoph Dann, Katja Hofmann, Sebastian Nowozin

PDF

Open Access

TL;DR

This paper introduces a method to quantify the memory usage of reinforcement learning agents by estimating mutual information between past behaviors and current actions, providing insights into their internal memory mechanisms.

Contribution

It presents a novel, implementation-independent approach to measure the minimal memory capacity of RL policies through mutual information estimation, validated on Atari games.

Findings

01

Memory usage varies significantly across different Atari games.

02

The method provides a lower bound on the minimal memory capacity of policies.

03

Memory estimation reveals insights into agent behavior and policy complexity.

Abstract

We propose a new method to study the internal memory used by reinforcement learning policies. We estimate the amount of relevant past information by estimating mutual information between behavior histories and the current action of an agent. We perform this estimation in the passive setting, that is, we do not intervene but merely observe the natural behavior of the agent. Moreover, we provide a theoretical justification for our approach by showing that it yields an implementation-independent lower bound on the minimal memory capacity of any agent that implement the observed policy. We demonstrate our approach by estimating the use of memory of DQN policies on concatenated Atari frames, demonstrating sharply different use of memory across 49 games. The study of memory as information that flows from the past to the current action opens avenues to understand and improve successful…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications · Game Theory and Applications

MethodsQ-Learning · Dense Connections · Convolution · Deep Q-Network