Generalization of Reinforcement Learners with Working and Episodic   Memory

Meire Fortunato; Melissa Tan; Ryan Faulkner; Steven Hansen; Adri\`a; Puigdom\`enech Badia; Gavin Buttimore; Charlie Deck; Joel Z Leibo; Charles; Blundell

arXiv:1910.13406·cs.LG·February 20, 2020·28 cites

Generalization of Reinforcement Learners with Working and Episodic Memory

Meire Fortunato, Melissa Tan, Ryan Faulkner, Steven Hansen, Adri\`a, Puigdom\`enech Badia, Gavin Buttimore, Charlie Deck, Joel Z Leibo, Charles, Blundell

PDF

Open Access 1 Repo

TL;DR

This paper develops a methodology to evaluate how different memory systems in reinforcement learning agents generalize to holdout data, using diverse tasks and ablation studies to analyze performance.

Contribution

It introduces a comprehensive testing framework for memory in reinforcement learning, including diverse tasks and ablation analysis of combined memory systems.

Findings

01

Memory systems improve generalization on specific tasks

02

Combined memory architectures outperform single systems

03

Evaluation methodology reveals strengths and limitations of memory types

Abstract

Memory is an important aspect of intelligence and plays a role in many deep reinforcement learning models. However, little progress has been made in understanding when specific memory systems help more than others and how well they generalize. The field also has yet to see a prevalent consistent and rigorous approach for evaluating agent performance on holdout data. In this paper, we aim to develop a comprehensive methodology to test different kinds of memory in an agent and assess how well the agent can apply what it learns in training to a holdout set that differs from the training set along dimensions that we suggest are relevant for evaluating memory-specific generalization. To that end, we first construct a diverse set of memory tasks that allow us to evaluate test-time generalization across multiple dimensions. Second, we develop and perform multiple ablations on an agent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deepmind/dm_memorytasks
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications

MethodsTest