Towards mental time travel: a hierarchical memory for reinforcement   learning agents

Andrew Kyle Lampinen; Stephanie C.Y. Chan; Andrea Banino; Felix Hill

arXiv:2105.14039·cs.LG·December 9, 2021·5 cites

Towards mental time travel: a hierarchical memory for reinforcement learning agents

Andrew Kyle Lampinen, Stephanie C.Y. Chan, Andrea Banino, Felix Hill

PDF

Open Access 5 Repos 1 Models 1 Video

TL;DR

This paper introduces HCAM, a hierarchical memory architecture that enables reinforcement learning agents to recall detailed past events over long timescales, improving their performance and generalization in complex tasks.

Contribution

The paper proposes HCAM, a novel hierarchical chunk attention memory that enhances long-term recall and reasoning in reinforcement learning agents, surpassing existing memory architectures.

Findings

01

HCAM significantly outperforms other memory models in long-term recall tasks.

02

Agents with HCAM demonstrate improved generalization and sample efficiency.

03

HCAM enables zero-shot transfer and better handling of temporally extended environments.

Abstract

Reinforcement learning agents often forget details of the past, especially after delays or distractor tasks. Agents with common memory architectures struggle to recall and integrate across multiple timesteps of a past event, or even to recall the details of a single timestep that is followed by distractor tasks. To address these limitations, we propose a Hierarchical Chunk Attention Memory (HCAM), which helps agents to remember the past in detail. HCAM stores memories by dividing the past into chunks, and recalls by first performing high-level attention over coarse summaries of the chunks, and then performing detailed attention within only the most relevant chunks. An agent with HCAM can therefore "mentally time-travel" -- remember past events in detail without attending to all intervening events. We show that agents with HCAM substantially outperform agents with other memory…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
LilHairdy/cleanrl_memory_gym
model

Videos

Towards mental time travel: a hierarchical memory for reinforcement learning agents· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Byte Pair Encoding · Residual Connection · Dropout