Towards mental time travel: a hierarchical memory for reinforcement learning agents
Andrew Kyle Lampinen, Stephanie C.Y. Chan, Andrea Banino, Felix Hill

TL;DR
This paper introduces HCAM, a hierarchical memory architecture that enables reinforcement learning agents to recall detailed past events over long timescales, improving their performance and generalization in complex tasks.
Contribution
The paper proposes HCAM, a novel hierarchical chunk attention memory that enhances long-term recall and reasoning in reinforcement learning agents, surpassing existing memory architectures.
Findings
HCAM significantly outperforms other memory models in long-term recall tasks.
Agents with HCAM demonstrate improved generalization and sample efficiency.
HCAM enables zero-shot transfer and better handling of temporally extended environments.
Abstract
Reinforcement learning agents often forget details of the past, especially after delays or distractor tasks. Agents with common memory architectures struggle to recall and integrate across multiple timesteps of a past event, or even to recall the details of a single timestep that is followed by distractor tasks. To address these limitations, we propose a Hierarchical Chunk Attention Memory (HCAM), which helps agents to remember the past in detail. HCAM stores memories by dividing the past into chunks, and recalls by first performing high-level attention over coarse summaries of the chunks, and then performing detailed attention within only the most relevant chunks. An agent with HCAM can therefore "mentally time-travel" -- remember past events in detail without attending to all intervening events. We show that agents with HCAM substantially outperform agents with other memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Topic Modeling
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Byte Pair Encoding · Residual Connection · Dropout
