Evaluating Long-Term Memory in 3D Mazes
Jurgis Pasukonis, Timothy Lillicrap, Danijar Hafner

TL;DR
This paper introduces the Memory Maze, a 3D environment designed to evaluate long-term memory in reinforcement learning agents, highlighting current algorithms' limitations and providing benchmarks for future improvements.
Contribution
The paper presents the Memory Maze environment, a new benchmark for assessing long-term memory in agents, along with datasets, evaluation methods, and baseline comparisons.
Findings
Current algorithms perform well on small mazes but struggle with large mazes.
Training with truncated backpropagation through time improves performance.
Humans outperform algorithms on complex maze navigation tasks.
Abstract
Intelligent agents need to remember salient information to reason in partially-observed environments. For example, agents with a first-person view should remember the positions of relevant objects even if they go out of view. Similarly, to effectively navigate through rooms agents need to remember the floor plan of how rooms are connected. However, most benchmark tasks in reinforcement learning do not test long-term memory in agents, slowing down progress in this important research direction. In this paper, we introduce the Memory Maze, a 3D domain of randomized mazes specifically designed for evaluating long-term memory in agents. Unlike existing benchmarks, Memory Maze measures long-term memory separate from confounding agent abilities and requires the agent to localize itself by integrating information over time. With Memory Maze, we propose an online reinforcement learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Data Stream Mining Techniques
MethodsTest
