Evaluating Long-Term Memory in 3D Mazes

Jurgis Pasukonis; Timothy Lillicrap; Danijar Hafner

arXiv:2210.13383·cs.AI·October 25, 2022

Evaluating Long-Term Memory in 3D Mazes

Jurgis Pasukonis, Timothy Lillicrap, Danijar Hafner

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces the Memory Maze, a 3D environment designed to evaluate long-term memory in reinforcement learning agents, highlighting current algorithms' limitations and providing benchmarks for future improvements.

Contribution

The paper presents the Memory Maze environment, a new benchmark for assessing long-term memory in agents, along with datasets, evaluation methods, and baseline comparisons.

Findings

01

Current algorithms perform well on small mazes but struggle with large mazes.

02

Training with truncated backpropagation through time improves performance.

03

Humans outperform algorithms on complex maze navigation tasks.

Abstract

Intelligent agents need to remember salient information to reason in partially-observed environments. For example, agents with a first-person view should remember the positions of relevant objects even if they go out of view. Similarly, to effectively navigate through rooms agents need to remember the floor plan of how rooms are connected. However, most benchmark tasks in reinforcement learning do not test long-term memory in agents, slowing down progress in this important research direction. In this paper, we introduce the Memory Maze, a 3D domain of randomized mazes specifically designed for evaluating long-term memory in agents. Unlike existing benchmarks, Memory Maze measures long-term memory separate from confounding agent abilities and requires the agent to localize itself by integrating information over time. With Memory Maze, we propose an online reinforcement learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jurgisp/memory-maze
noneOfficial

Videos

Evaluating Long-Term Memory in 3D Mazes· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Data Stream Mining Techniques

MethodsTest