Neural Map: Structured Memory for Deep Reinforcement Learning

Emilio Parisotto; Ruslan Salakhutdinov

arXiv:1702.08360·cs.LG·February 28, 2017·102 cites

Neural Map: Structured Memory for Deep Reinforcement Learning

Emilio Parisotto, Ruslan Salakhutdinov

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Neural Map, a structured 2D memory architecture for deep reinforcement learning that enables agents to remember and reason over long-term spatial information in complex environments.

Contribution

The Neural Map provides a novel spatially structured memory system with an adaptable write operator, improving long-term memory capabilities in DRL agents for 3D environments.

Findings

01

Outperforms previous memory architectures on maze tasks

02

Capable of generalizing to unseen environments

03

Effectively stores long-term spatial information

Abstract

A critical component to enabling intelligent reasoning in partially observable environments is memory. Despite this importance, Deep Reinforcement Learning (DRL) agents have so far used relatively simple memory architectures, with the main methods to overcome partial observability being either a temporal convolution over the past k frames or an LSTM layer. More recent work (Oh et al., 2016) has went beyond these architectures by using memory networks which can allow more sophisticated addressing schemes over the past k frames. But even these architectures are unsatisfactory due to the reason that they are limited to only remembering information from the last k frames. In this paper, we develop a memory system with an adaptable write operator that is customized to the sorts of 3D environments that DRL agents typically interact with. This architecture, called the Neural Map, uses a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zuoxingdong/VIN_PyTorch_Visdom
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory