Integrating Episodic Memory into a Reinforcement Learning Agent using   Reservoir Sampling

Kenny J. Young; Richard S. Sutton; Shuo Yang

arXiv:1806.00540·cs.LG·June 5, 2018·6 cites

Integrating Episodic Memory into a Reinforcement Learning Agent using Reservoir Sampling

Kenny J. Young, Richard S. Sutton, Shuo Yang

PDF

Open Access

TL;DR

This paper introduces a novel reservoir sampling-based external memory mechanism for reinforcement learning agents, enabling efficient online learning and improved recall of useful past states for better decision-making.

Contribution

It presents a new algorithm that maintains a fixed-size external memory of past states using reservoir sampling, allowing online reinforcement learning to selectively remember useful information.

Findings

01

Enables online gradient computation for external memory updates.

02

Improves the agent's ability to recall relevant past states.

03

Supports efficient long-term dependency learning in reinforcement learning.

Abstract

Episodic memory is a psychology term which refers to the ability to recall specific events from the past. We suggest one advantage of this particular type of memory is the ability to easily assign credit to a specific state when remembered information is found to be useful. Inspired by this idea, and the increasing popularity of external memory mechanisms to handle long-term dependencies in deep learning systems, we propose a novel algorithm which uses a reservoir sampling procedure to maintain an external memory consisting of a fixed number of past states. The algorithm allows a deep reinforcement learning agent to learn online to preferentially remember those states which are found to be useful to recall later on. Critically this method allows for efficient online computation of gradient estimates with respect to the write process of the external memory. Thus unlike most prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Reinforcement Learning in Robotics · Data Stream Mining Techniques