Retrieval-Augmented Reinforcement Learning

Anirudh Goyal; Abram L. Friesen; Andrea Banino; Theophane Weber; Nan; Rosemary Ke; Adria Puigdomenech Badia; Arthur Guez; Mehdi Mirza; Peter C.; Humphreys; Ksenia Konyushkova; Laurent Sifre; Michal Valko; Simon Osindero,; Timothy Lillicrap; Nicolas Heess; Charles Blundell

arXiv:2202.08417·cs.LG·May 25, 2022·6 cites

Retrieval-Augmented Reinforcement Learning

Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan, Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C., Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero,, Timothy Lillicrap, Nicolas Heess, Charles Blundell

PDF

Open Access 1 Datasets

TL;DR

This paper introduces a retrieval-augmented reinforcement learning framework that enhances learning efficiency by accessing a dataset of past experiences, enabling faster and more effective policy learning across tasks.

Contribution

It proposes a novel retrieval-based approach integrated into RL agents, allowing direct access to experience datasets to improve learning speed and task performance.

Findings

01

Retrieval-augmented DQN avoids task interference and learns faster.

02

Retrieval-augmented R2D2 outperforms baseline in Atari, achieving higher scores.

03

Extensive ablations confirm the effectiveness of the retrieval components.

Abstract

Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the agent's behavior, and (4) behavior is limited by the capacity of the model. In this paper we explore an alternative paradigm in which we train a network to map a dataset of past experiences to optimal behavior. Specifically, we augment an RL agent with a retrieval process (parameterized as a neural network) that has direct access to a dataset of experiences. This dataset can come from the agent's past experiences, expert demonstrations, or any other relevant source. The retrieval process is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

misovalko/my-research-papers
dataset· 21 dl
21 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques

MethodsRecurrent Replay Distributed DQN · Q-Learning · Convolution · Dense Connections · Deep Q-Network