Retrieval-Augmented Reinforcement Learning
Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan, Rosemary Ke, Adria Puigdomenech Badia, Arthur Guez, Mehdi Mirza, Peter C., Humphreys, Ksenia Konyushkova, Laurent Sifre, Michal Valko, Simon Osindero,, Timothy Lillicrap, Nicolas Heess, Charles Blundell

TL;DR
This paper introduces a retrieval-augmented reinforcement learning framework that enhances learning efficiency by accessing a dataset of past experiences, enabling faster and more effective policy learning across tasks.
Contribution
It proposes a novel retrieval-based approach integrated into RL agents, allowing direct access to experience datasets to improve learning speed and task performance.
Findings
Retrieval-augmented DQN avoids task interference and learns faster.
Retrieval-augmented R2D2 outperforms baseline in Atari, achieving higher scores.
Extensive ablations confirm the effectiveness of the retrieval components.
Abstract
Most deep reinforcement learning (RL) algorithms distill experience into parametric behavior policies or value functions via gradient updates. While effective, this approach has several disadvantages: (1) it is computationally expensive, (2) it can take many updates to integrate experiences into the parametric model, (3) experiences that are not fully integrated do not appropriately influence the agent's behavior, and (4) behavior is limited by the capacity of the model. In this paper we explore an alternative paradigm in which we train a network to map a dataset of past experiences to optimal behavior. Specifically, we augment an RL agent with a retrieval process (parameterized as a neural network) that has direct access to a dataset of experiences. This dataset can come from the agent's past experiences, expert demonstrations, or any other relevant source. The retrieval process is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques
MethodsRecurrent Replay Distributed DQN · Q-Learning · Convolution · Dense Connections · Deep Q-Network
