Reincarnating Reinforcement Learning: Reusing Prior Computation to   Accelerate Progress

Rishabh Agarwal; Max Schwarzer; Pablo Samuel Castro; Aaron Courville,; Marc G. Bellemare

arXiv:2206.01626·cs.LG·October 5, 2022·6 cites

Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress

Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville,, Marc G. Bellemare

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces reincarnating RL, a framework for reusing prior computational work like learned policies to accelerate RL development, demonstrated through transfer algorithms and real-world applications.

Contribution

It proposes a novel approach for transferring sub-optimal policies to value-based RL agents, addressing limitations of existing methods and enabling more efficient RL workflows.

Findings

01

Reincarnating RL outperforms tabula rasa RL on Atari, locomotion, and balloon navigation tasks.

02

The proposed transfer algorithm effectively reuses prior policies in diverse environments.

03

Reincarnating RL can reduce training time and resource requirements.

Abstract

Learning tabula rasa, that is without any prior knowledge, is the prevalent workflow in reinforcement learning (RL) research. However, RL systems, when applied to large-scale settings, rarely operate tabula rasa. Such large-scale systems undergo multiple design or algorithmic changes during their development cycle and use ad hoc approaches for incorporating these changes without re-training from scratch, which would have been prohibitively expensive. Additionally, the inefficiency of deep RL typically excludes researchers without access to industrial-scale resources from tackling computationally-demanding problems. To address these issues, we present reincarnating RL as an alternative workflow or class of problem settings, where prior computational work (e.g., learned policies) is reused or transferred between design iterations of an RL agent, or from one RL agent to another. As a step…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/reincarnating_rl
jaxOfficial

Videos

Reincarnating Reinforcement Learning: Reusing Prior Computation to Accelerate Progress· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Software Engineering Research · Machine Learning and Data Classification