Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Daniel Guo, Bernardo Avila Pires, Bilal Piot, Jean-bastien Grill,, Florent Altch\'e, R\'emi Munos, Mohammad Gheshlaghi Azar

TL;DR
This paper introduces PBL, a self-supervised representation learning method for multitask deep reinforcement learning that predicts future latent embeddings to improve agent performance across various environments.
Contribution
The paper presents PBL, a novel latent-predictive representation learning algorithm that enhances multitask RL by capturing environment dynamics through bootstrap predictions in latent space.
Findings
PBL outperforms state-of-the-art RL agents on DMLab-30 and Atari-57 benchmarks.
PBL effectively handles multimodal observations including images and language.
The method improves learning efficiency and task performance in complex environments.
Abstract
Learning a good representation is an essential component for deep reinforcement learning (RL). Representation learning is especially important in multitask and partially observable settings where building a representation of the unknown environment is crucial to solve the tasks. Here we introduce Prediction of Bootstrap Latents (PBL), a simple and flexible self-supervised representation learning algorithm for multitask deep RL. PBL builds on multistep predictive representations of future observations, and focuses on capturing structured information about environment dynamics. Specifically, PBL trains its representation by predicting latent embeddings of future observations. These latent embeddings are themselves trained to be predictive of the aforementioned representations. These predictions form a bootstrapping effect, allowing the agent to learn more about the key aspects of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Evolutionary Algorithms and Applications
