Self-Supervised Reinforcement Learning that Transfers using Random   Features

Boyuan Chen; Chuning Zhu; Pulkit Agrawal; Kaiqing Zhang; Abhishek; Gupta

arXiv:2305.17250·cs.LG·May 30, 2023·1 cites

Self-Supervised Reinforcement Learning that Transfers using Random Features

Boyuan Chen, Chuning Zhu, Pulkit Agrawal, Kaiqing Zhang, Abhishek, Gupta

PDF

Open Access 1 Video

TL;DR

This paper introduces a self-supervised reinforcement learning approach that leverages random features for reward modeling, enabling transfer across tasks without explicit reward labels and facilitating rapid adaptation in complex environments.

Contribution

It proposes a novel self-supervised pre-training method for model-free RL using random features, allowing implicit environment modeling and efficient transfer to new tasks.

Findings

01

Enables transfer across manipulation and locomotion tasks in simulation.

02

Allows fast adaptation to new reward functions without additional training.

03

Operates effectively with offline datasets and no reward labels.

Abstract

Model-free reinforcement learning algorithms have exhibited great potential in solving single-task sequential decision-making problems with high-dimensional observations and long horizons, but are known to be hard to generalize across tasks. Model-based RL, on the other hand, learns task-agnostic models of the world that naturally enables transfer across different reward functions, but struggles to scale to complex environments due to the compounding error. To get the best of both worlds, we propose a self-supervised reinforcement learning method that enables the transfer of behaviors across tasks with different rewards, while circumventing the challenges of model-based RL. In particular, we show self-supervised pre-training of model-free reinforcement learning with a number of random features as rewards allows implicit modeling of long-horizon environment dynamics. Then, planning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Self-Supervised Reinforcement Learning that Transfers using Random Features· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics