Representation Matters: Offline Pretraining for Sequential Decision   Making

Mengjiao Yang; Ofir Nachum

arXiv:2102.05815·cs.LG·February 12, 2021·22 cites

Representation Matters: Offline Pretraining for Sequential Decision Making

Mengjiao Yang, Ofir Nachum

PDF

Open Access 1 Video

TL;DR

This paper explores how offline unsupervised pretraining of state representations can significantly enhance the performance of various downstream sequential decision-making tasks, including RL, imitation learning, and offline policy optimization.

Contribution

It introduces the use of offline unsupervised learning objectives for state representation pretraining to improve downstream decision-making tasks, a novel approach in offline RL.

Findings

01

Pretraining with unsupervised objectives boosts downstream policy performance.

02

Reward prediction and representation type are critical components.

03

Pretraining benefits are consistent across different offline datasets.

Abstract

The recent success of supervised learning methods on ever larger offline datasets has spurred interest in the reinforcement learning (RL) field to investigate whether the same paradigms can be translated to RL algorithms. This research area, known as offline RL, has largely focused on offline policy optimization, aiming to find a return-maximizing policy exclusively from offline data. In this paper, we consider a slightly different approach to incorporating offline data into sequential decision-making. We aim to answer the question, what unsupervised objectives applied to offline datasets are able to learn state representations which elevate performance on downstream tasks, whether those downstream tasks be online RL, imitation learning from expert demonstrations, or even offline policy optimization based on the same offline dataset? Through a variety of experiments utilizing standard…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Representation Matters: Offline Pretraining for Sequential Decision Making· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Machine Learning and Data Classification