Unsupervised Curricula for Visual Meta-Reinforcement Learning

Allan Jabri; Kyle Hsu; Ben Eysenbach; Abhishek Gupta; Sergey Levine,; Chelsea Finn

arXiv:1912.04226·cs.AI·December 10, 2019·26 cites

Unsupervised Curricula for Visual Meta-Reinforcement Learning

Allan Jabri, Kyle Hsu, Ben Eysenbach, Abhishek Gupta, Sergey Levine,, Chelsea Finn

PDF

Open Access

TL;DR

This paper introduces an unsupervised method for creating adaptive curricula in visual meta-reinforcement learning, enabling automatic task discovery and improved transfer to downstream tasks without manual task design.

Contribution

It develops an unsupervised algorithm that models interaction in visual environments to automatically generate training curricula for meta-RL, using a parametric density model and information maximization.

Findings

01

Enables unsupervised meta-learning that transfers to downstream tasks.

02

Supports trajectory-level task acquisition through discriminative clustering.

03

Serves as effective pre-training for supervised meta-learning.

Abstract

In principle, meta-reinforcement learning algorithms leverage experience across many tasks to learn fast reinforcement learning (RL) strategies that transfer to similar tasks. However, current meta-RL approaches rely on manually-defined distributions of training tasks, and hand-crafting these task distributions can be challenging and time-consuming. Can "useful" pre-training tasks be discovered in an unsupervised manner? We develop an unsupervised algorithm for inducing an adaptive meta-training task distribution, i.e. an automatic curriculum, by modeling unsupervised interaction in a visual environment. The task distribution is scaffolded by a parametric density model of the meta-learner's trajectory distribution. We formulate unsupervised meta-RL as information maximization between a latent task variable and the meta-learner's data distribution, and describe a practical instantiation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Vision and Imaging · Domain Adaptation and Few-Shot Learning

MethodsTest