Meta Reinforcement Learning with Latent Variable Gaussian Processes

Steind\'or S{\ae}mundsson; Katja Hofmann; Marc Peter Deisenroth

arXiv:1803.07551·stat.ML·July 10, 2018·105 cites

Meta Reinforcement Learning with Latent Variable Gaussian Processes

Steind\'or S{\ae}mundsson, Katja Hofmann, Marc Peter Deisenroth

PDF

Open Access

TL;DR

This paper introduces a hierarchical latent variable model for meta reinforcement learning that automatically infers task relationships from data, enabling efficient generalization to new tasks with minimal data and reducing interaction time by up to 60%.

Contribution

It presents a novel latent variable framework for meta learning in reinforcement learning that eliminates the need for human-defined task relationships.

Findings

01

Achieves up to 60% reduction in interaction time for new tasks.

02

Effectively generalizes to unseen tasks using minimal data.

03

Automatically infers task relationships without human input.

Abstract

Learning from small data sets is critical in many practical applications where data collection is time consuming or expensive, e.g., robotics, animal experiments or drug design. Meta learning is one way to increase the data efficiency of learning algorithms by generalizing learned concepts from a set of training tasks to unseen, but related, tasks. Often, this relationship between tasks is hard coded or relies in some other way on human expertise. In this paper, we frame meta learning as a hierarchical latent variable model and infer the relationship between tasks automatically from data. We apply our framework in a model-based reinforcement learning setting and show that our meta-learning model effectively generalizes to novel tasks by identifying how new tasks relate to prior ones from minimal data. This results in up to a 60% reduction in the average interaction time needed to solve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms · Control Systems and Identification