Bayesian Optimization with Automatic Prior Selection for Data-Efficient   Direct Policy Search

R\'emi Pautrat; Konstantinos Chatzilygeroudis; Jean-Baptiste Mouret

arXiv:1709.06919·cs.RO·March 14, 2018

Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

R\'emi Pautrat, Konstantinos Chatzilygeroudis, Jean-Baptiste Mouret

PDF

2 Repos

TL;DR

This paper introduces MLEI, a new acquisition function for Bayesian optimization that automatically selects the most relevant prior, improving data-efficient policy learning for robots in transfer and damage scenarios.

Contribution

The paper proposes MLEI, a novel acquisition function that combines prior likelihood with expected improvement to automatically select the best prior in Bayesian optimization.

Findings

01

MLEI effectively identifies relevant priors in transfer learning tasks.

02

The method accelerates learning in robot damage and transfer scenarios.

03

Results demonstrate improved data efficiency over baseline methods.

Abstract

One of the most interesting features of Bayesian optimization for direct policy search is that it can leverage priors (e.g., from simulation or from previous tasks) to accelerate learning on a robot. In this paper, we are interested in situations for which several priors exist but we do not know in advance which one fits best the current situation. We tackle this problem by introducing a novel acquisition function, called Most Likely Expected Improvement (MLEI), that combines the likelihood of the priors and the expected improvement. We evaluate this new acquisition function on a transfer learning task for a 5-DOF planar arm and on a possibly damaged, 6-legged robot that has to learn to walk on flat ground and on stairs, with priors corresponding to different stairs and different kinds of damages. Our results show that MLEI effectively identifies and exploits the priors, even when there…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.