Cautious Bayesian Optimization for Efficient and Scalable Policy Search

Lukas P. Fr\"ohlich; Melanie N. Zeilinger; Edgar D. Klenske

arXiv:2011.09445·cs.RO·November 19, 2020·5 cites

Cautious Bayesian Optimization for Efficient and Scalable Policy Search

Lukas P. Fr\"ohlich, Melanie N. Zeilinger, Edgar D. Klenske

PDF

Open Access 1 Repo

TL;DR

This paper introduces a cautious Bayesian Optimization method that constrains the search space using the surrogate model's uncertainty, enabling efficient policy search in high-dimensional spaces and reducing system damage risk.

Contribution

The paper proposes a novel constraint on Bayesian Optimization based on the surrogate model's uncertainty, improving scalability and safety in policy search.

Findings

01

Effective in high-dimensional spaces (>100 dimensions)

02

Reduces risk of damaging the system during optimization

03

Demonstrates success on diverse tasks including motor skills and sim-to-real

Abstract

Sample efficiency is one of the key factors when applying policy search to real-world problems. In recent years, Bayesian Optimization (BO) has become prominent in the field of robotics due to its sample efficiency and little prior knowledge needed. However, one drawback of BO is its poor performance on high-dimensional search spaces as it focuses on global search. In the policy search setting, local optimization is typically sufficient as initial policies are often available, e.g., via meta-learning, kinesthetic demonstrations or sim-to-real approaches. In this paper, we propose to constrain the policy search space to a sublevel-set of the Bayesian surrogate model's predictive uncertainty. This simple yet effective way of constraining the policy update enables BO to scale to high-dimensional spaces (>100) as well as reduces the risk of damaging the system. We demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

boschresearch/ConfidenceRegionBO
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Advanced Multi-Objective Optimization Algorithms