Gaussian Process Planning with Lipschitz Continuous Reward Functions:   Towards Unifying Bayesian Optimization, Active Learning, and Beyond

Chun Kai Ling; Kian Hsiang Low; Patrick Jaillet

arXiv:1511.06890·stat.ML·November 24, 2015

Gaussian Process Planning with Lipschitz Continuous Reward Functions: Towards Unifying Bayesian Optimization, Active Learning, and Beyond

Chun Kai Ling, Kian Hsiang Low, Patrick Jaillet

PDF

TL;DR

This paper introduces a flexible Gaussian process planning framework with Lipschitz continuous rewards, unifying active learning and Bayesian optimization, and proposes an efficient, real-time epsilon-GPP algorithm with performance guarantees.

Contribution

It develops a nonmyopic adaptive GPP framework leveraging Lipschitz continuity to solve complex decision problems and introduces an anytime branch-and-bound algorithm for real-time planning.

Findings

01

Effective in Bayesian optimization tasks

02

Demonstrated success in energy harvesting applications

03

Provides performance guarantees for the planning algorithm

Abstract

This paper presents a novel nonmyopic adaptive Gaussian process planning (GPP) framework endowed with a general class of Lipschitz continuous reward functions that can unify some active learning/sensing and Bayesian optimization criteria and offer practitioners some flexibility to specify their desired choices for defining new tasks/problems. In particular, it utilizes a principled Bayesian sequential decision problem framework for jointly and naturally optimizing the exploration-exploitation trade-off. In general, the resulting induced GPP policy cannot be derived exactly due to an uncountable set of candidate observations. A key contribution of our work here thus lies in exploiting the Lipschitz continuity of the reward functions to solve for a nonmyopic adaptive epsilon-optimal GPP (epsilon-GPP) policy. To plan in real time, we further propose an asymptotically optimal,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsGaussian Process