Nesterov-aided Stochastic Gradient Methods using Laplace Approximation   for Bayesian Design Optimization

Andre Gustavo Carlon; Ben Mansour Dia; Luis FR Espath; Rafael Holdorf; Lopez; and Raul Tempone

arXiv:1807.00653·math.NA·February 28, 2020

Nesterov-aided Stochastic Gradient Methods using Laplace Approximation for Bayesian Design Optimization

Andre Gustavo Carlon, Ben Mansour Dia, Luis FR Espath, Rafael Holdorf, Lopez, and Raul Tempone

PDF

TL;DR

This paper introduces Nesterov-aided stochastic gradient methods combined with Laplace approximation estimators to efficiently solve Bayesian experimental design problems, significantly reducing model evaluations needed for complex models.

Contribution

The paper develops a novel combination of accelerated stochastic gradient methods with Laplace-based estimators for Bayesian design optimization, enabling efficient handling of complex models.

Findings

01

Accelerated stochastic gradient descent with MCLA converges faster than traditional methods.

02

Laplace-based estimators enable the use of complex models like PDEs.

03

Significant reduction in model evaluations, up to five orders of magnitude.

Abstract

Finding the best setup for experiments is the primary concern for Optimal Experimental Design (OED). Here, we focus on the Bayesian experimental design problem of finding the setup that maximizes the Shannon expected information gain. We use the stochastic gradient descent and its accelerated counterpart, which employs Nesterov's method, to solve the optimization problem in OED. We adapt a restart technique, originally proposed for the acceleration in deterministic optimization, to improve stochastic optimization methods. We combine these optimization methods with three estimators of the objective function: the double-loop Monte Carlo estimator (DLMC), the Monte Carlo estimator using the Laplace approximation for the posterior distribution (MCLA) and the double-loop Monte Carlo estimator with Laplace-based importance sampling (DLMCIS). Using stochastic gradient methods and Laplace-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.