Optimal Learning for Stochastic Optimization with Nonlinear Parametric   Belief Models

Xinyu He; Warren B. Powell

arXiv:1611.07161·stat.ML·November 23, 2016·SIAM J. Optim.

Optimal Learning for Stochastic Optimization with Nonlinear Parametric Belief Models

Xinyu He, Warren B. Powell

PDF

Open Access

TL;DR

This paper develops a sequential experimental design method for Bayesian learning with nonlinear belief models, using sampling and resampling techniques to efficiently estimate information gain and adapt to new data, with proven convergence and rapid empirical results.

Contribution

It introduces a novel resampling approach for adaptive Bayesian experimental design in nonlinear models, ensuring convergence and efficiency.

Findings

01

Method converges asymptotically to true parameters.

02

Achieves rapid convergence with few experiments.

03

Provides effective guidance for expensive sequential experiments.

Abstract

We consider the problem of estimating the expected value of information (the knowledge gradient) for Bayesian learning problems where the belief model is nonlinear in the parameters. Our goal is to maximize some metric, while simultaneously learning the unknown parameters of the nonlinear belief model, by guiding a sequential experimentation process which is expensive. We overcome the problem of computing the expected value of an experiment, which is computationally intractable, by using a sampled approximation, which helps to guide experiments but does not provide an accurate estimate of the unknown parameters. We then introduce a resampling process which allows the sampled model to adapt to new information, exploiting past experiments. We show theoretically that the method converges asymptotically to the true parameters, while simultaneously maximizing our metric. We show empirically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Gaussian Processes and Bayesian Inference · Advanced Bandit Algorithms Research