Gaussian-Dirichlet Posterior Dominance in Sequential Learning

Ian Osband; Benjamin Van Roy

arXiv:1702.04126·stat.ML·February 12, 2018·2 cites

Gaussian-Dirichlet Posterior Dominance in Sequential Learning

Ian Osband, Benjamin Van Roy

PDF

Open Access

TL;DR

This paper demonstrates that in sequential learning with categorical data, the Dirichlet posterior's mean second-order stochastically dominates the Gaussian posterior's mean, offering a new analytical tool.

Contribution

It establishes a formal ordering between Dirichlet and Gaussian posteriors in sequential learning, enhancing analysis of categorical outcome models.

Findings

01

Dirichlet posterior mean second-order stochastically dominates Gaussian posterior mean.

02

Results apply when conditioned on at least two observations.

03

Provides a new analytical tool for sequential learning with categorical data.

Abstract

We consider the problem of sequential learning from categorical observations bounded in [0,1]. We establish an ordering between the Dirichlet posterior over categorical outcomes and a Gaussian posterior under observations with N(0,1) noise. We establish that, conditioned upon identical data with at least two observations, the posterior mean of the categorical distribution will always second-order stochastically dominate the posterior mean of the Gaussian distribution. These results provide a useful tool for the analysis of sequential learning under categorical outcomes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference · Statistical Methods and Inference · Advanced Bandit Algorithms Research