Gaussian Imagination in Bandit Learning

Yueyang Liu; Adithya M. Devraj; Benjamin Van Roy; Kuang Xu

arXiv:2201.01902·cs.LG·February 23, 2022·1 cites

Gaussian Imagination in Bandit Learning

Yueyang Liu, Adithya M. Devraj, Benjamin Van Roy, Kuang Xu

PDF

Open Access

TL;DR

This paper analyzes how Gaussian-based Bayesian agents perform on Bernoulli bandits, showing that with diffuse priors, the regret increase is minimal and diminishes over time, supporting the robustness of Gaussian assumptions.

Contribution

The paper provides theoretical bounds on the regret increase for Gaussian Bayesian agents applied to Bernoulli bandits, formalizing the robustness of Gaussian assumptions in misspecified settings.

Findings

01

Regret increase grows at most as the square root of time horizon

02

Per-timestep regret increase vanishes with diffuse priors

03

Gaussian agents remain effective under misspecification

Abstract

Assuming distributions are Gaussian often facilitates computations that are otherwise intractable. We study the performance of an agent that attains a bounded information ratio with respect to a bandit environment with a Gaussian prior distribution and a Gaussian likelihood function when applied instead to a Bernoulli bandit. Relative to an information-theoretic bound on the Bayesian regret the agent would incur when interacting with the Gaussian bandit, we bound the increase in regret when the agent interacts with the Bernoulli bandit. If the Gaussian prior distribution and likelihood function are sufficiently diffuse, this increase grows at a rate which is at most linear in the square-root of the time horizon, and thus the per-timestep increase vanishes. Our results formalize the folklore that so-called Bayesian agents remain effective when instantiated with diffuse misspecified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Gaussian Processes and Bayesian Inference