A Survey on Contextual Multi-armed Bandits

Li Zhou

arXiv:1508.03326·cs.LG·February 2, 2016·86 cites

A Survey on Contextual Multi-armed Bandits

Li Zhou

PDF

Open Access 1 Repo

TL;DR

This survey reviews various stochastic and adversarial contextual bandit algorithms, analyzing their assumptions and regret bounds to provide a comprehensive overview of the field.

Contribution

It offers a systematic comparison of different algorithms, highlighting their theoretical guarantees and assumptions in the contextual bandit setting.

Findings

01

Different algorithms have varying regret bounds and assumptions.

02

The survey identifies key challenges and open problems in the field.

03

It provides a structured overview of the state-of-the-art methods.

Abstract

In this survey we cover a few stochastic and adversarial contextual bandit algorithms. We analyze each algorithm's assumption and regret bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bgalbraith/bandits
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Influenza Virus Research Studies