A Survey on Contextual Multi-armed Bandits
Li Zhou

TL;DR
This survey reviews various stochastic and adversarial contextual bandit algorithms, analyzing their assumptions and regret bounds to provide a comprehensive overview of the field.
Contribution
It offers a systematic comparison of different algorithms, highlighting their theoretical guarantees and assumptions in the contextual bandit setting.
Findings
Different algorithms have varying regret bounds and assumptions.
The survey identifies key challenges and open problems in the field.
It provides a structured overview of the state-of-the-art methods.
Abstract
In this survey we cover a few stochastic and adversarial contextual bandit algorithms. We analyze each algorithm's assumption and regret bound.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Influenza Virus Research Studies
