No-Regret Linear Bandits under Gap-Adjusted Misspecification
Chong Liu, Dan Qiao, Ming Yin, Ilija Bogunovic, Yu-Xiang Wang

TL;DR
This paper introduces a new gap-adjusted model of misspecification in linear bandits, demonstrating that classical algorithms are robust and proposing a new phased elimination algorithm with optimal regret and efficient deployment.
Contribution
It proposes a natural gap-adjusted misspecification model and develops a novel phased elimination algorithm that achieves optimal regret without scaling with T.
Findings
Classical LinUCB is robust under gap-adjusted misspecification with diminishing parameter.
The phased elimination algorithm attains optimal O(√T) regret with only log T batches.
The new analysis introduces a self-bounding argument and inductive lemma for misspecification control.
Abstract
This work studies linear bandits under a new notion of gap-adjusted misspecification and is an extension of Liu et al. (2023). When the underlying reward function is not linear, existing linear bandits work usually relies on a uniform misspecification parameter that measures the sup-norm error of the best linear approximation. This results in an unavoidable linear regret whenever . We propose a more natural model of misspecification which only requires the approximation error at each input to be proportional to the suboptimality gap at . It captures the intuition that, for optimization problems, near-optimal regions should matter more and we can tolerate larger approximation errors in suboptimal regions. Quite surprisingly, we show that the classical LinUCB algorithm -- designed for the realizable case -- is automatically robust against such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Distributed Sensor Networks and Detection Algorithms
