Dealing With Misspecification In Fixed-Confidence Linear Top-m Identification
Cl\'emence R\'eda (UP, INSERM), Andrea Tirinzoni (Scool, CNRS), R\'emy, Degenne (Scool, CNRS)

TL;DR
This paper addresses the challenge of identifying the top m arms in linear bandit models under misspecification, proposing a new adaptive algorithm with theoretical guarantees and empirical validation.
Contribution
It introduces the first practical algorithm for fixed-confidence Top-m identification under misspecified linear models, with matching lower bounds and adaptive sample complexity.
Findings
The algorithm adapts to the level of misspecification.
Theoretical bounds match empirical performance.
Competitively performs on synthetic and real data.
Abstract
We study the problem of the identification of m arms with largest means under a fixed error rate (fixed-confidence Top-m identification), for misspecified linear bandit models. This problem is motivated by practical applications, especially in medicine and recommendation systems, where linear models are popular due to their simplicity and the existence of efficient algorithms, but in which data inevitably deviates from linearity. In this work, we first derive a tractable lower bound on the sample complexity of any -correct algorithm for the general Top-m identification problem. We show that knowing the scale of the deviation from linearity is necessary to exploit the structure of the problem. We then describe the first algorithm for this setting, which is both practical and adapts to the amount of misspecification. We derive an upper bound to its sample complexity which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Data Stream Mining Techniques
