Jointly Efficient and Optimal Algorithms for Logistic Bandits
Louis Faury, Marc Abeille, Kwang-Sung Jun, Cl\'ement Calauz\`enes

TL;DR
This paper introduces new algorithms for Logistic Bandits that are both statistically efficient, achieving near-optimal regret, and computationally efficient, with low per-round complexity, addressing a longstanding trade-off in the field.
Contribution
The authors propose a novel learning procedure that maintains confidence sets efficiently and combines it with planning methods to achieve both statistical and computational efficiency in Logistic Bandits.
Findings
Achieved exponential improvements in regret over previous strategies.
Developed algorithms with near-optimal problem-dependent regret bounds.
First to combine statistical and computational efficiency in Logistic Bandit algorithms.
Abstract
Logistic Bandits have recently undergone careful scrutiny by virtue of their combined theoretical and practical relevance. This research effort delivered statistically efficient algorithms, improving the regret of previous strategies by exponentially large factors. Such algorithms are however strikingly costly as they require operations at each round. On the other hand, a different line of research focused on computational efficiency ( per-round cost), but at the cost of letting go of the aforementioned exponential improvements. Obtaining the best of both world is unfortunately not a matter of marrying both approaches. Instead we introduce a new learning procedure for Logistic Bandits. It yields confidence sets which sufficient statistics can be easily maintained online without sacrificing statistical tightness. Combined with efficient planning mechanisms we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Forecasting Techniques and Applications · Machine Learning and Algorithms
