Pareto Optimal Model Selection in Linear Bandits

Yinglun Zhu; Robert Nowak

arXiv:2102.06593·stat.ML·March 17, 2022·1 cites

Pareto Optimal Model Selection in Linear Bandits

Yinglun Zhu, Robert Nowak

PDF

Open Access

TL;DR

This paper investigates the challenge of model selection in linear bandits, establishing fundamental lower bounds and proposing Pareto optimal algorithms that adapt to unknown model dimensions, with empirical results demonstrating superior performance.

Contribution

It provides the first lower bound for model selection in linear bandits and introduces Pareto optimal algorithms that adapt to unknown dimensions, matching the lower bounds.

Findings

01

Established the first lower bound for model selection in linear bandits.

02

Proposed Pareto optimal algorithms that adapt to the true model dimension.

03

Empirical results show superior performance of the proposed algorithms.

Abstract

We study model selection in linear bandits, where the learner must adapt to the dimension (denoted by $d_{⋆}$ ) of the smallest hypothesis class containing the true linear model while balancing exploration and exploitation. Previous papers provide various guarantees for this model selection problem, but have limitations; i.e., the analysis requires favorable conditions that allow for inexpensive statistical testing to locate the right hypothesis class or are based on the idea of "corralling" multiple base algorithms, which often performs relatively poorly in practice. These works also mainly focus on upper bounds. In this paper, we establish the first lower bound for the model selection problem. Our lower bound implies that, even with a fixed action set, adaptation to the unknown dimension $d_{⋆}$ comes at a cost: There is no algorithm that can achieve the regret bound…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Reinforcement Learning in Robotics