On the Complexity of Representation Learning in Contextual Linear   Bandits

Andrea Tirinzoni; Matteo Pirotta; Alessandro Lazaric

arXiv:2212.09429·cs.LG·December 20, 2022

On the Complexity of Representation Learning in Contextual Linear Bandits

Andrea Tirinzoni, Matteo Pirotta, Alessandro Lazaric

PDF

Open Access

TL;DR

This paper investigates the inherent complexity of learning representations in contextual linear bandits, revealing that it can be significantly more difficult than learning with a fixed representation, depending on the structure of the problem.

Contribution

It provides a systematic, instance-dependent analysis showing the potential complexity gap between representation learning and fixed representation learning in contextual linear bandits.

Findings

01

Representation learning can be arbitrarily harder than fixed representation learning.

02

Learning with a set of representations is at least as hard as the worst case in the set.

03

In some cases, sub-logarithmic regret is achievable despite the complexity.

Abstract

In contextual linear bandits, the reward function is assumed to be a linear combination of an unknown reward vector and a given embedding of context-arm pairs. In practice, the embedding is often learned at the same time as the reward vector, thus leading to an online representation learning problem. Existing approaches to representation learning in contextual bandits are either very generic (e.g., model-selection techniques or algorithms for learning with arbitrary function classes) or specialized to particular structures (e.g., nested features or representations with certain spectral properties). As a result, the understanding of the cost of representation learning in contextual linear bandit is still limited. In this paper, we take a systematic approach to the problem and provide a comprehensive study through an instance-dependent perspective. We show that representation learning is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReceptor Mechanisms and Signaling · Advanced Bandit Algorithms Research