The Real Price of Bandit Information in Multiclass Classification

Liad Erez; Alon Cohen; Tomer Koren; Yishay Mansour; Shay Moran

arXiv:2405.10027·cs.LG·June 21, 2024

The Real Price of Bandit Information in Multiclass Classification

Liad Erez, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran

PDF

Open Access

TL;DR

This paper investigates the regret bounds in multiclass bandit classification, revealing a more nuanced dependency on the number of labels and hypothesis class size, and introduces an improved algorithm with tighter regret guarantees.

Contribution

The paper provides a new analysis of minimax regret in multiclass bandit classification and proposes an algorithm with improved regret bounds for certain hypothesis class sizes.

Findings

01

Regret bounds depend on both hypothesis class size and number of labels.

02

Proposed algorithm achieves regret of O(|H|+\u007frac{ ext{T}}{ ext{log} |H|}) for moderate-sized classes.

03

Matching lower bounds confirm the tightness of the regret bounds.

Abstract

We revisit the classical problem of multiclass classification with bandit feedback (Kakade, Shalev-Shwartz and Tewari, 2008), where each input classifies to one of $K$ possible labels and feedback is restricted to whether the predicted label is correct or not. Our primary inquiry is with regard to the dependency on the number of labels $K$ , and whether $T$ -step regret bounds in this setting can be improved beyond the $K T$ dependence exhibited by existing algorithms. Our main contribution is in showing that the minimax regret of bandit multiclass is in fact more nuanced, and is of the form $Θ (min {∣ H ∣ + T, K T lo g ∣ H ∣})$ , where $H$ is the underlying (finite) hypothesis class. In particular, we present a new bandit classification algorithm that guarantees regret $O (∣ H ∣ + T)$ ,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research