From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards

Liad Erez; Tomer Koren

arXiv:2502.09257·cs.LG·February 24, 2026

From Contextual Combinatorial Semi-Bandits to Bandit List Classification: Improved Sample Complexity with Sparse Rewards

Liad Erez, Tomer Koren

PDF

Open Access

TL;DR

This paper introduces a new algorithm for contextual combinatorial semi-bandits with sparse rewards, achieving improved sample complexity bounds and extending to list classification and adversarial settings.

Contribution

It provides a novel sample complexity bound for the $(,)$-PAC setting in sparse regimes, generalizes list multiclass classification, and extends regret bounds to adversarial data.

Findings

01

Sample complexity improves when sparsity s is much less than K.

02

Algorithm is computationally efficient with an ERM oracle.

03

Extends bounds to adversarial and list classification scenarios.

Abstract

We study the problem of contextual combinatorial semi-bandits, where input contexts are mapped into subsets of size $m$ of a collection of $K$ possible actions. In each round, the learner observes the realized reward of the predicted actions. Motivated by prototypical applications of contextual bandits, we focus on the $s$ -sparse regime where we assume that the sum of rewards is bounded by some value $s ≪ K$ . For example, in recommendation systems the number of products purchased by any customer is significantly smaller than the total number of available products. Our main result is for the $(ϵ, δ)$ -PAC variant of the problem for which we design an algorithm that returns an $ϵ$ -optimal policy with high probability using a sample complexity of $\tilde{O} ((p o l y (K / m) + s m / ϵ^{2}) lo g (∣Π∣/ δ))$ where $Π$ is the underlying (finite) class and $s$ is the sparsity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques · Anomaly Detection Techniques and Applications

MethodsSparse Evolutionary Training