Beyond Bandit Feedback in Online Multiclass Classification

Dirk van der Hoeven; Federico Fusco; Nicol\`o Cesa-Bianchi

arXiv:2106.03596·cs.LG·February 20, 2024·1 cites

Beyond Bandit Feedback in Online Multiclass Classification

Dirk van der Hoeven, Federico Fusco, Nicol\`o Cesa-Bianchi

PDF

Open Access 1 Video

TL;DR

This paper introduces Gappletron, an online multiclass classification algorithm that effectively handles arbitrary feedback graphs, providing strong regret bounds and demonstrating competitive performance in synthetic experiments.

Contribution

We propose Gappletron, the first algorithm for online multiclass classification with arbitrary feedback graphs, and establish its theoretical regret bounds and practical competitiveness.

Findings

01

Regret bounds of order B√ρKT in expectation and high probability.

02

Constant surrogate regret of order B²K in full information setting.

03

Lower bound of order max{B²K, √T} showing near-optimality.

Abstract

We study the problem of online multiclass classification in a setting where the learner's feedback is determined by an arbitrary directed graph. While including bandit feedback as a special case, feedback graphs allow a much richer set of applications, including filtering and label efficient classification. We introduce Gappletron, the first online multiclass algorithm that works with arbitrary feedback graphs. For this new algorithm, we prove surrogate regret bounds that hold, both in expectation and with high probability, for a large class of surrogate losses. Our bounds are of order $B ρ K T$ , where $B$ is the diameter of the prediction space, $K$ is the number of classes, $T$ is the time horizon, and $ρ$ is the domination number (a graph-theoretic parameter affecting the amount of exploration). In the full information case, we show that Gappletron achieves a constant…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Beyond Bandit Feedback in Online Multiclass Classification· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Data Stream Mining Techniques