Delaytron: Efficient Learning of Multiclass Classifiers with Delayed   Bandit Feedbacks

Naresh Manwani; Mudit Agarwal

arXiv:2205.08234·cs.LG·May 18, 2022

Delaytron: Efficient Learning of Multiclass Classifiers with Delayed Bandit Feedbacks

Naresh Manwani, Mudit Agarwal

PDF

Open Access

TL;DR

Delaytron is an online algorithm designed for multiclass classification with delayed bandit feedbacks, achieving regret bounds that adapt to unknown delays and missing feedback, validated through experiments.

Contribution

We introduce Delaytron, an efficient online algorithm for multiclass classification with unknown delays and missing feedback, providing adaptive regret bounds and empirical validation.

Findings

01

Achieves regret bounds of order with known delays

02

Uses doubling trick for unknown delays and missing feedback

03

Demonstrates effectiveness through experiments on various datasets

Abstract

In this paper, we present online algorithm called {\it Delaytron} for learning multi class classifiers using delayed bandit feedbacks. The sequence of feedback delays ${d_{t}}_{t = 1}^{T}$ is unknown to the algorithm. At the $t$ -th round, the algorithm observes an example $x_{t}$ and predicts a label $\tilde{y}_{t}$ and receives the bandit feedback $I [\tilde{y}_{t} = y_{t}]$ only $d_{t}$ rounds later. When $t + d_{t} > T$ , we consider that the feedback for the $t$ -th round is missing. We show that the proposed algorithm achieves regret of $O (\frac{2 K}{γ} [\frac{T}{2} + (2 + \frac{L ^{2}}{R ^{2} ∥ \W ∥ _{F}^{2}}) \sum_{t = 1}^{T} d_{t}])$ when the loss for each missing sample is upper bounded by $L$ . In the case when the loss for missing samples is not upper bounded, the regret achieved by Delaytron is $\mathcal{O}\left(\sqrt{\frac{2…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Machine Learning and Algorithms