The price of bandit information in multiclass online classification

Amit Daniely; Tom Helbertal

arXiv:1302.1043·cs.LG·July 10, 2013

The price of bandit information in multiclass online classification

Amit Daniely, Tom Helbertal

PDF

Open Access

TL;DR

This paper compares error rates in full information and bandit scenarios for multiclass online learning, providing tight bounds and applying results to multiclass linear classifiers, thus answering open questions in the field.

Contribution

It establishes tight bounds on the ratio of error rates between bandit and full information settings for multiclass learning, and applies these to multiclass linear classifiers.

Findings

01

Error ratio in realizable case is at most 8|Y|log|Y|

02

Error ratio in agnostic case is O(\u221a{|Y|})

03

Bandit error rates for multiclass linear classifiers are tightly characterized

Abstract

We consider two scenarios of multiclass online learning of a hypothesis class $H \subseteq Y^{X}$ . In the {\em full information} scenario, the learner is exposed to instances together with their labels. In the {\em bandit} scenario, the true label is not exposed, but rather an indication whether the learner's prediction is correct or not. We show that the ratio between the error rates in the two scenarios is at most $8 \cdot ∣ Y ∣ \cdot lo g (∣ Y ∣)$ in the realizable case, and $\tilde{O} (∣ Y ∣)$ in the agnostic case. The results are tight up to a logarithmic factor and essentially answer an open question from (Daniely et. al. - Multiclass learnability and the erm principle). We apply these results to the class of $γ$ -margin multiclass linear classifiers in $R^{d}$ . We show that the bandit error rate of this class is $\tilde{Θ} (\frac{∣ Y ∣}{γ ^{2}})$ in the realizable case and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Auction Theory and Applications