The price of bandit information in multiclass online classification
Amit Daniely, Tom Helbertal

TL;DR
This paper compares error rates in full information and bandit scenarios for multiclass online learning, providing tight bounds and applying results to multiclass linear classifiers, thus answering open questions in the field.
Contribution
It establishes tight bounds on the ratio of error rates between bandit and full information settings for multiclass learning, and applies these to multiclass linear classifiers.
Findings
Error ratio in realizable case is at most 8|Y|log|Y|
Error ratio in agnostic case is O(\u221a{|Y|})
Bandit error rates for multiclass linear classifiers are tightly characterized
Abstract
We consider two scenarios of multiclass online learning of a hypothesis class . In the {\em full information} scenario, the learner is exposed to instances together with their labels. In the {\em bandit} scenario, the true label is not exposed, but rather an indication whether the learner's prediction is correct or not. We show that the ratio between the error rates in the two scenarios is at most in the realizable case, and in the agnostic case. The results are tight up to a logarithmic factor and essentially answer an open question from (Daniely et. al. - Multiclass learnability and the erm principle). We apply these results to the class of -margin multiclass linear classifiers in . We show that the bandit error rate of this class is in the realizable case and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Auction Theory and Applications
