Fast Rates for Online Prediction with Abstention
Gergely Neu, Nikita Zhivotovskiy

TL;DR
This paper demonstrates that allowing a small abstention cost in online binary sequence prediction enables regret bounds independent of the time horizon, with precise dependence on abstention cost and number of experts.
Contribution
It provides the first exact characterization of regret bounds with abstention, showing a significant improvement over traditional rates without abstention.
Findings
Regret bounds are independent of the time horizon when abstention cost is below 0.5.
Matching upper and lower bounds of order (log N)/(1-2c) are established.
Extensions include regret bounds for changing abstention costs over time.
Abstract
In the setting of sequential prediction of individual -sequences with expert advice, we show that by allowing the learner to abstain from the prediction by paying a cost marginally smaller than (say, ), it is possible to achieve expected regret bounds that are independent of the time horizon . We exactly characterize the dependence on the abstention cost and the number of experts by providing matching upper and lower bounds of order , which is to be contrasted with the best possible rate of that is available without the option to abstain. We also discuss various extensions of our model, including a setting where the sequence of abstention costs can change arbitrarily over time, where we show regret bounds interpolating between the slow and the fast rates mentioned above, under some natural assumptions on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
