Adaptation to Easy Data in Prediction with Limited Advice
Tobias Sommer Thune, Yevgeny Seldin

TL;DR
This paper introduces SODA, an online learning algorithm that exploits easy data scenarios with small loss ranges, achieving improved regret bounds without prior knowledge of the loss range, and works effectively in both stochastic and adversarial settings.
Contribution
The paper presents SODA, a novel algorithm that circumvents previous impossibility results by using one additional observation per round, achieving adaptive regret bounds for easy data sequences.
Findings
SODA achieves an $O( ext{effective range} imes ext{scaling factors})$ regret bound.
SODA matches the lower bound $ ilde{ ext{Omega}}( ext{effective range} imes ext{sqrt}(T K))$.
SODA performs well in both stochastic and adversarial settings, providing safe and improved regret guarantees.
Abstract
We derive an online learning algorithm with improved regret guarantees for `easy' loss sequences. We consider two types of `easiness': (a) stochastic loss sequences and (b) adversarial loss sequences with small effective range of the losses. While a number of algorithms have been proposed for exploiting small effective range in the full information setting, Gerchinovitz and Lattimore [2016] have shown the impossibility of regret scaling with the effective range of the losses in the bandit setting. We show that just one additional observation per round is sufficient to circumvent the impossibility result. The proposed Second Order Difference Adjustments (SODA) algorithm requires no prior knowledge of the effective range of the losses, , and achieves an expected regret guarantee, where is the time…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
