Adaptation to Easy Data in Prediction with Limited Advice

Tobias Sommer Thune; Yevgeny Seldin

arXiv:1807.00636·cs.LG·August 28, 2019·1 cites

Adaptation to Easy Data in Prediction with Limited Advice

Tobias Sommer Thune, Yevgeny Seldin

PDF

Open Access

TL;DR

This paper introduces SODA, an online learning algorithm that exploits easy data scenarios with small loss ranges, achieving improved regret bounds without prior knowledge of the loss range, and works effectively in both stochastic and adversarial settings.

Contribution

The paper presents SODA, a novel algorithm that circumvents previous impossibility results by using one additional observation per round, achieving adaptive regret bounds for easy data sequences.

Findings

01

SODA achieves an $O( ext{effective range} imes ext{scaling factors})$ regret bound.

02

SODA matches the lower bound $ ilde{ ext{Omega}}( ext{effective range} imes ext{sqrt}(T K))$.

03

SODA performs well in both stochastic and adversarial settings, providing safe and improved regret guarantees.

Abstract

We derive an online learning algorithm with improved regret guarantees for `easy' loss sequences. We consider two types of `easiness': (a) stochastic loss sequences and (b) adversarial loss sequences with small effective range of the losses. While a number of algorithms have been proposed for exploiting small effective range in the full information setting, Gerchinovitz and Lattimore [2016] have shown the impossibility of regret scaling with the effective range of the losses in the bandit setting. We show that just one additional observation per round is sufficient to circumvent the impossibility result. The proposed Second Order Difference Adjustments (SODA) algorithm requires no prior knowledge of the effective range of the losses, $ε$ , and achieves an $O (ε K T ln K) + \tilde{O} (ε K 4 T)$ expected regret guarantee, where $T$ is the time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms