Towards Optimal Algorithms for Prediction with Expert Advice

Nick Gravin; Yuval Peres; Balasubramanian Sivan

arXiv:1409.3040·cs.LG·July 12, 2016

Towards Optimal Algorithms for Prediction with Expert Advice

Nick Gravin, Yuval Peres, Balasubramanian Sivan

PDF

Open Access 1 Video

TL;DR

This paper develops optimal algorithms and adversaries for the prediction with expert advice problem, extending known results from 2 experts to 3 and beyond, using probabilistic and asymptotic analysis.

Contribution

It introduces the first optimal algorithms for 3 experts, generalizes to more experts, and connects probability matching to minimax optimality in adversarial settings.

Findings

01

Optimal algorithms for 3 experts are probability matching against a specific adversary.

02

The probability matching algorithm is both adversarial and minimax optimal.

03

Extended analysis provides improved algorithms and bounds for 4 or more experts.

Abstract

We study the classical problem of prediction with expert advice in the adversarial setting with a geometric stopping time. In 1965, Cover gave the optimal algorithm for the case of 2 experts. In this paper, we design the optimal algorithm, adversary and regret for the case of 3 experts. Further, we show that the optimal algorithm for $2$ and $3$ experts is a probability matching algorithm (analogous to Thompson sampling) against a particular randomized adversary. Remarkably, our proof shows that the probability matching algorithm is not only optimal against this particular randomized adversary, but also minimax optimal. Our analysis develops upper and lower bounds simultaneously, analogous to the primal-dual method. Our analysis of the optimal adversary goes through delicate asymptotics of the random walk of a particle between multiple walls. We use the connection we develop to random…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Towards optimal algorithms for prediction with expert advice· youtube

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning