On Theorem 2.3 in "Prediction, Learning, and Games" by Cesa-Bianchi and   Lugosi

Alexey Chernov

arXiv:1011.5668·cs.LG·November 29, 2010

On Theorem 2.3 in "Prediction, Learning, and Games" by Cesa-Bianchi and Lugosi

Alexey Chernov

PDF

Open Access

TL;DR

This paper provides a modified proof of a loss bound for the exponentially weighted average forecaster with time-varying potential, showing the regret is bounded by sqrt{n ln(N)} across steps.

Contribution

It offers a new proof technique for the regret bound of the exponentially weighted average forecaster with time-varying potential.

Findings

01

Regret bound is upper-bounded by sqrt{n ln(N)}.

02

The proof applies uniformly across all steps n.

03

The result enhances understanding of the forecaster's performance.

Abstract

The note presents a modified proof of a loss bound for the exponentially weighted average forecaster with time-varying potential. The regret term of the algorithm is upper-bounded by sqrt{n ln(N)} (uniformly in n), where N is the number of experts and n is the number of steps.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Computability, Logic, AI Algorithms