On Theorem 2.3 in "Prediction, Learning, and Games" by Cesa-Bianchi and Lugosi
Alexey Chernov

TL;DR
This paper provides a modified proof of a loss bound for the exponentially weighted average forecaster with time-varying potential, showing the regret is bounded by sqrt{n ln(N)} across steps.
Contribution
It offers a new proof technique for the regret bound of the exponentially weighted average forecaster with time-varying potential.
Findings
Regret bound is upper-bounded by sqrt{n ln(N)}.
The proof applies uniformly across all steps n.
The result enhances understanding of the forecaster's performance.
Abstract
The note presents a modified proof of a loss bound for the exponentially weighted average forecaster with time-varying potential. The regret term of the algorithm is upper-bounded by sqrt{n ln(N)} (uniformly in n), where N is the number of experts and n is the number of steps.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Computability, Logic, AI Algorithms
