Prediction with Expert Advice in Games with Unbounded One-Step Gains

Vladimir V. V'yugin

arXiv:0806.4391·cs.LG·June 30, 2008

Prediction with Expert Advice in Games with Unbounded One-Step Gains

Vladimir V. V'yugin

PDF

Open Access

TL;DR

This paper extends algorithms for prediction with expert advice to scenarios with unbounded one-step gains, providing bounds and a universal algorithm that performs optimally even with large gains.

Contribution

It introduces modifications to existing algorithms for unbounded gains, establishes lower bounds, and constructs a universal algorithm with optimal performance.

Findings

01

Universal algorithm achieves optimal performance with unbounded gains.

02

Performance close to the best expert under limited deviation conditions.

03

Provides lower bounds for cumulative gain in the general case.

Abstract

The games of prediction with expert advice are considered in this paper. We present some modification of Kalai and Vempala algorithm of following the perturbed leader for the case of unrestrictedly large one-step gains. We show that in general case the cumulative gain of any probabilistic prediction algorithm can be much worse than the gain of some expert of the pool. Nevertheless, we give the lower bound for this cumulative gain in general case and construct a universal algorithm which has the optimal performance; we also prove that in case when one-step gains of experts of the pool have ``limited deviations'' the performance of our algorithm is close to the performance of the best expert.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Sports Analytics and Performance · Reinforcement Learning in Robotics