Optimal Tracking in Prediction with Expert Advice
Hakan Gokcesu, Suleyman S. Kozat

TL;DR
This paper introduces a new online algorithm for prediction with expert advice that achieves min-max optimal dynamic regret without prior information, handling time-varying, possibly unbounded expert losses.
Contribution
It presents the first universally optimal, adaptive, and truly online algorithm for prediction with expert advice, with regret guarantees against any competitor sequence.
Findings
Achieves min-max optimal dynamic regret.
Handles time-varying, unbounded expert losses.
Operates with logarithmic complexity and no prior knowledge.
Abstract
We study the prediction with expert advice setting, where the aim is to produce a decision by combining the decisions generated by a set of experts, e.g., independently running algorithms. We achieve the min-max optimal dynamic regret under the prediction with expert advice setting, i.e., we can compete against time-varying (not necessarily fixed) combinations of expert decisions in an optimal manner. Our end-algorithm is truly online with no prior information, such as the time horizon or loss ranges, which are commonly used by different algorithms in the literature. Both our regret guarantees and the min-max lower bounds are derived with the general consideration that the expert losses can have time-varying properties and are possibly unbounded. Our algorithm can be adapted for restrictive scenarios regarding both loss feedback and decision making. Our guarantees are universal, i.e.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Age of Information Optimization
