Efficient and Optimal Fixed-Time Regret with Two Experts
Laura Greenstreet, Nicholas J. A. Harvey, Victor Sanches Portella

TL;DR
This paper introduces an optimal, efficient algorithm for two-expert online prediction that achieves fixed-time regret bounds with constant per-round processing, improving upon previous methods.
Contribution
The paper presents a new algorithm for two-expert prediction with costs in [0,1], achieving optimal fixed-time regret with O(1) processing time per round, extending previous work beyond binary costs.
Findings
Achieves fixed-time regret bounds matching the theoretical optimum.
Operates with constant O(1) processing time per round.
Extends previous binary-cost algorithms to general costs in [0,1].
Abstract
Prediction with expert advice is a foundational problem in online learning. In instances with rounds and experts, the classical Multiplicative Weights Update method suffers at most regret when is known beforehand. Moreover, this is asymptotically optimal when both and grow to infinity. However, when the number of experts is small/fixed, algorithms with better regret guarantees exist. Cover showed in 1967 a dynamic programming algorithm for the two-experts problem restricted to costs that suffers at most regret with pre-processing time. In this work, we propose an optimal algorithm for prediction with two experts' advice that works even for costs in and with processing time per turn. Our algorithm builds up on recent work on the experts problem based on techniques and tools from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems
