Efficient and Optimal Fixed-Time Regret with Two Experts

Laura Greenstreet; Nicholas J. A. Harvey; Victor Sanches Portella

arXiv:2203.07577·cs.LG·March 16, 2022

Efficient and Optimal Fixed-Time Regret with Two Experts

Laura Greenstreet, Nicholas J. A. Harvey, Victor Sanches Portella

PDF

Open Access

TL;DR

This paper introduces an optimal, efficient algorithm for two-expert online prediction that achieves fixed-time regret bounds with constant per-round processing, improving upon previous methods.

Contribution

The paper presents a new algorithm for two-expert prediction with costs in [0,1], achieving optimal fixed-time regret with O(1) processing time per round, extending previous work beyond binary costs.

Findings

01

Achieves fixed-time regret bounds matching the theoretical optimum.

02

Operates with constant O(1) processing time per round.

03

Extends previous binary-cost algorithms to general costs in [0,1].

Abstract

Prediction with expert advice is a foundational problem in online learning. In instances with $T$ rounds and $n$ experts, the classical Multiplicative Weights Update method suffers at most $(T /2) ln n$ regret when $T$ is known beforehand. Moreover, this is asymptotically optimal when both $T$ and $n$ grow to infinity. However, when the number of experts $n$ is small/fixed, algorithms with better regret guarantees exist. Cover showed in 1967 a dynamic programming algorithm for the two-experts problem restricted to ${0, 1}$ costs that suffers at most $T /2 π + O (1)$ regret with $O (T^{2})$ pre-processing time. In this work, we propose an optimal algorithm for prediction with two experts' advice that works even for costs in $[0, 1]$ and with $O (1)$ processing time per turn. Our algorithm builds up on recent work on the experts problem based on techniques and tools from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems