Cover meets Robbins while Betting on Bounded Data: $\ln n$ Regret and Almost Sure $\ln\ln n$ Regret
Shubhada Agrawal, Aaditya Ramdas

TL;DR
This paper introduces a novel mixture betting strategy that achieves near-optimal regret bounds, combining stochastic adaptivity with adversarial robustness, and demonstrates a sharp game-theoretic law of the iterated logarithm.
Contribution
It presents a new mixture betting approach that attains $O( ln n)$ regret on some paths and $O( ln ln n)$ on others, blending stochastic and adversarial guarantees.
Findings
Achieves $O( ln ln n)$ regret on almost all paths under certain conditions.
Demonstrates a sharp game-theoretic upper law of the iterated logarithm.
Contrasts with prior work by providing a best-of-both-worlds betting strategy.
Abstract
Consider betting against a sequence of data in , where one is allowed to make any bet that is fair if the data have a conditional mean . Cover's universal portfolio algorithm delivers a worst-case regret of compared to the best constant bet in hindsight, and this bound is unimprovable against adversarially generated data. In this work, we present a novel mixture betting strategy that combines insights from Robbins and Cover, and exhibits a different behavior: it eventually produces a regret of on almost all paths (a measure-one set of paths if each conditional mean equals and intrinsic variance increases to ), but has an regret on the complement (a measure zero set of paths). Our paper appears to be the first to point out the value in hedging two very different strategies to achieve a best-of-both-worlds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
