Gradient-Variation Regret Bounds for Unconstrained Online Learning
Yuheng Zhao, Andrew Jacobsen, Nicol\`o Cesa-Bianchi, and Peng Zhao

TL;DR
This paper introduces parameter-free algorithms for unconstrained online learning that adapt to gradient variation, achieving regret bounds without prior knowledge of problem parameters.
Contribution
The authors develop fully adaptive, efficient algorithms with regret bounds based on gradient variation, extending to dynamic regret and improving SEA model results.
Findings
Achieve regret of order ext{O}( ext{ extbar}u ext{ extbar}\u00a0\u221aV_T(u)+L ext{ extbar}u ext{ extbar}^2+G^4)
No prior knowledge of comparator norm, Lipschitz constant, or smoothness needed
Efficient closed-form update in each round
Abstract
We develop parameter-free algorithms for unconstrained online learning with regret guarantees that scale with the gradient variation . For -smooth convex loss, we provide fully-adaptive algorithms achieving regret of order without requiring prior knowledge of comparator norm , Lipschitz constant , or smoothness . The update in each round can be computed efficiently via a closed-form expression. Our results extend to dynamic regret and find immediate implications to the stochastically-extended adversarial (SEA) model, which significantly improves upon the previous best-known result [Wang et al., 2025].
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
