Gradient-Variation Online Learning under Generalized Smoothness
Yan-Feng Xie, Peng Zhao, Zhi-Hua Zhou

TL;DR
This paper advances gradient-variation online learning by extending algorithms to generalized smoothness conditions, enabling adaptive, fast convergence in diverse optimization scenarios without prior curvature knowledge.
Contribution
It introduces a generalized smoothness framework, extends optimistic mirror descent, and designs a universal, adaptive algorithm with optimal regret bounds for convex and strongly convex functions.
Findings
Achieves gradient-variation regret bounds under generalized smoothness.
Develops a Lipschitz-adaptive meta-algorithm for unbounded gradients.
Provides applications for fast convergence in games and stochastic optimization.
Abstract
Gradient-variation online learning aims to achieve regret guarantees that scale with variations in the gradients of online functions, which has been shown to be crucial for attaining fast convergence in games and robustness in stochastic optimization, hence receiving increased attention. Existing results often require the smoothness condition by imposing a fixed bound on gradient Lipschitzness, which may be unrealistic in practice. Recent efforts in neural network optimization suggest a generalized smoothness condition, allowing smoothness to correlate with gradient norms. In this paper, we systematically study gradient-variation online learning under generalized smoothness. We extend the classic optimistic mirror descent algorithm to derive gradient-variation regret by analyzing stability over the optimization trajectory and exploiting smoothness locally. Then, we explore universal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
