Gradient-Variation Online Learning under Generalized Smoothness

Yan-Feng Xie; Peng Zhao; Zhi-Hua Zhou

arXiv:2408.09074·cs.LG·November 5, 2024

Gradient-Variation Online Learning under Generalized Smoothness

Yan-Feng Xie, Peng Zhao, Zhi-Hua Zhou

PDF

Open Access

TL;DR

This paper advances gradient-variation online learning by extending algorithms to generalized smoothness conditions, enabling adaptive, fast convergence in diverse optimization scenarios without prior curvature knowledge.

Contribution

It introduces a generalized smoothness framework, extends optimistic mirror descent, and designs a universal, adaptive algorithm with optimal regret bounds for convex and strongly convex functions.

Findings

01

Achieves gradient-variation regret bounds under generalized smoothness.

02

Develops a Lipschitz-adaptive meta-algorithm for unbounded gradients.

03

Provides applications for fast convergence in games and stochastic optimization.

Abstract

Gradient-variation online learning aims to achieve regret guarantees that scale with variations in the gradients of online functions, which has been shown to be crucial for attaining fast convergence in games and robustness in stochastic optimization, hence receiving increased attention. Existing results often require the smoothness condition by imposing a fixed bound on gradient Lipschitzness, which may be unrealistic in practice. Recent efforts in neural network optimization suggest a generalized smoothness condition, allowing smoothness to correlate with gradient norms. In this paper, we systematically study gradient-variation online learning under generalized smoothness. We extend the classic optimistic mirror descent algorithm to derive gradient-variation regret by analyzing stability over the optimization trajectory and exploiting smoothness locally. Then, we explore universal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition