Online Convex Optimization with Heavy Tails: Old Algorithms, New Regrets, and Applications

Zijian Liu

arXiv:2508.07473·cs.LG·March 20, 2026

Online Convex Optimization with Heavy Tails: Old Algorithms, New Regrets, and Applications

Zijian Liu

PDF

Open Access

TL;DR

This paper investigates classical online convex optimization algorithms in heavy-tailed noise settings, establishing optimal regret bounds without modifications and applying these results to nonconvex optimization and broader scenarios.

Contribution

It provides the first optimal regret bounds for old algorithms in heavy-tailed stochastic gradients without needing gradient clipping.

Findings

01

Classical algorithms achieve optimal regret bounds in heavy-tailed settings.

02

First provable convergence results for nonconvex optimization under heavy-tailed noise.

03

Extensions to smooth OCO and optimistic algorithms for broader cases.

Abstract

In Online Convex Optimization (OCO), when the stochastic gradient has a finite variance, many algorithms provably work and guarantee a sublinear regret. However, limited results are known if the gradient estimate has a heavy tail, i.e., the stochastic gradient only admits a finite $p$ -th central moment for some $p \in (1, 2]$ . Motivated by it, this work examines different old algorithms for OCO (e.g., Online Gradient Descent) in the more challenging heavy-tailed setting. Under the standard bounded domain assumption, we establish new regrets for these classical methods without any algorithmic modification. Remarkably, these regret bounds are fully optimal in all parameters (can be achieved even without knowing $p$ ), suggesting that OCO with heavy tails can be solved effectively without any extra operation (e.g., gradient clipping). Our new results have…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Bandit Algorithms Research · Sparse and Compressive Sensing Techniques