Optimal Asynchronous Stochastic Nonconvex Optimization under Heavy-Tailed Noise

Yidong Wu; Luo Luo

arXiv:2601.19379·math.OC·January 28, 2026

Optimal Asynchronous Stochastic Nonconvex Optimization under Heavy-Tailed Noise

Yidong Wu, Luo Luo

PDF

Open Access

TL;DR

This paper introduces an asynchronous normalized stochastic gradient descent algorithm with momentum for nonconvex optimization under heavy-tailed noise, achieving optimal time complexity and demonstrating effectiveness through experiments.

Contribution

It presents a novel asynchronous optimization method that handles heavy-tailed noise and heterogeneity, with proven optimal time complexity under certain moment conditions.

Findings

01

Achieves optimal time complexity for heavy-tailed noise scenarios.

02

Demonstrates effectiveness through numerical experiments.

03

Handles arbitrarily heterogeneous computation times.

Abstract

This paper considers the problem of asynchronous stochastic nonconvex optimization with heavy-tailed gradient noise and arbitrarily heterogeneous computation times across workers. We propose an asynchronous normalized stochastic gradient descent algorithm with momentum. The analysis show that our method achieves the optimal time complexity under the assumption of bounded $p$ th-order central moment with $p \in (1, 2]$ . We also provide numerical experiments to show the effectiveness of proposed method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Risk and Portfolio Optimization