Optimal Asynchronous Stochastic Nonconvex Optimization under Heavy-Tailed Noise
Yidong Wu, Luo Luo

TL;DR
This paper introduces an asynchronous normalized stochastic gradient descent algorithm with momentum for nonconvex optimization under heavy-tailed noise, achieving optimal time complexity and demonstrating effectiveness through experiments.
Contribution
It presents a novel asynchronous optimization method that handles heavy-tailed noise and heterogeneity, with proven optimal time complexity under certain moment conditions.
Findings
Achieves optimal time complexity for heavy-tailed noise scenarios.
Demonstrates effectiveness through numerical experiments.
Handles arbitrarily heterogeneous computation times.
Abstract
This paper considers the problem of asynchronous stochastic nonconvex optimization with heavy-tailed gradient noise and arbitrarily heterogeneous computation times across workers. We propose an asynchronous normalized stochastic gradient descent algorithm with momentum. The analysis show that our method achieves the optimal time complexity under the assumption of bounded th-order central moment with . We also provide numerical experiments to show the effectiveness of proposed method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Risk and Portfolio Optimization
