Distributed stochastic inertial-accelerated methods with delayed derivatives for nonconvex problems
Yangyang Xu, Yibo Xu, Yonggui Yan, Jie Chen

TL;DR
This paper introduces an inertial proximal stochastic subgradient method with delayed derivatives for nonsmooth nonconvex optimization, providing convergence guarantees and demonstrating advantages in distributed asynchronous environments.
Contribution
It proposes a novel inertial accelerated SsGM capable of handling delayed derivatives in distributed settings with proven convergence rates.
Findings
Convergence rate of O(1/K^{1/2}) for three classes of nonconvex problems.
Delayed derivatives slow convergence, but the effect diminishes over iterations.
Higher parallel speed-up observed in asynchronous updates despite delays.
Abstract
Stochastic gradient methods (SGMs) are predominant approaches for solving stochastic optimization. On smooth nonconvex problems, a few acceleration techniques have been applied to improve the convergence rate of SGMs. However, little exploration has been made on applying a certain acceleration technique to a stochastic subgradient method (SsGM) for nonsmooth nonconvex problems. In addition, few efforts have been made to analyze an (accelerated) SsGM with delayed derivatives. The information delay naturally happens in a distributed system, where computing workers do not coordinate with each other. In this paper, we propose an inertial proximal SsGM for solving nonsmooth nonconvex stochastic optimization problems. The proposed method can have guaranteed convergence even with delayed derivative information in a distributed environment. Convergence rate results are established to three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
