Accelerated Stochastic Algorithms for Nonconvex Finite-sum and Multi-block Optimization
Guanghui Lan, Yu Yang

TL;DR
This paper introduces accelerated stochastic algorithms, RapGrad and RapDual, that significantly reduce gradient and projection computations for nonconvex finite-sum and multi-block problems, especially when the condition number is large.
Contribution
The paper proposes novel randomized accelerated methods, RapGrad and RapDual, with improved complexity bounds for nonconvex optimization, outperforming existing deterministic and randomized algorithms.
Findings
RapGrad reduces gradient computations by up to O(√m) compared to deterministic methods.
RapDual decreases projection subproblems by up to O(√m) over deterministic counterparts.
Complexity bounds are new and demonstrate efficiency gains in large-scale nonconvex problems.
Abstract
In this paper, we present new stochastic methods for solving two important classes of nonconvex optimization problems. We first introduce a randomized accelerated proximal gradient (RapGrad) method for solving a class of nonconvex optimization problems consisting of the sum of component functions, and show that it can significantly reduce the number of gradient computations especially when the condition number (i.e., the ratio between the Lipschitz constant and negative curvature) is large. More specifically, RapGrad can save up to gradient computations than existing deterministic nonconvex accelerated gradient methods. Moreover, the number of gradient computations required by RapGrad can be (at least ) times smaller than the best-known randomized nonconvex gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Complexity and Algorithms in Graphs
