On Adapting Nesterov's Scheme to Accelerate Iterative Methods for Linear Problems
Tao Hong, Irad Yavneh

TL;DR
This paper adapts Nesterov's acceleration scheme to iterative methods for linear systems, providing explicit optimal parameters, demonstrating robustness over complex eigenvalues, and validating efficiency through numerical tests.
Contribution
It introduces a Nesterov-inspired acceleration scheme for linear solvers, with explicit optimal parameters and robustness analysis for complex eigenvalues.
Findings
The scheme requires more iterations than classical methods but is easy to implement.
Explicit formulas for optimal parameters are derived based on eigenvalues.
Numerical tests confirm the scheme's efficiency and robustness.
Abstract
Nesterov's well-known scheme for accelerating gradient descent in convex optimization problems is adapted to accelerating stationary iterative solvers for linear systems. Compared with classical Krylov subspace acceleration methods, the proposed scheme requires more iterations, but it is trivial to implement and retains essentially the same computational cost as the unaccelerated method. An explicit formula for a fixed optimal parameter is derived in the case where the stationary iteration matrix has only real eigenvalues, based only on the smallest and largest eigenvalues. The fixed parameter, and corresponding convergence factor, are shown to maintain their optimality when the iteration matrix also has complex eigenvalues that are contained within an explicitly defined disk in the complex plane. A comparison to Chebyshev acceleration based on the same information of the smallest and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Matrix Theory and Algorithms · Stochastic Gradient Optimization Techniques
