Gradient Descent and the Power Method: Exploiting their connection to find the leftmost eigen-pair and escape saddle points
Rachael Tappenden, Martin Tak\'a\v{c}

TL;DR
This paper reveals that gradient descent with fixed step size on quadratic functions is equivalent to the power method, enabling eigen-information extraction and proposing new step size strategies to improve nonconvex optimization.
Contribution
It establishes a connection between gradient descent and the power method, providing new insights and strategies for eigenvalue estimation and nonconvex optimization.
Findings
GD with fixed step size on quadratic functions reveals eigenvalues
Eigen-information can be used to accelerate convergence
Proposed new step size strategies improve practical performance
Abstract
This work shows that applying Gradient Descent (GD) with a fixed step size to minimize a (possibly nonconvex) quadratic function is equivalent to running the Power Method (PM) on the gradients. The connection between GD with a fixed step size and the PM, both with and without fixed momentum, is thus established. Consequently, valuable eigen-information is available via GD. Recent examples show that GD with a fixed step size, applied to locally quadratic nonconvex functions, can take exponential time to escape saddle points (Simon S. Du, Chi Jin, Jason D. Lee, Michael I. Jordan, Aarti Singh, and Barnabas Poczos: "Gradient descent can take exponential time to escape saddle points"; S. Paternain, A. Mokhtari, and A. Ribeiro: "A newton-based method for nonconvex optimization with fast evasion of saddle points"). Here, those examples are revisited and it is shown that eigenvalue…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
Methods@15 Ways to Get Help || How do I speak to a live person at JetBlue?
