$k$-SVD with Gradient Descent
Yassir Jedra, Devavrat Shah

TL;DR
This paper introduces a simple gradient descent approach with a universal step-size rule for computing the top k singular values and vectors of any matrix, achieving global linear convergence and matching the performance of more complex methods.
Contribution
It presents a gradient descent method with a universal step-size for k-SVD that guarantees global convergence for matrices of any rank, improving over prior local or specialized results.
Findings
Gradient descent with a universal step-size converges globally for any k, d ≥ 1.
The method's behavior within an attractive region resembles Heron's method.
Empirical results validate the theoretical convergence guarantees.
Abstract
The emergence of modern compute infrastructure for iterative optimization has led to great interest in developing optimization-based approaches for a scalable computation of -SVD, i.e., the largest singular values and corresponding vectors of a matrix of rank . Despite lots of exciting recent works, all prior works fall short in this pursuit. Specifically, the existing results are either for the exact-parameterized (i.e., ) and over-parameterized (i.e., ) settings; or only establish local convergence guarantees; or use a step-size that requires problem-instance-specific oracle-provided information. In this work, we complete this pursuit by providing a gradient-descent method with a simple, universal rule for step-size selection (akin to pre-conditioning), that provably finds -SVD for a matrix of any rank . We establish that the gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Computational Geometry and Mesh Generation · Digital Image Processing Techniques
