Low-Rank Approximation with $1/\epsilon^{1/3}$ Matrix-Vector Products
Ainesh Bakshi, Kenneth L. Clarkson, David P. Woodruff

TL;DR
This paper introduces an improved algorithm for low-rank matrix approximation using Krylov methods that significantly reduces the number of matrix-vector products needed, achieving near-optimal complexity for Schatten-$p$ norms.
Contribution
The paper presents a novel Krylov-based algorithm that reduces matrix-vector product complexity for Schatten-$p$ norm low-rank approximation, extending results to all $p \, \geq 1$ and establishing tight lower bounds.
Findings
Uses only $\tilde{O}(k p^{1/6} / \epsilon^{1/3})$ matrix-vector products.
Improves previous bounds for $p=2$ from $\tilde{O}(k/\epsilon^{1/2})$ to $\tilde{O}(k/\epsilon^{1/3})$.
Establishes a lower bound of $\Omega(1/\epsilon^{1/3})$ for all fixed $p \geq 1$, showing optimality.
Abstract
We study iterative methods based on Krylov subspaces for low-rank approximation under any Schatten- norm. Here, given access to a matrix through matrix-vector products, an accuracy parameter , and a target rank , the goal is to find a rank- matrix with orthonormal columns such that , where denotes the norm of the the singular values of . For the special cases of (Frobenius norm) and (Spectral norm), Musco and Musco (NeurIPS 2015) obtained an algorithm based on Krylov methods that uses matrix-vector products, improving on the na\"ive dependence obtainable by the power method, where suppresses poly factors. Our main result is an algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
