TL;DR
This paper introduces a randomized blocked algorithm for efficiently computing partial rank-revealing factorizations of matrices, achieving high accuracy and speed, especially on GPU architectures, by reducing communication and avoiding pivoting.
Contribution
The paper presents a novel randomized blocked method for low-rank matrix factorizations that improves computational efficiency and accuracy over traditional techniques.
Findings
Achieves accuracy comparable or better than column-pivoted QR.
Reduces communication and accelerates computations on GPUs.
Maintains asymptotic complexity of $O(mnk)$.
Abstract
This manuscript describes a technique for computing partial rank-revealing factorizations, such as, e.g, a partial QR factorization or a partial singular value decomposition. The method takes as input a tolerance and an matrix , and returns an approximate low rank factorization of that is accurate to within precision in the Frobenius norm (or some other easily computed norm). The rank of the computed factorization (which is an output of the algorithm) is in all examples we examined very close to the theoretically optimal -rank. The proposed method is inspired by the Gram-Schmidt algorithm, and has the same asymptotic flop count. However, the method relies on randomized sampling to avoid column pivoting, which allows it to be blocked, and hence accelerates practical computations by reducing communication. Numerical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
