Direct QR factorizations for tall-and-skinny matrices in MapReduce architectures
Austin R. Benson, David F. Gleich, James Demmel

TL;DR
This paper introduces a stable and efficient direct QR factorization method for tall-and-skinny matrices in MapReduce environments, improving stability and performance over existing approaches.
Contribution
It presents a new direct TSQR algorithm for MapReduce that is both numerically stable and requires only slightly more than two data passes, unlike previous unstable methods.
Findings
The new TSQR method is more stable than Cholesky QR in MapReduce.
It outperforms the Householder QR method in both theory and practice.
The approach enables efficient SVD computation with minimal data passes.
Abstract
The QR factorization and the SVD are two fundamental matrix decompositions with applications throughout scientific computing and data analysis. For matrices with many more rows than columns, so-called "tall-and-skinny matrices," there is a numerically stable, efficient, communication-avoiding algorithm for computing the QR factorization. It has been used in traditional high performance computing and grid computing environments. For MapReduce environments, existing methods to compute the QR decomposition use a numerically unstable approach that relies on indirectly computing the Q factor. In the best case, these methods require only two passes over the data. In this paper, we describe how to compute a stable tall-and-skinny QR factorization on a MapReduce architecture in only slightly more than 2 passes over the data. We can compute the SVD with only a small change and no difference in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
