Batched QR and SVD Algorithms on GPUs with Applications in Hierarchical   Matrix Compression

Wajih Halim Boukaram; George Turkiyyah; Hatem Ltaief; David E.; Keyes

arXiv:1707.05141·cs.MS·July 18, 2017

Batched QR and SVD Algorithms on GPUs with Applications in Hierarchical Matrix Compression

Wajih Halim Boukaram, George Turkiyyah, Hatem Ltaief, David E., Keyes

PDF

Open Access

TL;DR

This paper develops high-performance GPU algorithms for batched QR and SVD computations, enabling efficient hierarchical matrix compression with significant speedups over existing methods.

Contribution

It introduces GPU-optimized batched QR and SVD algorithms using the Jacobi method and randomized techniques, tailored for hierarchical matrix applications.

Findings

01

Substantial speedups over cuSOLVER SVDs achieved

02

Effective GPU kernels based on memory hierarchy levels implemented

03

Enables efficient H-matrix arithmetic on GPUs

Abstract

We present high performance implementations of the QR and the singular value decomposition of a batch of small matrices hosted on the GPU with applications in the compression of hierarchical matrices. The one-sided Jacobi algorithm is used for its simplicity and inherent parallelism as a building block for the SVD of low rank blocks using randomized methods. We implement multiple kernels based on the level of the GPU memory hierarchy in which the matrices can reside and show substantial speedups against streamed cuSOLVER SVDs. The resulting batched routine is a key component of hierarchical matrix compression, opening up opportunities to perform H-matrix arithmetic efficiently on GPUs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques