Adaptive Sketching Based Construction of H2 Matrices on GPUs
Wajih Halim Boukaram, Yang Liu, Pieter Ghysels, Xiaoye Sherry Li

TL;DR
This paper introduces a GPU-accelerated, bottom-up sketching algorithm for constructing H2 matrices, achieving significant speedups over CPU and existing GPU methods, and is the first of its kind.
Contribution
It presents the first GPU implementation of bottom-up sketching-based H2 matrix construction with novel batched kernel design for variable data structures.
Findings
Up to 13x speedup over CPU implementation
Up to 1000x speedup over existing GPU H2 construction
Up to 660x speedup over ButterflyPACK algorithm
Abstract
We develop a novel linear-complexity bottom-up sketching-based algorithm for constructing a matrix, and present its high performance GPU implementation. The construction algorithm requires both a black-box sketching operator and an entry evaluation function. The novelty of our GPU approach centers around the design and implementation of the above two operations in batched mode on GPU with accommodation for variable-size data structures in a batch. The batch algorithms minimize the number of kernel launches and maximize the GPU throughput. When applied to covariance matrices, volume IE matrices and update operations, our proposed GPU implementation achieves up to speedup over our CPU implementation, and up to speedup over an existing GPU implementation of the top-down sketching-based algorithm from the H2Opus library. It also achieves a …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Graph Theory and Algorithms · Parallel Computing and Optimization Techniques
