GPU Accelerated Sparse Cholesky Factorization

M. Ozan Karsavuran; Esmond G. Ng; Barry W. Peyton

arXiv:2409.14009·cs.DC·February 13, 2025

GPU Accelerated Sparse Cholesky Factorization

M. Ozan Karsavuran, Esmond G. Ng, Barry W. Peyton

PDF

Open Access

TL;DR

This paper explores GPU acceleration techniques for sparse Cholesky factorization, significantly reducing computation time for large-scale scientific problems by offloading dense matrix operations to GPUs.

Contribution

It introduces methods for offloading dense matrix operations in sparse Cholesky factorization to GPUs, achieving up to 4x speedup over CPU-only implementations.

Findings

01

Up to 4x speedup with GPU acceleration

02

Effective offloading of dense matrix operations

03

Potential for improved large-scale scientific computations

Abstract

The solution of sparse symmetric positive definite linear systems is an important computational kernel in large-scale scientific and engineering modeling and simulation. We will solve the linear systems using a direct method, in which a Cholesky factorization of the coefficient matrix is performed using a right-looking approach and the resulting triangular factors are used to compute the solution. Sparse Cholesky factorization is compute intensive. In this work we investigate techniques for reducing the factorization time in sparse Cholesky factorization by offloading some of the dense matrix operations on a GPU. We will describe the techniques we have considered. We achieved up to 4x speedup compared to the CPU-only version.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Advanced Data Compression Techniques · Image Retrieval and Classification Techniques