Analysis of A Splitting Approach for the Parallel Solution of Linear Systems on GPU Cards
Ang Li, Radu Serban, Dan Negrut

TL;DR
This paper introduces SaP::GPU, a GPU-based parallel solver for large banded linear systems that partitions matrices for independent factorization, demonstrating competitive efficiency against established solvers.
Contribution
The paper presents a novel split and parallelize approach for solving large linear systems on GPUs, with an open-source implementation that performs well against standard solvers.
Findings
SaP::GPU is robust and efficient for large systems.
It compares favorably with PARDISO, SuperLU, and MUMPS.
Performs well on dense banded systems near diagonal dominance.
Abstract
We discuss an approach for solving sparse or dense banded linear systems on a Graphics Processing Unit (GPU) card. The matrix is possibly nonsymmetric and moderately large; i.e., . The () approach seeks to partition the matrix into diagonal sub-blocks , , which are independently factored in parallel. The solution may choose to consider or to ignore the matrices that couple the diagonal sub-blocks . This approach, along with the Krylov subspace-based iterative method that it preconditions, are implemented in a solver called , which is compared in terms of efficiency with three commonly used sparse direct solvers: , , and . , which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Electromagnetic Scattering and Analysis · Numerical methods for differential equations
