PSelInv - A Distributed Memory Parallel Algorithm for Selected Inversion: the non-symmetric Case
Mathias Jacquelin, Lin Lin, Chao Yang

TL;DR
This paper introduces a parallel algorithm, PSelInv, for efficiently computing selected elements of the inverse of large sparse non-symmetric matrices on distributed memory systems, with scalable performance demonstrated up to 6,400 cores.
Contribution
It extends the selected inversion algorithm to non-symmetric matrices and implements a memory-efficient, scalable parallel version using asynchronous communication.
Findings
Scales efficiently to 6,400 cores for various matrices.
Uses tree-based asynchronous communication to optimize load balancing.
Does not assume symmetry, broadening applicability.
Abstract
This paper generalizes the parallel selected inversion algorithm called PSelInv to sparse non- symmetric matrices. We assume a general sparse matrix A has been decomposed as PAQ = LU on a distributed memory parallel machine, where L, U are lower and upper triangular matrices, and P, Q are permutation matrices, respectively. The PSelInv method computes selected elements of A-1. The selection is confined by the sparsity pattern of the matrix AT . Our algorithm does not assume any symmetry properties of A, and our parallel implementation is memory efficient, in the sense that the computed elements of A-T overwrites the sparse matrix L+U in situ. PSelInv involves a large number of collective data communication activities within different processor groups of various sizes. In order to minimize idle time and improve load balancing, tree-based asynchronous communication is used to coordinate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Stochastic Gradient Optimization Techniques · Distributed and Parallel Computing Systems
