Optimization of the Sparse Multi-Threaded Cholesky Factorization for A64FX
Valentin Le F\`evre (1), Tetsuzo Usui (2), Marc Casas (1) ((1), Barcelona Supercomputing Center, (2) Fujitsu Limited)

TL;DR
This paper introduces OPT-D-COST, a dynamic algorithm that optimizes task granularity in parallel sparse Cholesky factorization, significantly improving performance on A64FX processors.
Contribution
The paper presents a novel method called selective nesting and an automatic algorithm OPT-D-COST for optimizing parallel sparse Cholesky factorization based on matrix structure.
Findings
Achieves an average speedup of 1.46× over state-of-the-art methods.
Effectively leverages matrix sparsity for workload optimization.
Demonstrates improved performance on a diverse set of 60 matrices.
Abstract
Sparse linear algebra routines are fundamental building blocks of a large variety of scientific applications. Direct solvers, which are methods for solving linear systems via the factorization of matrices into products of triangular matrices, are commonly used in many contexts. The Cholesky factorization is the fastest direct method for symmetric and definite positive matrices. This paper presents selective nesting, a method to determine the optimal task granularity for the parallel Cholesky factorization based on the structure of sparse matrices. We propose the OPT-D-COST algorithm, which automatically and dynamically applies selective nesting. OPT-D-COST leverages matrix sparsity to drive complex task-based parallel workloads in the context of direct solvers. We run an extensive evaluation campaign considering a heterogeneous set of 60 sparse matrices and a parallel machine featuring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Parallel Computing and Optimization Techniques · Cloud Computing and Resource Management
