sTiles: An Accelerated Computational Framework for Sparse Factorizations of Structured Matrices
Esmail Abdul Fattah, Hatem Ltaief, Havard Rue, and David Keyes

TL;DR
sTiles is a GPU-accelerated framework that efficiently factorizes sparse structured matrices, achieving significant speedups over existing solvers by leveraging tile algorithms and structure-aware task scheduling.
Contribution
It introduces a novel GPU-based framework with structure-aware task execution and a left-looking Cholesky variant for better parallelism in sparse matrix factorizations.
Findings
Achieves up to 11.08X speedup over PARDISO.
Provides a 5X speedup over a 32-core CPU on GPU.
Effectively handles arrowhead sparse matrices with variable bandwidths.
Abstract
This paper introduces sTiles, a GPU-accelerated framework for factorizing sparse structured symmetric matrices. By leveraging tile algorithms for fine-grained computations, sTiles uses a structure-aware task execution flow to handle challenging arrowhead sparse matrices with variable bandwidths, common in scientific and engineering fields. It minimizes fill-in during Cholesky factorization using permutation techniques and employs a static scheduler to manage tasks on shared-memory systems with GPU accelerators. sTiles balances tile size and parallelism, where larger tiles enhance algorithmic intensity but increase floating-point operations and memory usage, while parallelism is constrained by the arrowhead structure. To expose more parallelism, a left-looking Cholesky variant breaks sequential dependencies in trailing submatrix updates via tree reductions. Evaluations show sTiles…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Face and Expression Recognition
