Learning Sparse Approximate Inverse Preconditioners for Conjugate Gradient Solvers on GPUs
Zherui Yang, Zhehao Li, Kangbo Lyu, Yixuan Li, Tao Du, Ligang Liu

TL;DR
This paper introduces a GPU-friendly, learning-based method for constructing sparse approximate inverse preconditioners using GNNs, significantly accelerating conjugate gradient solutions for large linear systems.
Contribution
The authors propose a novel GNN-based approach to generate preconditioners that avoid triangular solves, improving GPU parallelization and convergence speed for CG methods.
Findings
Achieves 40%-53% reduction in GPU solution time.
Outperforms standard preconditioners and previous learning methods.
Demonstrates strong generalization across multiple datasets.
Abstract
The conjugate gradient solver (CG) is a prevalent method for solving symmetric and positive definite linear systems Ax=b, where effective preconditioners are crucial for fast convergence. Traditional preconditioners rely on prescribed algorithms to offer rigorous theoretical guarantees, while limiting their ability to exploit optimization from data. Existing learning-based methods often utilize Graph Neural Networks (GNNs) to improve the performance and speed up the construction. However, their reliance on incomplete factorization leads to significant challenges: the associated triangular solve hinders GPU parallelization in practice, and introduces long-range dependencies which are difficult for GNNs to model. To address these issues, we propose a learning-based method to generate GPU-friendly preconditioners, particularly using GNNs to construct Sparse Approximate Inverse (SPAI)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMatrix Theory and Algorithms · Tensor decomposition and applications · Model Reduction and Neural Networks
