Low-synchronization Arnoldi Methods for the Matrix Exponential with Application to Exponential Integrators
Tanya Tafolla, St\'ephane Gaudreault, Mayya Tokman

TL;DR
This paper introduces low-synchronization Arnoldi methods to improve the parallel scalability of exponential integrators by reducing global communication during orthogonalization, leading to faster computations for large matrices.
Contribution
It develops low-synchronization Gram-Schmidt algorithms that maintain accuracy while enhancing parallel efficiency in exponential integrators.
Findings
Reduced global communication improves parallel scalability.
Low-synchronization methods match the accuracy of classical Gram-Schmidt.
Experiments show faster time-to-solution in large-scale problems.
Abstract
High order exponential integrators require computing linear combination of exponential like -functions of large matrices times a vector . Krylov projection methods are the most general and remain an efficient choice for computing the matrix-function-vector-product evaluation when the matrix is is large and unable to be explicitly stored, or when obtaining information about the spectrum is expensive. The Krylov approximation relies on the Gram-Schmidt (GS) orthogonalization procedure to produce the orthonormal basis . In parallel, GS orthogonalization requires \textit{global synchronizations} for inner products and vector normalization in the orthogonalization process. Reducing the amount of global synchronizations is of paramount importance for the efficiency of a numerical algorithm in a massively parallel setting. We improve the parallel strong scaling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Numerical methods for differential equations
