Low-Synch Gram-Schmidt with Delayed Reorthogonalization for Krylov Solvers
Daniel Bielich, Julien Langou, Stephen Thomas, Kasia Swirydowicz,, Ichitaro Yamazaki, Erik G. Boman

TL;DR
This paper introduces a novel orthogonalization scheme for Krylov subspace methods that reduces global reductions from three to one per iteration, enhancing parallel scalability in eigenvalue computations.
Contribution
A new variant of the classical Gram-Schmidt process that requires only one global reduction per iteration for Arnoldi-based Krylov methods, improving parallel efficiency.
Findings
Significant strong-scaling improvements demonstrated for eigenvalue problems.
The proposed method maintains numerical stability unlike previous approaches.
Empirical results show reduced communication overhead in parallel environments.
Abstract
The parallel strong-scaling of Krylov iterative methods is largely determined by the number of global reductions required at each iteration. The GMRES and Krylov-Schur algorithms employ the Arnoldi algorithm for nonsymmetric matrices. The underlying orthogonalization scheme is left-looking and processes one column at a time. Thus, at least one global reduction is required per iteration. The traditional algorithm for generating the orthogonal Krylov basis vectors for the Krylov-Schur algorithm is classical Gram Schmidt applied twice with reorthogonalization (CGS2), requiring three global reductions per step. A new variant of CGS2 that requires only one reduction per iteration is applied to the Arnoldi-QR iteration. Strong-scaling results are presented for finding eigenvalue-pairs of nonsymmetric matrices. A preliminary attempt to derive a similar algorithm (one reduction per Arnoldi…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Electromagnetic Scattering and Analysis · Optical Network Technologies
