Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix   Computations

Hussam Al Daas (STFC; Scientific Computing Department; Rutherford; Appleton Laboratory; Didcot; UK); Grey Ballard (Wake Forest University,; Computer Science Department; Winston-Salem; NC; USA); Laura Grigori (EPFL,; Institute of Mathematics; Lausanne; Switzerland; PSI; Center for; Scientific Computing; Theory; Data; Villigen; Switzerland); Suraj Kumar; (Institut national de recherche en sciences et technologies du num\'erique,; Lyon; France); Kathryn Rouse (Inmar Intelligence; Winston-Salem; NC; USA); and Mathieu Verite (EPFL; Institute of Mathematics; Lausanne; Switzerland)

arXiv:2409.11304·cs.DC·September 18, 2024

Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations

Hussam Al Daas (STFC, Scientific Computing Department, Rutherford, Appleton Laboratory, Didcot, UK), Grey Ballard (Wake Forest University,, Computer Science Department, Winston-Salem, NC, USA), Laura Grigori (EPFL,, Institute of Mathematics, Lausanne, Switzerland, PSI

PDF

TL;DR

This paper establishes tight communication lower bounds and presents optimal algorithms for symmetric matrix computations like SYRK, SYR2K, and SYMM, crucial in linear algebra applications, using geometric and optimization techniques.

Contribution

It provides the first tight communication bounds for these symmetric matrix operations and designs algorithms that achieve these bounds in both sequential and parallel models.

Findings

01

Derived tight communication lower bounds for SYRK, SYR2K, and SYMM.

02

Developed communication-optimal algorithms matching the bounds.

03

Applied geometric inequalities and nonlinear optimization in proofs.

Abstract

In this article, we focus on the communication costs of three symmetric matrix computations: i) multiplying a matrix with its transpose, known as a symmetric rank-k update (SYRK) ii) adding the result of the multiplication of a matrix with the transpose of another matrix and the transpose of that result, known as a symmetric rank-2k update (SYR2K) iii) performing matrix multiplication with a symmetric input matrix (SYMM). All three computations appear in the Level 3 Basic Linear Algebra Subroutines (BLAS) and have wide use in applications involving symmetric matrices. We establish communication lower bounds for these kernels using sequential and distributed-memory parallel computational models, and we show that our bounds are tight by presenting communication-optimal algorithms for each setting. Our lower bound proofs rely on applying a geometric inequality for symmetric computations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.