Graph Expansion and Communication Costs of Fast Matrix Multiplication
Grey Ballard, James Demmel, Olga Holtz, Oded Schwartz

TL;DR
This paper establishes fundamental lower bounds on the communication costs of fast matrix multiplication algorithms, linking these costs to the expansion properties of their computation graphs, and demonstrates these bounds are achievable.
Contribution
It introduces a novel graph-theoretic approach to derive tight lower bounds on communication costs for fast matrix multiplication algorithms, including Strassen's and others.
Findings
Derived lower bounds on communication costs for sequential algorithms.
Extended bounds to parallel algorithms with multiple processors.
Showed bounds are attainable by existing fast algorithms in linear algebra.
Abstract
The communication cost of algorithms (also known as I/O-complexity) is shown to be closely related to the expansion properties of the corresponding computation graphs. We demonstrate this on Strassen's and other fast matrix multiplication algorithms, and obtain first lower bounds on their communication costs. In the sequential case, where the processor has a fast memory of size , too small to store three -by- matrices, the lower bound on the number of words moved between fast and slow memory is, for many of the matrix multiplication algorithms, , where is the exponent in the arithmetic count (e.g., for Strassen, and for conventional matrix multiplication). With parallel processors, each with fast memory of size , the lower bound is times smaller. These bounds are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
