Highly Parallel Sparse Matrix-Matrix Multiplication
Ayd{\i}n Bulu\c{c}, John R. Gilbert

TL;DR
This paper introduces highly parallel algorithms for sparse matrix-matrix multiplication, enabling scalable performance on thousands of processors, which benefits high-performance graph algorithms and linear solvers.
Contribution
It presents the first parallel algorithms with increasing speedups for unbounded processors, utilizing a novel hypersparse kernel and 2D block distribution.
Findings
Achieves scalable performance up to thousands of processors
Demonstrates effectiveness on diverse test scenarios
Provides a state-of-the-art MPI implementation
Abstract
Generalized sparse matrix-matrix multiplication is a key primitive for many high performance graph algorithms as well as some linear solvers such as multigrid. We present the first parallel algorithms that achieve increasing speedups for an unbounded number of processors. Our algorithms are based on two-dimensional block distribution of sparse matrices where serial sections use a novel hypersparse kernel for scalability. We give a state-of-the-art MPI implementation of one of our algorithms. Our experiments show scaling up to thousands of processors on a variety of test scenarios.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Interconnection Networks and Systems
