Highly Parallel Sparse Matrix-Matrix Multiplication

Ayd{\i}n Bulu\c{c}; John R. Gilbert

arXiv:1006.2183·cs.DC·August 9, 2016·24 cites

Highly Parallel Sparse Matrix-Matrix Multiplication

Ayd{\i}n Bulu\c{c}, John R. Gilbert

PDF

Open Access

TL;DR

This paper introduces highly parallel algorithms for sparse matrix-matrix multiplication, enabling scalable performance on thousands of processors, which benefits high-performance graph algorithms and linear solvers.

Contribution

It presents the first parallel algorithms with increasing speedups for unbounded processors, utilizing a novel hypersparse kernel and 2D block distribution.

Findings

01

Achieves scalable performance up to thousands of processors

02

Demonstrates effectiveness on diverse test scenarios

03

Provides a state-of-the-art MPI implementation

Abstract

Generalized sparse matrix-matrix multiplication is a key primitive for many high performance graph algorithms as well as some linear solvers such as multigrid. We present the first parallel algorithms that achieve increasing speedups for an unbounded number of processors. Our algorithms are based on two-dimensional block distribution of sparse matrices where serial sections use a novel hypersparse kernel for scalability. We give a state-of-the-art MPI implementation of one of our algorithms. Our experiments show scaling up to thousands of processors on a variety of test scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Interconnection Networks and Systems