Accelerating Sparse Matrix-Matrix Multiplication on GPUs with Processing Near HBMs
Shiju Li, Younghoon Min, Hane Yie, Hoshik Kim, Soohong Ahn, Joonseop Sim, Chul-Ho Lee, Jongryool Kim

TL;DR
This paper introduces a hardware-software co-designed framework with near-memory processing techniques to accelerate sparse matrix-matrix multiplication on GPUs, significantly improving performance for graph analytics and GNN workloads.
Contribution
It presents a novel Hash based Multi-phase SpGEMM and AIA technique, optimizing irregular memory access patterns on GPU HBM, outperforming existing methods.
Findings
Up to 17.3% time reduction in graph analytics applications
76.5% and 58.4% speedups for Graph Contraction and Markov Clustering
Average 1.43x speedup for GNN training across multiple datasets
Abstract
Sparse General Matrix-Matrix Multiplication (SpGEMM) is a fundamental operation in numerous scientific computing and data analytics applications, often bottlenecked by irregular memory access patterns. This paper presents Hash based Multi-phase SpGEMM on GPU and the Acceleration of Indirect Memory Access (AIA) technique, a novel custom near-memory processing approach to optimizing SpGEMM on GPU HBM. Our hardware-software co-designed framework for SpGEMM demonstrates significant performance improvements over state-of-the-art methods, particularly in handling complex, application-specific workloads. We evaluate our approach on various graph workloads, including graph contraction, Markov clustering, and Graph Neural Networks (GNNs), showcasing its practical applicability. For graph analytics applications, AIA demonstrates up to 17.3% time reduction from the software-only implementation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Cloud Computing and Resource Management
