ParamSpMM: Adaptive and Efficient Sparse Matrix-Matrix Multiplication on GPUs for GNNs

Lixing Zhang; Guanhua Ye; Hongzheng Li; Shigang Li; Yingxia Shao

arXiv:2605.15695·cs.DC·May 18, 2026

ParamSpMM: Adaptive and Efficient Sparse Matrix-Matrix Multiplication on GPUs for GNNs

Lixing Zhang, Guanhua Ye, Hongzheng Li, Shigang Li, Yingxia Shao

PDF

TL;DR

ParamSpMM is a novel adaptive GPU-based sparse matrix multiplication method for GNNs, utilizing a new data structure and ML-based decision system to optimize performance across diverse inputs.

Contribution

It introduces ParamSpMM with a new data structure and ML-based configuration predictor, enabling highly adaptive and efficient SpMM computations for GNNs.

Findings

01

Outperforms Nvidia cuSPARSE with an average speedup of 1.92x

02

Effectively adapts to diverse input characteristics in GNNs

03

Enhances GNN training efficiency significantly

Abstract

Fueled by the ability to mine real-world graph data, GNN applications have experienced phenomenal growth. Sparse Matrix-Matrix Multiplication (SpMM) is a critical operator in GNNs. However, existing SpMM designs for GNNs struggle to adapt to diverse input characteristics. In this paper, we first conduct a comprehensive analysis of existing SpMM optimizations, revealing their limitations through statistical and empirical evidence. Based on this analysis, we introduce ParamSpMM, a parametric approach for highly adaptive and efficient SpMM computation in GNNs. It incorporates a new data structure, the Parameterized Compressed Sparse Row (PCSR), to flexibly integrate existing optimization techniques. ParamSpMM enables the configuration of these optimization techniques according to various input characteristics. Furthermore, we complement ParamSpMM with an ML-based SpMM-decider that predicts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.