Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Genghan Zhang, Yuetong Zhao, Yanting Tao, Zhongming Yu, Guohao Dai,, Sitao Huang, Yuan Wen, Pavlos Petoumenos, Yu Wang

TL;DR
This paper introduces Sgap, a novel GPU sparse tensor algebra compiler framework that enhances reduction semantics and optimization space exploration, leading to significant performance improvements in sparse matrix multiplication.
Contribution
Sgap proposes segment group and atomic parallelism techniques to elevate reduction semantics and optimize sparse tensor algebra compilation on GPUs, surpassing existing methods.
Findings
Up to 1.2x speedup on TACO's SpMM kernels.
Achieved 1.6x - 2.3x speedup on dgSPARSE library.
Enhanced reduction semantics and optimization space exploration.
Abstract
Sparse compiler is a promising solution for sparse tensor algebra optimization. In compiler implementation, reduction in sparse-dense hybrid algebra plays a key role in performance. Though GPU provides various reduction semantics that can better utilize the parallel computing and memory bandwidth capacity, the central question is: how to elevate the flexible reduction semantics to sparse compilation theory that assumes serial execution. Specifically, we have to tackle two main challenges: (1) there are wasted parallelism by adopting static synchronization granularity (2) static reduction strategy limits optimization space exploration. We propose Sgap: segment group and atomic parallelism to solve these problems. Atomic parallelism captures the flexible reduction semantics to systematically analyze the optimization space of sparse-dense hybrid algebra on GPU. It is a new optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Cloud Computing and Resource Management
