Auto-SpMV: Automated Optimizing SpMV Kernels on GPU
Mina Ashoury, Mohammad Loni, Farshad Khunjush, Masoud, Daneshtalab

TL;DR
Auto-SpMV is a framework that optimizes sparse matrix-vector multiplication on GPUs for both low latency and energy efficiency by tuning compilation parameters and selecting optimal sparse formats, using extensive datasets and hyperparameter tuning.
Contribution
It introduces a novel framework that combines compile-time and run-time optimizations for SpMV on GPUs, including the largest dataset for training and hyperparameter fine-tuning for improved performance.
Findings
Auto-SpMV reduces latency by up to 51.9%.
Auto-SpMV decreases energy consumption by up to 52%.
Auto-SpMV improves energy efficiency by up to 99.7%.
Abstract
Sparse matrix-vector multiplication (SpMV) is an essential linear algebra operation that dominates the computing cost in many scientific applications. Due to providing massive parallelism and high memory bandwidth, GPUs are commonly used to accelerate SpMV kernels. Prior studies mainly focused on reducing the latency of SpMV kernels on GPU. However, few attempts have been made to improve the energy efficiency of SpMV kernels, resulting in GPUs being excluded from the range of low-power applications. Furthermore, prior work has primarily focused on optimizing the sparse format of SpMV kernels, the literature ignores evaluating the impact of tweaking compilation parameters. Lastly, Little attention has been paid to preparing a comprehensive training dataset of running SpMV kernels and fine-tuning the learning hyperparameters. To address these limitations, we present a novel framework,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Tensor decomposition and applications · Ferroelectric and Negative Capacitance Devices
