Copernicus: Characterizing the Performance Implications of Compression Formats Used in Sparse Workloads
Bahar Asgari, Ramyad Hadidi, Joshua Dierberger, Charlotte Steinichen,, Amaan Marfatia, Hyesoon Kim

TL;DR
This paper evaluates how different sparse matrix formats impact performance on domain-specific architectures, specifically using FPGA-based implementations of sparse matrix-vector multiplication, highlighting the trade-offs in decompression and processing efficiency.
Contribution
It provides a comprehensive analysis of seven common sparse formats' performance implications on FPGA-based DSAs, optimizing decompression for fair comparison and exploring multiple metrics.
Findings
Decompression overhead varies significantly across formats.
Certain formats achieve higher throughput but at increased resource cost.
Performance trade-offs depend on workload characteristics and format choice.
Abstract
Sparse matrices are the key ingredients of several application domains, from scientific computation to machine learning. The primary challenge with sparse matrices has been efficiently storing and transferring data, for which many sparse formats have been proposed to significantly eliminate zero entries. Such formats, essentially designed to optimize memory footprint, may not be as successful in performing faster processing. In other words, although they allow faster data transfer and improve memory bandwidth utilization -- the classic challenge of sparse problems -- their decompression mechanism can potentially create a computation bottleneck. Not only is this challenge not resolved, but also it becomes more serious with the advent of domain-specific architectures (DSAs), as they intend to more aggressively improve performance. The performance implications of using various formats…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Cloud Computing and Resource Management · Interconnection Networks and Systems
