CODAG: Characterizing and Optimizing Decompression Algorithms for GPUs
Jeongmin Park, Zaid Qureshi, Vikram Mailthody, Andrew Gacek, Shunfan, Shao, Mohammad AlMasri, Isaac Gelado, Jinjun Xiong, Chris Newburn, I-hsin, Chung, Michael Garland, Nikolay Sakharnykh, Wen-mei Hwu

TL;DR
This paper introduces CODAG, a new GPU decompression architecture that improves throughput by optimizing resource utilization and simplifying algorithm integration, outperforming existing methods across multiple compression techniques.
Contribution
Proposes CODAG, a simple, efficient GPU kernel architecture for high-throughput decompression that enhances resource utilization and ease of algorithm integration.
Findings
CODAG achieves up to 13.46x speedup over state-of-the-art decompressors.
It effectively supports multiple compression algorithms with improved performance.
The architecture reduces GPU resource underutilization during decompression.
Abstract
Data compression and decompression have become vital components of big-data applications to manage the exponential growth in the amount of data collected and stored. Furthermore, big-data applications have increasingly adopted GPUs due to their high compute throughput and memory bandwidth. Prior works presume that decompression is memory-bound and have dedicated most of the GPU's threads to data movement and adopted complex software techniques to hide memory latency for reading compressed data and writing uncompressed data. This paper shows that these techniques lead to poor GPU resource utilization as most threads end up waiting for the few decoding threads, exposing compute and synchronization latencies. Based on this observation, we propose CODAG, a novel and simple kernel architecture for high throughput decompression on GPUs. CODAG eliminates the use of specialized groups of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Parallel Computing and Optimization Techniques · Advanced Data Storage Technologies
