Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures
Jiannan Tian, Cody Rivera, Sheng Di, Jieyang Chen, Xin Liang, Dingwen, Tao, Franck Cappello

TL;DR
This paper presents a GPU-optimized Huffman encoding method that significantly improves throughput for data compression in high-performance computing applications by parallelizing codebook construction and utilizing GPU memory bandwidth.
Contribution
It introduces novel GPU-based parallel algorithms for Huffman codebook construction and encoding, achieving substantial performance gains over existing GPU and CPU implementations.
Findings
Up to 6.8X throughput improvement on NVIDIA V100 GPU
Effective scaling of codebook construction with input symbols
Significant speedup over multi-threaded CPU Huffman encoder
Abstract
Today's high-performance computing (HPC) applications are producing vast volumes of data, which are challenging to store and transfer efficiently during the execution, such that data compression is becoming a critical technique to mitigate the storage burden and data movement cost. Huffman coding is arguably the most efficient Entropy coding algorithm in information theory, such that it could be found as a fundamental step in many modern compression algorithms such as DEFLATE. On the other hand, today's HPC applications are more and more relying on the accelerators such as GPU on supercomputers, while Huffman encoding suffers from low throughput on GPUs, resulting in a significant bottleneck in the entire data processing. In this paper, we propose and implement an efficient Huffman encoding approach based on modern GPU architectures, which addresses two key challenges: (1) how to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Advanced Data Compression Techniques · Advanced Image and Video Retrieval Techniques
