CEAZ: Accelerating Parallel I/O via Hardware-Algorithm Co-Designed Adaptive Lossy Compression
Chengming Zhang, Sian Jin, Tong Geng, Jiannan Tian, Ang Li, Dingwen, Tao

TL;DR
CEAZ is a hardware-algorithm co-designed lossy compression system for scientific data on FPGAs that achieves high compression ratios and throughput, significantly accelerating parallel I/O in HPC systems.
Contribution
This work introduces the first FPGA-based adaptive lossy compressor with a novel Huffman coding approach and precise ratio control, optimized for scientific data in HPC environments.
Findings
CEAZ achieves 2.3X higher throughput than previous FPGA compressors.
CEAZ attains 3.0X better compression ratio.
Parallel I/O throughput improves up to 37.8X.
Abstract
As HPC systems continue to grow to exascale, the amount of data that needs to be saved or transmitted is exploding. To this end, many previous works have studied using error-bounded lossy compressors to reduce the data size and improve the I/O performance. However, little work has been done for effectively offloading lossy compression onto FPGA-based SmartNICs to reduce the compression overhead. In this paper, we propose a hardware-algorithm codesign for an efficient and adaptive lossy compressor for scientific data on FPGAs (called CEAZ), which is the first lossy compressor that can achieve high compression ratios and throughputs simultaneously. Specifically, we propose an efficient Huffman coding approach that can adaptively update Huffman codewords online based on codewords generated offline, from a variety of representative scientific datasets. Moreover, we derive a theoretical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Storage Technologies · Algorithms and Data Compression · Parallel Computing and Optimization Techniques
