A High-Throughput GPU Framework for Adaptive Lossless Compression of Floating-Point Data
Zheng Li (Chongqing University), Weiyan Wang (Chongqing University), Ruiyuan Li (Chongqing University), Chao Chen (Chongqing University), Xianlei Long (Chongqing University), Linjiang Zheng (Chongqing University), Quanqing Xu (OceanBase, Ant Group), Chuanhui Yang (OceanBase

TL;DR
This paper presents a GPU-based adaptive lossless compression framework for floating-point data, achieving high throughput and better compression ratios by addressing data movement, conversion accuracy, and sparsity challenges.
Contribution
It introduces a novel GPU framework with asynchronous I/O, error-free float-to-integer conversion, and adaptive sparse encoding, improving compression performance over existing methods.
Findings
Achieves 9.1% better compression ratio than state-of-the-art.
Provides 2.4x higher compression throughput.
Offers 2.4x higher decompression throughput.
Abstract
The torrential influx of floating-point data from domains like IoT and HPC necessitates high-performance lossless compression to mitigate storage costs while preserving absolute data fidelity. Leveraging GPU parallelism for this task presents significant challenges, including bottlenecks in heterogeneous data movement, complexities in executing precision-preserving conversions, and performance degradation due to anomaly-induced sparsity. To address these challenges, this paper introduces a novel GPU-based framework for floating-point adaptive lossless compression. The proposed solution employs three key innovations: a lightweight asynchronous pipeline that effectively hides I/O latency during CPU-GPU data transfer; a fast and theoretically guaranteed float-to-integer conversion method that eliminates errors inherent in floating-point arithmetic; and an adaptive sparse bit-plane encoding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Parallel Computing and Optimization Techniques · Advanced Data Storage Technologies
