Adaptive Encoding Strategies for Erasing-Based Lossless Floating-Point Compression
Ruiyuan Li, Zheng Li, Yi Wu, Chao Chen, Tong Liu, Yu Zheng

TL;DR
This paper introduces Elf*, an optimized adaptive encoding strategy for lossless floating-point time series compression, improving compression ratios and efficiency over existing methods, especially in streaming scenarios.
Contribution
It proposes Elf*, a set of adaptive encoding optimizations with a theoretical proof of optimality, and extends it to Streaming Elf* for high efficiency in streaming environments.
Findings
SElf* achieves 9.2% better compression ratio than competitors.
Elf* ranks among the most competitive batch compressors.
SElf* maintains high efficiency in streaming scenarios.
Abstract
Lossless floating-point time series compression is crucial for a wide range of critical scenarios. Nevertheless, it is a big challenge to compress time series losslessly due to the complex underlying layouts of floating-point values. The state-of-the-art erasing-based compression algorithm Elf demonstrates a rather impressive performance. We give an in-depth exploration of the encoding strategies of Elf, and find that there is still much room for improvement. In this paper, we propose Elf*, which employs a set of optimizations for leading zeros, center bits and sharing condition. Specifically, we develop a dynamic programming algorithm with a set of pruning strategies to compute the adaptive approximation rules efficiently. We theoretically prove that the adaptive approximation rules are globally optimal. We further extend Elf* to Streaming Elf*, i.e., SElf*, which achieves almost the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNumerical Methods and Algorithms · Algorithms and Data Compression · Advanced Data Storage Technologies
