Erasing-based lossless compression method for streaming floating-point time series
Ruiyuan Li, Zheng Li, Yi Wu, Chao Chen, Songtao Guo, Ming Zhang, Yu, Zheng

TL;DR
This paper introduces Elf and Elf+, innovative lossless compression algorithms for streaming floating-point time series data that leverage erasing bits to enhance compression ratios while maintaining efficiency.
Contribution
The paper presents a novel erasing-based lossless compression method, Elf, and its optimized version Elf+ for floating-point time series, with rigorous analysis and superior experimental performance.
Findings
Elf and Elf+ achieve higher compression ratios than existing methods.
Both algorithms operate in linear time and constant space.
Extensive experiments validate their effectiveness across multiple datasets.
Abstract
There are a prohibitively large number of floating-point time series data generated at an unprecedentedly high rate. An efficient, compact and lossless compression for time series data is of great importance for a wide range of scenarios. Most existing lossless floating-point compression methods are based on the XOR operation, but they do not fully exploit the trailing zeros, which usually results in an unsatisfactory compression ratio. This paper proposes an Erasing-based Lossless Floating-point compression algorithm, i.e., Elf. The main idea of Elf is to erase the last few bits (i.e., set them to zero) of floating-point values, so the XORed values are supposed to contain many trailing zeros. The challenges of the erasing-based method are three-fold. First, how to quickly determine the erased bits? Second, how to losslessly recover the original data from the erased ones? Third, how to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
