Exploring Autoencoder-based Error-bounded Compression for Scientific Data
Jinyang Liu, Sheng Di, Kai Zhao, Sian Jin, Dingwen Tao, Xin Liang,, Zizhong Chen, Franck Cappello

TL;DR
This paper introduces a convolutional autoencoder framework for error-bounded lossy compression of scientific data, achieving significantly better compression ratios while maintaining data fidelity, compared to existing methods.
Contribution
It develops an error-bounded autoencoder-based compression framework tailored for scientific data, optimizing model parameters and demonstrating superior performance over existing solutions.
Findings
Achieves 100% to 800% better compression ratio than SZ2.1 and ZFP at similar data distortion.
Provides an in-depth analysis of autoencoder characteristics for error-bounded compression.
Demonstrates competitive compression quality on five real-world scientific datasets.
Abstract
Error-bounded lossy compression is becoming an indispensable technique for the success of today's scientific projects with vast volumes of data produced during simulations or instrument data acquisitions. Not only can it significantly reduce data size, but it also can control the compression errors based on user-specified error bounds. Autoencoder (AE) models have been widely used in image compression, but few AE-based compression approaches support error-bounding features, which are highly required by scientific applications. To address this issue, we explore using convolutional autoencoders to improve error-bounded lossy compression for scientific data, with the following three key contributions. (1) We provide an in-depth investigation of the characteristics of various autoencoder models and develop an error-bounded autoencoder-based framework in terms of the SZ model. (2) We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Algorithms and Data Compression · Advanced Data Storage Technologies
