Sparse $L^1$-Autoencoders for Scientific Data Compression
Matthias Chung, Rick Archibald, Paul Atzberger, Jack Michael, Solomon

TL;DR
This paper introduces sparse $L^1$-autoencoders leveraging high-dimensional, $L^1$-regularized latent spaces to achieve efficient scientific data compression, significantly reducing data size while preserving accuracy and mitigating artifacts.
Contribution
The paper proposes a novel autoencoder architecture with $L^1$ regularization in high-dimensional latent spaces for improved scientific data compression, addressing artifact mitigation and achieving high compression ratios.
Findings
Achieved around 100x compression ratios on SAS datasets.
Demonstrated mitigation of blurring and artifacts in compressed data.
Showed potential for use in high-performance distributed computing environments.
Abstract
Scientific datasets present unique challenges for machine learning-driven compression methods, including more stringent requirements on accuracy and mitigation of potential invalidating artifacts. Drawing on results from compressed sensing and rate-distortion theory, we introduce effective data compression methods by developing autoencoders using high dimensional latent spaces that are -regularized to obtain sparse low dimensional representations. We show how these information-rich latent spaces can be used to mitigate blurring and other artifacts to obtain highly effective data compression methods for scientific data. We demonstrate our methods for short angle scattering (SAS) datasets showing they can achieve compression ratios around two orders of magnitude and in some cases better. Our compression methods show promise for use in addressing current bottlenecks in transmission,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Algorithms and Data Compression · Computational Physics and Python Applications
