Error bounded compression for weather and climate applications
Langwen Huang, Luigi Fusco, Florian Scheidl, Jan Zibell, Michael Armand Sprenger, Sebastian Schemm, Torsten Hoefler

TL;DR
EBCC is a novel error-bounded compression method tailored for weather and climate data, achieving high compression ratios while maintaining specified error bounds, thus enabling efficient storage and analysis of massive datasets.
Contribution
Introduces EBCC, a two-layer compression approach with feedback control, optimized for weather and climate data, outperforming existing methods in error control and compression ratio.
Findings
EBCC achieves compression ratios from 15x to over 300x.
It maintains errors within natural variability for climate data.
Outperforms other methods across multiple benchmarks.
Abstract
As the resolution of weather and climate simulations increases, the amount of data produced is growing rapidly from hundreds of terabytes to tens of petabytes. The huge size becomes a limiting factor for broader adoption, and its fast growth rate will soon exhaust all the available storage devices. To address these issues, we present EBCC (Error Bounded Climate-data Compressor). It follows a two-layer approach: a base compression layer using JPEG2000 to capture the bulk of the data with a high compression ratio, and a residual compression layer using wavelet transform and SPIHT encoding to efficiently eliminate long-tail extreme errors introduced by the base compression layer. It incorporates a feedback rate-control mechanism for both layers that adjusts compression ratios to achieve the specified maximum error target. We evaluate EBCC alongside other established compression methods on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
