Adaptive Configuration of In Situ Lossy Compression for Cosmology   Simulations via Fine-Grained Rate-Quality Modeling

Sian Jin; Jesus Pulido; Pascal Grosset; Jiannan Tian; Dingwen Tao,; James Ahrens

arXiv:2104.00178·cs.DC·April 22, 2021

Adaptive Configuration of In Situ Lossy Compression for Cosmology Simulations via Fine-Grained Rate-Quality Modeling

Sian Jin, Jesus Pulido, Pascal Grosset, Jiannan Tian, Dingwen Tao,, James Ahrens

PDF

TL;DR

This paper introduces an adaptive, in situ lossy compression method for cosmology simulations that optimizes error bounds per data partition to maximize compression ratio while preserving post-analysis accuracy.

Contribution

It presents a novel adaptive approach with models and optimization techniques for partition-wise lossy compression in cosmological simulations, achieving high compression with minimal overhead.

Findings

01

Achieves up to 73% higher compression ratio

02

Maintains post-analysis quality with only 1% overhead

03

Models are highly accurate and reliable

Abstract

Extreme-scale cosmological simulations have been widely used by today's researchers and scientists on leadership supercomputers. A new generation of error-bounded lossy compressors has been used in workflows to reduce storage requirements and minimize the impact of throughput limitations while saving large snapshots of high-fidelity data for post-hoc analysis. In this paper, we propose to adaptively provide compression configurations to compute partitions of cosmological simulations with newly designed post-analysis aware rate-quality modeling. The contribution is fourfold: (1) We propose a novel adaptive approach to select feasible error bounds for different partitions, showing the possibility and efficiency of adaptively configuring lossy compression for each partition individually. (2) We build models to estimate the overall loss of post-analysis result due to lossy compression and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.