Task-parallel in-situ temporal compression of large-scale computational fluid dynamics data
Heather Pacella, Alec Dunton, Alireza Doostan, Gianluca Iaccarino

TL;DR
This paper introduces a task-parallel in-situ data compression method using matrix ID for large-scale CFD simulations, achieving high compression ratios with minimal runtime impact.
Contribution
It presents a novel single-pass ID algorithm integrated with task-based parallelism for efficient in-situ compression in CFD applications.
Findings
Achieved compression factors over 100 with low error
Demonstrated scalability with negligible runtime increase
Validated on large-scale Navier-Stokes simulations
Abstract
Present day computational fluid dynamics simulations generate extremely large amounts of data, sometimes on the order of TB/s. Often, a significant fraction of this data is discarded because current storage systems are unable to keep pace. To address this, data compression algorithms can be applied to data arrays containing flow quantities of interest to reduce the overall amount of storage. Compression methods either exactly reconstruct the original dataset (lossless compression) or provide an approximate representation of the original dataset (lossy compression). The matrix column interpolative decomposition (ID) can be implemented as a type of lossy compression for data matrices that factors the original data matrix into a product of two smaller factor matrices. One of these matrices consists of a subset of the columns of the original data matrix, while the other is a coefficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Numerical Methods and Algorithms · Advanced Data Storage Technologies
