Scalable Multigrid-based Hierarchical Scientific Data Refactoring on GPUs
Jieyang Chen, Lipeng Wan, Xin Liang, Ben Whitney, Qing Liu, Qian Gong,, David Pugmire, Nicholas Thompson, Jong Youl Choi, Matthew Wolf, Todd Munson,, Ian Foster, Scott Klasky

TL;DR
This paper introduces highly optimized GPU kernels for multigrid-based hierarchical data refactoring, enabling scalable, high-throughput data reduction and manipulation for scientific workflows on supercomputers.
Contribution
It presents the first highly optimized GPU implementation of multigrid data refactoring kernels, achieving near-peak throughput and enabling scalable scientific data management.
Findings
Achieves up to 264 TB/s throughput, 92% of theoretical peak.
Enables efficient data creation, access, and reduction on GPUs.
Demonstrates effectiveness in scientific visualization and compression workflows.
Abstract
Rapid growth in scientific data and a widening gap between computational speed and I/O bandwidth makes it increasingly infeasible to store and share all data produced by scientific simulations. Instead, we need methods for reducing data volumes: ideally, methods that can scale data volumes adaptively so as to enable negotiation of performance and fidelity tradeoffs in different situations. Multigrid-based hierarchical data representations hold promise as a solution to this problem, allowing for flexible conversion between different fidelities so that, for example, data can be created at high fidelity and then transferred or stored at lower fidelity via logically simple and mathematically sound operations. However, the effective use of such representations has been hindered until now by the relatively high costs of creating, accessing, reducing, and otherwise operating on such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
