GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis
Guoxi Liu, Thomas Randall, Rong Ge, and Federico Iuricich

TL;DR
GALE introduces a GPU-accelerated data structure for unstructured mesh analysis, significantly improving performance by leveraging heterogeneous CPU-GPU systems and task parallelism.
Contribution
It presents the first CUDA-based data structure for mesh connectivity, offloading computation to GPU and enabling faster, memory-efficient analysis.
Findings
GALE achieves up to 2.7x speedup over existing methods.
It maintains memory efficiency while improving performance.
Demonstrates effectiveness on high-core CPU and GPU systems.
Abstract
Unstructured meshes present challenges in scientific data analysis due to irregular distribution and complex connectivity. Computing and storing connectivity information is a major bottleneck for visualization algorithms, affecting both time and memory performance. Recent task-parallel data structures address this by precomputing connectivity information at runtime while the analysis algorithm executes, effectively hiding computation costs and improving performance. However, existing approaches are CPU-bound, forcing the data structure and analysis algorithm to compete for the same computational resources, limiting potential speedups. To overcome this limitation, we introduce a novel task-parallel approach optimized for heterogeneous CPU-GPU systems. Specifically, we offload the computation of mesh connectivity information to GPU threads, enabling CPU threads to focus on executing the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
