Massively Parallel Computation of Accurate Densities for N-body Dark Matter Simulations using the Phase-Space-Element Method
Ralf Kaehler

TL;DR
This paper introduces a highly scalable, GPU-accelerated method for accurately computing densities in large-scale dark matter N-body simulations using phase-space tessellation, enabling efficient processing of massive datasets on heterogeneous clusters.
Contribution
The paper presents a novel GPU-based implementation of phase-space tessellation for dark matter simulations, with adaptive resolution and scalable load balancing for massive datasets.
Findings
Achieves nearly tenfold speedup over CPU implementations.
Successfully scales to 256 GPUs handling over 400 billion tetrahedra.
Demonstrates efficient density computation for TB-sized simulation snapshots.
Abstract
This paper presents an accurate density computation approach for large dark matter simulations, based on a recently introduced phase-space tessellation technique and designed for massively parallel, heterogeneous cluster architectures. We discuss a memory efficient construction of an oct-tree structure to sample the mass densities with locally adaptive resolution, according to the features of the underlying tetrahedral tessellation. We propose an efficient GPU implementation for the computationally intensive operation of intersecting the tetrahedra with the cubical cells of the deposit grid, that achieves a speedup of almost an order of magnitude compared to an optimized CPU version. We discuss two dynamic load balancing schemes - the first exchanges particle data between cluster nodes and deposits all tetrahedra for each block of the grid structure on single nodes, whereas the second…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
