TL;DR
sputniPIC is a novel multi-GPU implicit Particle-in-Cell code that significantly accelerates plasma simulations, enabling large-scale 3D modeling on single nodes with substantial performance gains over CPU implementations.
Contribution
The paper introduces sputniPIC, a multi-GPU optimized implicit PIC code with a new particle decomposition approach and support for reduced precision, enhancing simulation speed and scalability.
Findings
Achieved 200-800x speedup over CPU version.
Reduced precision improves performance by 45-80%.
Enabled large-scale 3D plasma simulations on single multi-GPU nodes.
Abstract
Large-scale simulations of plasmas are essential for advancing our understanding of fusion devices, space, and astrophysical systems. Particle-in-Cell (PIC) codes have demonstrated their success in simulating numerous plasma phenomena on HPC systems. Today, flagship supercomputers feature multiple GPUs per compute node to achieve unprecedented computing power at high power efficiency. PIC codes require new algorithm design and implementation for exploiting such accelerated platforms. In this work, we design and optimize a three-dimensional implicit PIC code, called sputniPIC, to run on a general multi-GPU compute node. We introduce a particle decomposition data layout, in contrast to domain decomposition on CPU-based implementations, to use particle batches for overlapping communication and computation on GPUs. sputniPIC also natively supports different precision representations to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
