High Performance Direct Gravitational N-body Simulations on Graphics Processing Units -- II: An implementation in CUDA
Robert G. Belleman (1), Jeroen Bedorf (1), Simon Portegies Zwart (1), ((1) UvA)

TL;DR
This paper demonstrates that modern GPUs, specifically the NVIDIA GeForce 8800GTX, can effectively perform direct gravitational N-body simulations, offering a cost-effective alternative to specialized hardware like GRAPE-6Af, with competitive accuracy and performance for large N.
Contribution
The authors implement and evaluate gravitational N-body simulations on a GPU using CUDA, showing significant performance gains and comparable accuracy to specialized hardware.
Findings
GPU outperforms GRAPE-6Af for N > 512 with softening
Energy conservation is within one part in 10^6 on GPU
GPU achieves about 100x speedup over CPU for N ≥ 10^5
Abstract
We present the results of gravitational direct -body simulations using the Graphics Processing Unit (GPU) on a commercial NVIDIA GeForce 8800GTX designed for gaming computers. The force evaluation of the -body problem is implemented in ``Compute Unified Device Architecture'' (CUDA) using the GPU to speed-up the calculations. We tested the implementation on three different -body codes: two direct -body integration codes, using the 4th order predictor-corrector Hermite integrator with block time-steps, and one Barnes-Hut treecode, which uses a 2nd order leapfrog integration scheme. The integration of the equations of motions for all codes is performed on the host CPU. We find that for particles the GPU outperforms the GRAPE-6Af, if some softening in the force calculation is accepted. Without softening and for very small integration time steps the GRAPE still…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
