Distributed N-body Simulation on the Grid Using Dedicated Hardware
Derek Groen, Simon Portegies Zwart, Steve McMillan, Jun Makino

TL;DR
This paper evaluates the performance of large-scale gravitational N-body simulations on a global grid with specialized hardware, developing a model to predict performance and demonstrating feasibility for future large simulations.
Contribution
It introduces a performance model for N-body simulations on distributed grids with specialized hardware, enabling prediction of execution times across different infrastructures.
Findings
High network bandwidth dominates simulation performance over latency.
Adding GRAPE hardware significantly reduces computation time.
Predicted simulation of a few million particles can be completed in about ten hours.
Abstract
We present performance measurements of direct gravitational N -body simulation on the grid, with and without specialized (GRAPE-6) hardware. Our inter-continental virtual organization consists of three sites, one in Tokyo, one in Philadelphia and one in Amsterdam. We run simulations with up to 196608 particles for a variety of topologies. In many cases, high performance simulations over the entire planet are dominated by network bandwidth rather than latency. With this global grid of GRAPEs our calculation time remains dominated by communication over the entire range of N, which was limited due to the use of three sites. Increasing the number of particles will result in a more efficient execution. Based on these timings we construct and calibrate a model to predict the performance of our simulation on any grid infrastructure with or without GRAPE. We apply this model to predict the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Parallel Computing and Optimization Techniques · Scientific Research and Discoveries
