Benchmarking with Supernovae: A Performance Study of the FLASH Code

Joshua Martin; Catherine Feldman; Eva Siegmann; Tony Curtis; David; Carlson; Firat Coskun; Daniel Wood; Raul Gonzalez; Robert J. Harrison; Alan; C. Calder

arXiv:2408.16084·cs.DC·August 30, 2024

Benchmarking with Supernovae: A Performance Study of the FLASH Code

Joshua Martin, Catherine Feldman, Eva Siegmann, Tony Curtis, David, Carlson, Firat Coskun, Daniel Wood, Raul Gonzalez, Robert J. Harrison, Alan, C. Calder

PDF

TL;DR

This paper evaluates the performance and energy efficiency of the FLASH astrophysics simulation code on new high-performance computing hardware, comparing Intel, AMD, and ARM-based systems for simulating supernovae.

Contribution

It provides a comprehensive benchmarking of FLASH on diverse modern hardware architectures, highlighting optimal configurations for performance and energy efficiency in supernova simulations.

Findings

01

Intel Sapphire Rapids CPUs offer high performance for FLASH.

02

ARM-based A64FX processors show competitive energy efficiency.

03

Optimal MPI configurations improve simulation scalability.

Abstract

Astrophysical simulations are computation, memory, and thus energy intensive, thereby requiring new hardware advances for progress. Stony Brook University recently expanded its computing cluster "SeaWulf" with an addition of 94 new nodes featuring Intel Sapphire Rapids Xeon Max series CPUs. We present a performance and power efficiency study of this hardware performed with FLASH: a multi-scale, multi-physics, adaptive mesh-based software instrument. We extend this study to compare performance to that of Stony Brook's Ookami testbed which features ARM-based A64FX-700 processors, and SeaWulf's AMD EPYC Milan and Intel Skylake nodes. Our application is a stellar explosion known as a thermonuclear (Type Ia) supernova and for this 3D problem, FLASH includes operators for hydrodynamics, gravity, and nuclear burning, in addition to routines for the material equation of state. We perform a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.