Radiation Hydrodynamics at Scale: Comparing MPI and Asynchronous Many-Task Runtimes with FleCSI
Alexander Strack, Hartmut Kaiser, Dirk Pfl\"uger

TL;DR
This paper compares MPI and asynchronous many-task runtimes within the FleCSI framework, demonstrating that AMTRs can achieve comparable or superior performance to MPI in scalable radiation hydrodynamics simulations.
Contribution
It provides a comprehensive benchmarking of MPI and AMTR backends in FleCSI, highlighting the performance and scalability differences across applications and runtime systems.
Findings
MPI backend achieves over 97% efficiency on large-scale Poisson solver
HPX backend outperforms MPI+Kokkos on small to medium scales in radiation hydrodynamics
Legion backend shows notable overheads and limited scalability
Abstract
Writing efficient distributed code remains a labor-intensive and complex endeavor. To simplify application development, the Flexible Computational Science Infrastructure (FleCSI) framework offers a user-oriented, high-level programming interface that is built upon a task-based runtime model. Internally, FleCSI integrates state-of-the-art parallelization backends, including MPI and the asynchronous many-task runtimes (AMTRs) Legion and HPX, enabling applications to fully leverage asynchronous parallelism. In this work, we benchmark two applications using FleCSI's three backends on up to 1024 nodes, intending to quantify the advantages and overheads introduced by the AMTR backends. As representative applications, we select a simple Poisson solver and the multidimensional radiation hydrodynamics code HARD. In the communication-focused Poisson solver benchmark, FleCSI achieves over 97%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Distributed and Parallel Computing Systems · Scientific Computing and Data Management
