Evaluating Portable Parallelization Strategies for Heterogeneous Architectures in High Energy Physics
Mohammad Atif, Meghna Battacharya, Paolo Calafiura, Taylor Childers,, Mark Dewing, Zhihua Dong, Oliver Gutsche, Salman Habib, Kyle Knoepfel, Matti, Kortelainen, Ka Hei Martin Kwok, Charles Leggett, Meifeng Lin, Vincent, Pascuzzi, Alexei Strelchenko, Vakhtang Tsulaia, Brett Viren

TL;DR
This paper evaluates various portable parallelization strategies like Kokkos, SYCL, and OpenMP for high energy physics applications to facilitate multi-architecture compatibility without rewriting code.
Contribution
It provides a comprehensive comparison of portability solutions using real HEP applications, guiding future software development in the field.
Findings
Performance metrics vary across solutions and architectures.
Porting challenges include API complexity and build system integration.
Some solutions show promising portability with acceptable performance.
Abstract
High-energy physics (HEP) experiments have developed millions of lines of code over decades that are optimized to run on traditional x86 CPU systems. However, we are seeing a rapidly increasing fraction of floating point computing power in leadership-class computing facilities and traditional data centers coming from new accelerator architectures, such as GPUs. HEP experiments are now faced with the untenable prospect of rewriting millions of lines of x86 CPU code, for the increasingly dominant architectures found in these computational accelerators. This task is made more challenging by the architecture-specific languages and APIs promoted by manufacturers such as NVIDIA, Intel and AMD. Producing multiple, architecture-specific implementations is not a viable scenario, given the available person power and code maintenance issues. The Portable Parallelization Strategies team of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies · Parallel Computing and Optimization Techniques
