Evaluating Application Characteristics for GPU Portability Layer Selection
Mohammad Atif, Meghna Bhattacharya, Mark Dewing, Zhihua Dong, Julien Esseiva, Oliver Gutsche, Matti Kortelainen, Ka Hei Martin Kwok, Charles Leggett, Meifeng Lin, Aleksei Strelchenko, Vakhang Tsulaia, Brett Viren, Tianle Wang, Haiwang Yu

TL;DR
This paper analyzes how different GPU portability layers perform with various application characteristics, helping developers choose the most suitable technology for their heterogeneous computing tasks.
Contribution
It provides a detailed study of application and kernel characteristics affecting portability layer performance, guiding informed selection of GPU programming frameworks.
Findings
Different portability layers excel at different application types.
Application characteristics significantly influence layer performance.
The study offers criteria for choosing the appropriate portability technology.
Abstract
GPUs have become the dominant source of computing power for high performance computing and are increasingly being used across the High Energy Physics computing landscape for a wide variety of tasks. Though NVIDIA is currently the main provider of GPUs, AMD and Intel are rapidly increasing their market share. As a result, programming using a vendor-specific language such as CUDA can significantly reduce deployment choices. There are a number of portability layers such as Kokkos, Alpaka, SYCL, OpenMP and std::par that permit execution on a broad range of GPU and CPU architectures, significantly increasing the flexibility of application programmers. However, each of these portability layers has its own characteristics, performing better at some tasks and worse at others, or placing limitations on aspects of the application. In this presentation, we report on a study of application and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Advanced Data Storage Technologies · Cloud Computing and Resource Management
