Porting HPC Applications to AMD Instinct$^\text{TM}$ MI300A Using Unified Memory and OpenMP
Suyash Tandon, Leopold Grinberg, Gheorghe-Teodor Bercea, Carlo, Bertolli, Mark Olesen, Simone Bn\`a, Nicholas Malaya

TL;DR
This paper demonstrates how to port HPC applications to AMD MI300A APUs using unified memory and OpenMP, simplifying development and improving performance for data center workloads.
Contribution
It provides a programming blueprint leveraging unified memory and OpenMP 5.2 for AMD MI300A APUs, highlighting differences from traditional GPU approaches.
Findings
Unified memory simplifies application development.
OpenFOAM was successfully ported to MI300A using OpenMP.
The approach enhances performance and maintainability.
Abstract
AMD Instinct MI300A is the world's first data center accelerated processing unit (APU) with memory shared between the AMD "Zen 4" EPYC cores and third generation CDNA compute units. A single memory space offers several advantages: i) it eliminates the need for data replication and costly data transfers, ii) it substantially simplifies application development and allows an incremental acceleration of applications, iii) is easy to maintain, and iv) its potential can be well realized via the abstractions in the OpenMP 5.2 standard, where the host and the device data environments can be unified in a more performant way. In this article, we provide a blueprint of the APU programming model leveraging unified memory and highlight key distinctions compared to the conventional approach with discrete GPUs. OpenFOAM, an open-source C++ library for computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Advanced Data Storage Technologies
