On the performance of a highly-scalable Computational Fluid Dynamics code on AMD, ARM and Intel processors
Pablo Ouro, Unai Lopez-Novoa, Martyn Guest

TL;DR
This paper evaluates the performance and scalability of a state-of-the-art Computational Fluid Dynamics code across AMD, ARM, and Intel HPC clusters, revealing that AMD EPYC offers superior performance, while ARM-based systems show promising scalability.
Contribution
It provides a comparative analysis of CFD code performance on AMD, ARM, and Intel processors, highlighting the strengths and scalability of each architecture in HPC environments.
Findings
EPYC cluster outperforms others in all benchmarks.
Intel Skylake has better strong scalability.
ARM-based TX2 shows promising scalability.
Abstract
No area of computing is hungrier for performance than High Performance Computing (HPC), the demands of which continue to be a major driver for processor performance and adoption of accelerators, and also advances in memory, storage, and networking technologies. A key feature of the Intel processor domination of the past decade has been the extensive adoption of GPUs as coprocessors, whilst more recent developments have seen the increased availability of a number of CPU processors, including the novel ARM-based chips. This paper analyses the performance and scalability of a state-of-the-art Computational Fluid Dynamics (CFD) code on three HPC cluster systems equipped with AMD EPYC-Rome (EPYC, 4096 cores), ARM-based Marvell ThunderX2 (TX2, 8192 cores) and Intel Skylake (SKL, 8000 cores) processors. Three benchmark cases are designed with increasing computation-to-communication ratio and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
