Parallel Performance of ARM ThunderX2 for Atomistic Simulation Algorithms
William Robert Saunders, James Grant, Eike Hermann M\"uller

TL;DR
This paper evaluates the parallel performance of key atomistic simulation algorithms on ARM ThunderX2 hardware, comparing it to traditional x86_64 systems to inform hardware suitability for scientific computing.
Contribution
It provides a performance assessment of atomistic simulation algorithms on ARM ThunderX2, demonstrating their efficiency and portability across diverse hardware architectures.
Findings
ARM ThunderX2 shows competitive performance for atomistic simulations.
Performance portability is achievable with the developed framework.
Insights into hardware suitability for scientific simulations.
Abstract
Atomistic simulation drives scientific advances in modern material science and accounts for a significant proportion of wall time on High Performance Computing facilities. It is important that algorithms are efficient and implementations are performant in a continuously diversifying hardware landscape. Furthermore, they have to be portable to make best use of the available computing resource. In this paper we assess the parallel performance of some key algorithms implemented in a performance portable framework developed by us. We consider Molecular Dynamics with short range interactions, the Fast Multipole Method and Kinetic Monte Carlo. To assess the performance of emerging architectures, we compare the Marvell ThunderX2 (ARM) architecture to traditional x86_64 hardware made available through the Azure cloud computing service.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParallel Computing and Optimization Techniques · Ion-surface interactions and analysis · Scientific Research and Discoveries
