Staggered fermions simulations on GPUs
Claudio Bonati, Guido Cossu, Massimo D'Elia, Adriano Di Giacomo

TL;DR
This paper details the implementation of the RHMC algorithm for staggered fermions on GPUs, performing the entire molecular dynamics trajectory on the GPU to improve performance in lattice QCD simulations.
Contribution
It introduces a GPU-based implementation of the full RHMC algorithm for staggered fermions, including optimization strategies and performance analysis.
Findings
Significant performance improvements over CPU implementations
Identification of bottlenecks in GPU-based fermion simulations
Effective strategies to circumvent computational bottlenecks
Abstract
We present our implementation of the RHMC algorithm for staggered fermions on Graphics Processing Units using the NVIDIA CUDA programming language. While previous studies exclusively deal with the Dirac matrix inversion problem, our code performs the complete MD trajectory on the GPU. After pointing out the main bottlenecks and how to circumvent them, we discuss the performance of our code.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
