Accelerating Lattice QCD Simulations using GPUs
Tilmann Matthaei

TL;DR
This paper explores GPU acceleration for lattice QCD simulations, specifically improving the DDalphaAMG solver, achieving a 3x speedup for large lattices, and discusses challenges and potential future improvements.
Contribution
It introduces a new memory access scheme and evaluates GPU acceleration for DDalphaAMG, demonstrating significant speedups for large-scale lattice QCD problems.
Findings
Achieved around 3x speedup for large lattices using GPUs.
Memory-bound implementation limits acceleration benefits.
Large-scale problems are necessary for GPU advantages over CPUs.
Abstract
Solving discretized versions of the Dirac equation represents a large share of execution time in lattice Quantum Chromodynamics (QCD) simulations. Many high-performance computing (HPC) clusters use graphics processing units (GPUs) to offer more computational resources. Our solver program, DDalphaAMG, previously was unable to fully take advantage of GPUs to accelerate its computations. Making use of GPUs for DDalphaAMG is an ongoing development, and we will present some current progress herein. Through a detailed description of our development, this thesis should offer valuable insights into using GPUs to accelerate a memory-bound CPU implementation. We developed a storage scheme for multiple tuples, which allows much more efficient memory access on GPUs, given that the element at the same index is read from multiple tuples simultaneously. Still, our implementation of a discrete Dirac…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle physics theoretical and experimental studies · Quantum Chromodynamics and Particle Interactions · High-Energy Particle Collisions Research
