A Framework for Lattice QCD Calculations on GPUs
F. T. Winter, M. A. Clark, R. G. Edwards, B. Jo\'o

TL;DR
This paper presents a high-level software framework enabling the efficient porting of lattice QCD calculations to GPUs, significantly accelerating computations while maintaining code portability and ease of development.
Contribution
It introduces a novel reimplementation of the low-level lattice QCD software layer using JIT compilation, allowing unaltered application execution on GPUs.
Findings
Achieved over tenfold acceleration of gauge-generation algorithms on GPU clusters.
Enabled unaltered execution of existing lattice QCD applications on GPU hardware.
Demonstrated effective software reuse and portability across different GPU architectures.
Abstract
Computing platforms equipped with accelerators like GPUs have proven to provide great computational power. However, exploiting such platforms for existing scientific applications is not a trivial task. Current GPU programming frameworks such as CUDA C/C++ require low-level programming from the developer in order to achieve high performance code. As a result porting of applications to GPUs is typically limited to time-dominant algorithms and routines, leaving the remainder not accelerated which can open a serious Amdahl's law issue. The lattice QCD application Chroma allows to explore a different porting strategy. The layered structure of the software architecture logically separates the data-parallel from the application layer. The QCD Data-Parallel software layer provides data types and expressions with stencil-like operations suitable for lattice field theory and Chroma implements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
