cuLGT: Lattice Gauge Fixing on GPUs

Hannes Vogt; Mario Schr\"ock

arXiv:1412.3655·hep-lat·December 12, 2014

cuLGT: Lattice Gauge Fixing on GPUs

Hannes Vogt, Mario Schr\"ock

PDF

Open Access

TL;DR

This paper presents an optimized GPU implementation for lattice gauge fixing in SU(3) and SU(2) theories, achieving high performance and improved usability for large-scale simulations.

Contribution

The authors develop an enhanced CUDA-based code for gauge fixing that significantly increases performance and ease of integration compared to previous versions.

Findings

01

Achieves up to 470 GFlops on GTX 580 GPU.

02

Uses 4 or 8 threads per lattice site for parallelism.

03

Improves performance and usability of gauge fixing code.

Abstract

We adopt CUDA-capable Graphic Processing Units (GPUs) for Landau, Coulomb and maximally Abelian gauge fixing in 3+1 dimensional SU(3) and SU(2) lattice gauge field theories. A combination of simulated annealing and overrelaxation is used to aim for the global maximum of the gauge functional. We use a fine grained degree of parallelism to achieve the maximum performance: instead of the common 1 thread per site strategy we use 4 or 8 threads per lattice site. Here, we report on an improved version of our publicly available code (www.cuLGT.com and github.com/culgt) which again increases performance and is much easier to include in existing code. On the GeForce GTX 580 we achieve up to 470 GFlops (utilizing 80% of the theoretical peak bandwidth) for the Landau overrelaxation code.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSuperconducting Materials and Applications · Medical Imaging Techniques and Applications · Physics of Superconductivity and Magnetism