Study of Critical Slowing-Down in $SU(2)$ Landau Gauge Fixing
Attilio Cucchieri, Tereza Mendes (New York University)

TL;DR
This paper investigates the phenomenon of critical slowing-down in gauge-fixing algorithms for $SU(2)$ lattice gauge theory, analyzing five algorithms across 2D and 4D lattices through numerical and analytical methods.
Contribution
It provides a comparative analysis of five gauge-fixing algorithms and discusses their tuning, offering insights into their performance and behavior in $SU(2)$ Landau gauge fixing.
Findings
Critical slowing-down observed in all algorithms.
Performance varies significantly among the algorithms.
Analytical and numerical results agree on the slowing-down behavior.
Abstract
We study the problem of critical slowing-down for gauge-fixing algorithms (Landau gauge) in lattice gauge theory on and dimensional lattices, both numerically and analytically. We consider five such algorithms, and we measure four different observables. A detailed discussion and analysis of the tuning of these algorithms is also presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Study of Critical Slowing-Down in
Landau Gauge Fixing
Attilio Cucchieri
and Tereza Mendes Poster presented by T. Mendes. Department of Physics, New York University, 4 Washington Place, New York, NY 10003, USA
Abstract
We study the problem of critical slowing-down for gauge-fixing algorithms (Landau gauge) in lattice gauge theory on and dimensional lattices, both numerically and analytically. We consider five such algorithms, and we measure four different observables. A detailed discussion and analysis of the tuning of these algorithms is also presented.
1 INTRODUCTION
Here we report on the status of our study [1, 2] of Landau gauge-fixing algorithms. The efficiency of these algorithms is of key importance when gauge-dependent quantities — such as gluon and quark propagators — are evaluated, especially if the effect of Gribov copies is considered.
The main issue regarding the efficiency of these algorithms is the problem of critical slowing-down (CSD), which occurs when the relaxation time of an algorithm diverges as the lattice volume is increased (see for example [3]). Conventional local algorithms have dynamic critical exponent , namely , while global methods may succeed in eliminating CSD completely, i.e. . We consider five different algorithms: the Los Alamos method (a conventional local algorithm), the Cornell method, which is generally believed to have , the overrelaxation and stochastic overrelaxation methods (improved local algorithms, which are expected to show ), and the Fourier acceleration method, which is a global method. We confirm the predictions for (in both two and four dimensions) with the exception of the Cornell method. For this case we obtain that the dynamic critical exponent is actually z\mathrel{\hbox to0.0pt{\lower 3.0pt\hbox{\mathchar 536\relax}\hss}\raise 2.0pt\hbox{\mathchar 316\relax}}1, a result that can be understood by a comparative analysis between the Cornell and the overrelaxation methods. Besides the problem of CSD, we are also interested in understanding which quantities should be used to test the convergence of the gauge fixing, and in finding prescriptions for the tuning of parameters, when needed.
We consider the Standard Wilson action in dimensions:
[TABLE]
In order to fix the Landau gauge, we look for a local minimum of the function [4]
[TABLE]
where
[TABLE]
and we start from g(\mbox{\boldmathx})=1\!\!\!\bot for all , keeping the thermalized configuration \{U_{\mu}(\mbox{\boldmathx})\} fixed. [For the matrices we employ the usual parametrization U\equiv u_{0}\,1\!\!\!\bot+i\,\vec{\mbox{u}}\cdot\vec{\mbox{\sigma}}, where the components of \vec{\mbox{\sigma}} are the three Pauli matrices.]
If is a stationary point of then we have the well-known result
[TABLE]
namely the lattice divergence of each color component of the gauge field
[TABLE]
is null. From equation (4) it follows that
[TABLE]
is constant, namely it is independent of .
The algorithms we consider are all (with the exception of the Fourier acceleration) based on updating a single-site variable g(\mbox{\boldmathy}) at a time. As a function of g(\mbox{\boldmathy}) only, the minimizing function becomes
[TABLE]
where
[TABLE]
is the single-site effective magnetic field. Note that h(\mbox{\boldmathy}) is proportional to an matrix, i.e. we can write
[TABLE]
We also define {\cal T}(\mbox{\boldmathy})\,\equiv\,\mathop{\rm tr}\nolimits[\,g(\mbox{\boldmathy})\,{\widetilde{h}}(\mbox{\boldmathy})\,].
The single-site (multiplicative) update can be written as
[TABLE]
and for the methods we consider we have (see [1] and references therein):
Los Alamos Method: in this case we have g^{(new)}(\mbox{\boldmathy})\,=\,{\widetilde{h}}^{\dagger}(\mbox{\boldmathy}), i.e. this update brings the single-site function {\widetilde{\cal E}}\left[g(\mbox{\boldmathy})\right] to its unique absolute minimum;
Overrelaxation: here the matrix R^{(update)}(\mbox{\boldmathy}) is given by [{\widetilde{h}}^{\dagger}(\mbox{\boldmathy})\,g^{\dagger}(\mbox{\boldmathy})]^{\omega}, with ; notice that the Los Alamos method corresponds to the case in which is equal to one, and that for the value of {\widetilde{\cal E}}\left[g(\mbox{\boldmathy})\right] does not change;
Stochastic Overrelaxation: in this case the update g^{(new)}(\mbox{\boldmathy}) is given by
[TABLE]
with probability , and by {\widetilde{h}}^{\dagger}(\mbox{\boldmathy}) with probability ;
Cornell Method: here R^{(update)}(\mbox{\boldmathy}) is proportional to \left[1\!\!\!\bot-\alpha\,a^{2}\,g_{0}\,\left(\nabla\cdot A^{\left(g\right)}\right)(\mbox{\boldmathy})\right]; in [1] we prove that this can also be written as
[TABLE]
Fourier Acceleration: in this case the matrix R^{(update)}(\mbox{\boldmathy}) is proportional to
[TABLE]
where is the Fourier transform, is defined as , and takes the values . (Of course, to reduce the number of times the Fourier transform is evaluated, a checkerboard update should be employed.)
In order to determine the convergence of the gauge fixing, we measure the relaxation times for the following quantities [1]:
[TABLE]
Here indicates the number of sweeps of the lattice and . We expect to observe that
[TABLE]
and also that all the quantities above have the same relaxation time , and hence the same , .
2 RESULTS FOR
For the four local methods we used [1] lattice sizes , while in the Fourier acceleration case we considered . In all cases we have chosen the constant physics , namely . We stopped the gauge fixing when the condition was satisfied.
From our data it is clear that the four quantities have the same for each given algorithm, as expected. However, for all the local updates — with the exception of the stochastic overrelaxation — the quantity was hardest to relax, namely its value was several orders of magnitude larger than the value of the other quantities. On the contrary, the Fourier acceleration method seems to be very efficient in relaxing . Basing on these results, we think that the quantity allows a very sensible check of the gauge fixing and, in our opinion, it should always be used.
We evaluate the dynamic critical exponents from the weighted least-squares fit for , using lattice sizes , and obtain [1]:
[TABLE]
Clearly, the Fourier acceleration method is the most successful in reducing CSD, and it should be the method of choice when large lattice sizes are employed. However, the method is more costly per iteration than the local methods, and its CPU-time/site increases logarithmically with the lattice size. When this is taken into account, the total time for gauge fixing a configuration is smaller for Fourier acceleration than for the local methods if the lattice size is larger than about 300 sites, as can be seen in Figure 1. [Of course this result is very machine- and code-dependent. In any case, it seems unlikely [1] that the Fourier acceleration method would become the method of choice in two dimensions at lattice sizes smaller than around 100 sites.]
3 TUNING
The need for tuning the parameters of an algorithm in order to get optimal efficiency is, of course, a potential disadvantage. Of the five algorithms we consider, all (but the Los Alamos method) require tuning. We did a careful analysis of this problem and we were able to verify a simple analytic expression for the optimal choice of (overrelaxation method), and to relate to the optimal choice for the parameters of the other local methods. More specifically:
Overrelaxation: We tested the Ansatz [5]
[TABLE]
and obtained very good agreement for lattice sizes greater than 12 with . Note that, as the number of iterations increases, the matrix should approach the identity matrix . In this limit, the overrelaxation update can be written as [1]
[TABLE]
Cornell Method: In this case, in the limit of large , we can write [1]
[TABLE]
By comparing this equation with (12), we made the conjecture , with given by . This Ansatz is very well fitted by our data [1].
Stochastic Overrelaxation: In this case it is not clear how to write a formula for the update as the number of iterations increases. A possibility [1] is to consider
[TABLE]
which suggests the relation . This conjecture seems to be satisfied reasonably well for lattice sizes larger than 30.
Fourier Acceleration: By analogy with the continuum case, we expect [6]
[TABLE]
and therefore with the choice we should obtain . This gives, in , a value . This result is only in qualitative agreement with our data, which seem to indicate that for large lattice sizes.
4 EXTENSION TO
We study (in two and in four dimensions) the case , namely we fix U_{\mu}(\mbox{\boldmathx})=1\!\!\!\bot in (3) for all and , and we look for a local minimum of the function (2) starting from a random configuration \{g(\mbox{\boldmathx})\}. In both the two and four dimensional cases, we essentially confirm the results obtained for at finite and in two dimensions [2].
It is interesting to notice that, in the case , we can also write the minimizing function as
[TABLE]
Here the matrix g=g_{0}1\!\!\!\bot+i\,\vec{\mbox{\sigma}}\cdot\vec{\mbox{g}} is considered as a four-dimensional unit vector (g_{0}\mbox{,}\,\vec{\mbox{g}}) and \|g\|^{2}\equiv g_{0}^{2}+\vec{\mbox{g}}\cdot\vec{\mbox{g}}. When we are close to a minimum we can write
[TABLE]
where \vec{\mbox{f}}(\mbox{\boldmathx}) is a three-vector field. This gives
[TABLE]
and
[TABLE]
namely (up to order ) we have the action of a three-vector massless free field \vec{\mbox{f}}(\mbox{\boldmathx}). Therefore we can use standard analytic methods [7] in order to study the problem of minimizing this quadratic form, and to compare the results for , and the optimal choice of the parameters with our numerical data. Indeed, we find good agreement for all the quantities and methods considered [2].
Finally, we extend our simulations to the case (again in two and in four dimensions) [2]. All our results at are confirmed except for the Cornell method, for which we get , and for the Fourier method, which shows . In the Cornell case this result can be easily understood, since at its single-site update does not decrease the minimizing function at each step. In particular, the value of increases if \alpha{\cal N}(\mbox{\boldmathy}) is large than two. [This is obvious if we consider the analogy between the Cornell method and the overrelaxation method, and the relation \alpha{\cal N}(\mbox{\boldmathy})\sim\omega. Moreover, it is plausible that only at very small values of we can have smaller than two and, at the same time, \alpha{\cal N}(\mbox{\boldmathy})>2 for a large set of lattice sites .] However, we can recover the value by redefining the Cornell single-site update in the following way [2]
[TABLE]
with \tilde{\alpha}(\mbox{\boldmathy})\equiv\min\left(\,\alpha\,{\cal N}(\mbox{\boldmathy})\mbox{,}\;2\,\right). In the Fourier acceleration case we have the same problem: the update can increase the value of the minimizing function , and this happens most likely for small values of . However, in this case we did not find a redefinition of the Fourier update [2] which could recover the value .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Cucchieri and T. Mendes, Nucl. Phys. B 471 (1996) 263.
- 2[2] A. Cucchieri and T. Mendes, Critical Slowing-Down in Four-Dimensional S U ( 2 ) 𝑆 𝑈 2 SU(2) Landau Gauge Fixing Algorithms , in preparation.
- 3[3] L. Adler, Nucl. Phys. B (Proc. Suppl.) 9 (1989) 437; U. Wolff, Nucl. Phys. B (Proc. Suppl.) 17 (1990) 93; A. D. Sokal, Nucl. Phys. B (Proc. Suppl.) 20 (1991) 55.
- 4[4] K. G. Wilson, Recent Developments in Gauge Theories Proc. NATO Advanced Study Institute (Carges̀e, 1979), eds. G. ’t Hooft et al. (Plenum Press, New York–London, 1980).
- 5[5] J. E. Mandula and M. Ogilvie, Phys. Lett. B 248 (1990) 156.
- 6[6] C. T. H. Davies et al., Phys. Rev. D 37 (1988) 1581.
- 7[7] H. Neuberger, Phys. Rev. Lett. 59 (1987) 1877; U. Wolff, Phys. Lett. B 288 (1992) 166.
