Studying Effective String Theory using deep generative models
Michele Caselle, Elia Cellini, Alessandro Nada

TL;DR
This paper explores Effective String Theory for confinement in Yang-Mills theory using deep generative models, providing numerical insights into flux tube properties that are analytically challenging.
Contribution
It introduces a novel numerical approach employing deep generative models to study EST, specifically applying it to compute the flux tube width in the Nambu-Goto model.
Findings
Numerical estimation of flux tube width in EST.
Validation of deep learning methods for non-perturbative QCD phenomena.
Potential for extending numerical EST studies to other observables.
Abstract
Effective String Theory (EST) offers a robust non-perturbative framework for describing confinement in Yang-Mills theory by treating the confining flux tube between a static quark-antiquark pair as a thin, vibrating string. While EST calculations are typically carried out using zeta-function regularization, certain problems-such as determining the flux tube width-are too complex to solve analytically. However, recent studies have demonstrated that EST can be explored numerically by employing deep learning techniques based on generative algorithms. In this work, we provide a brief introduction to EST and this novel numerical approach. Finally, we present results for the width of the Nambu-Got\"o EST.
| 0.991(2) | 0.55(5) | 0.25007(4) | -0.032(1) | 0.1579(5) | 1.02 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Algorithms and Data Compression
[a]Elia Cellini
Studying Effective String Theory using deep generative models
Michele Caselle
Alessandro Nada
Abstract
Effective String Theory (EST) offers a robust non-perturbative framework for describing confinement in Yang-Mills theory by treating the confining flux tube between a static quark-antiquark pair as a thin, vibrating string. While EST calculations are typically carried out using zeta-function regularization, certain problems—such as determining the flux tube width—are too complex to solve analytically. However, recent studies have demonstrated that EST can be explored numerically by employing deep learning techniques based on generative algorithms. In this work, we provide a brief introduction to EST and this novel numerical approach. Finally, we present results for the width of the Nambu-Gotō EST.
1 Introduction
The study of confinement is essential for advancing our understanding of the non-perturbative regime of Quantum Chromodynamics (QCD), particularly in explaining the absence of isolated color charges and the emergence of hadronic bound states. Among the various theoretical approaches to study confinement in pure Yang-Mills theories, such as the study of monopoles and center of vortex [1, 2, 3, 4], a powerful framework emerged in the past few years is represented by Effective String Theory (EST). In the latter, the chromoelectric flux tube joining couples of quark-antiquark is modeled as a vibrating string [5, 6, 7, 8, 9]. In particular, the Polyakov loops correlators at distance are associated with the full partition function of an EST action:
[TABLE]
The most straightforward choice for the EST action is the Nambu-Gotō string. While it exhibits anomalies at the quantum level, these disappear in the large-distance limit. Various analytical investigations of the Nambu-Gotō partition function, along with lattice computations of pure gauge theories, have demonstrated that EST serves as a highly predictive effective model [10, 11, 12, 13, 14, 15]. However, certain aspects—such as analyzing the flux tube width—pose challenges that analytical methods alone cannot resolve. A different numerical strategy for studying EST consists of discretizing the EST functional on the lattice and subsequently applying Markov Chain Monte Carlo (MCMC) techniques to sample from the associated probability distribution. Nevertheless, due to the strong non-linearity inherent in EST actions, conventional MCMC approaches experience critical slowing down when simulating complex EST regimes [16].
In order to address the numerical problems of lattice regularized EST, refs. [16, 17] presented a new numerical method based on Deep Learning (DL) sampling techniques. DL, together with quantum computing, provide a novel and promising route to overcome the problems of standard lattice simulations. The former, in particular through a class of algorithms called Normalizing Flows (NFs) [18], have been proposed to tackle critical slowing down while the latter allows for real-time sign problem free simulations of lattice gauge theories through the Hamiltonian formalism [19, 20].
NFs are a class of deep generative models that can generate uncorrelated samples from a given target distribution [21]. Additionally, they provide a means to compute observables, including partition functions, which remain inaccessible to conventional MCMC methods [22, 23].
Although flow-based samplers offer significant advantages, they encounter difficulties when scaling to state-of-the-art applications [24, 25, 26]. These challenges can be addressed by integrating Normalizing Flows (NFs) with non-equilibrium simulations rooted in out-of-equilibrium statistical mechanics [27, 28, 29, 30, 31, 32, 33], leading to the development of Stochastic Normalizing Flows (SNFs) [34, 35]. SNFs have shown remarkable scalability across both scalar and gauge theories on the lattice, positioning them among the most promising samplers in lattice field theory [17, 36, 37]. Notably, in ref. [17], SNFs successfully overcame the scaling limitations encountered in the first study of lattice EST [16].
In this proceeding, we first briefly review the concepts of the lattice Nambu-Gotō EST and NFs; afterward, we present numerical results for the width of the EST, a correlation function related to the density of the chromoelectric flux tube.
2 Lattice Nambu-Gotō String
For a target Yang-Mills theory, the corresponding Nambu-Gotō action, when regularized on a two-dimensional lattice using a “physical gauge,” takes the form [16]:
[TABLE]
where denotes a square lattice of dimensions with index describing the worldsheet of the string, and a lattice spacing set to . The field represents the transverse degrees of freedom of the string, while corresponds to the string tension, which serves as the coupling parameter of the theory. We impose periodic boundary conditions along the temporal direction of length , and Dirichlet boundary conditions along the spatial direction of length , representing the Polyakov loops and . With this setup, the physical temperature can be identified as the inverse of .
The width of the regularized Nambu-Gotō string can be computed as:
[TABLE]
where the expectation value is evaluated also over the temporal extension (because of the translation invariance imposed by the periodic boundary conditions). The only analytical solution for this observable is a perturbative calculation up to the next-to-leading order [38, 39, 40]. In the high-temperature regime the expected behaviour is
[TABLE]
where is a new scale which emerges from the regularization of the correlator defining the string width and represents the limit under which the EST prediction are not valid.
In the high-temperature limit it has been conjectured, using the Svetitsky-Yaffe mapping of the lattice gauge theory into a suitable two-dimensional spin model, that the terms multiplying the factor can be resummed [41]:
[TABLE]
where:
[TABLE]
encodes the dependence of the string tension on the physical temperature. Thus, the behaviour of the linear term in of the Nambu-Gotō string width is expected to be:
[TABLE]
A reliable numerical test of the next-the-leading order and of the conjecture is the main goal of this contribution.
3 Flow-based samplers
In recent years, flow-based samplers have emerged as one of the most promising deep learning-based approach to overcome the problems of standard lattice simulations [21, 23, 42]. Normalizing Flows (NFs) [18], are a class of deep generative models that allows for the exact evaluation of the density , of the generated samples. The main idea behind NFs is to optimize the variational density to approximate a given target Boltzmann distribution .
The training procedure in done by minimizing with respect to the parameters of the NFs the so-called reverse Kullback-Leibler divercence:
[TABLE]
Once the training is done, one can compute asymptotically unbiased estimator for the vacuum expectation values over the target using a re-weighting procedure generally called Importance Sampling (IS) in the machine learning field [22, 23]:
[TABLE]
Among the various NF architectures, two remarkable examples in lattice field theory are Continuous NFs (CNFs) [43, 44] and Stochastic NFs (SNFs) [34, 35, 37]. The former generate the samples using Neural Ordinary Differential Equation [43] while the latter combine standard NFs with Non-Equilibrium Markov Chain Monte Carlo algorithms [27, 28, 29, 30, 31, 32, 33]. An intriguing aspect of SNFs is that, in this setup, the IS described above generalizes to the Jarzynski’s equality [45], a milestone in the field of non-equilibrium thermodynamics.
In EST, CNFs have been used in the first work on this subject to provide a proof-of-concept for the feasibility of the application of flow-based samplers to EST lattice calculations [16]. On the other hand, SNFs represent right now the most promising class of flow-based samplers and they have been used in ref. [17] to obtain state-of-the-art numerical results for the width and shape of the effective string. In the following, we show numerical results for the width obtained with both, CNFs and SNFs.
4 Numerical Results
In this section, we study the width of the Nambu-Gotō effective string using CNFs and SNFs in the high-temperature regime: . With the CNFs, we used same architecture, as mention in ref. [16]. However, we found poor scaling in the string tension , and we could study only values . Nevertheless, as we shall see below, in this -regime we could observe the linear broadening with of the width (see fig. 1) and, furthermore, we could observe the next-to-leading order correction computed in refs. [38, 39, 40]. We fit the results using the expression:
[TABLE]
where the expected values for the leading term are and , while for the next-to-leading correction, we have . The remaining terms correspond to non-universal contributions that emerge in the lattice-regularized EST. Results are reported in table 1.
To isolate the next-to-leading contribution, we examined the following quantity:
[TABLE]
where we utilized the best-fit values for the coefficients , , , and , averaging over different values of , as indicated by the notation . Specifically, we assume that the entire dependence of the width on is captured by the term in the denominator. Consequently, is expected to follow the behavior [38, 39, 40]. In fig. 2, we present this quantity as a function of for three different values of , alongside the expected behavior derived from the two-loop calculation [38, 39, 40]. The plots clearly illustrate the agreement with theoretical predictions.
On the other hand, using SNFs, we could overcome the scaling issues of the CNFs, and reach small values of . With this setup, we performed a study with , we fitted the dependence of our results with the following expression:
[TABLE]
Then, we fit the coefficient using
[TABLE]
where is an undetermined bulk constant and , should take the conjectured value . Results of the fit are reported in table 2. The results remarkably agrees with the conjectured prediction . This finding can be clearly observed in fig. 3 as well, where the numerical prediction , is compared with the conjecture behaviour .
5 Conclusion
In this contribution, using a rather unusual but totally new approach to EST calculations based on deep generative models, we show numerical results for the width of the Nambu-Gotō EST in the high-temperature regime, a challenging calculation for standard analytical method. Using a CNF architecture we provide a numerical test for the next-to-leading order of the width computed analytical in ref. [38, 39, 40]. Afterward, using a SNF, we reached small and provided a numerical proof for the conjecture provided in ref. [41].
The method used in this contribution could be extended to investigate other observables, as the shape of the flux tube [17, 46] as well as other models. Namely, an intriguing direction for future application is the extension of our approach to the case to study four-dimensional lattice gauge theories and the so-called axionic string [47, 48]. Another interesting candidate for further numerical studies are the so-called higher-order corrections to the Nambu-Gotō theory, a more general class of EST that provide a better description of the confining flux tube close to the critical point [12, 49, 17].
Acknowledgments
We thank Andrea Bulgarelli, Marco Panero, Dario Panfalone and Lorenzo Verzichelli for several insightful discussions. We acknowledge support from the SFT Scientific Initiative of INFN. This work was partially supported by the Simons Foundation grant 994300 (Simons Collaboration on Confinement and QCD Strings) and by the European Union - Next Generation EU, Mission 4 Component 1, CUPD53D23002970006, under the Italian PRIN “Progetti di Ricerca di Rilevante Interesse Nazionale – Bando 2022” prot. 2022ZTPK4E.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. A. Mickley, C. Allton, R. Bignell and D. B. Leinweber, Center vortex evidence for a second finite-temperature QCD transition , Phys. Rev. D 111 (2025) 034508 [ 2411.19446 ]. · doi ↗
- 2[2] X. Crean, J. Giansiracusa and B. Lucini, Topological data analysis of monopole current networks in U ( 1 ) U(1) lattice gauge theory , Sci Post Phys. 17 (2024) 100 [ 2403.07739 ]. · doi ↗
- 3[3] D. R. Junior and L. E. Oxman, Ensembles of center vortices and chains: Insights from a natural lattice framework , 2411.04325 .
- 4[4] J. A. Mickley, D. B. Leinweber and D. Nogradi, Center vortices and the SU ( 3 ) \mathrm{SU}(3) conformal window , 2501.11279 .
- 5[5] Y. Nambu, Strings, Monopoles and Gauge Fields , Phys. Rev. D 10 (1974) 4262 . · doi ↗
- 6[6] T. Goto, Relativistic quantum mechanics of one-dimensional mechanical continuum and subsidiary condition of dual resonance model , Prog. Theor. Phys. 46 (1971) 1560 . · doi ↗
- 7[7] M. Luscher, Symmetry Breaking Aspects of the Roughening Transition in Gauge Theories , Nucl. Phys. B 180 (1981) 317 . · doi ↗
- 8[8] M. Luscher, K. Symanzik and P. Weisz, Anomalies of the Free Loop Wave Equation in the WKB Approximation , Nucl. Phys. B 173 (1980) 365 . · doi ↗
