Non-local non-linear sigma models
Steven S. Gubser, Christian B. Jepsen, Ziming Ji, Brian Trundy, and, Amos Yarom

TL;DR
This paper investigates non-local non-linear sigma models across dimensions, revealing unique renormalization properties and potential links to M2-brane dynamics, with implications for both Archimedean and ultrametric spaces.
Contribution
It introduces a scale-invariant non-local sigma model framework, analyzing one-loop divergences and renormalization without metric renormalization, and discusses possible applications to M2-branes.
Findings
One-loop divergences are canceled by bi-local terms.
Metric renormalization is absent in the non-local case.
Potential connections to M2-brane dynamics are proposed.
Abstract
We study non-local non-linear sigma models in arbitrary dimension, focusing on the scale invariant limit in which the scalar fields naturally have scaling dimension zero, so that the free propagator is logarithmic. The classical action is a bi-local integral of the square of the arc length between points on the target manifold. One-loop divergences can be canceled by introducing an additional bi-local term in the action, proportional to the target space laplacian of the square of the arc length. The metric renormalization that one encounters in the two-derivative non-linear sigma model is absent in the non-local case. In our analysis, the target space manifold is assumed to be smooth and Archimedean; however, the base space may be either Archimedean or ultrametric. We comment on the relation to higher derivative non-linear sigma models and speculate on a possible application to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Contents
1 Introduction
Scalar field theories over the reals with bi-local kinetic terms were introduced in [1], and the recent work [2] provides a useful point of entry into the extensive literature. Similar field theories over the -adic numbers were considered in [3] as a continuum description of Dyson’s hierarchical model [4]. A unifying point of view on the bi-local vector model was provided in [5], showing that the standard large development can be framed in terms that are largely independent of whether the theory is formulated over the reals or the -adics. The present work extends the study of bi-local theories to bi-local non-linear sigma models, starting with the action
[TABLE]
where is the distance function on the -dimensional base space and is the distance function on the target manifold. In the limit , where the theory (1) becomes classically scale invariant, we find logarithmic divergences in one-loop diagrams which can be canceled by counterterms that can be expressed in terms of the target space laplacian of the square of the distance function, together with field redefinitions.111An exception, as we will see, is when is an even integer and the base space . Through a procedure we will outline in section 12, one recovers in this case a local non-linear sigma model, and at least for we can use our results to check the standard analysis [6] of the one-loop beta function.
Ricci flatness suppresses the one-loop divergences that we encounter, so in a sense (and with significant caveats) we may claim that we are deriving the vacuum Einstein equations from conformal invariance, as in [6]. Our work was partly motivated by the more recent results of [7], which were derived for a nearest neighbor arc length model on the Bruhat-Tits tree—in other words, on the other side of the -adic AdS/CFT duality [8, 9] from our results for field theories over the -adic numbers. However, the particular structure of counterterm we find suggests that renormalization of our theories have less to do with renormalization of the local metric as normally understood (i.e. Ricci flow) than with an augmentation of the action (1) to include the target space laplacian of .
A conservative expectation is that once non-local terms are allowed in a field theory, they proliferate and the theory becomes non-renormalizable. Theories with purely quadratic bi-local kinetic terms, as studied in [1, 3] (as well as many subsequent works) avoid such problems through a non-renormalization theorem: If we write
[TABLE]
then the claim is that the quadratic bi-local term is never renormalized (at least perturbatively), though the purely local term certainly is—and depending on details, derivative terms might be radiatively generated. Non-local interaction terms vitiate this non-renormalization theorem, and one’s suspicions could be renewed that there is no sensible theory. We will not be able in this work entirely to allay such concerns, because we do not give a demonstration parallel to the one in [6] that Ward identities based on diffeomorphism invariance guarantee that loop divergences can only modify the original form of the action. Indeed, the counterterms we generate at one loop do modify the bi-local action in an unexpected way, but one which appears to be controlled in a derivative expansion, so that higher derivative terms can be radiatively generated at each new order without spoiling results from lower orders. We will revisit the question of renormalizability in section 13.
The organization of the rest of this paper is as follows. In section 2 we present the main results in Fourier analysis that we need, both over the reals and the -adics. In section 3 we explain how double integrals such as the one in (1) can be regulated if divergences arise as . In section 4 we introduce the classical action for the bi-local non-linear sigma model. In section 5 we discuss loop divergences in general terms, including an introductory account of the non-renormalization property of the kinetic term in (2). In sections 6-10 we investigate the simplest one-loop divergences of the bi-local non-linear sigma model, and then in section 11 we argue that all these divergences can be canceled by a laplacian counterterm in place of renormalization of the local metric, together with field redefinitions. As a byproduct of our analysis, we recover in section 12 the usual beta function for the two-derivative theory in two dimensions. We conclude in section 13 with a summary of possible future directions.
2 Fourier transforms
In loop calculations we will often need to go back and forth between momentum space expressions similar to the ones presented in (2) and their real space counterparts, using the Fourier transforms
[TABLE]
The relevant results are fairly similar between real and -adic cases, so we present them together. When , the definitions (3) are entirely standard, and can be understood as the ordinary dot product. Likewise, in this case, is understood as the standard norm on . We refer to the real case as Archimedean because the norm has the property that if , then there is some such that .
The simplest -dimensional -adic construction is based on letting be the (unique) unramified -dimensional extension of . Let and be the field norm and field trace with respect to the extension . Then we define where is the usual -adic norm. We will refer to the -adic case as ultrametric because the norm just defined has the property . Next we define . Note that , so to give meaning to we now only need to define for . To this end we find the unique -adic integer such that , and we understand that by we really mean .
We are particularly interested in the Fourier transform of powers of :
[TABLE]
Here is a meromorphic function of which can be evaluated as
[TABLE]
where we set in the Archimedean case and in the ultrametric case, with
[TABLE]
Intuitively, is a variant of the Euler gamma, specific to the choice of , and constructed so as to be the coefficient of the term in (4). In the remainder of our discussion, integrals are over unless otherwise indicated.
The contact terms in (4) are somewhat delicate and dependent on detail. When , the integral in (4) is convergent, and no contact terms are needed. One can easily check that as , so when the power law term goes away and we recover the obvious result
[TABLE]
For , the integral in (4) diverges, and we need a more careful approach. A good first step is to understand (4) in terms of its action on a test function :
[TABLE]
where is some linear map on functions . A suitable class of test functions are so-called Schwartz-Bruhat functions. When , we require that is locally constant with compact support. For example, the characteristic function of the -adic integers is a Schwartz-Bruhat function on . When , the test functions are more appropriately called Schwartz functions, and their defining property is that they go to [math] faster than any power of , as do all their derivatives. An example is a Gaussian. Both in the real and ultrametric cases, the Fourier transform of a Schwartz-Bruhat function is again a Schwartz-Bruhat function.
With (8) taken as the definition of , our task is to find a representation of entirely in position space. In the ultrametric case for arbitrarily positive , one finds
[TABLE]
This is the Vladimirov derivative. In the Archimedean case, the same expression (9) is valid for . There is one more easy case to dispose of: even positive integer for Archimedean . Then , which makes sense in (4) because the right hand should be purely distributional, on account of being analytic in . Explicitly,
[TABLE]
where .
We are left with the task of defining for Archimedean and for but not an even integer. Heuristically, the contact terms in (4) are a sum of terms of the form , where , with divergent coefficients. To state this more precisely, we write
[TABLE]
where a regulated integral
[TABLE]
is rendered finite (if possible) by allowing the subtraction from of a finite sum of smooth functions of either of the following types:
- I.
Pure powers: more precisely, any function whose dependence comes solely through a factor where is a real number. This is meant to include, through the case , functions which have no dependence. 2. II.
Higher partial waves: more precisely, any function of the form where is a spherical harmonic on other than the -wave.
Type I functions are never integrable, whereas type II functions may or may not be; so at best there is a unique choice of type I functions that will work, whereas many choices of type II functions are possible. An alternative approach, generalizing the principle value prescription, is to eschew modifications of the integrand and instead carry out integration in polar coordinates centered around , as follows. One first performs the angular integrals. Then the radial integral is restricted to run from to . One next allows the subtraction of an arbitrary finite sum of negative powers of and/or positive powers of , chosen (if possible) so that the limits and , taken independently, lead to a finite result. Doing the angular integration first obviates the need for type II functions, while the ultraviolet and infrared cutoffs, and , obviate the need for type I.222The alert reader may notice that the alternative approach using cutoffs is not quite equivalent to adjusting by pure powers of : For example, if is a positive even integer and , then we get a logarithmic divergence that would obviously be canceled using an appropriate type I function but cannot be cured using powers of and/or after a cutoff integration. Because we avoid even integer as well as functions which grow as positive powers of large separation , we do not need to specify a resolution to this inequivalence.
While the subtractions described can in principle cure either ultraviolet (UV) or infrared (IR) divergences, we will be interested only in applications where UV divergences matter: that is, divergences arising when (with held fixed). Type II subtractions are relatively innocuous because they follow automatically from performing angular integrations first; therefore we will use the notation to indicate a integration with type II subtractions which we usually omit to write explicitly.
Although we have stated our integration prescriptions in the abstract, it is easy to see how to apply them to (11) when is a Schwartz function. Consider the case , and set for simplicity. Then (11) becomes
[TABLE]
The extra terms in square brackets on the right hand side of (13) evidently render the integral convergent near for . The term linear in is clearly a type II function, and the term quadratic in is a sum of a type II function proportional to (a -wave term) and a type I function proportional to . If , then we would need one additional term in the Taylor series expansion of around , and this additional term is a type II function. In summary, for , and omitting type II subtractions,
[TABLE]
Evidently, if , a simpler subtraction scheme would work, resulting in (14) with the laplacian term omitted, in agreement with (9).
For general (other than positive even integers) and Archimedean ,
[TABLE]
where
[TABLE]
In principle, one may derive (15) by subtracting an appropriate number of terms in the Taylor series expansion of and then finding appropriate type II subtractions to bring the result into the form (15).
A more efficient way to determine the coefficients is to start from (15) and Fourier transform:
[TABLE]
In the second equality of (17), we have partially carried out the integral in polar coordinates around the point , introducing a radial variable . In the third equality, we have carried out the angular integral and introduced a new radial variable, . The integral in the last line of (17) converges, provided is positive but not an even integer, and provided the coefficients are coefficients in the Taylor series expansion of the Bessel function around . These coefficients are well known, and from them one can recover the expression (16) for the .
3 Bi-local integrals
We are particularly interested in double integrals of the form
[TABLE]
where and is piecewise constant if is ultrametric and smooth if is Archimedean. Unless otherwise noted, all double integrals over and will by taken over all of . In the ultrametric case, for any , following (9) we define
[TABLE]
In the Archimedean case, we define
[TABLE]
by performing the integration first and allowing the subtraction of type I and type II functions to in order to achieve a finite result (if possible). As in the previous section, type II subtractions are deemed relatively inconsequential, so even unprimed integration over and means to perform the integration first, allowing the subtraction of type II functions in order to achieve a finite result (if possible). Explicitly, for not a positive even integer,
[TABLE]
where the coefficients are as given in (16). We avoid positive even integer when is Archimedean because in this case we expect that our constructions will lead instead to purely local theories; also, precisely in this case, the subtleties pointed out in footnote 2 regarding logarithmic divergences come into play.
Our computational strategy will turn on converting bi-local position space integrals into Fourier space integrals. Let’s start with the simplest example of that calculation, valid for ultrametric and any , and also for Archimedean and . Let be a Schwartz-Bruhat function. Then
[TABLE]
The first step is actually the trickiest, because it is not clear from the rules of integration set forth following (20) that we are allowed to add a function like to the integrand. To justify this step, we denote , and we argue that
[TABLE]
The second integral in (23) is the component of the Fourier transform of . But this Fourier transform is , and since the component indeed vanishes.
Let’s now pursue the same computation for the Archimedean case with . On one hand, using (21),
[TABLE]
On the other hand, using (14),
[TABLE]
In order to conclude
[TABLE]
we must therefore argue that the final integrals in (24) and (25) agree. Subtracting (25) from (24) and simplifying slightly with the definition , we arrive at
[TABLE]
The first equality in (27) follows from (14), and the second is by the same argument used following (23). To summarize, for Archimedean and for ,
[TABLE]
for , and so without the on the left hand side of (28) we would have a sign problem. The equality (26) can be checked in a similar manner for . A key relation is
[TABLE]
Two take-away lessons are:
- •
When we write simple kinetic terms in momentum space, in position space we are combining non-local position space terms and local terms involving derivatives in a precisely tuned ratio.
- •
There is some freedom in the precise structure of the position space form, as exemplified by the equality of the last integrals in (24) and (25) due to a manipulation which is the non-local version of integration by parts.
4 The bi-local non-linear sigma model
We are now in a position to present the action for the bi-local non-linear sigma model. Let be a smooth -dimensional manifold with a Riemannian metric , whose Riemann and Ricci tensors are
[TABLE]
Given any two points and on , let
[TABLE]
be the square of the shortest distance between and . Clearly, is a smooth function of and , provided and are not too far apart. For smooth functions whose range is sufficiently localized, we consider the action functional
[TABLE]
where indicates a regulated double integral of the type discussed around (19)-(21).333One may wonder whether the primed integral, as defined following (20), spoils coordinate invariance of the integrand. For instance, if is sufficiently large we may, in light of (21), be required to include a term to the integrand, which if written only in terms of partial derivatives does not appear to be coordinate invariant. In fact, it is easy to convince oneself that, e.g., can be constructed from covariant quantities: . Note that this discussion requires us to avoid positive even integer when and are valued in (as opposed to ). In the Archimedean setting, when , there are derivative terms like implicitly built into (32), with coefficients tuned so as to ensure convergence of the integral. The parameter has dimensions of energy so that we can regard and as dimensionless. The factor is a loop-counting parameter: Classical effects are , one-loop amplitudes are , two loop amplitudes are , and so forth. In other words, plays the role of .
A close cousin of the action (32) was considered in [7]:
[TABLE]
where now and are vertices of a graph and the sum is over undirected edges. The formula actually appears earlier in [6], though it was intended there to be considered on a square lattice, as a regulator for the local non-linear sigma model, rather than on the Bruhat-Tits tree as in [7].
We require the range of the maps to be sufficiently localized in order to ensure that we do not encounter any failures of smoothness in , and in order to ensure that we can use a single system of Riemann normal coordinates for throughout. One can now solve the geodesic equation perturbatively in the curvature and use that to evaluate and expand the action (32),
[TABLE]
where
[TABLE]
and
[TABLE]
Here and are evaluated at the origin of the Riemann normal coordinates, which we assume is at . Often, the definition of Riemann normal coordinates includes the requirement , but this is not necessary for our calculations, and we find it more convenient to retain explicit factors of and the inverse metric . Put differently, we are choosing a coordinate system so that all geodesics passing through the origin are linear in the affine parameter:
[TABLE]
The ellipsis in (34) indicates higher order interactions, involving five or more powers of and/or , as well as derivatives and powers of the curvature. We will consider up to six point interactions in section 10.
5 Loop divergences in momentum space
Our aim in this section is to introduce the main concepts related to divergent loop diagrams that we will need in our analysis of the non-local non-linear sigma model. As a warmup, we first exhibit the simplest manifestation of the non-renormalization theorem of the non-local quadratic kinetic term in the action (2), where or , with
[TABLE]
The purely cubic theory is unstable, but it serves our purpose because we are only interested in analyzing the behavior of the one-loop correction to the propagator. Using the diagram shown in figure 1a, we obtain the one-loop contribution to the quadratic part of the one-particle irreducible (1PI) effective action:
[TABLE]
We continue the convention of integrating over all of except as otherwise indicated. Let’s assume , so is UV divergent (and IR convergent). To regulate the divergence, we introduce a hard cutoff: . If , then can be any positive real number. If , then we will require that is an integer power of .
The ultrametric case is easy to analyze, because when we have exactly. So, except in the compact region where , the integrand has no dependence at all. Therefore, any UV divergences are entirely independent of , and to evaluate them we can set :
[TABLE]
The last equality comes from splitting the integration region into shells with fixed ; then the integral becomes a geometric sum. Because the divergent part of has no -dependence, the counterterm required to cancel it is proportional to . In other words, it is a mass term. This argument is easy to generalize to the statement that only purely local terms (powers of ) can be radiatively generated starting from the action (2) over . An essentially equivalent argument was made in a Wilsonian picture in [3].
The Archimedean case is more subtle because of the possibility of subleading divergences. A straightforward approach is to expand
[TABLE]
in powers of . Terms with an odd number of powers of vanish by parity, leaving only terms analytic in . Of these, only terms proportional to with are UV divergent. In short, the divergent part of is a polynomial in whose order is . A divergent term proportional to requires a counterterm proportional to . These are the radiatively generated derivative terms mentioned in section 1.
We should note a troublesome feature of the hard momentum cutoff for Archimedean theories: The coefficients one finds for sub-leading divergences depend on how one implements the cutoff. For example, it is easy to check that the coefficient of the term in changes if instead of requiring we impose the more democratic condition . However, the feature that we care about, namely the fact that the divergent terms have only polynomial dependence on , doesn’t depend on the details of the cutoff. It is perhaps instructive to consider one other alternative, namely dimensional regularization, in which one first computes
[TABLE]
by continuing to a domain of in which the integral is convergent. (In the current example, is such a domain.) The only divergences one then tracks are poles of the right hand side of (42) as a function of . These occur precisely when is a non-negative integer. It is characteristic of dimensional regularization that there is (at most) one divergent term for a given , corresponding to a logarithmic divergence in the original integral.
The loop diagrams we will need to consider in our analysis of the non-local non-linear sigma model are simpler than (39) in one regard: The loop is a single propagator starting and ending at the same vertex. This matters because there is then only one internal momentum , and imposing the hard cutoff is a privileged choice because it corresponds to integrating over an -invariant region. An example is the diagram shown in figure 1b, which is proportional to
[TABLE]
assuming that whatever vertex factor is needed to fully evaluate the diagram doesn’t depend on . We also assume so that is UV divergent but IR convergent. We straightforwardly find
[TABLE]
There are obviously no subleading divergences in .
For convenience we introduce
[TABLE]
We are interested in divergences proportional to that arise when . As a technical trick to isolate these divergences, we make small and positive, and we look for divergences of the form , which in the limit give rise to terms. To characterize this limit precisely, given and , we set and and then take the limit with and held fixed. (Clearly then we are allowing non-integer , in the spirit of [6].) For the most part, our final results are independent of . When is sufficiently small, we may replace (44) with
[TABLE]
If we lift the requirement that is an integer power of when , then (46) is unaltered, because , and differs from at most by a factor of . The important point is that in the limit , includes a logarithmic term , and isolating this term is our stated objective.
We will encounter one other loop integral:
[TABLE]
It comes from graphs similar to the one in (43), but with a vertex prefactor . Using the same reasoning that led to (40), we see that when a hard cutoff is imposed, one obtains in the ultrametric case
[TABLE]
If and is positive but not an even integer, then
[TABLE]
for some coefficients . If we choose positive but not an even integer and fix any finite value of , then for sufficiently small , (49) applies, and the least singular power of appearing in it is . As , this power remains positive and finite, tending to . So there is no behavior, even in the limit. If instead we make a positive even integer, then by choosing the very particular value , so that exactly, we find (for sufficiently small ) that the least singular term in (49) is , which does contribute a divergence in the limit; moreover, in this case, by calculation, .444Although we have argued that the hard cutoff prescription is the natural one to use, it is interesting to note that if instead we impose , then still when and is sufficiently small.
We can summarize the situation, both for the Archimedean and ultrametric cases, by stating that for positive but sufficiently small, then subject to the restriction that cannot be a positive even integer when ,
[TABLE]
where
[TABLE]
The higher powers of in (50) are accompanied by non-negative integer powers of , and they correspond to operators which remain relevant in the limit. The only behavior arising from , in the limits described above, is the term coming from the term shown in (50). We are not concerned about terms to and because they drop out of the behavior in the limit. In the following sections, therefore, we will drop terms from (46) and (51), and we will evaluate and in terms of rather than .
6 The propagator through one loop
To derive the tree-level propagator, we use an obvious generalization of (26) to multi-component scalar fields to rewrite the free action in momentum space:
[TABLE]
where we recall that , and for notational convenience we have introduced555A point worthy of remark is that while and have the same sign in the ultrametric case, and also in the Archimedean case for , for they have the opposite sign. The integral in (52) is well-defined and positive, so to make our theory sensible we should always choose . This means that for . As explained in (28) for a single real scalar, the regulated position space integral used to define the action (32) includes a term that enters with the opposite sign of the non-local term, so positivity conditions are difficult to judge in position space.
[TABLE]
We immediately extract from (52) the propagator
[TABLE]
We are primarily interested in small, so that is nearly logarithmic.
Informally, we can understand the one-loop correction to the propagator as a contribution to the 1PI effective action coming from all possible Wick contractions of the two of the four factors of in . The calculation is done most straightforwardly in momentum space, where we can express
[TABLE]
where . As usual, derivative terms are implied in in the Archimedean case when . Symbolically, the Wick-contracted quartic action is
[TABLE]
