Gauge and gravitational instantons: From 3-forms and fermions to Weak Gravity and flat axion potentials
Arthur Hebecker, Philipp Henkenjohann

TL;DR
This paper explores how gauge and gravitational instantons influence global symmetry breaking and axion potentials within quantum gravity, proposing bounds and analyzing the role of fermions and spacetime topology.
Contribution
It provides a detailed description of instanton effects via 3-form gauge theories and conjectures a cutoff-dependent lower bound on axion potentials due to quantum gravity constraints.
Findings
Decoupling of instantons explained through 3-form gauge theory.
Fermionic operators induced by K3 instantons may be phenomenologically relevant.
A conjectured lower bound on axion potential from quantum gravity considerations.
Abstract
We investigate the role of gauge and gravitational instantons in the context of the Swampland program. Our focus is on the global symmetry breaking they induce, especially in the presence of fermions. We first recall and make more precise the description of the dilute instanton gas through a 3-form gauge theory. In this language, the familiar suppression of instanton effects by light fermions can be understood as the decoupling of the 3-form. Even if all fermions remain massive, such decoupling may occur on the basis of an explicitly unbroken but anomalous global symmetry in the fermionic sector. This should be forbidden by quantum gravity, which leads us to conjecture a related, cutoff-dependent lower bound on the induced axion potential. Finally, we note that the gravitational counterpart of the above are K3 instantons. These are small fluctuations of Euclidean spacetime with K3…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
**Gauge and gravitational instantons:
** **From 3-forms and fermions to
** **Weak Gravity and flat axion potentials
** Arthur Hebecker and Philipp Henkenjohann
Institut für Theoretische Physik, Universität Heidelberg,
Philosophenweg 19, 69120 Heidelberg, Germany
June 18, 2019
**Abstract
**
We investigate the role of gauge and gravitational instantons in the context of the Swampland program. Our focus is on the global symmetry breaking they induce, especially in the presence of fermions. We first recall and make more precise the description of the dilute instanton gas through a 3-form gauge theory. In this language, the familiar suppression of instanton effects by light fermions can be understood as the decoupling of the 3-form. Even if all fermions remain massive, such decoupling may occur on the basis of an explicitly unbroken but anomalous global symmetry in the fermionic sector. This should be forbidden by quantum gravity, which leads us to conjecture a related, cutoff-dependent lower bound on the induced axion potential. Finally, we note that the gravitational counterpart of the above are K3 instantons. These are small fluctuations of Euclidean spacetime with K3 topology, which induce fermionic operators analogous to the ’t Hooft vertex in gauge theories. Although Planck-suppressed, they may be phenomenologically relevant if accompanied by other higher-dimension fermion operators or if the K3 carries appropriate gauge fluxes.
1 Introduction and overview
1.1 Fermions in the 3-form description
In this paper, we investigate the effective description of gauge or gravitational instantons by 3-form gauge theories, focusing in particular on the effect of fermions on this effective theory. Furthermore, we discuss how the interplay of fermionic operators and instantons affects axionic shift symmetries. If exact, such global symmetries should be in the swampland and we propose a lower bound on axion masses to quantify the minimal strength of symmetry breaking. Such a bound can then be used to derive constraints on the fermionic operators mentioned above.
Gauge theories of 3-form potentials have been discussed since a long time [1]. It is also well-known that the Chern-Simons 3-form of a non-Abelian gauge theory transforms under the non-Abelian gauge transformation exactly like a fundamental 3-form gauge potential [2]. A similar argument can be made for gravity. Taking this seriously, one can use the 3-form gauge theory as an effective description of Yang-Mills (YM) theory at low energies. This has been discussed in [3] in the context of chiral perturbation theory of QCD and more generally in [4, 5, 6, 7].
Our first, simple, technical point in this paper is to apply this description to the case of a Higgsed YM theory at energies below the symmetry breaking scale. In this regime, the instanton gas is dilute and a quantitatively controlled analysis is possible. As a result, the effective 3-form gauge theory description can be rigorously established as long as the source term (used for probing the theory through the coupling tr) is small. We will rely on this controlled model of a dilute instanton gas and its 3-form description in what follows.
We are particularly interested in the 3-form description of YM theories with instantons and massless fermions [4, 5, 6, 7]. Especially the idea that this may imply fermion condensates independently of confinement and that small fermion masses may be generated through gravitational instantons are intriguing.
A crucial assumption for this line of reasoning is that the effective 3-form description of YM theory with massless fermions includes a massless pseudoscalar. Indeed, in QCD this is the familiar meson. This scalar may be dualized into a 2-form which then gauges the effective 3-form, making it massive [8]. As a result, one obtains a very reasonable effective description for how fermions remove instanton effects. However, as we point out, a different option arises in our case of a Higgsed YM system: Light fermions may suppress the gauge coupling of the effective 3-form, completely decoupling it in the massless limit. This interpretation is supported, in our calculable setting, by the fact that no evidence of the massless scalar can be found. Thus, the effective description of how fermions remove instanton effects may change depending on the diluteness of the gas.
1.2 Fermions and axion potentials
We go on to study Higgsed YM theories which are coupled to an axion. Gauge instantons then generically induce an axion potential where denotes the instanton action. According to the Weak Gravity Conjecture (WGC) for axions, is bounded from above as , with the axion decay constant [9].111 To be precise, most naively the WGC constrains the mass (action) of a charged object to be small at weak coupling. For axions, this is the large- regime, . We assume here that the bound is (or is also) valid at small , which is our main case of interest in this paper.
However, a priori the WGC does not make a claim about the overall size of the axion potential. Thus, the potential could be small because comes with a small prefactor [10, 11, 12, 13]. In the following we want to take the bound on seriously and focus on the prefactor.
The smallness of the instanton prefactor can, in particular, be due to the presence of light fermions with mass . Indeed, the prefactor scales as , with the Higgs scale and the number of flavors. Obviously, for the potential vanishes identically and a global symmetry involving a shift in the axion and anomalous U(1) rotations of fermions emerges. Unless that symmetry is broken, for example by additional fermion interactions, this is inconsistent with quantum gravity expectations [14, 15, 16, 17].
Intriguingly, a similar phenomenon can be observed in a model in which all fermions remain heavy. As we will explain, this is achieved by shift-symmetry-preserving Yukawa interactions, which provide effective mass operators in addition to the hard mass terms. In fact, such a structure arises in the Standard Model if an axion coupling to tr is introduced [18]. Again, once the hard masses are taken to zero, the axion potential vanishes due to the emergence of a global symmetry. This shows that the problematic feature of the theory is not massless fermions but rather the presence of a global symmetry. Note also that in this class of models one can apparently take the massless-axion limit without tempering with any other IR degrees of freedom.
One could restore consistency with quantum gravity by simply claiming that flat axion potentials are in the swampland, thereby excluding such models. However, there are many counterexample in string theory: Calabi-Yau compactifications of type II strings lead to 4d supergravity models with perfectly flat moduli spaces, including axionic directions. Of course, in this case we expect that the global axionic symmetries are broken by higher-dimension operators involving fermions. The latter stay massless due to supersymmetry and a low-energy theory with a global shift symmetry is not realized even at arbitrarily low energy.
We note that much more work related to the WGC, especially the WGC for axions, has recently been done (see e.g. [19, 10, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43])222See [44] for a recent review on the WGC and, more generally, on the swampland paradigm.. A lot of this is in the context of trying to avoid or defend the bound on . An imortant ingredient is trying to break the microscopic connection between and [45], with recent concrete realizations including in particular [24, 33, 34, 40]. Moreover, it has been argued that wormholes generate a potential for axions [46] (see also [21, 47, 11]) which, however, we will not discuss in the following. Phenomenological consequences of axion potentials for small are discussed in [47] and [11]. In spirit close to our discussion is a proposal in [48] which constrains the relative size of any corrections (not just instantons) to axion potentials. The strong form of this constraint requires that there always exists a dominant contribution to the axion potential that has a sub-Planckian periodicity (corresponding to ). Still, this constraint does not make a claim about the absolute size of axion potentials.
1.3 Flat axionic potentials and the Swampland
Ultimately we are interested in finding a general lower bound on axion potentials, thereby quantifying to which extent an approximate global axionic shift symmetry is compatible with quantum gravity. To do so, we try to invoke the WGC for the effective 3-form theory encoding the crucial instanton effects. The magnetic version of the WGC for 3-forms bounds the cutoff of the 3-form theory according to , where denotes the 3-form gauge coupling. Unfortunately, the restriction of our effective 3-form theory to small translates to a restriction to energy scales below . Thus, we never enter the interesting regime above where WGC constraints would apply.
To improve on this, we argue for the existence of an extended, highly non-linear 3-form description which is valid beyond . This enhanced range of validity allows for a cutoff which can be larger than and is given by the mass scale of the lightest massive degrees of freedom in the UV, for example the mass of light fermions or the Higgs scale. In contrast to the original 3-form description, this new 3-form theory is severely constrained by the WGC. For example, it would parametrically constrain the Higgs scale of Higgsed YM models with gauge coupling according to . This would put weakly coupled YM theories which are Higgsed at a high scale into the swampland. We feel that this is too strong a statement to be taken seriously and conclude that the WGC for 3-forms is probably only valid for the canonical quadratic 3-form theory.
Thus, while the 3-form-WGC-approach fails, we still expect some lower bound on generic axion potentials to exist. We are inspired by the axionic WGC, but we also know from the SUGRA example that the bound should disappear as the cutoff is taken to zero. Guided by this as well as by simplicity and consistency with known examples we hence propose that
[TABLE]
where and parametrize our ignorance of the exact bound, denotes the axion decay constant and is the cutoff of the low energy theory that exclusively describes the axion. From this one can easily derive a bound on the axion mass :
[TABLE]
We have already mentioned that we expect global axionic shift symmetries to be broken by appropriate fermion operators. If the corresponding fermions are not massless, they can be integrated out, thereby inducing an axion potential. Our proposed bound on such potentials then translates to a lower bound on fermion operators. We apply this approach to fermion masses in Higgsed YM theory with and without additional mass contributions from Yukawa couplings. In both cases we find that fermion masses are parametrically bounded by
[TABLE]
for with the caveat that in the case without Yukawa couplings this holds only for more than four fermion flavors.
1.4 Gravitational instantons
Finally, we argue for the possibility of gravity-induced fermion interactions via gravitational instantons. To appreciate our argument, recall that the axial U(1) symmetry of gauged massless fermions is anomalously broken according to , where is the axial U(1) current. In the presence of gauge instantons, the spacetime integral of the topological density is non-trivial. This implies the existence of an effective fermion interaction that explicitly breaks the axial U(1). This instanton-induced fermion interaction, also called ’t Hooft interaction, can be determined explicitly in the dilute gas approximation [49, 50].
The same logic can be applied to pure gravity. There we have with the right hand side being a gravitational topological term. To the best of our knowledge, K3 is the only compact manifold non-trivially contributing to this term [51, 52]. We describe how a K3 manifold can be glued into flat , such that it can be considered a local fluctuation of it. This is analogous to the localized field strength of a gauge instanton and also implies the gravitational analogues of ’t Hooft fermion interactions. Using the dilute gas approximation we naively estimate the strength of these interactions and find that the associated energy scale is, in the case of the SM, of the order of GeV. We therefore conclude that they are phenomenologically irrelevant. However, we point out that this may be circumvented by combining the K3-instanton-induced interactions with other higher-dimension operators.
The rest of the paper is organized as follows: In Sect. 2, we recall and develop some basic ideas about 3-form gauge theories. We then use these theories as effective descriptions of gauge instantons (with and without fermions) in Sect. 3. This setting and other approaches are employed in Sect 4 to discuss possible Swampland constraints on the achievable flatness of axion potentials. Section 5 analyzes analogous effects of gravitational instantons and some Conclusions are drawn in Sect. 6.
2 The physics of massless and massive 3-forms
In this section we collect some results about 3-form gauge theories [1, 2, 3, 4] (see Appendix A for details and derivations). In particular, we argue for the equivalence with the dual -form description. We also show how the force between domain walls can be used to follow the transition between Coulomb and Higgs phase. This quantifies features discussed in [8].
The free theory is defined by the Euclidean action
[TABLE]
where is the field strength, is the gauge coupling, and is the 4d Riemannian manifold on which the theory lives. If is compact, the -flux on it is quantized and one can dualize the partition function based on (4) in terms of a sum over discrete values of (see Appendix A.1). Explicitly,
[TABLE]
where is a normalization constant. This partition function is invariant under the shift and we can hence view as a periodic variable which takes values in the range . Using this we conclude that for constant the term with corresponds to the lowest energy state. In the limit this term dominates:
[TABLE]
We also note that, for compact space and non-compact (Euclidean) time, i.e. for , the theory clearly represents a non-trivial quantum mechanical system: The fundamental degree of freedom can be characterized by . This system corresponds to a quantum particle on a circle. From now on we will, however, take for simplicity.
Let us now introduce Cartesian coordinates and choose the source such that it represents two parallel domain walls localized at and :
[TABLE]
They are subject to a force per unit area,
[TABLE]
which is independent of the distance . This is expected since one has no propagating degrees of freedom. Instead, the force is due to a constant background field strength which is different between the walls and outside.
Next we consider the coupling of a dynamical scalar with mass to the 3-form:
[TABLE]
Here determines the normalization of . If , one may take the scalar to be periodic such that becomes its axion decay constant. The action above is then dual to the situation where a 2-form is gauged by a 3-form as discussed in [8].333 This is the gauge-field-theoretic description [53, 54] of axion monodromy inflation [55, 56], recently revived in the context of -term axion monodromy [57, 58, 59].
For us, is a convenient parameter to switch this gauging on and off. Indeed, if the field disappears in the IR and is not Higgsed.444 Note that a non-zero may even be made consistent with a fundamentally axionic nature of : All one needs is to interpret the effect of the gauging of by a further 3-form (which has been integrated out) as an effective monodromy or mass parameter.
Integrating out gives
[TABLE]
which, upon carrying out the integration, simplifies to
[TABLE]
with . Also the force per area is altered:
[TABLE]
The additional term exponentially decays with the distance of the two domain walls and indicates the presence of a propagating degree of freedom with mass . The effect of the constant background field strength is also still present, but it is now suppressed by . At , the 3-form theory is Higgsed and this long-distance effect disappears.
3 3-form gauge theory as effective field theory of instantons
In the late 70s it has been noted that YM theory always contains a 3-form that inherits a corresponding gauge transformation, , from the original non-Abelian gauge symmetry. It therefore can be considered a proper gauge 3-form [2]. More recently it was argued that this 3-form may provide an alternative description of the Peccei-Quinn solution of the strong CP problem in terms of a 3-form which gauges the 2-form dual to the axion [8]. This logic and its implications have been developed further in [4, 5].
In this section we want to analyze the relation between YM theory and 3-form gauge theory more systematically. To do so we focus on the calculable case of a weakly coupled Higgsed YM theory such that we can employ the dilute instanton gas approximation in our computations. We find that the instanton induced correction to the vacuum energy can be effectively described by a pure 3-form gauge theory at small -angle and below the Higgs scale. In the presence of light fermions this effective description remains a good approximation below the fermion mass.
3.1 Pure Yang-Mills theory
Let us first consider pure YM theory with Euclidean action
[TABLE]
where is the Lie-algebra-valued gauge potential and . As is well known, is a total derivative, i.e. it can be written as the exterior derivative of a 3-form . This is the proper 3-form gauge potential mentioned in the introduction to this section [2]. denotes the gauge coupling and is again an arbitrary external source. For we may identify it with the usual -parameter of YM theory. In order to be able to deal with this theory computationally we assume the gauge symmetry to be broken spontaneously at a scale and take the running gauge coupling to be small at this scale: . All instantons larger than are then cut off and the dilute instanton gas approximation is valid. In this case one can integrate out the gauge field and obtains the partition function
[TABLE]
where denotes the instanton action and (see Appendix B for details). For small this reduces to
[TABLE]
which is exactly the same as (6) for the pure 3-form gauge theory if we set
[TABLE]
By comparing the actions (4) and (13) we see that generates the same correlation functions for in the 3-form gauge theory as for in the gauge theory. Also the forces on domain walls will obviously be the same. Hence we have established the pure 3-form gauge theory as an EFT of Higgsed YM theory at energies below the symmetry breaking scale .
3.2 Yang-Mills theory with fermions
3.2.1 Comparison at the 1-instanton level
Next we add fermions with mass to the Higgsed YM theory. For simplicity we add only one fermion field in the fundamental representation of the gauge group:555For the sake of simplicity we have not included the Higgs sector in the action which is, nevertheless, always implicitly assumed to be present.
[TABLE]
If , we can first integrate out the fermions finding an effective gauge theory action with additional terms suppressed by powers of [60]. Ignoring these small corrections at and below the Higgsing scale we can continue to use the analysis of Subsection 3.1.
By contrast, light fermions (with mass ) have a significant effect. To see this, we integrate out the gauge field first and find the effective action for the fermions in a background of a dilute instanton gas. The corresponding calculation has been done by ’t Hooft [49, 50] and leads to the following partition function:
[TABLE]
is some constant and is the left- and right-handed projection operator, respectively. Note that the instanton induced 2-fermion interaction corresponds to the well-known ’t Hooft determinant for one flavor and is suppressed by the instanton action via . In the process of integrating out the fermions these interactions will give rise to loop corrections to the effective action which are suppressed by powers of and correspond to multi-instanton effects. Therefore it is possible to view the effective action as a power series in this suppression factor.
For now let us ignore these loop corrections and calculate the partition function to leading order in , i.e. at the 1-instanton level, which can be done exactly [49]. The result is
[TABLE]
and reduces to
[TABLE]
for small . Fortunately, this exactly coincides with (15) up to a suppression factor and therefore we can once again apply the logic of Subsection 3.1 to conclude that, at leading order in , Higgsed YM theory with a light fermion is, at energies below the fermion mass , effectively described by a 3-form gauge theory (4) with
[TABLE]
For the sake of completeness let us let us also give the corresponding result for fermion flavors with mass (cf. Appendix B):
[TABLE]
This shows that in the limit of massless fermions, , the 3-form gauge coupling vanishes or, in other words, the 3-form becomes non-dynamical. At the same time, the cutoff of the effective 3-form theory goes to zero of course. This consistently reproduces the fact that the -parameter of YM theory becomes unphysical and instantons are suppressed in the presence of massless fermions.
Let us give an intermediate summary and make an observation which we find interesting: We are considering a YM theory that is Higgsed at a scale and contains light fermions of mass below . We may assume , such that an EFT at scale with can be defined. In this EFT, the massive gauge bosons have been integrated out such that we are dealing with a purely fermionic theory. In addition to the kinetic and mass term, these fermions are subject to the famous, instanton-induced ’t Hooft interaction. Next, we may also integrate out the fermions (at the 1-instanton level) to obtain the EFT relevant at scales below . We argued that this is a massless 3-form gauge theory. The only assumptions were small and that higher-order corrections in do not modify (20) significantly. The interesting implication of this is that the low-energy limit of a fermion theory with ’t Hooft interactions is provided by a 3-form theory. Note that this 3-form has a priori nothing to do with the 3-form present in the original YM theory.
3.2.2 Multi-instanton effects
Let us now consider the next-to-leading order corrections due to the instanton induced fermion interaction. We want to clarify whether they can significantly affect our 3-form EFT in the relevant energy range . To do so, the corrections to the force between two domain walls are calculated in Appendix B.2. The diagrams contributing to the energy are shown in Figure 1. Upon differentiation of this energy with respect to the distance between the domain walls, , we find the force density
[TABLE]
The dots denote the leading order contribution (8) with (21) which is reproduced by the leading diagram Figure 1(a) as expected. The first term in the brackets corrects the leading order result and hence contributes to the gauge coupling of the effective 3-form theory. We expect that, at the intuitive level, this corresponds to a renormalization of the fermion mass , possibly due to non-perturbative effects, such that (21) remains true when used with the appropriately renormalized mass. The second term, however, is finite and contains a non-trivial dependence on the distance between the domain walls.
We can rewrite this second term using the modified Bessel function of the second kind:
[TABLE]
For very small distances, , as well as very large distances, , this can be approximated as
[TABLE]
Note that in the limit the force becomes exactly proportional to .666In this limit one must not forget about the factor in front of the bracket in (23). Furthermore, our loop calculation indicates that it is due to the exchange of two fermions. Indeed, if the force were due to the exchange of one massless scalar, it should be independent of , consistent with the Coulomb law in co-dimension one. By contrast, our observed -behavior is consistent with the faster decay of a force originating from multi-particle exchange.
In [61] it has been argued that the exchange of two massless fermions leads to a force that falls off as which, at first sight, seems to be in contradiction with the behavior found above. However, in our scenario the fermions interact with a classical source while in [61] they mediate a force between two fermions via four-fermion interactions. This ultimately explains the difference in the force laws. It is instructive to ‘derive’ the law using dimensional analysis. First of all, since we are interested only in the contribution to the force due to the exchange of fermions between different points in spacetime, only the second order diagram in Figure 1(b) is relevant for us. We will work in Euclidean space and regularize the volume by considering a finite 4-dimensional box with edges of length . In the end we will take the limit . Using that the fermion propagator in position space behaves like , its contribution to the vacuum energy (see (110)777In the case at hand we have .) takes the form
[TABLE]
where the source is chosen according to (72) and can be represented by with being the Heaviside step function. In order to avoid confusion we have changed notation compared to (72) such that denotes the first component of the Euclidean four-vector corresponding to the direction orthogonal to the domain walls. By appropriately shifting the integration variable we can bring the integral into the form
[TABLE]
for some function , which demonstrates that the final result can only depend on and . In general, there will be infinite contributions to the energy in the limit . A quantity that has a chance of being finite is the energy per unit domain wall area, , whose finite part must be for dimensional reasons. Upon differentiation this reproduces the behavior of the force density which was obtained in the exact analysis.
Let us now go back to our original question whether the NLO corrections to the force density found above spoil the validity of the 3-form effective description and the 1-instanton approximation. Naively, i.e. ignoring the NLO force corrections, these approximations are valid in the energy regime which corresponds to distances with . As long as the ratio of the NLO contribution in (23) (without the mass renormalization term and using the approximation (25)) to the leading contribution (8) is much smaller than one this remains also true if we take the force corrections into account. We find
[TABLE]
where we used the formulae (106) and (103) for and , respectively. For the 3-form description is still valid. Since we are in the regime , and , the first three factors are much smaller than unity. However, the last term exhibits a factorial growth with . While this may spoil the condition and hence the validity our 3-form EFT in principle, we can always avoid this by keeping sufficiently small.
Finally we are in a position to address the question, which was raised in the Introduction, whether there exists an emergent bosonic degree of freedom in Higgsed YM theory with massless fermions. If this would be the case, we expected either a constant contribution to the force between the domain walls for a massless boson or a purely exponential contribution in the case of a massive boson. Neither of this is the case for the force in (23). Hence, we conclude that there is no sign for the presence of some emergent bosonic degree of freedom and the sub-leading corrections to the force are really due to the exchange of multiple fermions according to the ’t Hooft interaction in (18).
There is, however, a loophole to our conclusion that no bosonic degree of freedom is present. In the general case of flavors the ’t Hooft interaction is a -fermion interaction and we therefore have a Nambu-Jona-Lasinio type effective theory for the fermions [62, 63]. For such theories it has been shown that non-perturbatively generated masses for the fermions and bosonic bound states of fermions are present at large enough coupling [64, 65]. In particular this is what is thought to be happening in QCD, leading to chiral symmetry breaking. However, we have limited our analysis to the small coupling regime, , so we expect that this is not relevant in our case.
3.3 Yang-Mills theory with fermions and Yukawa couplings
So far we have seen that Higgsed YM theory with light fermions, i.e. lighter than the Higgs scale, can effectively be described by a 3-form theory,
[TABLE]
at scales below the fermion mass and with . This means in particular that this description completely breaks down in the limit .
It is possible write down a model where fermions enforce the vanishing of without becoming massless themselves. To see this,888 We owe this idea and its simple model realization to Gia Dvali.
consider a Higgsed SU(2) gauge theory with gauge coupling and which is Higgsed by the vacuum expectation value of a scalar doublet with . Furthermore, we add Weyl fermions in the following representations: two SU(2) doublets and four singlets with . We generate masses via Yukawa couplings
[TABLE]
where are SU(2) indices. For simplicity we choose the Yukawa coupling constants such that the fermions obtain masses of the order of the Higgs scale . Finally we add an explicit mass term
[TABLE]
It is crucial to note that whatever value has, the fermion masses will always be at least due to the Yukawa couplings. Consider a U(1) transformation according to which and . This U(1) is anomalous with respect to SU(2) and explicitly broken by the mass term . Thus the SU(2) -parameter is physical and below the Higgs scale the theory is effectively described by a 3-form theory like (29). The cutoff of this effective theory is given by .
Let us consider how the effective 3-form description is affected by the parameter . For the anomalous U(1) can be used to rotate away the SU(2) -parameter. It is hence unphysical. However, in the effective 3-form description is still physical which seems to be a contradiction. Furthermore, for all there is no reason for the 3-form description to break down. In particular its cutoff is independent of the value of . So how can the two points and be smoothly connected to each other in the effective 3-form description?
A reasonable and simple answer is that the 3-form decouples in the limit . By this we mean that such that . In this way the 3-form is unable to generate a potential for the -parameter and makes it effectively unphysical. Hence consistency with the UV theory is restored. We expect that the coupling constant of the 3-form theory must be proportional to some positive power of . Its role is analogous to that of the fermion masses in our original model without Yukawa couplings. In the following we will assume that this analogy can be taken literally and is given by (21) with . Note also that the decoupling of the 3-form is due to the change of a parameter of the UV theory and takes place without changing the degrees of freedom in the IR.
4 Swampland constraints on axions and fermions
4.1 Global symmetries and fermion operators
Axions have by definition a perturbative global shift symmetry. If this symmetry were exact also at the non-perturbative level, it would violate the quantum gravity censorship of global symmetries. We therefore expect this symmetry to be broken by non-perturbative effects in a consistent theory.
Indeed, this is realized if the axion couples to a (Higgsed) YM theory with massive gauged Dirac fermions. In the following we will refer to such a model as the light-fermion-scenario in contrast to the heavy-fermion-scenario explained in Subsection 3.3. This terminology is supposed to stress the fact that the fermions in the model presented in Subsection 3.3 remain always massive due to the Yukawa couplings. The Lagrangian of the light-fermion-scenario reads
[TABLE]
where denotes the axion and its decay constant. Now the shift symmetry is non-perturbatively broken by instantons which induce an effective potential for the axion at low energies (below the Higgs scale).
If at least one of the fermions becomes massless the Lagrangian becomes invariant under the transformation and , where is the massless fermion. Since this symmetry contains a shift in the axion and is exact at the quantum level, we conclude that the axion potential must vanish in the presence of a massless fermion.101010Strictly speaking we do not have an exact global symmetry here because of the chiral gravitational anomaly. However, this effect is severely suppressed, as we will discuss in the next section, and can be eliminated by adding an appropriate number of ungauged massless fermions. This can be viewed as a simple symmetry argument for a technical result of the instanton calculus. However, as we will discuss later on, we expect such a theory to be constrained due to the quantum gravity censorship of global symmetries [14, 15, 16, 17].
Let us now consider the heavy-fermion-scenario of Subsection 3.3. It consists of a Higgsed SU(2) YM theory with two Weyl fermion doublets and four Weyl singlets . These eight Weyl fermions are given a mass via the Higgs mechanism such that we end up with four massive Dirac fermions. Furthermore, we add an explicit mass term . If we couple an axion to this theory, instantons will generate an effective axion potential. In the limit the theory becomes invariant under an exact global symmetry with the transformation law , and . Similarly to the light-fermion-scenario we can conclude from this that the axion potential must vanish in the limit .111111 As already noted in the Introduction, a closely related situation arises for an SU(2)L axion extending the Standard Model. In this case, the anomalous symmetry is U(1)B+L and its possible explicit breaking by higher-dimension operators has been used to argue for a ultra-light axion in [18].
The existence of this global symmetry is again in conflict with quantum gravity expectations.
At first sight the two examples given above may look very similar. Here we would like to point out an important difference. In both theories the axion potential becomes zero in the limit of a vanishing mass parameter. While in the light-fermion-scenario this results in the presence of a truly massless fermion, in the heavy-fermion-scenario all fermions remain massive in the limit due to the Yukawa couplings. In particular, in the former case the axion potential vanishes only at the expense of changing the IR degrees of freedom by introducing massless fermions while in the latter case those degrees of freedom are fixed for all values of .
We have seen that both examples have a global symmetry which eventually allows for the presence of an exactly massless axion. One may argue that those two theories are simply incompatible with quantum gravity and hence reside in the swampland. However, a similar situation arises in SUGRA which contains an axion with an exactly flat potential [66]. Nevertheless, we expect higher fermion interaction terms to break this symmetry explicitly and thereby make the theory consistent with quantum gravity again. Similarly, such additional fermion interactions could be used to make our two scenarios consistent with quantum gravity, too.
Motivated by this we conjecture that such additional fermion interactions are mandatory for fermions without a hard mass term in the presence of axions in order to prevent the existence of a global symmetry. This is a non-trivial statement and we find it interesting how the exclusion of a global axionic shift symmetry imposes constraints on fermion interactions. One could ask whether a minimal strength of these interactions can be inferred on general grounds and in the next two subsections we attempt to do so by conjecturing a constraint on axion potentials.
4.2 (Too strong) a constraint on axions from the WGC for 3-forms
Let us now bring back in the 3-form description of our two models. So far we have discussed the effective 3-form description of Higgsed YM theory without axions in Section 3. However, according to (10) the axion is easily accommodated by replacing the source by the axion field and adding a corresponding kinetic term for it. Then we immediately see that the axion mass is given by . That means, whenever a global symmetry of the full UV theory forbids an effective axion potential, the effective 3-form description must decouple as (cf. Subsection 3.3) is required for a vanishing potential. This is exactly what we observe in the case of a Higgsed YM theory in the limit of massless fermions and what we still expect to happen once Yukawa couplings have been introduced as in the heavy-fermion-scenario. More generally, the 3-form gauge coupling should always be proportional to a symmetry-breaking parameter of the UV theory.
The idea is now to apply the WGC to the effective 3-form description of YM theory and thereby derive a constraint on the 3-form gauge coupling and hence also on the axion potential. Let us start by stating the two versions of the WGC for 3-forms. The electric WGC for 3-forms requires the existence of domain walls which naturally couple to the 3-form and whose tension is bounded from above according to
[TABLE]
where is the 3-form gauge coupling [9]. As long as the cutoff of the 3-form theory is below this bound has no consequences because the theory breaks down before the presence of such heavy domain walls is required by the WGC. On the other hand, if the cutoff is larger than and domain walls are not part of the theory, such a theory is forbidden by the WGC. The magnetic version of the WGC for 3-forms simply bounds the cutoff from above according to
[TABLE]
which is exactly the condition for the electric WGC to be satisfied without the presence of a light domain wall.
In order to apply the WGC to our effective 3-form theory of Higgsed YM theories we need to discuss the cutoff of this effective theory in some detail. In Section 3 we have found that the effective 3-form description of Higgsed YM theory is naively valid up to the Higgs scale while the presence of light fermions with masses reduce this cutoff down to . If these fermions acquire their masses via order one Yukawa couplings, the cutoff stays at . However, we have ignored a caveat in the corresponding argument which we would like to point out now. Recall from Section 2 that the energy density of the 3-form theory is given by . Furthermore, the effective 3-form description is only valid for and thus breaks down at energy densities corresponding to , i.e. . Therefore the actual cutoff of the EFT is . Using this new cutoff we find which means that the theory always perfectly satisfies both the magnetic and electric WGC.
The last paragraph showed that the effective 3-form description of instantons breaks down at a scale set by the 3-form gauge coupling . In particular this cutoff can be much lower than naively expected. Consider for example the heavy-fermion-scenario. In this theory the 3-form gauge coupling can be made arbitrarily small by choosing the parameter appropriately. In particular we can choose it such that we have a cutoff , where is the Higgs scale as usual. On the other hand, there are no new degrees of freedom in the theory below the Higgs scale. Hence, there should be an effective theory that is valid in the energy range between and and contains the same degrees of freedom as the original effective 3-form theory.
In order to get rid of the constraint we would like to find a 3-form theory with action such that it reproduces the full partition function (14), i.e.
[TABLE]
with satisfying (16) or (21), depending on whether we include fermions or not. Although we cannot determine the explicit form of , the above formula implicitly defines it and thereby also defines the effective 3-form theory we are looking for. In general can be a very complicated functional. For example, from the instanton calculus we expect to have non-trivial support only at -configurations which are -function-localized at certain points, each contributing one unit to .
Now we can use this improved effective 3-form theory and apply the WGC to it. According to (34) the 3-form gauge coupling must obey . This turns out to be an extremely strong statement. To see this consider for example a simple Higgsed YM theory with Higgs scale given by . As usual the corresponding effective 3-form theory has a gauge coupling and cutoff . (34) then implies
[TABLE]
where is the gauge coupling constant of the YM theory evaluated at the scale and we have used the relation . This would imply that weakly coupled YM theories can only be spontaneously broken at exponentially low scales. Even though this is a valid result we think that it is too strong as there is naively no good reason why such a scenario should not be realizable in string theory.
Therefore we discard (34) and conclude that the application of the WGC to the effective 3-form description of Higgsed YM theory leads to peculiar results. On the other hand, as we have discussed above, the WGC applied to the 3-form theory with the conservative estimate for the cutoff is satisfied. From this we conclude that, if the WGC for 3-forms has any regime of validity at all, it can only be applied to canonical 3-form theories with the standard action given by (4).
4.3 A conjecture on axion potentials and implications for fermions
Given our failed attempt to use the WGC for 3-forms to constrain axion potentials we now instead try to find a reasonable conjecture for a bound on the axion potential in the following. In general we expect the non-perturbative axion potential to be of the form
[TABLE]
We would like to find a lower bound on the amplitude of this potential. A first step into this direction is the WGC for axions which constrains the action according to , i.e. [9]. However, as long as is completely free this does not provide a hard bound on the potential. Very naively one could conjecture that but this, again, is too strong since the axion potential in SUGRA vanishes exactly and therefore provides a counterexample in the landscape.
How can we reconcile a vanishing axion potential in SUSY moduli space and, at the same time, a lower bound on it? A possible answer is that the bound depends on the cutoff of the effective axion theory and vanishes for zero cutoff. We therefore propose the following general form of a bound on axion potentials:
[TABLE]
Here is the UV cutoff of the low energy theory that exclusively contains the axion while and parametrize our ignorance of the exact form of the bound.121212In principle, is fixed by the precise form of the WGC for axions: . But an exact analogue of extremal black holes, which normally define the bound, does not exist. Natural candidates might be axionic wormholes [67, 11], suggesting the value . Alternatively, if wormholes are absent, extremal gravitational instantons may define the bound [29]. This, however, may not be universal since different stringy models have different saxion couplings leading to slightly different values for [68]. Let us also sketch a supergravity example (à la KKLT) hinting towards . Consider a superfield with shift-symmetric Kähler potential and constant superpotential . In principle both objects can receive additive instanton corrections of the form . If only is corrected and if , the leading contribution to the axion potential for is , where is the gravitino mass. It appears plausible that is (of the order of) the mass of the lightest fermion and provides the cutoff relevant for the pure-axion theory. This suggests the bound on axionic potentials. The corresponding bound on the axion mass is
[TABLE]
In order for this to be consistent with the fact that is the cutoff of the pure axion theory, this lower bound must lie below . For this is generically true due to the exponential suppression by . It is also interesting to consider the case , even though this is not the regime we focused on. In this case such that
[TABLE]
This bound has a chance of being smaller than only if which, interestingly, seems to be marginally satisfied in the supergravity example discussed in Footnote 12.
Now let us discuss the implications of this bound for the two scenarios discussed so far. For simplicity we use and set in the following. Let us start by considering the light-fermion-scenario with fermions of mass and an axion as defined by (32). The cutoff of the EFT of the axion is given by . In order to satisfy (38) the following inequality must hold
[TABLE]
Here we have used (22) to determine in (38). Taking the WGC into account, this is trivially satisfied for any value of for while implies the bound
[TABLE]
Consequently the fermion mass is only restricted for . For an exactly flat axion potential is possible at the expense of massless fermions. In this case the degrees of freedom in the IR change and there exists no low energy theory that contains only the axion. As already discussed in Subsection 4.1, such a theory has a global symmetry consisting of a shift in the axion and an anomalous U(1) rotation of fermions. We therefore expect additional fermion interactions to be present that break this symmetry. The constraint (39) on the axion mass reads
[TABLE]
Next consider the heavy-fermion-scenario. To be more general consider the -fold duplicated version of the model we have discussed so far, such that we have explicit mass terms with mass . With (38) implies
[TABLE]
which is very similar but stronger than what we have found in the last paragraph. Finally (39) reads in this case
[TABLE]
This procedure could also be used to constrain other fermion operators that break a global symmetry which protects the axion potential.
5 Gravitational instantons and fermion interactions
Consider YM theory with massless Dirac fermions , , in the fundamental representation of the gauge group. This theory has an axial symmetry on the classical level which is anomalously broken by instantons [60]. As a result, the corresponding current of the symmetry is not conserved:
[TABLE]
with and
[TABLE]
By recalling that instantons are topologically non-trivial field configurations, obeying
[TABLE]
we can conclude that
[TABLE]
This shows that the axial charge must change by along a single instanton event. Since counts the number of right-handed minus the number of left-handed fermions, an instanton must convert right-handed fermions into left-handed fermions.131313To conclude this we have to use the fact that the sum of right- and left-handed fermions is conserved. Hence we conclude that instantons induce a -fermion interaction that explicitly breaks the axial symmetry. These are the well-known ’t Hooft interactions and they can be explicitly calculated. In particular, it turns out that the vectorial and the chiral flavor symmetry are left unbroken by the interaction. Simply by using this symmetry breaking pattern we can construct the flavor structure of the ’t Hooft vertex. To do so we define the -invariant matrix with . With this quantity we can construct exactly one interaction that is invariant under the chiral flavor symmetry, namely . We can not fix the color index structure by this line of reasoning but it will not be relevant for us anyway.
Having discussed the YM case let us now turn to gravity. For this it is most convenient to use Weyl fermions instead of Dirac fermions. Hence let us consider massless Weyl fermions with which live on a manifold with metric . In terms of these the current reads
[TABLE]
and there exists a corresponding gravitational chiral anomaly [69, 70, 71, 72] given by
[TABLE]
where is the Riemann tensor and . There exist a number of different gravitational instantons for which the integral of the right hand side of (51) is non-zero [73] and hence induces a change in the charge associated with the current . Among them are, for example, the well-known Eguchi-Hanson instanton or the K3 manifold [74, 75]. In the following we would like to treat gravitational instantons as fluctuations of flat spacetime . This can be done by cutting a 4-dimensional ball out of so that the boundary of the resulting hole is which then can be connected to the boundary of the gravitational instanton via a wormhole-like throat. Sections of this throat are topologically . In the case of the Eguchi-Hanson instanton this procedure is not possible since the topology of its boundary is and hence does not match the of the hole in .141414See, however, [76, 77, 78] and [79] for a discussion of possible physical effects of Eguchi-Hanson instantons and of gravitational instantons in general, respectively.
A more promising candidate for a quantum fluctuation of is the K3 manifold[51, 80]. Since it is Ricci-flat, it solves the vacuum Einstein equations and has a vanishing action. Furthermore, it is the only compact 4-dimensional manifold with self-dual curvature such that the right hand side of (51) is non-zero [75, 51, 52, 73], namely
[TABLE]
As in the case of gauge instantons this result depends exclusively on the topology of K3 and is related to the Atiyah-Singer index theorem [73]. The topological nature of this relation will be important for us in the following. Since K3 has no boundary, we have to cut out a 4-dimensional ball in order to glue it into flat spacetime according to the general procedure described in the previous paragraph (see Fig. 2). The resulting manifold is of course not a K3 anymore and has changed its topology. It also does not solve the Einstein equations anymore. Therefore one may be worried whether the topological relation (52) still holds for the new manifold. In the following we argue why there is no problem.
Let us start with K3 that is glued onto an instead of as described above. This manifold has no boundary and is topologically still a K3. Hence (52) remains true. In the next step we split the anomaly integral into three parts, corresponding to the , the original K3 contribution and the wormhole connecting them:
[TABLE]
But the metrics on and on the wormhole have locally and hence also . Now delete a point from the which gives but certainly does not change the integral . Hence we can conclude that a K3 glued into a flat region indeed gives a contribution to the anomaly integral.
Finally, this allows us to conclude, similarly to the gauge instanton case, that a K3 fluctuation is, according to (51) and (52), accompanied by a change of the axial charge . Hence K3 must induce an effective fermion interaction which is a product of fermion fields. Since fermionic fields anti-commute and our theory contains exactly of them (two components for each of the Weyl fermions) this information uniquely fixes the structure of this effective fermion interaction. It is simply the product of all fermion fields:
[TABLE]
where and denote the first and second component of the Weyl spinor , respectively. The anti-commutativity of the fields implies that this product is totally antisymmetric in all of its indices, i.e. in both flavor and spinor indices. Thus it is invariant under any subgroup of the full SU() flavor symmetry as well as under any non-anomalous U(1) transformation of the fermions. This ensures in particular that the interaction is not in conflict with any pattern of consistent gauge symmetries that could act on the fermion fields.
Now we would like to estimate the strength of the effective interaction (54) by evaluating the contribution of K3 to the path integral of Euclidean quantum gravity with an insertion of this operator. To do so we need to integrate over all metrics of an asymptotically flat spacetime that contains one K3 fluctuation. Each such metric is suppressed by the action according to with
[TABLE]
and Ricci scalar which depends on the metric . While the asymptotic and the K3 have vanishing curvature and hence do not contribute to the action, the wormhole part is non-trivial. Let be the typical size and curvature scale of the wormhole.151515It may be interesting to analyze whether the controversial negative mode issue [81, 82, 83, 84, 47, 11, 85] of Giddings Strominger wormholes affects this wormhole region and hence K3 instantons as well. Then, by dimensional analysis, its contribution to the action must be of the form161616Note that wormholes are negatively curved which leads to the positive sign for the wormhole action. . The effective fermion interaction receives contributions from instantons of all sizes so that we have to integrate over it. Since we do not integrate out the fermions, no factor of the fermion mass can arise in front of the effective interaction. Therefore, we can use dimensional analysis to find
[TABLE]
where the last equality defines the typical scale of this interaction and parametrizes our ignorance about the instanton determinant. The integral over is dominated by a critical value . If it is reasonable to assume as well since there is no large dimensionless number in the problem. Then one finds . By contrast, if and remains , one obtains . In this case, the calculation remains controlled since the integral is dominated in the large-radius/weakly-curved regime. (However, might depend non-trivially on , so we can not be certain about this.) For the Standard Model one would find , i.e. an almost Planck-suppressed interaction.
Note that for (57) is a Majorana mass term. This raises the interesting question whether the K3 instantons generate fermion masses also for more than one flavor. A general mass term has the form
[TABLE]
and can only be generated by (57) if it respects the symmetries of this interaction (this is analogous to the fact that chiral symmetry prevents fermion masses from being generated in perturbation theory). In the following we show that for we can always construct a non-anomalous U(1) transformation of the fermion fields which is a symmetry of (57) but not of (58) for any non-trivial mass matrix . Hence no masses are generated by (57) for . To see this consider any non-vanishing term in the sum (58). Neither this term nor (57) is invariant under the anomalous U(1) transformation . However, we can render it non-anomalous, and hence a symmetry of (57), by letting a second field transform according to . For we can choose such that is still not invariant under this transformation. Consequently, such mass terms can not be generated by (57). So far this argument does not exclude a Dirac mass term for . However, this possibility is also readily excluded by a symmetry argument. To do so note that a Dirac mass term of the form breaks the discrete transformation while (57) remains invariant. To summarize, the K3 induced interaction does not generate masses for fermions except for . 171717Euclidean wormholes may provide a different mechanism by which gravity breaks global symmetries, in particular inducing fermion interactions (see e.g. [86, 87, 88, 14, 89]). However, wormholes also introduce deep conceptual problems, most notably with AdS/CFT [90], and may hence have to be excluded. This would enhance the role of K3 instantons in breaking global symmetries.
In the following we show how the K3-induced interaction can nevertheless contribute to fermion mass generation under certain circumstances. Let us illustrate the idea with a simple example. Consider the case of , i.e. we have two Weyl fermions and . Now let have a mass term while remains massless. Furthermore, the K3 induced interaction is a vertex with four external fermion lines, two associated with and two associated with . Making use of the Feynman rules for Weyl fermions (see e.g. [91]) we see that the two lines can be connected to each other via the propagator to give a diagram which generates a mass for . Note that is crucial for this to give a non-zero contribution. Using (57) the full diagram can be estimated as
[TABLE]
where we have cut off the quadratically divergent momentum integral at the scale . Since we are treating the K3 instantons as point-like and their typical size is , we have to impose for consistency. It is then natural to assume , such that .
This result can be readily generalized to the case of an arbitrary number of Weyl fermions of which all have a mass except for one. For this massless fermion a mass is generated by the K3 induced interaction if, again, all external lines of the vertex are connected to each other pairwise via the massive propagator. In this way a mass of order for the originally massless fermion is generated. However, in the phenomenologically interesting case of low-scale masses we expect this to be miniscule. Indeed, for SM-like scales , and one finds .
More generally it is possible to combine the K3 interactions also with higher-dimension operators to generate fermion masses.181818In [18] a similar mechanism has been used to generate an axion potential in the context of SU(2) gauge instantons. This is even possible if all fermions are massless at tree level. Consider for example three massless Weyl fermions with K3 interaction and some additional interaction . This operator can be connected to the K3 vertex via four propagators to generate a mass for . In contrast to the previous discussion here we connect external -lines with external -lines which results in a propagator that does not vanish for zero mass, . Note that the operator and its complex conjugate alone do not generate a mass for any of the fermions. We want to emphasize that the effect of such combinations of K3 induced interactions and higher-dimension operators can in principle unsuppressed and hence large. It would therefore be interesting to analyze in more detail the effect of such operators in the SM. Experimental information, such as bounds on the proton lifetime, may then be used to constrain these operators.
So far we have seen that K3 instantons alone in general do not generate fermion masses but can be combined with higher-dimension operators to do so. At this point we want to describe a mechanism by which one could possibly get rid of the need for such operators to obtain fermion masses. The idea is to have fermions charged under a U(1) gauge symmetry and to consider the effect of K3 instantons which have cycles carrying non-trivial U(1) flux.191919The possibility of a K3 instantons with non-trivial U(1) flux has been noted in [79] In this case the chiral gravitational anomaly (51) would obtain corrections from the U(1) flux on the K3 according to
[TABLE]
where denotes the field strength of the U(1) gauge symmetry and for simplicity we have assumed the charge of all fermions to have absolute value one. Upon integrating this equation over spacetime the second term on the right hand side will be proportional to the U(1) flux squared. The latter can be chosen to make the change in the charge associated with along such a K3 instanton small. This would then reduce the number of fermions which occur in the induced fermion interaction. If this number could be chosen to be two, this interaction would correspond to simple mass terms.
Furthermore, it would be interesting to determine whether these interactions may contribute to effective potentials of axions. If this axion couples to gravity via the topological term (53), this does not seem to be the case since a shift in the axion could be undone by an appropriate anomalous fermion transformation. However, a detailed discussion is needed to answer this question properly.
6 Conclusions
In the first part of this paper we have studied the effective 3-form description of instantons. To do so we coupled both theories to an external source and calculated the respective partition function and forces on domain walls which we modeled by a spatially varying . While this calculation can be done exactly in the 3-form theory, for the gauge instantons one needs to employ the dilute gas approximation. This restricts the range of applicability to weakly coupled Higgsed YM theories. We found that the partition functions and forces agree for small values of the external source and an appropriately chosen 3-form gauge coupling constant . This shows that 3-form theories indeed are EFTs of Higgsed YM theories for small . We expect this correspondence to hold also for gravitational instantons. It would be interesting to see how the 3-form description can be improved such that the restriction to small can be relaxed.
With the same method we analyzed the effect of gauged fermions on the effective 3-form description. It turned out that they simply alter the expression for the gauge coupling of the effective 3-form theory by a factor proportional to their mass. This implies that massless fermions decouple the effective 3-form theory which is consistent with the fact that they completely suppress isolated gauge instantons. Recently, it has been argued that the effective 3-form description of YM theory with massless fermions could potentially contain a massless bosonic degree of freedom [4, 5, 7]. While this is the case in a confining theory like QCD with an pseudoscalar, we are not able to find evidence for this in the case of a Higgsed YM theory.
After having discussed the effective 3-form description of instantons, we considered axionic shift symmetries in the second part of the paper. Ultimately, we expect such global symmetries to be broken due to quantum gravitational effects. Our intuition is then that this should manifest itself in terms of a non-vanishing potential and mass for the axion. Now, an interesting question is whether there is a quantitative bound on how small axion masses can be.
For YM theories coupled to fermions and an axion, the massless axion is protected by an exact global symmetry that involves a shift in the axion and an anomalous U(1) transformation of the fermions. This symmetry is generically broken by fermion operators which explicitly break the anomalous U(1) rotation of the fermions. Since quantum gravity censors global symmetries, it requires the presence of such operators, thereby also generating a mass for the axion. Conversely, if one has some a priori knowledge about a lower bound on axion masses, the coefficients of the relevant fermion operators can be constrained from below.
However, we can not simply propose a general lower bound on axion masses in terms of and . The reason is that SUSY compactifications provide examples with exactly massless axions in the landscape. It is then natural to expect that any lower bound on axion masses must depend on the cutoff, . Here is the scale below which the effective theory contains exclusively the axion. If the -dependence of the axion mass bound is such that , then consistency with SUGRA is maintained. Indeed, these theories always have further massless degrees of freedom, such that the cutoff is zero.
Based on simplicity, the examples we have considered so far, and the WGC for axions we then propose the following bound on axion masses :
[TABLE]
Here is the axion decay constant, is the cutoff of the low-energy axion theory, and is an unknown parameter. Since in YM theory instantons and fermions determine the non-perturbatively generated axion mass, one may then use this bound to constrain fermion masses . The precise results are somewhat model dependent, but the structure is of the type
[TABLE]
where denotes the Higgs scale of the YM theory.
There are at least two promising directions to make progress with this conjecture in the future. First of all, it is important to test it in stringy constructions. In particular, it would be interesting to understand more precisely how SUSY or weakly broken SUSY protect massless and very light axions respectively. Second, one can take our bound for granted and explore possible phenomenological implications. Especially constraints on fermion operators which break axionic shift symmetries could be studied in a variety of models.
Finally, in the last section we discussed the possibility of fermion operators which are generated by gravitational instantons. These operators are the analogues of the so-called ’t Hooft interactions which are generated by gauge instantons. We argued that K3 instantons seem to be the only gravitational instantons capable of generating those interactions which, in a theory containing Weyl fermions , take the form . A rough estimate of the strength of this operator showed that it is severely suppressed and not relevant for phenomenology. However, if combined with other higher-dimension fermion operators, it can give unsuppressed contributions to fermion masses. We also note that further interactions with different structure are possible if the fermions are charged under a U(1) gauge symmetry and K3 instantons with non-trivial U(1) flux are considered. In particular, this may allow for fermion mass terms which are directly induced by such gravitational instantons.
Acknowledgments
We are very grateful to Gia Dvali for collaboration during part of this project and for many extremely useful discussions. We would also like to thank Paolo Di Vecchia, Archil Kobakhidze and Pablo Soler for helpful discussions. This work is supported by Deutsche Forschungsgemeinschaft (DFG) under Germany’s Excellence Strategy EXC-2181/1 - 390900948 (the Heidelberg STRUCTURES Excellence Cluster).
Appendix A 3-form gauge theory
A.1 Pure 3-form gauge theory
The free theory of a 3-form gauge potential is defined by the Euclidean action
[TABLE]
where is the field strength associated to and is an external source. corresponds to the coupling constant and is the 4-dimensional Riemannian manifold on which the gauge theory lives. In the following we take with having finite volume and no boundary unless otherwise stated. The corresponding (thermal) partition function is
[TABLE]
We have normalized such that the Dirac quantization condition reads . Making use of this quantization condition we can rewrite the partition function as
[TABLE]
where now we view as an independent integration variable and treat the action as a functional of . After rewriting , performing the -integral and using the identity we find for the partition function
[TABLE]
with being a possibly infinite constant.
In order to understand this theory physically let us consider the partition function in the limit of constant . Then
[TABLE]
where is the circumference of and denotes the volume of . This is the partition function of a theory with infinitely many orthogonal energy eigenstates labeled by all integers and with energy given by . From the form of it is clear that the theory is invariant under the shift . Hence it is sufficient to consider only .202020Note that for there are two degenerate energy eigenstates. We will ignore this subtlety in the following. For this choice the vacuum energy is given by . For only the vacuum state remains while all other energy eigenstates disappear due to their exponential suppression relative to the vacuum. In the following we will only keep the vacuum state as we are primarily interested in the limit .
The partition function for infinite volume and arbitrary reads
[TABLE]
From this we easily read off the energy density
[TABLE]
and calculate the vacuum expectation value of :
[TABLE]
We also find for the correlator of
[TABLE]
The appearance of the -function in the correlator and the fact that the vacuum expectation value exactly follows the external source shows that the field strength is a purely local object that does not propagate any degree of freedom through spacetime.
So far we have seen that the 3-form theory in the large volume limit has only one energy eigenstate, the vacuum, and lacks any propagating degrees of freedom. Therefore, one may be tempted to conclude that the theory does not contain any dynamics. This is not true as we will explain now. As is well known, the gauge potential naturally couples to the world-sheet (WS) of a domain wall via . Alternatively we may write this as where is the conserved current of the domain wall. After integration by parts we see that the source term is exactly of this form with . In the following we will choose an appropriate that describes two domain walls and calculate the force that acts on them. It turns out that this force is not zero and therefore the theory is not trivial.
For simplicity we will choose to be with coordinates . We would like to describe two domain walls defined by and . Assuming this is realized by the choice
[TABLE]
with . The force on the domain wall is simply the negative derivative of the energy associated to this configuration with respect to the position of the domain wall. However, this force is going to be infinite due to the domain walls being infinitely extended. Hence, the proper quantity to determine is the force per area. To do so we consider an infinite cylinder with a base area that is parallel to the domain walls. Now we calculate the change in the energy residing in the cylinder due to a small change in the position of the first domain wall. Using (69) we find . Since this expression is linear in and we can immediately read off the force per area acting on the domain walls at and , respectively,
[TABLE]
Note that the forces do not depend on the distance between the domain walls. The force on one of the walls remains non-zero even if we push the other domain wall out to infinity. The situation is somewhat analogous to that with a charged membrane positioned orthogonally to a homogeneous electric field.
A.2 3-form gauge theory coupled to a scalar field
Now we extend the 3-form theory by introducing a scalar field with mass that couples to according to the new action
[TABLE]
where determines the normalization of . This theory is special for since in that case it is dual to a 2-form theory that is gauged by . The dual action reads
[TABLE]
which can be easily checked by dualization under the path integral. This action is invariant under the simultaneous transformations and for an arbitrary 2-form . It realizes the Stückelberg mechanism for , i.e. the gauge symmetry is spontaneously broken by the vacuum such that only a massive is left. In the following we want to argue that on the -side of the duality this symmetry breaking can be possibly understood as an effect of the (quantum) dynamics of the massless .
Let us make this argument for a more familiar example. Consider the following action:
[TABLE]
This action realizes the Stückelberg mechanism for a 1-form gauge potential . If we embedded this theory in a Higgs theory, would be the vacuum expectation value of the Higgs field. Hence we expect the gauge symmetry to be restored in the vacuum for which indeed is the case as is clear by inspection of the action.
Next let us have a look at the dual action which reads
[TABLE]
Since does not transform under the gauge symmetry of its classical vacuum configuration does not break it. Let us inspect the case for which the spontaneous symmetry breaking is turned off. In this case the dynamics of the field is frozen and it effectively acts as a source for . This observation suggests that the dynamics of is ultimately responsible for the spontaneous symmetry breaking. Note also that the duality of (76) and (77) breaks down when is massive. Hence this property of seems to be crucial for the dynamics behind the spontaneous symmetry breaking. All of these observations carry over to the theories defined by (74) and (75).
Let us go back to the generic case with arbitrary and calculate the partition function of the theory. The -integration can be carried out as before which leads to
[TABLE]
This path integral is Gaussian in and we can therefore simply use the classical equation of motion,
[TABLE]
with , to find the formal result
[TABLE]
For , i.e. , this reduces, up to constant factors, to (68) as it should be. The vacuum expectation value and correlator of are now calculated to be
[TABLE]
and
[TABLE]
From the pole structure of the correlator we infer the presence of a massive degree of freedom with mass which continues to exist even for . In fact this is not surprising as we have seen that for a Stückelberg mechanism is at work in the dual description.
Instead of using the formal expression (80) to determine the force on domain walls we explicitly use a solution to the equation of motion with as defined in (72). This solution can be written as
[TABLE]
Matching the solutions in the different regimes to each other at the boundary and demanding to be constant at fixes all six integration constants uniquely. Upon using the equation of motion (79) the action in (80) can be rewritten as
[TABLE]
The integrand of this action is the energy density in the presence of the two domain walls. In order to appreciate its structure it is helpful to explicitly calculate it:
[TABLE]
We clearly see that the first term equals the energy density (69) of the pure 3-form theory corrected by a factor . The effect of this part of the energy density on the force per area is hence exactly as we have calculated in (73) but with the additional factor . Now consider the second term in (85). We would like to repeat the computation of the change in energy within a given cylinder as we have done in Subsection A.1. However, this time the energy density changes at arbitrarily large distances from the domain walls if we move them around. Hence, we have to use an infinitely extended cylinder. The total energy within such a cylinder with base area , ignoring the first term in (85) we have discussed already, is
[TABLE]
Taking the negative derivative with respect to , dividing by and combing with the contribution from the first term in (85) gives for the total force density
[TABLE]
Let us compare this result with (73). We have again a constant contribution in (12) which is suppressed by the factor compared to (73) and a new second term that exponentially falls off with the distance between the domain walls. Note that this exponential fall-off is exactly what we could have anticipated from the presence of a massive degree of freedom with mass . While the first contribution to the force is due to the interaction of the domain walls with the background field strength, the second exponential term represents an interaction between the two domain walls due to a massive scalar field. In the limit , i.e. in the decoupling limit of , (87) reduces to (73) as it should be. For the constant part of the force disappears while the second essentially remains unaffected. This can be intuitively understood by observing from (81) that the background field strength vanishes for and constant . Hence there is no field strength the domain walls can interact with anymore and the corresponding force becomes zero. On the other hand, as already explained above, even though there is a massive scalar present which is why the second contribution to the force remains.
Appendix B Instantons in Yang-Mills theory
B.1 Review of the instanton calculus
In this appendix we collect some well known results about gauge instantons. A good reference is for example [60] but see also [92, 93, 94]. An SU() gauge theory is described by the Euclidean action
[TABLE]
where is the Lie-algebra-valued gauge potential and . denotes the gauge coupling and can in principle be an external source that depends on space. Here we assume the topology of space to be simply . An instanton corresponds to a topologically non-trivial field configuration which minimizes the action and has the properties
[TABLE]
The instanton configuration has moduli, four for the instanton location, one for its size and the rest for the orientation in group space. The contribution of an instanton with a given size and location to the partition function reads
[TABLE]
where
[TABLE]
and
[TABLE]
takes into account the running of the coupling with the instanton size . is an arbitrary reference scale. Furthermore, we have
[TABLE]
with and being order one numerical constants. Besides the instanton there is also an anti-instanton configuration with
[TABLE]
and the corresponding contribution to the partition function is
[TABLE]
In the following we will use the abbreviation .
Next we would like to determine the full contribution of instantons to the partition function. This can be done in the dilute gas approximation in which all instantons are considered point-like. Such an approximation is only valid if the density of instantons in space is small compared to their maximal size, i.e. if there is no overlap between them. However, in principle we have to integrate (90) over all and hence take into account instantons of all sizes. In fact the contribution of large instantons, which are problematic for the dilute gas approximation, diverges. Indeed, inserting (92) into (90) reveals that the integrand of the -integration is given by . The exponent is positive for any which renders the integral IR divergent. Hence, the dilute gas approximation is not applicable in a pure non-Abelian gauge theory.
Fortunately, this problem can be avoided by introducing a scalar field that breaks the gauge symmetry spontaneously with its vacuum expectation value and gives a mass to the gauge field. In this case the contribution to the partition function involves an additional factor so that large instantons are exponentially suppressed and the -integral becomes finite. Now, performing the -integration in (90) with the exponential suppression factor, dividing by , and ignoring the phase gives for the instanton density at leading order
[TABLE]
where we have chosen and
[TABLE]
The size of the instantons is now effectively cut off at and therefore, as long as 212121This can always be achieved by choosing the gauge coupling small at the symmetry breaking scale., the dilute gas approximation is valid. In particular, the cutoff of the resulting effective theory is given by as we will treat everything (in particular instantons) smaller than as point-like.
Now we are in a position to sum the contribution of all possible ways to place instantons and anti-instantons in spacetime and find
[TABLE]
In the next step we consider YM theory with Dirac fermions of mass in the fundamental representation of SU(). The corresponding action is222222For the sake of simplicity we have not included the Higgs sector in the action which is, nevertheless, always implicitly assumed to be present.
[TABLE]
where is the Euclidean covariant derivative and are matrices in spinor space satisfying the Euclidean version of the Clifford algebra, .
Once again we would like to obtain an effective action that is valid below the scale . If , we can first integrate out the fermions in an instanton background which will give an effective action for the gauge field that has additional terms suppressed by powers of . Ignoring these small corrections we are left with a pure gauge theory, i.e. for we can simply ignore the fermions at scales below and the analysis presented at the beginning of this section applies.
For fermions which are light, i.e. , we can no longer simply integrate them out but have to take them into account properly. For later use let us define the operator-valued matrix where denotes the left-handed and right-handed chirality projector, respectively. We would like to integrate out the gauge field and find the effective action for the fermions in a background of a dilute instanton gas. The corresponding calculation has been done by ’t Hooft [49] (see also [50]). In particular he showed that the partition function corresponding to the action (99) in an instanton background gives rise to the same fermion propagators as the partition function
[TABLE]
where and denote position and size of the instanton, depends on and and, most importantly, the running coupling now includes a contribution due to the fermions:
[TABLE]
Note that the color-structure of the operator is in general non-trivial but has been suppressed by our simplified notation. In the following the details of this will not be relevant for us but they can be found for example in [49, 95].
The contribution of an anti-instanton at the same position and of the same size is the complex conjugate of (100). Now we can perform the -integration in (100) and sum the instanton and anti-instanton contribution as in (98) to get the following partition function for the fermions
[TABLE]
with
[TABLE]
From this we see explicitly that integrating out the gauge fields yields effective fermion interactions, also called ’t Hooft interactions, which in the case of one flavor reduce to a simple mass term that explicitly reads
[TABLE]
Next we would like to determine the vacuum expectation value of . To do so we need to integrate out the fermions in (102) to obtain an explicit expression for the partition function as a functional of . The result can be organized in a series expansion in the small quantity where terms of order correspond to -instanton contributions. If one is only interested in the leading term of this expansion, one can skip the derivation of the effective action (102) and instead directly integrate out gauge fields and fermions in (99) in one step.
This calculation has also been done by ’t Hooft [49] and the result differs from that without fermions only by an additional factor in the instanton contribution (90) and the proper running coupling as stated in (101). Furthermore, the constant in (90) is changed by a factor and hence also depends on now. Note that the negative contribution from the running coupling to the exponent of is over-compensated by the factor so that the -integration remains UV finite for all values of .
After repeating the familiar steps of summing all instanton contributions we find
[TABLE]
with
[TABLE]
Remember that the exponent of this formula is only exact up to order as we have ignored multi-instanton contributions. Compared to the theory without or with heavy fermions, (98), we essentially get a suppression factor . With this formula it is easy to calculate
[TABLE]
for small and formally infinite volume of the 4-dimensional Euclidean space. We observe that the vacuum expectation value vanishes for . This is of course the well-known result that massless fermions screen the topological susceptibility of non-Abelian gauge theories. Ultimately, this is due to the fact that massless fermions render unphysical.
B.2 Forces on domain walls in a dilute instanton gas
Similarly to the discussion of the 3-form gauge theory in Appendix A we now want to introduce domain walls via the external source and calculate the forces acting on them. From the effective theory (102) we have seen that instantons induce -fermion interactions. This implies that fermions can be exchanged between two distinct instantons and hence induce interactions between them. This effect should contribute to the force between domain walls.
In order to calculate this effect we can determine the vacuum energy in the presence of two domain walls, as defined by (72), in perturbation theory and then differentiate it with respect to the distance between the domain walls. For simplicity we consider the case for which the instanton-induced fermion interaction is just a correction to the mass term. We organize our calculation as an expansion in two small parameters. First we assume to be small and keep only terms up to quadratic order in the interaction term in (102) which gives the following interaction Lagrangian
[TABLE]
where we have introduced . Our final result, the force on the domain walls, will be given up to quadratic order in as well. Second, there is a factor in front of the instanton induced interaction term in (108) which is a measure for the interaction strength. This is a small quantity and therefore we use it as our second expansion parameter. Recall that each instanton comes with this factor and hence we can view terms of order as an -instanton effect. We will include contributions up to order .
To calculate , consider the vacuum to vacuum transition amplitude
[TABLE]
For and allowing also for a non-zero, static source (), equation (109) may be viewed as defining a partition function . Thus, is given by
[TABLE]
i.e. by the sum of all connected vacuum diagrams in the presence of the source as claimed in the last paragraph.
Using this result we are in a position to calculate up to second order in and . The two relevant diagrams are shown in Figure 1. Making use of the interaction vertex (108) and the profile of (72) one finds
[TABLE]
where the dots denote -independent terms which therefore are irrelevant for the discussion of domain wall effects. Here, as usual, and is the formally infinite spatial volume which we assume to be larger than any other scale in the problem. The term linear in is due to the leading order diagram (cf. Figure 1(a)) which is essentially a propagator in position space evaluated at the origin and integrated over the whole space. All other terms are due to the next-to-leading order diagram (cf. Figure 1(b)) which has a more complicated structure. It consists of two position space propagators evaluated at the difference of two space points and over which we have to integrate. Writing the propagators in momentum space gives us two 4-dimensional momentum integrals. We can perform three position integrals which correspond to the directions parallel to the domain walls to obtain a -function in momentum space. Consequently, three momentum integrals are trivially performed. The left-over 1-dimensional momentum integral can be performed using the residue theorem. Now we are left with one 4-dimensional position and one 4-dimensional momentum integral as well as the integral over the direction orthogonal to the domain walls. Once again we can perform a 1-dimensional momentum integration using the residue theorem. Although tedious, the leftover position integrals are straightforwardly carried out using (72).
Note that, except for the last integral, all momentum integrals are divergent. However, since the effective theory (102) is valid only up to the scale , those integrals are naturally cutoff at this scale. In this case we expect the leading terms, i.e. those linear in 232323Note that there are two terms linear in of which only one is displayed in (111) since the other term does not depend on ., to parametrically reproduce the vacuum energy that is obtained from the partition function (105) derived by ’t Hooft.242424We expect a parametric match only because ’t Hooft does not cut off his calculation at but takes into account all momenta. In contrast, the EFT defined by (102) does not know about physics above . Since the relevant integral is quadratically divergent this is indeed the case.
Differentiating (111) with respect to , dividing by and multiplying by gives the force density acting on the domain wall at :
[TABLE]
The second term in the first line simply provides a higher order correction to the leading contribution which we could have alternatively obtained from (105). However, we also find a completely new contribution to the force at the 2-instanton level proportional to . This new term is exponentially suppressed in the distance of the two domain walls.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Aurilia, The Problem of Confinement: From Two-dimensions to Four-dimensions , Phys. Lett. 81B (1979) 203–206 . · doi ↗
- 2[2] M. Luscher, The Secret Long Range Force in Quantum Field Theories With Instantons , Phys. Lett. 78B (1978) 465–467 . · doi ↗
- 3[3] P. Di Vecchia and G. Veneziano, Chiral Dynamics in the Large n Limit , Nucl. Phys. B 171 (1980) 253–272 . · doi ↗
- 4[4] G. Dvali, S. Folkerts and A. Franca, How neutrino protects the axion , Phys. Rev. D 89 (2014) 105025 , [ 1312.7273 ]. · doi ↗
- 5[5] G. Dvali and L. Funcke, Small neutrino masses from gravitational θ 𝜃 \theta -term , Phys. Rev. D 93 (2016) 113002 , [ 1602.03191 ]. · doi ↗
- 6[6] G. Dvali and L. Funcke, Domestic Axion , 1608.08969 .
- 7[7] G. Dvali, Topological Origin of Chiral Symmetry Breaking in QCD and in Gravity , 1705.06317 .
- 8[8] G. Dvali, Three-form gauging of axion symmetries and gravity , hep-th/0507215 .
