Non-derivative Axionic Couplings to Nucleons at large and small N
Francesco Bigazzi, Aldo L. Cotrone, Matti J\"arvinen, Elias Kiritsis

TL;DR
This paper investigates non-derivative CP-odd axion-nucleon couplings using effective field theory, Skyrme model, and holographic QCD, providing model-independent expressions and quantitative estimates relevant for phenomenology.
Contribution
It introduces a comprehensive analysis of axion-nucleon couplings across three theoretical frameworks, deriving model-independent formulas and detailed quantitative estimates.
Findings
Derived expressions for CP-odd axion-nucleon couplings in various models.
Provided quantitative estimates for these couplings.
Connected couplings to low-lying mesons and the $ ext{η'}$.
Abstract
Among the possible CP-odd couplings of the axion to ordinary matter, the most relevant ones for phenomenology are the Yukawa couplings to nucleons. We analyze such non-derivative couplings within three different approaches: standard effective field theory, the Skyrme model and holographic QCD. In all the cases, the couplings can be related to the CP-odd non-derivative couplings to nucleons of the low-lying mesons and the . Using the effective field theory approach we discuss how to derive the expressions for the CP-odd interaction terms as functions of the parameters of the effective Lagrangian at generic number of colors and flavors . Then, we compute the CP-odd couplings to nucleons of the axion, the and the pseudo-Goldstone mesons in both the Skyrme and the holographic QCD model with . We present model-independent expressions for the coefficients of…
| CL + lattice | CL + pheno | Skyrme | Skyrme alt. | WSS | WSS alt. | |
|---|---|---|---|---|---|---|
| 9.4(9) | 13.0(1.1) | 18.7(4) | 27(5) | 16.3(4) | 21(4) | |
| 8.8(9) | 12.4(1.1) | 18.7(4) | 27(5) | 16.3(4) | 20(4) |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Non-derivative Axionic Couplings to Nucleons at large and small N
Francesco Bigazzia, Aldo L. Cotronea,b, Matti Järvinenc,d and Elias Kiritsise,f
a INFN, Sezione di Firenze; Via G. Sansone 1; I-50019 Sesto Fiorentino (Firenze), Italy
b Dipartimento di Fisica e Astronomia, Universitá di Firenze; Via G. Sansone 1; I-50019 Sesto Fiorentino (Firenze), Italy
c Institute for Theoretical Physics, Utrecht University, Princetonplein 5,
3584 CC Utrecht, The Netherlands
d Department of Physics and Helsinki Institute of Physics, P.O. Box 64, FI-00014 University of Helsinki, Finland
e Crete Center for Theoretical Physics, Department of Physics, University of Crete, 71003 Heraklion, Greece
f APC, Université Paris 7, CNRS/IN2P3, CEA/IRFU, Obs. de Paris, Sorbonne Paris Cité, Bâtiment Condorcet, F-75205, Paris Cedex 13, France (UMR du CNRS 7164).
Abstract:
Among the possible CP-odd couplings of the axion to ordinary matter, the most relevant ones for phenomenology are the Yukawa couplings to nucleons. We analyze such non-derivative couplings within three different approaches: standard effective field theory, the Skyrme model and holographic QCD. In all the cases, the couplings can be related to the CP-odd non-derivative couplings to nucleons of the low-lying mesons and the . Using the effective field theory approach we discuss how to derive the expressions for the CP-odd interaction terms as functions of the parameters of the effective Lagrangian at generic number of colors and flavors . Then, we compute the CP-odd couplings to nucleons of the axion, the and the pseudo-Goldstone mesons in both the Skyrme and the holographic QCD model with . We present model-independent expressions for the coefficients of the non-derivative axion-nucleon couplings. This allows us to provide quantitative estimates of these couplings.
Axions, axion-nucleon coupling, eta’, chiral Lagrangian, Skyrme model, holographic baryons
††preprint: CCTP-2019-6
ITCP-IPP 2019/6
1 Introduction and results
The Peccei-Quinn proposal [1] for a natural solution of the strong CP problem - the unnaturally small value of the QCD angle - implies the existence of a new light neutral pseudoscalar boson, the axion [2, 3]. The original theory, severely constrained by data, was not renormalizable because of the axion coupling to the QCD instanton density [4, 5]. In renormalizable axion models, successively proposed in [6, 7, 8], the axions were also made very weakly interacting with ordinary matter, as required by experimental constraints. These “invisible axions”, whose couplings to matter have been deduced using anomalies and the chiral Lagrangian [9]-[14], are nowadays considered among the most promising candidates as dark matter constituents, and also as possible realizations of the inflaton [15]-[20].
As of today, axion-like particles (ALPs) are ubiquitous and serve various purposes. The nomenclature has also evolved but still remains sometimes murky. Axions that solve the strong CP problem are typically called QCD axions. The term “axion-like particle” is often used to refer (only) to other types of axions.
Generically, axions display a perturbative shift symmetry and couple to instanton densities. In any reliable quantum field theory (QFT) realization, the symmetry is broken (at best) to a discrete symmetry due to non-perturbative effects. Such effects, related to instantons in weakly-coupled models, induce a mass and, more generally, a potential term, for the QCD axion (see e.g. [13]).
ALPs arise very commonly in string theory [21], the simplest example being the Ramond-Ramond (RR) axion of type IIB string theory. They can also arise, after compactification, from internal components of antisymmetric form gauge fields as well as from off-diagonal components of the metric. In these cases, ALPs are related to generalized gauge fields [22, 23]. The corresponding gauge symmetries provide the perturbative Peccei-Quinn (PQ) symmetries in string theory [24].
Continuous shift symmetries in string theory can be broken by non-perturbative effects. The argument is general:111In fact it also applies to NS-NS axions that couple to world-sheet or NS5-brane instantons. RR axions couple to the world volume of D-branes [25]. The same D-branes, wrapped around some appropriate Euclidean cycle, provide instanton effects in string theory [26]-[29]. The nature of these effects depends on the amount of supersymmetry. In the case of maximal supersymmetry they do not generate a potential, but affect higher derivative terms like the corrections, [30, 31]. In all cases, the end result is that the original shift symmetry is broken to a discrete subgroup.222There can be subtleties that arise when axionic symmetries are coupled to anomalous U(1)’s [32, 33], but the final result is analogous.
In the above descriptions (and in other QFT constructions), ALPs are fundamental fields in the theory. However this is not the only possibility. ALPs can also be composite or emergent. In its most common realizations [34]-[40], [15], the composite axion is identified with a neutral pseudo-Goldstone boson of an extra, strongly coupled, confining gauge theory, whose dynamical scale determines the axion coupling . In these models, the axion can be rendered “invisible” only if , [15]. As a common feature, the Peccei-Quinn symmetry in these models acts on some extra fermion fields. Moreover, as a difference with fundamental axions, in simple composite axion models there is no direct coupling between the axion and the lepton sector.
In a more general setup, recently considered in [41] and motivated in [42], ALPs emerge as composites of a hidden sector with a large number of colors (), which couples to the Standard Model (SM) via interactions that are irrelevant in the IR. This fact ensures that one can have an invisible emergent axion even if the strong coupling scale of the hidden gauge theory is much smaller than . These emergent/composite axions do not rely necessarily on fermionic Peccei-Quinn like symmetries, but rather on the approximate Peccei-Quinn symmetry associated to instanton densities in gauge theories.
The holographic correspondence [43] provides a paradigm for such a type of composite/emergent axions. In the holographic context, in fact, composite operators in a QFT are mapped into fundamental fields in the dual string/gravity theory. Hence, standard string theory axions can be related to composite operators in QFT. This is the case for the QCD topological term [44]-[51].
The holographic QCD axion model recently proposed in [52] provides a further different way of realizing a composite axion. In the model there is no hidden gauge theory sector, hence no extra dynamical scale. The Peccei-Quinn symmetry acts on an extra massless quark flavor with a (non-local) Nambu-Jona-Lasinio (NJL) quartic interaction. The latter has a suitable higher dimensional UV completion in the holographic model and it induces the condensation of the extra quark at a scale , of the order of , set by the NJL coupling. If is taken to be much higher than the QCD chiral symmetry breaking scale, the extra quarks and the related massive hadrons essentially decouple from the Standard Model. The only exception is the pseudo-Goldstone boson of the extra chiral symmetry breaking. This is identified with the composite QCD axion. Its interactions with the low energy QCD modes are precisely described by the axion-dressed chiral Lagrangian [14].
The couplings of the QCD axion and other ALPs to ordinary matter are crucial in axion searches. They typically receive contributions both from the UV realization of the ALP theory and from the QCD chiral Lagrangian and the structure of the axial anomaly [10, 9, 11, 13]. These are combined with some input from QCD matrix elements that can be extracted from experiments or, more recently, from lattice data [53]. State of the art derivative axion couplings to nucleons are obtained in this way.
There exist also non-derivative couplings to nucleons that so far have not received much attention (see however [54]). There are two reasons for this. The first is that such couplings, when they arise from the CP-preserving effective Lagrangian for the nucleons, are effectively derivative couplings. This is the case when the nucleons are near the mass-shell, as it happens at low enough energies. If the couplings arise in the CP-violating sector of the effective Lagrangian, then they are suppressed by the small CP violation due to a residual angle and/or to SM effects.333The CP-violation due the -term is constrained to be tiny by the neutron electric dipole moment measurements. The CP-violation appearing in the CKM matrix trickles also down to the hadronic sector at high loop order, [55, 56, 57]. Similar remarks apply to the leptonic CKM matrix, [58]. The second reason is that such non-derivative couplings to nucleons depend on non-trivial and non-perturbative QCD physics, and chiral symmetry considerations (and the chiral Lagrangian) are not helpful enough to pin them down.
In this paper we will undertake an analysis of such couplings. There are two main motivations behind our analysis. The first is that ALPs are generically speaking light particles and the range of allowed masses towards the lower end has expanded substantially in recent years, especially as candidates for dark matter, [17, 18, 20, 59]. In such cases, non-derivative couplings, even if small, can become important in coherent collections of ALPs. The second is that we now have a new theoretical tool to address non-perturbative nucleon physics and this is the holographic correspondence.
In the holographic approach, QCD is modeled by a large strongly coupled gauge theory, which can be studied by means of a dual weakly coupled string model on a classical gravity background. The top-down holographic model which better reproduces the main features of low energy QCD is due to Witten, Sakai and Sugimoto (WSS) [60, 61]. Nucleons in this theory have been first studied in [62, 63, 64]. They are realized in a way which resembles the traditional Skyrme picture [65], where nucleons arise, at large , as solitons of the chiral Lagrangian [66]. In the WSS model they are instanton solutions of a five dimensional gauge theory ( being the number of flavors) on a certain curved background. After reduction to four dimensions this theory encodes not only the chiral Lagrangian (including the Skyrme term) for the QCD-like pseudo-Goldstone bosons and the , but also the effective Lagrangian for the massive axial (vector) mesons. Moreover, couplings and masses for the mesons are determined by the few parameters of the model. The holographic QCD axion in [52] is embedded in the WSS model.
There are moreover bottom up models for holographic meson and baryon physics that have been analyzed in the past. The first and crudest of them named AdS/QCD, [67, 68], reproduced the physics of the chiral Lagrangian to leading order and has given rather good predictions of parameters for the chiral Lagrangian beyond the two-derivative level. There are more sophisticated bottom-up models for QCD, like V-QCD, [69, 47] that improves importantly on AdS/QCD.
In this paper we embark in an effort to determine the non-derivative couplings of the axion to nucleons under three different perspectives: standard effective field theory, Skyrme model and holographic WSS model.444Analysing the same effects in the more sophisticated but also more difficult theory, namely V-QCD is beyond the scope of the present work. In our analysis of the Skyrme and holographic model, the large limit is taken by construction. In all the three approaches, the axion couplings can be related to those of the and the pions to the nucleons, after considering the mixing between the axion and these mesons. One loop corrections in the effective theory of nucleons and mesons can contribute too. The mixing arises diagonalizing the mass matrix in the low energy effective action. It is worth emphasizing that meson-nucleon couplings have been largely studied in chiral perturbation theory. For instance, the CP-odd pion-nucleon coupling has been computed in e.g. [70, 71, 72, 73] for at various orders in the perturbative expansion. We will show how our results in the Skyrme and holographic model compare with those findings.
A word of caution is in order. The Skyrme and holographic models, while reproducing many features of low-energy QCD, are certainly not suitable for precision physics at percent level as there is no formal parameter that controls their accuracy. Nevertheless, their use alongside the standard Chiral Lagrangian approach is justified by the following considerations. To begin with, the CP-odd sector of the axion physics is difficult to be analyzed directly within any first-principle model (e.g. one has a sign problem on the lattice). Moreover, while in the original Skyrme model the nuclear binding energies are notoriously overestimated, its extension including vector mesons improves significantly this issue [74]. Finally, bottom-up holographic models for meson physics have been surprisingly successful in reproducing experimental data, and organizing (subleading) parameters that are theoretically unconstrained in the chiral Lagrangian. The holographic WSS model is a top-down model and has a proven discrepancy with experimental results of less than 30% (in average) when dealing with static properties of nucleons, including couplings similar to the ones computed in this paper [75]. It is also worth mentioning that if, in principle, Chiral Lagrangian calculations (combined with large arguments) can be systematically improved, in practice they are limited by unknown low-energy constants. Large models like Skyrme and WSS can provide model-dependent estimates of such unknowns.555Moreover, in principle, the top-down holographic QCD approach can also be systematically improved, up to the point of providing a string dual of real world QCD: however, in practice, this reveals to be a very hard task.
Our main results are those for the CP-odd non-derivative couplings of the pseudoscalar mesons and the axion to the nucleons. They can be summarized as follows.
- •
Using chiral perturbation theory, we derive the expressions for the CP-odd interaction terms, at any number of colors and flavors , between or the axion and nucleons (transforming under generic spin-half representations of the gauge group) in terms of the parameters of the effective Lagrangian. The main results are the Lagrangian terms in equations (3.41)–(3.42) (for contributions arising through a “residual” -angle, possible even in the presence of the axion666There are many possible contributions to in our formulae, and include uncanceled contributions from the strong sector or induced by the CKM CP violation.).
- •
We estimate, in turn, the one-loop contributions to the non-derivative couplings in effective field theory using previous results, [76] and extending them to general .
- •
In chiral perturbation theory, we write the CP-odd non-derivative couplings of the axion to the nucleons both in terms of the CP-odd couplings of the pions and , equation (4.80), and, alternatively, in terms of the pion-nucleon sigma term and the strong contribution to the neutron-proton mass splitting, equation (4.87), extending the formula in [54].
- •
We compute the CP-odd couplings (due to the effective angle and other sources of CP-violation in the QCD sector) of the axion, the meson and pions to nucleons in the Skyrme model. The main results for are given in equations (5.106) and (5.107), and for in equations (E.226), (E.229) and (E.230). To the best of our knowledge, these couplings have not been computed in the literature so far.
- •
We compute in turn novel results for the same couplings in the WSS model. The results for are given in equations (6.131), (6.132), and (6.133). The generalization to is analogous to the computation in the Skyrme model.
In section 7 we will compare the numerical estimates of the axion couplings to nucleons obtained in model-independent expressions from chiral perturbation theory plus lattice and phenomenological inputs, with the ones obtained in the Skyrme and holographic WSS models. The first ones are affected by some uncertainties on the value of the sigma term, while the latter depend on the specific models. Although there is a certain discrepancy in the results, a reasonable range of the value of the couplings appears to be
[TABLE]
having in mind an effective Lagrangian term of the form , where is the nucleon doublet and is the axion. Here we do not distinguish the proton and neutron couplings, whose difference is smaller than the proposed window of values. Also, we have taken into account the one loop corrections in the effective theory, discussed in section 3.3. They are somewhat smaller compared to the numbers in (1.1) for , .
The paper is organized as follows. In section 2 we review the construction of the QCD chiral Lagrangian, containing the meson and the pions, coupled with the axion. In particular we write down the field redefinitions suitable for diagonalizing the related mass matrix. In section 3 we discuss the effective theory describing the coupling of mesons and axions to nucleons and deduce the general structure of the non-derivative axion-nucleon couplings. One loop corrections are discussed here too. These CP-odd non-derivative axion couplings are expressed in terms of the pion and couplings to nucleons, and in turn of the pion-nucleon sigma term and proton-neutron mass difference in section 4. In section 5 we compute the couplings in the Skyrme model with QCD flavors. In section 6, after a short review of the description of nucleons in the holographic setup, we perform analogous computations in the WSS model with QCD flavors. A summary and an analysis of the results obtained in sections 4, 5 and 6 is given in section 7. Review material, further technical details and the results for QCD flavors in the Skyrme and WSS models are provided in the appendices.
2 The effective Lagrangian couplings of the axion to the
The general structure of axion models and their UV completions are summarized in appendix A. Here, we examine the effective Lagrangian (containing the pions and the meson) for QCD coupled to the axion. Our aim is to write the effective theory for generic axion couplings rather than choosing a specific “frame”, which allows us to check that the final results, in term of physical states, are frame independent. This will be useful in particular when considering the couplings of the and the axion to the nucleons as we will do in Sec. 3.
The QCD Lagrangian with generic renormalizable axion couplings777There may also be non-renormalizable axion couplings if the theory that generates the axion is an effective theory. This is the case in the context of composite or emergent axions, [41] but may also be relevant for the case of fundamental axion fields that appear at intermediate scales. is given by
[TABLE]
Here are the quark fields, we suppressed flavor indices, used mostly plus convention for the Minkowski metric, and also included the axion dependence in the mass terms as well as the axion kinetic term. The -angle can be absorbed in the constant term of so we set it to zero – effects due to finite will be considered later on. We also included the current term here, but as we argue in Appendix B, it does not affect those mixing terms between the axion and mesons which we are interested in. Therefore we will set the current to zero in the following.
We shall then derive the low energy effective action for pions and the mesons which contains the axion couplings determined by (2). Axion couplings in effective theories for QCD have been considered in earlier works [77, 11], see for example the fairly recent detailed analysis [13]. In our analysis, it is essential to also include the effect of the meson for two main reasons. First, we are interested in the couplings of the axion both at small and large , and the axial anomaly of QCD is suppressed at large so that the meson is as light as the pions. Second, as we shall see, the mixing between the and the axion is particularly important for the non-derivative couplings of the axion to the nucleons which we will analyze below. Earlier work considering the effect of the in the axion-meson mixing include [12, 78]. Since we are mostly interested in the mixing of the axion with the mesons, we will concentrate on the terms giving rise to quadratic terms in the action.
The first interaction term in (2) couples, in the effective theory, the axion to the pseudoscalar glueball field [77, 79]. The glueball can be integrated out, giving rise to the “standard” effective Lagrangian with the -angle replaced by the axion field. The meson fields are included in
[TABLE]
where . The matrix transforms as
[TABLE]
where are matrices. We normalize the group generators as
[TABLE]
Notice that is not a mass eigenstate but will mix with the other fields, as we will discuss below.
With these conventions, the effective Lagrangian that is the low-energy theory of (2) (with the axion derivative/current term omitted) can be written as,888As usual this is based on symmetries only. See Appendix B for a more detailed discussion.
[TABLE]
where is the pion mass matrix, and is the Yang-Mills topological susceptibility.
We allowed the to have a different decay constant than the pion [80] by including the last term in (2). Requiring the standard kinetic term normalization fixes
[TABLE]
The meson mass matrix is proportional to the quark mass matrix, up to higher order chiral corrections.
We assume that it is possible to write the mass matrix in the form
[TABLE]
(possibly after vectorial axion-valued transformations) where999We do not need an explicit formula for this matrix, but a convenient choice would be . is Hermitean and independent of . It is then straightforward to solve explicitly for the mixing between the mesons and the axion. Effects due to an anti-Hermitean component in the mass matrix are discussed in Appendix B. We define the topological susceptibility of full QCD, , and the “gauge invariant” coupling of the axion, , as
[TABLE]
In order to diagonalize the Lagrangian, we first carry out the following field redefinitions (see also [12]):101010The p stands for “physical”.
[TABLE]
which remove the mixing of the pions and with the axion up to terms suppressed by higher powers of . More precisely, the mixing terms in Eqs. (2.10) and (2.11) are determined by the non-diagonal elements of the mass matrix in (2). Notice that the terms involving are (up to nonlinear terms in the pions) equivalent to a chiral rotation which removes the dependence of . Then the other terms are sourced by the gauge invariant coupling . After applying (2.10) and (2.11), the transformation (2) removes the non-diagonal kinetic terms.
Inserting these relations in the Lagrangian (2), the quadratic terms (with terms suppressed by dropped) are included in
[TABLE]
where
[TABLE]
and the axion mass squared is given by
[TABLE]
The fields and are however not mass eigenstates (for a generic quark mass matrix) because they are subject to the “usual” mixing of neutral states imposed by the quark masses. The final physical states and are found by diagonalizing the mass matrix arising from (2.13) which in general needs to be done numerically.
CP-odd couplings and a finite -angle (which may be induced by linear couplings of the axion to other sectors not considered here) introduce small VEVs for the mesons. Such VEVs will be important (apart from the mixing between the axion and the mesons) when considering their couplings to the nucleons in Sec. 3. We discuss these VEVs in Appendix B.1. We also include results for the CP-violating vertices of three pseudoscalar mesons in Appendix B.2.
Finally, the scaling of the QCD parameters at large and/or for our conventions is the following. The decay constants scale as
[TABLE]
while the term , which induces the splitting of the pion and decay constants, is suppressed by a power of since it is the coefficient of a double trace term. We normalized this coefficient such that it scales as
[TABLE]
In this case, the difference of the decay constants also obeys
[TABLE]
We also note that
[TABLE]
3 The effective couplings of and the axion to nucleons
In this section we proceed further and discuss the effective field theory that couples the mesons and the axion to the nucleons. In particular we analyze the CP-violating non-derivative couplings of the and the axion to the nucleons.111111The lowest dimension CP-preserving couplings for on-shell nucleons reduce to derivative couplings.
3.1 Generic and
We shall discuss here the effective theory for the nucleons and their couplings to the pions, and the axion for generic values of and . Specific results for will be given in Sec. 3.2. We start by writing down the couplings in the absence of the axion and the -angle, and then discuss how they are introduced in a generic frame. Working in a generic frame will help us to check that the results behave correctly under chiral transformations. We shall work at any odd value of , and for simplicity only consider spin baryons but in arbitrary representations of the flavor group.
In order to describe the couplings of the baryons to the mesons and the axion, the standard method, [81], is to first define a new matrix through . Because the square root of a matrix is ambiguous, this is not enough to make the definition precise; we can take
[TABLE]
This matrix transforms as
[TABLE]
under the chiral transformations (see, e.g., [82]). Here is a unitary matrix depending on the pion fields and defined121212For a precise definition and discussion, see, e.g., [81, 82]. through the equation in (3.21). The baryons belong to the -index representations131313One should keep in mind that in the limit of large there would be more relations than what we impose here: the nucleon states are arranged according to the spin-flavor symmetry , with the ground state being the fully symmetric -index representation in this limit. These representations break into towers of states classified according to their spin and flavor representations separately [83, 84, 85, 86]. of (the vectorial) and transform according to [87]
[TABLE]
The baryon bilinears may therefore couple to meson operators transforming as . Such operators include
[TABLE]
where is the quark mass matrix and was defined in (3.20).
The derivative couplings of the pions and the arise from the terms
[TABLE]
where “” stands for all possible (invariant) contractions of the flavor indices of the two nucleons and the meson matrix. We have also scaled out the leading expected behavior of the coupling constants on and so that the expectation is
[TABLE]
Notice however that we may still have subleading dependence involving terms and . We will follow similar conventions for other couplings below, unless stated otherwise.
In the second term of (3.24) we include a sum over various contractions (with couplings , ,…) of the flavor indices of the baryons and before taking the trace, which are consistent with chiral symmetry. Not all of the possible terms are independent, but their relations are nontrivial for baryons transforming under generic representations of the chiral symmetry. The nontrivial piece amounts to combining the baryon, the part of the meson term transforming under the adjoint representation, and the anti-baryon in the conjugate representation into a singlet. We will explain the structure in more detail in Appendix C.3, considering and as examples.
The precise knowledge of the structure of these terms is not needed for most of our results. In the limit of large it has been solved in [86]. Moreover, for the coupling of the flavor singlet meson, these terms reduce to the standard trace, i.e., they have the same form as the first term, but are leading by a factor of . The suppression of the double trace term is consistent with the expected enhancement of the chiral symmetry141414In the limit with finite the U(1)A anomaly vanishes. to in the limit of large . In other words, (3.24) may be replaced by
[TABLE]
where we projected to the adjoint representation in the second term by rearranging the singlet contribution. Therefore
[TABLE]
and according to (3.25)
[TABLE]
In this article, however, we are interested in the non-derivative couplings to the nucleons, which we will discuss next.
3.1.1 Non-derivative couplings
We go on to discuss the non-derivative couplings of the nucleons to the , the pions, and the axion. Such couplings are absent when CP is conserved, but they can arise through a nonzero -angle which in turn can result from additional UV terms in the action, which drive the expectation value of the axion away from zero. Other CP-violating contributions than those appearing effectively through the -angle are also possible, we analyze them in Appendix C.
We first write down the relevant terms in the effective Lagrangian in the absence of the axion and the -angle, and then discuss how the axion and are included. A class of non-derivative couplings arises from the mass terms involving (that were defined in (3.23)):
[TABLE]
Several comments are in order.
The terms in (3.29) give rise to corrections to the nucleon masses, and they are , in agreement with the scaling of the nucleon mass. Notice that we separated again a piece from the flavor singlet contributions in order to write the more complicated non-singlet terms in terms of the combinations which vanishes (up to higher order interaction terms in the pions) for flavor independent quark masses.
The terms containing an extra are suppressed by a power of . Considering the singlet term as an example, this can be seen from the identity
[TABLE]
where the first equality is exact when the nucleons are on-shell and we used the fact that the baryon mass (see also Appendix D).
From (3.31) we also observe that the terms involving are, moreover, derivative terms in disguise and they are suppressed in the non-relativistic limit. This is not entirely true for the single trace “contraction” terms in (3.30), because of the flavor structure: the single trace terms of (3.29) give rise to flavor dependence of the nucleon mass which leads to a non-derivative contribution from (3.30). These contributions are of the second order in the differences . The single trace term in (3.30) will not, however, contribute to the linear axion couplings because the axion dependence also cancels at linear order due to the subtraction of the singlet piece after substituting (2.10) and (2.11) as we shall show below. This holds true also for the mixing effects between the pions (arising from expanding the ) and the axion. Therefore we shall not discuss the terms in (3.30) further.
Another interesting class of terms are the flavor singlet terms which couple the to the nucleon singlets. Possible terms are
[TABLE]
The scaling with was determined as follows: every factor of brings a power of with respect to the nucleon mass term and the terms in (3.29) and in (3.30) [88]. Recall that the term is in addition suppressed by due to the factor.
In order to extract the non-derivative couplings from these Lagrangian terms, we consider the axion couplings induced by the axion terms in the QCD Lagrangian (2) and turn on a finite -angle. We replace151515One can also introduce the -angle by shifting (after these transformations) or by adding the -angle in the quark mass matrix: all these will lead to the same results due to chiral symmetry.
[TABLE]
and
[TABLE]
where is assumed to be Hermitean. In terms of the axion, the -angle is therefore introduced effectively by replacing
[TABLE]
Notice that also the meson mixing in (2.10) and in (2.11) will be affected: a finite -angle leads to a small displacement of vacuum expectation values (VEVs) of the pions and the from zero.161616Our convention is that after the shift, the VEV of the axion is zero so that the constant mode of the axion is included in . This is equivalent to introducing through an extra term in the Lagrangian which is linear in the axion and independent of the QCD fields. Other sources of CP-violation than the -angle can also lead to such VEVs. We analyze them in Appendix B.1 and as it turns out, they do not lead to terms with essentially different structures than those terms which we derive in this section. In Appendix B we also analyze the effects of non-hermiticity of the quark mass matrix and show that they can be absorbed in the -angle.
In order to extract the contributions involving , we therefore need to study the nucleon couplings which are quadratic in the pion fields, the axion, and (at vanishing ). First we notice that
[TABLE]
where , the anticommutator is normalized as , and we already inserted the relations (2.10)–(2) but so far kept . We remark that in (3.39) the axion dependence is proportional to the identity matrix in flavor space.171717In Appendix B we point out that this proportionality is broken by higher order terms in the chiral Lagrangian. This breaking is however only present for terms which are quadratic or higher in the quark masses, which we are neglecting in this article. This verifies the fact that it is indeed absent in the second term of (3.30) (which we have already omitted). Notice that we discuss here only the trace term , see Appendix C.2 for the analysis of the second term in (3.29).
The terms which give rise to quadratic couplings of and the axion to the nucleons are those in (3.29), the second term in (3.32), and the term in (3.33). Inserting (3.37), (3.39) and (3.40) in these expressions, and adding through the replacement (3.36), we find the following couplings:
[TABLE]
where we neglected second-order terms in the quark masses. Since the axion coupling is proportional to , which also controls the nucleon mass corrections, there is a relation between the two (in the absence of other sources of CP-violation than the -angle). We will discuss this in more detail in Sec. 4. Notice also that the couplings on the second line (3.42) are suppressed by with respect to the couplings on the first line. Therefore the couplings on the first line should be compared to the Skyrme and WSS model analyses carried out in sections 5 and 6, which employ the ’t Hooft limit.
Contributions from the single trace term in (3.29) are analyzed in Appendix C.2: these include the CP-odd nucleon-pion couplings (C.203), and a correction to the nucleon-axion coupling (C.204), which is nonzero if quark masses depend on flavor. The CP-odd nucleon- coupling is independent of these terms. Recall also that the mixing of and the pions due to flavor dependent quark masses is not included in these results.
3.2 Results for
We now discuss the structure of the couplings at finite, fixed , taking (and any odd ) as an example. This will be useful in order to compare to the results of sections 4, 5 and 6. In addition, results for are given in Appendix C. First, we analyze the derivative couplings in (3.24). The three pions and the 181818In the case , what we call is, as usual, a “mathematical” isosinglet pseudoscalar which does not have any component, see e.g. [89]. This at does not correspond to the physical one, but it can be related to the physical and fields. In [89] for instance, taking into account the pseudoscalar mixing angle, it is taken as , so its mass is around 549 MeV. form a four-dimensional representation of which breaks into the iso-singlet and the iso-triplet of pions at finite . The ground state, spin- fermions, are in the fundamental isospin- representation:
[TABLE]
In this case the derivative couplings of (3.24) and (3.26) read
[TABLE]
where , and are the Pauli matrices.191919In this action we neglected interactions with other nucleon representations like the . There is only one independent term in the sums over contractions in (3.24) and (3.26). The flavor structures of the mass terms (3.29)–(3.30) are analogous to the derivative terms.
We note that in the limit of large , the derivative couplings of the and the pions become identical since in this limit, signaling the enhancement of the chiral symmetry from to .
When , it is natural to take the light quark masses to be equal, , so that isospin is unbroken and does not mix with the pions. In this case the couplings of the previous subsection apply directly for the physical mass eigenstates , , and . The couplings of the axion and the mesons to the nucleons therefore read
[TABLE]
The terms on the first line arise from (3.41) and (3.42). We also included the pion coupling on the second line, which depends on the single trace coupling in (3.29); see eq. (C.203) in Appendix C.2. Recall that in this case the pion mass is given by in the chiral limit.
We note that the same coupling also controls the QCD contributions to the mass splitting between the proton and the neutron, so that there is a relation between the mass splitting and the CP-odd pion-nucleon-nucleon coupling (see, e.g., [72]). We will discuss this relation in more detail below.
Similarly to the derivative couplings above, equals at large up to corrections suppressed by . Since also in this limit, the couplings of the pions and that of the in (3.47) to the nucleons become equal, signaling the enhanced chiral symmetry.
3.3 The one-loop corrections
The meson-nucleon couplings of the effective Lagrangian described above give rise to further contributions to the couplings of interest via quantum effects. These quantum effects are determined by the effective meson and nucleon fields. In the ’t Hooft limit, they are suppressed by powers of , while in the Veneziano limit, they may be unsuppressed.
We will discuss such effects below. In the case of , , they were calculated in [76] but we will extend here this calculation to arbitrary and .
The starting Lagrangian for the pion-nucleon interactions was given already in the previous subsections, but we will abstract here the couplings that are of interest. The and couplings to pions from the chiral Lagrangian are
[TABLE]
where the couplings behave as
[TABLE]
in the chiral and large limits (see Appendix B.2). Similarly the coupling to the axion can be written as
[TABLE]
where scales as . The couplings to the nucleons are202020The contribution from (3.41) scales as .
[TABLE]
We also need the kinetic terms of the spin 1/2 nucleons and the spin-3/2 , which in the conventional case are
[TABLE]
above stands for the spin-1/2 nucleon iso-doublet while for the iso-quartet. The covariant derivatives and were defined in (3.23) and the matrix in (3.20). are the Clebsch-Gordan matrices that couple the doublet, the quartet and the triplet of . From (3.45) we have
[TABLE]
where the last equality above with the coupling of the nucleons to the axial current being valid in the chiral limit.
At arbitrary and , the field represents the lowest-lying flavor representation of spin 1/2, that is the in Dynkin notation. The spin 3/2 representation is the flavor representation and so on. In the general case, there may be also higher-spin baryons with spin larger than 3/2 and up to . The parameters , and scale as . The matrix is a unitary matrix while the pion current is a Hermitean matrix.
The flavor structure of the pion-nucleon interaction involves where is an adjoint index of and are indices in the flavor representations of the nucleons. At general , these can be any representation in the tensor product . We will be assuming here exact flavor symmetry so that all quark masses are the same.
We discuss first the one-loop couplings of the and mesons to the nucleons and consider the axion coupling below. The diagrams that contribute at one loop are shown in figure 1. Diagram (a) has ground-state nucleons as intermediate states while diagram (b) has higher spin nucleons. Diagram (c) gives a vanishing contribution due to isospin symmetry as charge conjugate pions give opposite sign contributions. Notice that the CP-violation must arise from the three-meson vertex because this vertex vanishes if CP is conserved.
For the rest, we will call the flavor representation of the ground-state nucleons and all the others that give rise to the analogues of . The flavor couplings that enter in in (3.53), , are the Clebsch-Gordan coefficients linking the adjoint, and of the flavor group. The index is that of the ground-state nucleon while is that of the . Of course for higher spin nucleons, is a tensorial spinor, and the space-time factors in the couplings in (3.53) will change appropriately. These are changes though and we will not worry about them at the moment.
For the computation of the loop diagrams we follow Ref. [76]. The covariant extended on-mass shell scheme is used for renormalization [90, 91]. The diagram (a) gives at , [76]
[TABLE]
[TABLE]
where the renormalization scale was set to the nucleon mass and the external nucleons were put on-shell. In the general , case, where the coupling is between the nucleon states and we must multiply (3.55) by of the flavor group where is the standard quadratic Casimir of the relevant representations. The axial coupling in (3.55) scales as at large while the scaling of is in (3.50). From (3.55) we deduce that212121This estimate is valid if we take first the chiral limit and then the large limit. It is known that the chiral limit and the large do not commute for generic observables.
[TABLE]
as and .
The -mediated diagram has the contribution
[TABLE]
where the dimensionless function can be found in [76]. In the large limit it scales as while . We obtain
[TABLE]
At general and , the diagrams of type (b) corresponding to the coupling of external nucleon states and are multiplied by
[TABLE]
where are the Clebsch-Gordan coefficients linking the adjoint ( index), the ground-state flavor representation ( index) and the higher spin intermediate flavor representation (index ). There are also differences due to the different spin structure of the intermediate states. The conclusion is that all such individual contributions scale at large as in (3.56).222222Notice that at large both the nucleons and the (as well as the higher spin states) are actually packed into the completely symmetric representation of the combined spin-flavor group [85, 86].
For , from [76] we obtain
[TABLE]
[TABLE]
The results in the extended on-mass shell scheme are suppressed by about 30% with respect to the heavy-baryon limit [76].
The one-loop couplings to the axion can also be extracted from the above results: this amounts to replacing by in (3.55) and in (3.57). We find that
[TABLE]
Finally, we can estimate these loop contributions as a function of the angle as follows. By using the expressions for the three meson couplings in chiral perturbation theory, (B.188) and (B.189), we find that
[TABLE]
where we took the limit , and used MeV, MeV, MeV, and for the mixing angle defined in (B.178). We obtain for the CP-violating one-loop couplings
[TABLE]
Similarly, from (B.191) (setting and taking the chiral limit with two light quarks) we find that
[TABLE]
so that the one-loop axion couplings can be estimated as
[TABLE]
4 General IR relations for the couplings
Before considering large models in detail in the following sections, let us consider model-independent low energy relations between the couplings valid also at small (partially recollecting what we have learned in the previous sections).
We consider the chiral Lagrangian (2) choosing the scheme where the axion is entirely contained in the topological term [14]
[TABLE]
where is defined in (2.3). We concentrate on flavors, where the quark mass matrix reads and are the Pauli matrices. The GMOR relation sets
[TABLE]
As we have seen in section 2, (4) generates a mixing of the with the axion, and in turn the latter with the pions. Neglecting the mixing terms between and pions, the mass eigenstates at leading order in are now (see equations (2.10)-(2))
[TABLE]
where
[TABLE]
is the full topological susceptibility of the theory. The previous expressions show that the mixing of the axion with pions only occurs if the flavors are non-degenerate in mass.
The couplings of the and the pions to nucleons are defined via the following effective Lagrangian terms
[TABLE]
where is the two component vector for the spin nucleons , the first two couplings are CP-even, while the last two are CP-odd and arise in the presence of any effective non zero -angle or any other CP-violating source.
The axion couplings to nucleons, to leading order in , can be obtained from the ones above, simply replacing and with the respective physical fields defined in (2.10), (2.11). Notice that in the non-relativistic limit, which is justified at energies much smaller than the nucleon mass , the non-derivative CP-even couplings can be traded for derivative ones since, for any scalar field
[TABLE]
These meson-nucleon couplings were already analyzed using effective field theory in Sec. 3; the results for derivative couplings are given in equation (3.46) and for the non-derivative CP-violating couplings in equations (3.47) and (3.48).
If we define the derivative axion-nucleon couplings as
[TABLE]
we find, using (4.72), (4.71) and (4)
[TABLE]
[TABLE]
To leading order in the chiral limit, where the mass of the pion is much smaller than that of the (i.e. when ), the expressions above can be rewritten as
[TABLE]
[TABLE]
where the isoscalar and isovector axial couplings, obtained from the matrix elements of the axial current between baryon states (the axial form factors), are related to those of the and the pion by means of the generalized Goldberger-Treiman relations
[TABLE]
The couplings (4.76), (4.77) are, for the case , precisely the “universal” ones obtained in any axion model in the KSVZ class [6, 7].
The non-derivative axion-nucleon couplings
[TABLE]
are present whenever the Peccei-Quinn mechanism is not perfect (or if weak interaction CP-breaking effects are taken into account). As already recalled, explicit breaking of the Peccei-Quinn symmetry by higher-dimensional operators shifts the VEV of the axion. These operators are generically present in most UV completions of axion models. Even if suppressed by powers of the Planck mass, up to dimension ten their coefficient must be very small, in order to cope with the experimental bounds (see e.g. [92, 93]) setting , [94, 95]. The extent of fine-tuning needed for these coefficients determines the so-called “quality” of the axion. Therefore, on general grounds one can expect that the axion does not cancel completely the -angle. In such a case, we can study linear CP-breaking couplings of the axion and to nucleons, with a (non-vanishing) -dependent coefficient.
Taking the axion-meson mixing (4) into account, we can write
[TABLE]
Therefore, to leading order in the above mentioned chiral limit
[TABLE]
Let us introduce the shorthand notation for the first and second term in each line of (4)
[TABLE]
In [54] a relation between the common term in (4) and the pion-nucleon sigma term in the chiral limit has been derived. We will re-derive it in the degenerate case in the Skyrme and WSS models in sections 5, 6. We denote by the nucleon mass in the isospin-conserving limit, where the two light quarks have a common mass equal to . By virtue of the Feynman-Hellmann theorem, the sigma term is then
[TABLE]
The relation of the sigma term with the part of the axion coupling232323Note that this implies a relation between the coupling and the sigma term. reads [54]242424In [54] the formula is obtained by considering the nucleon matrix element of the interaction operator . The latter is extracted, in the limit, from the expansion of the quark mass term in the Hamiltonian, once the -term has been rotated in the mass sector and the energy has been minimized.
[TABLE]
where
[TABLE]
Moreover, working again in the chiral limit, the pion coupling (and so the term defined above) is known in chiral perturbation theory to be related at tree level to the strong contribution to the neutron-proton mass splitting, which we denote as (see e. g. [71] and sections 5, 6), as252525In the case an analogous relation holds involving the mass difference of and baryons [70].
[TABLE]
Note that the term is of .
Thus, using (4), (4.84) and (4.86), we can write universal low energy relations, valid for flavors in the chiral limit () at tree level in chiral perturbation theory and up to third order in the quark mass difference , generalizing the expression in [54],
[TABLE]
In summary, in these relations the CP-odd couplings of the axion to neutrons and protons are expressed in terms of the pion-nucleon sigma term and the strong contribution to the neutron-proton mass splitting. The latter can be estimated with phenomenological extractions or lattice calculations. We are going to comment on the numerical evaluation of these couplings in section 7.
Notice that what we have previously denoted by is the value of the couplings in the quark mass degenerate case, which corresponds to and . We will keep this definition in the rest of the paper.
5 The Skyrme model picture
In this section we consider the couplings discussed above in the context of the Skyrme model, where the nucleons can be seen explicitly as solutions in the low-energy Lagrangian. In the Skyrme model, one treats baryons as solitons of the chiral Lagrangian, stabilized with the aid of a quartic term in , the so-called Skyrme term [65]. The axion-dressed Lagrangian (4) thus contains now a further term
[TABLE]
The coupling in the Skyrme term scales with as . Moreover, we work at large and finite , so that and double trace terms are discarded in (4). Hence, for instance, in the mass-degenerate case , the relations (4) reduce to
[TABLE]
where
[TABLE]
Up to corrections of order , the masses of the physical states are
[TABLE]
where in the first line we have introduced the Witten-Veneziano mass .
The Skyrme model without the axion allows to compute the meson-nucleon couplings defined in eq. (4.71). From these, using the general formulae collected in the previous section, the axion-nucleon couplings can be deduced.
Notice that in the Skyrme model, both [66] and [89] have been computed (the latter one in a suitable extension of the Skyrme model including some vector mesons). To the best of our knowledge, the CP-odd couplings and have not been computed in the Skyrme model so far. The coupling in the Skyrme model with a finite -angle has been argued to be suppressed in the large limit [96]. In the chiral limit (which in general does not commute with the large one) it has been computed in e.g. [70, 71, 72, 73].
In the following subsection, considering the non-degenerate case, we will directly extract the non-derivative axion-nucleon couplings and, in turn, and , showing in particular that the relations (4.84), (4.86) and (4.87) are indeed reproduced.
5.1 Nucleon mass terms and CP-odd couplings for
The starting point of our analysis is the computation of the quark mass contribution to the nucleon mass in the non-degenerate case. It can be extracted from the mass term of (4), set on-shell on the Skyrme solution, once we subtract the vacuum configuration and using the GMOR relation . This gives
[TABLE]
where
[TABLE]
describe the ansatz for the extended Skyrmion solution of [97, 98] which we truncate to the -pion sector. The generalized pseudoscalar-vector meson Lagrangian considered in [98] contains in fact various mixing and isospin-breaking terms involving the pion, the and other vector mesons. These terms contribute to the Skyrmion equation of motion and to the nucleon mass terms. However they are expected to provide subleading corrections (see e.g. comments in [99]) and we will neglect their effect here.
In eq. (5.94) is the angular momentum and the moment of inertia of the nucleon. Moreover , , () are the Pauli matrices and is the two-dimensional identity matrix. To leading order in and in the small quark mass limit, the function is the solution of the equation [66]
[TABLE]
where with , and the conditions for having baryonic number equal to one are , . The equation can be solved numerically. The solution asymptotes as at large .
The static Skyrmion solution is described by the matrix , and the quantization is achieved in a non-relativistic limit passing to a slowly rotating time-dependent solution where the matrix accounts for the time-dependent (for ) moduli. Some detail on the instanton moduli quantization, valid mutatis mutandis for both the Skyrme and the holographic WSS model, can be found in appendix F.
In the extended Skyrme model of [97, 98], where the Skyrmion emerges as a soliton solution of an effective Lagrangian including and vector mesons, the quantization includes a time-dependent solution for the soliton component [97], whose form is given in eq. (5.94) above. The function is extracted from the equations of motion. The latter include (at least) two parameters, related to the vector-vector-pseudoscalar coupling and the vector-pseudoscalar3 coupling . These have been extracted in the model at hand in [89].
The soliton component is proportional to and thus to the soliton angular velocity Tr which clearly vanishes in the static limit. Time derivatives of the moduli are usually suppressed in the model and therefore the contribution of this term will be generically suppressed too. However, this term provides the leading order contribution to the CP-even coupling. The explicit result can be found in section E of [89] for the case. By standard parameter fitting the authors of [89] obtain .
Since , we can expand the Lagrangian term (5.93) around . To first order in we get
[TABLE]
The corresponding Hamiltonian term is given by
[TABLE]
where is the third component of the isospin operator and we have used the relation
[TABLE]
From (5.97), recalling that (resp. ) on the proton (resp. neutron) state, we get the pion-nucleon sigma term and the quark mass contribution to the neutron proton mass splitting [100, 98]
[TABLE]
Notice that the sigma term can be given explicitly as
[TABLE]
where the constant is determined by the numerical solution for through
[TABLE]
Moreover, the neutron-proton mass difference can be obtained by numerically solving the equation for [98]. Notice that in the more general case of degenerate quark masses, one would get the same expression as in (5.100), as a result of the standard embedding of the Skyrmion solution into .
Taking the previous results into account, there is a quick way to get the CP-odd axion-nucleon couplings.
Following the procedure outlined in section 2, we can remove the axion-meson mixing terms in the vacuum solution in such a way that the net effect is just a redefinition of the quark mass matrix. In the “chiral limit” (with defined in (5.91)), treating the axion as an external field, a convenient choice is262626Following the notations in section 2, a natural choice of frame which indeed reproduces the nucleon Hamiltonian mass term in eq. (5.105), consists in taking and , so that .
[TABLE]
where . Hence, in the Skyrme plus axion case, after shifting the axion by to account for a residual angle, the nucleon mass term arises from the Lagrangian term
[TABLE]
where and are given in (5.94). Working to first order in , expanding for small and keeping only the terms in , we get
[TABLE]
Treating the axion as an external field, the corresponding Hamiltonian term reads
[TABLE]
which can be expressed in terms of the Skyrme model parameters using (5.99) and (5.100). This expression gives the Lagrangian CP-odd axion couplings precisely as in eq. (4.87). In terms of the model parameters
[TABLE]
For the more general case of degenerate quark flavors, the common factor in the denominator is replaced by .
Taking into account the axion-meson mixing, and the general IR relations (4), the previous expressions allow us to immediately find and . In the chiral limit they read
[TABLE]
where we have used eqns. (5.99) and (5.100). As a countercheck, we have controlled that the expressions above can be also obtained directly starting from the -dependent Skyrme Lagrangian272727See e.g. [101, 102] and improve their results with the extensions in [97, 98]. and considering the equations of motion for the and pion fluctuations around the Skyrmion solution.
If the Peccei-Quinn mechanism works perfectly, so that no residual term is left over, the results in (5.106) can be interpreted as providing the quartic axion-axion-nucleon couplings and those in (5.107) the meson-axion-nucleon couplings: it just suffices to trade back for . In practice in the chiral limit this gives
[TABLE]
In appendix E we will present related results in the non-degenerate case.
6 Large estimates in the WSS holographic model
In this section we consider a second model which allows to estimate the non-derivative couplings to nucleons of axion, and pions. It is the most successful top-down holographic cousin of planar QCD, know as the Witten-Sakai-Sugimoto model (WSS) [60, 61]. Although it has a higher-dimensional UV completion, it gives reasonable quantitative results for many QCD observables. Similarly to the Skyrmions, baryons are solitons of this theory. Moreover, the model allows to include systematically the contribution of the (axial) vector mesons.
In the next subsection, we will briefly review the model for the reader who is unfamiliar with it. In the following subsections we will compute the above mentioned CP-odd couplings in analogy to what we have done for the Skyrme model.
6.1 The WSS model
The WSS model is based on a brane setup in type IIA string theory. There are -branes wrapped on a circle of radius where antiperiodic boundary conditions for the fermions are imposed. Then -anti--branes are placed at antipodal points on that circle.282828This is not the only possibility. More general non-antipodal configurations can be also considered. However, it is only in the antipodal case that the glueball and meson mass scales coincide. At energies much smaller than , the dynamics on such branes is given by a dimensional large gauge theory with massless quarks. The massive sector contains scalars and fermions (with mass scaling as ) in the adjoint representation of the gauge group. In the limit where a simple holographic gravitational dual description of the model can be given, this massive sector cannot be decoupled from the QCD-like one. The classical gravity regime, in fact, amounts on taking and , where the ’t Hooft coupling sets the ratio between the confining string tension and .
The flavor sector in the model is described by the low energy modes of the -branes. When the latter can be treated as probes of the background sourced by the ’s: this corresponds to the quenched approximation for the quarks.292929See [103] for an account of the flavor backreaction to first order in . In this limit, the -anti--branes actually join in the background, realizing geometrically chiral symmetry breaking. Their effective action reduces to a Yang-Mills theory with Chern-Simons terms on a curved space-time in five dimensions
[TABLE]
where (in units303030The correct dimensions in the formulae of this section are recovered by inserting powers of where needed. with )
[TABLE]
Here is the radial holographic direction and we have omitted the wedge product symbol “” in the CS terms. We shall mostly focus on such an action in the case.
The matrix for the Goldstone modes is given by the path ordered holonomy matrix
[TABLE]
The effective action for precisely reduces to the chiral Lagrangian with the Skyrme term [61], with the pion decay constant and the coupling given by
[TABLE]
The pseudoscalar meson acquires a mass according to the Witten-Veneziano relation
[TABLE]
where is the topological susceptibility of the pure theory, given by
[TABLE]
This implies that in the WSS model the Witten-Veneziano mass,
[TABLE]
is much smaller than .
Baryons in the model are instanton solutions [63] with baryon number given by
[TABLE]
where is the space spanned by and is the field strength. Nucleons correspond to solutions.
In the case with massless flavors, using an notation for the gauge field
[TABLE]
a simple static instanton solution can be given around where the curvature of the background can be neglected. The related solution corresponds to a charged BPST instanton
[TABLE]
where and
[TABLE]
This solution depends on eight parameters: the instanton center of mass position , the instanton size and implicitly, three “angles” related to the fact that the solution can be rotated by means of a global gauge transformation. Precisely as in the Skyrme case of section 5.1, they are encoded in the matrix
[TABLE]
Substituting the solution (6.118) into the action (6.109) one finds
[TABLE]
where, up to corrections
[TABLE]
where gives the baryon mass in the , limit.
This implies that and are not genuine moduli; in fact they are classically fixed by minimizing as
[TABLE]
These relations imply that the instanton size is very small (but not zero) in the regime, and that the center of the instanton is classically localized at . The full expression for the static instanton solution valid for any value of in the limit can be found in [64].
In [48, 49], the solution above has been modified by including a (small) mass term for the flavor fields (taken to be degenerate in mass) as well as a (small) non-zero topological term. The deformation is driven (in units ) by the (small) parameter
[TABLE]
In the mass-and--deformed instanton solution, two new gauge field components are turned on, to first order in the above parameter. One is the Abelian component , which gives a non trivial vacuum expectation value to the meson, while the other is the non-Abelian component which can be interpreted as a dipole electric potential in the bulk. It is the latter component which is responsible for the nucleon electric dipole moment to be different from zero [48, 49], but it is the former which will be useful for our purposes below.
In appendix F we review the instanton moduli quantization, which will be necessary for the computation of the CP-odd couplings in section 6.3.
6.2 The derivative axion-nucleon couplings
As we have pointed out in section 5 the derivative axion coupling to nucleons, for the case, can be simply deduced once the CP-even couplings and are known. In the WSS model in the chiral limit these couplings have been already computed in [64]. They read
[TABLE]
clearly satisfying the generalized Goldberger-Treiman relations. Using (4.76), (4.77) the derivative axion-nucleon couplings follow.
It is worth noticing that the leading contribution to the isoscalar coupling comes from the time-dependent term, proportional to the angular velocity , in the solution for (F.235).
6.3 The non-derivative axion-nucleon couplings for
In the holographic WSS model, the nucleons correspond to leading order to the static instanton solutions (6.118) for the five-dimensional gauge field on the flavor branes. Moreover, the quantization of the solution proceeds in analogy with the Skyrme case and, as we review in appendix F, it generates (among others) a non-trivial time-dependent profile for the Abelian component which is related to the field. Furthermore, the quark mass term in the model, in the small quark mass limit, turns out to have precisely the same form as in the Skyrme case [104, 105, 48, 49]. Finally, and crucially, the WSS model allows for the explicit introduction of an axion [52], giving the standard modification of the low energy chiral Lagrangian as written in (4). The holographic set-up provides a UV origin of the axion model, but is an essentially free parameter.
All in all, focusing on the pseudoscalar sector, one can extract the quark mass contribution to the baryon Hamiltonian and the CP-odd couplings precisely as in the Skyrme model in section 5.
Let us consider the “chiral limit” with non-degenerate flavors. The starting point is thus the same Lagrangian mass term as in (5.93) where now
[TABLE]
and, from the time-dependent solution (F.235),
[TABLE]
where is the momentum conjugate to and is the instanton angular velocity defined in eq. (F.237). We shall not need the explicit expression for since, when computing the VEV of the above expression on the nucleon ground state, this will not contribute. We thus set in the following. Notice that scales like , just as in the Skyrme case.
The above ingredients are sufficient to extract the pion-nucleon sigma term and the neutron-proton mass splitting following precisely the same steps as in the Skyrme model case. To leading order in the holographic limit they are given by [106, 107]
[TABLE]
where
[TABLE]
and
[TABLE]
Since at low energy the effective axion-pseudoscalar Lagrangian in the WSS model [52] has precisely the same form as in the Skyrme case, the CP-odd axion-nucleon couplings are expressed in terms of the nucleon mass properties (6.128) precisely as in (4.87). In terms of the model parameters we thus get
[TABLE]
In the “chiral limit”, the CP-odd meson-nucleon couplings are given in turn by precisely the same relations as in (5.107). In terms of the WSS model parameters, they are given by
[TABLE]
and by
[TABLE]
Note that
[TABLE]
which is precisely the minimal scaling argued in [96] to occur in the Skyrme model.
It is worth noticing that the above expressions can be equivalently obtained by considering the instanton solution for the gauge field components corresponding to the pseudoscalar mesons in the WSS model at finite angle [48, 49].
Let us notice that, as in the Skyrme model case, the above results have been obtained by neglecting the effects of vector mesons which are expected to be subleading.313131This expectation appears to be confirmed by explicit computations in the complete model. We thank Lorenzo Bartolini for discussions on this point and for communication of his preliminary results.
In appendix G we will collect analogous results for the case.
7 Discussion and numerical estimates of the couplings
Let us summarize what we have learned in the previous sections. As a first result, we have observed that the non-derivative CP-odd axion couplings with protons and neutrons can be expressed, purely from low energy considerations valid for any number of colors , in two equivalent ways. The first one is in terms of the CP-odd couplings and of the and pions to the nucleons, as in formula (4.80) or its chiral limit (4). Then, by exploiting known results, the axion couplings can also be expressed in terms of the pion-nucleon sigma term and the strong contribution to the neutron-proton mass splitting, as in formula (4.87).
These have to be thought of as tree level relations in the language of chiral perturbation theory. Their relevance is that the axion-nucleon couplings are expressed in a model-independent way in terms of nucleon properties which, if not directly measurable from experiments, can be computed on the lattice or can be deduced from phenomenology. There is a vast literature on lattice QCD results for the pion-nucleon sigma term. The most recent papers, from e.g. the Wuppertal-Marseille [108] and the ETMC collaborations [109] report values of of and MeV, respectively. These lattice results exhibit a certain tension with phenomenological derivations from pionic atom data (see e.g. [110]) which essentially point towards a value of the sigma term which is around MeV.323232Further discussions on the sigma term issue can be found in e.g. [112, 113]. Lattice QCD results on the strong force contribution to the neutron-proton mass splitting can be found in e.g. [111], giving . Using this result, the value (see e.g. [13]) and the most recent ETMC computation of the sigma term we get, from our formulae (4.87), the estimates in the first column of table 1. The values are expressed in MeV times . Instead, by employing the phenomenological extraction of the sigma term in the second paper in [110], i.e. MeV, we obtain the values in the second column of table 1.
Moreover, we estimated the contributions for mass degenerate flavors to () arising through CP-violating coupling at one-loop order in chiral perturbation theory in section 3.3, equation (3.66).333333A similar estimate of for mass degenerate flavors is given in eq. (3.64). These contributions are smaller than the estimates for the “direct” contributions which we discussed so far, but they affect the results in the first two columns of table 1 by increasing their values by MeV times .
In view of the tension between lattice and phenomenological data for the sigma term, we find it interesting to write down other possible expressions for the CP-odd axion couplings. This is one of the reasons why it is interesting to have found explicit expressions for those couplings, in terms of the model parameters, in Skyrme and WSS models. Moreover, trying to improve the leading order relations we have found in chiral perturbation theory is in practice problematic, due to unknown low energy constants. In this situation, one could view the Skyrme and WSS results as providing model-dependent estimates of the latter.
Note that our computations of the CP-odd meson-nucleon couplings in both the Skyrme and WSS model have shown agreement, to leading order in the large limit, with formulae (4.84) and (4.86) already known from chiral effective field theory. But below we want to avoid the explicit use of the value of the sigma term and the neutron-proton mass difference. Instead, we use the Skyrme and holographic models for direct estimates of the couplings. Although, as mentioned in the Introduction, we cannot expect these models to have an accuracy at percent level (e.g. in WSS it is expected to be around 30% on average in our type of observables), at the moment we cannot claim to have a better precision with other means. A-posteriori, the numerical values obtained in these models appear to be comparable with the Chiral Lagrangian ones, once we include the one-loop corrections in the latter.
Let us begin with the Skyrme model. In the case the coefficient of the Skyrme term is usually fixed to by fitting the masses of baryons and the pion; the fit also sets MeV [100]. With these values of parameters we obtain the estimate for the magnitude of the axion couplings (5.106) in the third column of table 1. Up to the first decimal digit, isospin breaking corrections are not visible.
One can also proceed in an alternative way as follows. Let us first notice that the numerical value of in (H.254) is so small, as compared to (see equation (H.251)), that we can safely discard its contribution.343434We do not expect an order of magnitude correction to these couplings by using slightly different values of the parameters. Therefore, since
[TABLE]
[66], we can rewrite the CP-odd -nucleon coupling in (5.107), with given in (5.100), to leading order in the limit, as
[TABLE]
and
[TABLE]
The advantage of expressions (7.137) with respect to the form presented in the previous section is that they do not explicitly depend on the parameters of the Skyrme model anymore, so they are suitable to provide a straightforward estimate for the couplings.
We conform the values of the physical quantities to the ones in [13], where are calculated353535The error on is irrelevant.
[TABLE]
Remembering that (see eq. (5.101)), using relations (7.138) and (7.137), we get the numerical values in the fourth column of table 1. Working in the “chiral limit”, with the leading order mass term and at first order in increases the error in table 1, from MeV to MeV.
Coming to the Holographic WSS model, fitting the model parameters to the values for and reported in (7.138) and to the -meson mass implies setting [61]
[TABLE]
The axion non-derivative coupling (6.131), including the known quantum corrections (see appendix H), is reported in the fifth column of table 1.
Moreover we can obtain an alternative estimate of the couplings in analogy with what we have done for the Skyrme model above. As we have seen, the WSS model allows for the computation of the four couplings - see formulae (6.2), (6.132), (6.133) for the case. At leading order in , such that the modulus is treated as classical, and to leading order in the limit, these couplings satisfy the relations
[TABLE]
where was defined in (6.129) and in (6.130).
Therefore, from relations (4.76), (4.77), (4) we can express the non-derivative axion-nucleon couplings in (6.131) in terms of the derivative ones
[TABLE]
Taking into account the values reported in (7.138) we obtain the results in the sixth column of table 1.
On top of the errors reported in table 1, we can give an estimate of the errors implied by our approximations. The estimate is going to be rough, since lacking an explicit calculation of the discarded contributions, we do not control their coefficients. Nevertheless we can get an idea of the magnitude of these errors for order one coefficients. Working in the “chiral limit” implied discarding corrections of order . Moreover, the mass term in the Lagrangian is a valid approximation at leading order in . Finally, we have discarded corrections in .363636The other higher order quantities, e.g. in or in , are small and can be discarded. The three effects mentioned above increase the error of from MeV to MeV.373737The three effects alone cause errors of MeV and MeV to and respectively. These errors are dominated by the effects, e.g. for they are of order MeV.
Acknowledgments
We thank Asimina Arvanitaki, Lorenzo Bartolini, Panayiotis Betzios, Yuta Hamada, Maria Paola Lombardo, Francesco Nitti, Maxim Pospelov, Gabriele Veneziano, Giovanni Villadoro and Lukas Witkowski for useful discussions. We also thank the anonymous referee whose suggestions and criticism helped us improve this presentation.
This work was supported in part by the Advanced ERC grant SM-grav, No 669288, by the Netherlands Organisation for Scientific Research (NWO) under the VIDI grant 680-47-518, and the Delta-Institute for Theoretical Physics (-ITP), both funded by the Dutch Ministry of Education, Culture and Science (OCW).
We thank the Galileo Galileo Institute for Theoretical Physics (Florence) and the organizers of the workshop “String Theory from a Worldsheet Perspective” during which part of this work has been completed.
APPENDIX
Appendix A The structure of axion models
In this appendix we review the general structure of axion models.
An axion field may have several meanings in the literature, but for our purposes it has two properties: it has a perturbative shift symmetry and couples linearly to an instanton density (here the QCD density) of the Standard Model (SM).
Such a field was first introduced by Peccei and Quinn to solve the strong CP-problem and render the -angle of the strong interactions a dynamical variable.
There are several realizations of axions in QFT, with two extreme classes: the fundamental axions, [6]-[11], and the emergent or composite axions, [41]. We will produce here a typical simple model of a fundamental axions. A general prescription of theories of composite axions and further references can be found in [41].
A.1 A typical UV model for elementary axions
A simple class of UV complete examples of axions can be constructed as follows. A new sector that realizes two global symmetries is used and has two scalar field transforming under them. An anomaly free combination of the two s is gauged. The scalars obtain a vev so that they break the gauge symmetry but leave the orthogonal global symmetry unbroken. This will play the role of the PQ symmetry when coupled to the SM.
We therefore consider a set of new chiral fermions (we take them to be all left-handed) that are charged under two symmetries, and with charges and two Higgs-like scalars, , with charges () and .
We gauge with a corresponding gauge boson and we assume that the full spectrum is anomaly free.
This implies that
[TABLE]
that guarantees the absence of the gauge anomaly and mixed anomaly in four dimensions.
The bosonic part of the action is383838Strictly speaking the simple model we describe is not UV complete as the couplings are marginally irrelevant in the UV. However, the gauge theory can be embedded in a non-Abelian group, [6]-[11].
[TABLE]
We can obtain symmetry breaking where both and condense with the same vacuum expectation value for simplicity
[TABLE]
We will assume that the remaining scalar degrees of freedom, beyond the phases, have high masses and we will neglect them in the low energy dynamics.
The bosonic part of the action now becomes
[TABLE]
Changing basis to
[TABLE]
[TABLE]
we may rewrite the action as
[TABLE]
where
[TABLE]
The field will become the axion in the low energy theory and as we defined it, it has mass-dimension 1. We may normalize it as
[TABLE]
Consider now the Yukawa couplings of the fermions
[TABLE]
with the selection rules
[TABLE]
We would like to have where is a typical EW scale. For finite and small couplings , it is clear that we can take , and that we can arrange the charges so that all the fermions acquire masses that are also well above the EW scale.
The field is the longitudinal component of the very massive gauge boson, and can be set to zero by a gauge transformation. The only particle from this sector that remains in the IR theory (the SM) is the massless PQ scalar .
The scalar will have direct couplings to the SM fermions (quarks and leptons) if they are charged under the two UV symmetries and if the high energy Higgs couple to them. The remaining global symmetry shifts by constants.
We have also assumed that the gauged symmetry is anomaly free.393939It may however have mixed anomalies with the SM hypercharge. This is generic in string theory realizations, [114]. However the remaining global symmetry associated with may have mixed anomalies with the standard model. We denote the relevant charge traces as
[TABLE]
where is the charge generator of , are the gauge generators of the SM and the trace is on the SM fermions. If such traces are non-zero, then in the effective field theory we have the following axionic couplings
[TABLE]
where is the Yang-Mills field strength and is the electromagnetic one. We have ignored the coupling to the W-instanton density, as its effects are heavily suppressed.
Therefore we can parametrize the effective couplings of to the SM matter as in [13]
[TABLE]
[TABLE]
Here plays the role of the PQ axion for the SM. is a non-derivative interaction coupling the axions to mass terms of fermions and bosons. The ellipsis in (A.157) stands for higher order and higher derivative interactions that have been omitted.
We have defined the axion to be dimensionless, and its effective interaction strength is set by the axion scale that is assumed to be much larger than the SM scales. Therefore all axion interactions are weak. This is the reason why higher non-linear interactions are not so important in the effective action.
does not contain quantum corrections associated with the SM. Due to the protected nature of the axion such corrections are very special. At weak coupling they arise because of YM instantons, that will generate a potential for the axion, [13]. This is the analogue of the anomaly for the PQ symmetry.
The couplings of the axion to leptons and or photons are straight-forward as they can be treated in perturbation theory. However, the couplings to the strong interaction operators are trickier.
With a chiral rotation of quarks, the coupling to the instanton density can be transferred to the mass matrix of the quarks. Therefore, an invariant collection of couplings involve the quark masses and the derivative coupling to . The quark component of generates out of the QCD vacuum various vector mesons. Their content is axion-model dependent. The axion dependence of the quark couplings generates axion couplings to scalar mesons. Their content is also axion-model dependent.
There is however a subset of model independent couplings, if we want the axion to solve the strong CP problem. Namely and then there is a universal coupling of the axion to all the quark states, that amounts to a coupling to the so-called singlet meson. It is this class of couplings that we address in this work.
A.2 Other axion models
Generically, the origin of the axion coupled to the SM may vary. In the fundamental axion case, like the simple model described in the previous subsection, the axion involves a high energy degree of freedom protected by the PQ symmetry.
The effective Lagrangian of the axion coupling it at low energies to the SM degrees of freedom is the same as in (A.157) as it is dictated by (anomalous) symmetry alone. Therefore different fundamental theories yielding axions will have a similar effective action, albeit with different charge assignments and coefficients, [10, 9].
The situation is somewhat different when the axion is composite/emergent. In such cases the axion is a composite generated by the instanton density.404040Due to anomalies an analogue of the will be as good. In general, at finite and these two mix strongly, [47]. At weak mixing it is usually the (mostly) that is lighter in QCD-like theories but in more general theories it could be otherwise. Which of the two is lighter depends of a hidden QFT coupled at high energies to the SM. The general quadratic effective action of the emergent axion was derived in [41]. In that case there is also a mass contribution to the axion effective action originating in the UV theory. In a QCD-like UV theory this is controlled by the topological susceptibility of the hidden sector. The axion in that case has a mass with two contributions that are a priori independent: one from the hidden sector and the standard one from QCD.
Appendix B Current terms and -axion mixing in chiral Lagrangians
In this Appendix we address the mixing between the axion and mesons in a somewhat more general case than that described in section 2.
In general, the derivative of the axion couples to a current involving contributions from leptons, vectorial quark current and the axial quark current as . We write the axial current already appearing in (2) as
[TABLE]
with a Hermitean matrix in flavor space. The vectorial current is conserved and therefore couplings to such current play no role in our analysis: we can neglect them without loss of generality.
It is however simpler to start with the action where the derivative coupling to the current is absent (i.e., the case already considered in Sec. 2). Therefore we first perform a (flavor dependent) axion valued chiral rotation
[TABLE]
which removes the derivative coupling. It will be reinstated using a chiral rotation in the effective theory below.
After this transformation, the Lagrangian obtained from (2) is without the current term
[TABLE]
but with the effect of the current included via and .
The effective field theory Lagrangian corresponding to (B) is the one in (2) of section 2, but with the understanding that and are replaced by and , respectively. It can be generalized to include explicitly the effect of the derivative coupling of the axion to the current as follows. We perform the inverse of (B.159), which in the chiral effective theory means
[TABLE]
After this transformation, the final form of the chiral Lagrangian, corresponding to the QCD Lagrangian of (2) with the current term included, is
[TABLE]
where the last term is suppressed by both the flavor masses and [88, 115]. We show below that its effect on the axion mixing can be absorbed into redefinitions of other parameters up to quadratic terms in . We also observe that the chiral rotation gives rise to kinetic mixing between the axion and the pions. Comparing the derivatives of the QCD Lagrangian and the effective Lagrangian with respect to at we obtain
[TABLE]
We then discuss how the mass eigenstates are computed to leading nontrivial order in . First we make an observation about the kinetic mixing. We can apply a shift
[TABLE]
so that the change of the axion kinetic term exactly cancels the kinetic mixing and the remaining terms in (B) are unchanged up to corrections suppressed by . The outcome is that at leading nontrivial order in the only effect due to the current terms, is the mixing of the physical axion with the pions implied by the above shift. Because the non-derivative couplings involving the axion in (B) are already suppressed by , the non-derivative mixing terms introduced by this shift are , i.e., negligibly small. Therefore the effect of the kinetic mixing will be irrelevant for the analysis of the axion-nucleon couplings in Sec. 3. Higher order terms in the effective action also include a derivative coupling between the axion and which is present even in the absence of the current (B.158) but suppressed by [115]. The same applies to such a coupling as the kinetic mixing in (B): its effect is irrelevant for our studies.
We then discuss the diagonalization of the quadratic terms in (B). We assume that where is independent from the axion and Hermitean. We consider the effects due to a non-Hermitean matrix below. In order to eliminate the non-derivative mixing terms with the axion (up to higher order corrections in ), we first carry out a field redefinition
[TABLE]
Inserting this in the action (B) and requiring that the non-diagonal mass terms involving the axion vanish, fixes
[TABLE]
The nondiagonal kinetic terms are then removed by a subsequent redefinition
[TABLE]
If is treated as a small correction and quadratic terms in this parameter are neglected, we may write the result for the coefficients in a compact way in terms of the topological susceptibility (which now includes a correction):
[TABLE]
with
[TABLE]
Within this approximation, the rules (2.10)–(2.11) therefore generalize to
[TABLE]
Notice that the last (fourth) term in (B.171) is suppressed with respect to the third term by the quark mass. The mass eigenstates are then found by diagonalizing the pion mass matrix as explained in Sec. 2. The axion mass squared is given by
[TABLE]
We then discuss the effect of for the axion-nucleon couplings studied in Sec. 3. It is sufficient to include it in the expressions (3.37) and (3.39) using the approximation in (B.169). For the former expression we find that
[TABLE]
Since the correction due to is subleading in . For the latter expression we find
[TABLE]
In this case, the correction due to is suppressed by the quark masses. Since also contains a factor proportional to the quark masses, this correction is actually of quadratic order in the quark masses and can be safely ignored.
Notice that the last term in (B) also affects the pion mixing and masses even in the absence of the axion. For example, for a flavor independent quark mass , the masses of the pions and the are given by
[TABLE]
Finally, we recall that for , the diagonalization of the pion mass matrix boils down to the “standard” analysis in the literature (see, e.g. [116, 77]). The meson, the meson, the pions and the kaons form a nonet of the vectorial , which is broken to various subgroups both by effects and by finite quark masses. Taking the light quark masses to be the same, , only the and mesons mix. The relevant mass matrix is given as
[TABLE]
where we dropped the terms for simplicity. We adopt a convention where the physical states are given by
[TABLE]
B.1 VEVs induced by CP-odd couplings
CP-odd couplings in the meson Lagrangian also give rise to VEVs of the pions and the which need to be taken into account. First, the quark mass matrix may include CP-odd contributions: we write where both and the CP-violating matrix are Hermitean. We treat all CP-odd couplings as linear perturbations, and also consider the possibility of finite -angle (arising from the VEV of the axion) which is obtained by shifting in the Lagrangian. The terms driving the VEVs are then
[TABLE]
Notice that we chose the normalization of similarly as that of the -angle: . This term could arise from the CP-violation in the CKM matrix, [55, 56, 57] or BSM sources. In fact, one immediately sees that can be absorbed by a shift in the -angle (up to highly suppressed contributions) so we can set it to zero without loss of generality.
We then find for the VEVs of the pions after a straightforward computation
[TABLE]
where is the solution to and satisfies therefore . If and commute, . Notice that the VEV of the is given by
[TABLE]
Moreover, the VEV in the combination appearing in the flavor singlet terms after the inclusion of the -angle
[TABLE]
only depends on the effective angle . In order to analyze the couplings to nucleons, we also need the VEVs in . For consistency with chiral symmetry, in these couplings we also need to include the contributions from . We find that
[TABLE]
We also consider the terms arising from which are linear in the mesons and the axion and are present due to the VEVs. First notice that
[TABLE]
The linear contributions are given by the second term on the second line and the terms arising from the third line after inserting the VEVs. After some cancellations, we obtain
[TABLE]
Therefore, as expected, all contributions due to can be absorbed in the -angle, so it is enough to consider the couplings of the mesons and the axion arising (only) from the VEVs due to the -angle as we do in Sec. 3.
B.2 Three-meson couplings
Having extracted the VEVs of the mesons due to a finite -angle, we can also extract the (CP-violating) couplings of (non-derivative) vertices with three mesons. Up to corrections suppressed at large , these arise from the mass term
[TABLE]
When computing these couplings it is important to note that the mixing (2.10) and in (2.11) is (for generic quark mass matrices) only consistent with chiral symmetry if cubic and higher interactions are included. That is, we write the field redefinitions of the pions and the in general as
[TABLE]
where was defined in (2.14). This definition differs from (2.10) and (2.11) if , and do not commute. Notice that the transformation (B.187) corresponds to an axion-valued axial transformation of the pion fields, which ensures that the field redefinition does not generate undesired higher order interactions that would be inconsistent with chiral symmetry.
After inserting the mixing with the axion from (B.187) and introducing the -angle through (3.36) we obtain the three pion couplings [70]:
[TABLE]
where . The coupling of the to the pions is
[TABLE]
In the limit of large and in the chiral limit with flavor independent quark masses the -pion-pion coupling simplifies to
[TABLE]
where we inserted . Notice that the expressions of these couplings are identical in the WSS model and in chiral perturbation theory at large , so (B.190) also holds in the WSS model. The coupling between the axion and the pions is found to be
[TABLE]
Appendix C Details on the nucleon couplings in chiral perturbation theory
C.1 Non-derivative couplings from external CP-violation
In this Appendix we discuss the non-derivative couplings of nucleons to the mesons and the axion which arise due to such sources of CP-violation that are not included in the -angle, such as the CKM matrix in the quark or lepton sector. For example, the mass terms (3.29) and (3.30) can be generalized to read
[TABLE]
The coefficients with a bar are CP violating and in the absence of the -angle should arise from non-QCD CP-violating sources. We only consider here CP-odd couplings which are singlets under the chiral transformations.414141In Appendix B.1 we discuss CP-odd contributions arising through the quark mass matrix.
Similarly, an extra term should be included in the Lagrangian (3.32) containing flavor singlet couplings:
[TABLE]
For the CP-violating term we assumed a scaling which is analogous to the CP conserving terms, as above.
We then consider the axion couplings induced by the axion terms in the QCD Lagrangian (2). The important terms are the first term in (C.195) and the second term in (C.1).We use notation defined in Sec. 2 and include the axion through the replacements (3.34) and (3.35).
After this, the terms of interest become
[TABLE]
where . Inserting here the definition of , and using the relations (2.10)–(2), we may extract the non-derivative couplings (i.e., the terms involving ) of the fields and to the nucleons.
The terms linear in and are424242Notice that when the quark masses break the vectorial , the non-singlet terms in (3.30) also contribute to the coupling of (but not to the coupling of the axion). Therefore, for generic quark masses the result for here is only an estimate. When all quark masses are equal, it is the precise result for the physical field . then
[TABLE]
Notice that since , both axion terms are of the same order in . The term in (C.199) is suppressed by a factor of with respect to the term in (C.198) so it is negligible in the ’t Hooft limit but should be included in the Veneziano limit.
C.2 CP-odd nucleon-meson couplings
Here we extend the discussion of the nucleon couplings to mesons in Sec. 3 to include in particular the contributions from the single trace terms in (C.192). As it turns out, for this it is convenient to also redefine the nucleon fields. First we define the transition matrix through (see (3.21))
[TABLE]
with the solution
[TABLE]
Then the redefined baryon fields are obtained by acting on with as in (3.22).
Only the couplings involving in (C.192) are affected by the baryon redefinition at the order we are interested in. The combination coupling to the redefined baryons is
[TABLE]
where we also noted that up to subleading corrections in which are irrelevant here. After introducing the -angle through (3.36), we obtain the CP-odd couplings
[TABLE]
C.3 Group theoretic structure of the nucleon couplings to the mesons
We discuss here the structure of the single trace terms including sum over contractions, which appear in Eqs. (3.26), (3.29), and (3.30). We start by considering the structure at as an example. The number of terms in the sums can be understood in the following way, which can be generalized to higher . First, we classify the pions and the according to the underlying quark representations434343For conjugation acts trivially on the representations but we keep the conjugates since this will not be the case for the generalizations to higher . . All possible flavor structures of the and couplings are then obtained by adding the nucleons, and consequently by identifying the singlets in the product state. We write
[TABLE]
When expanding the last expression, only products of a representation with its conjugate contain singlets (and exactly one each),
[TABLE]
These two singlets map to the two terms in (3.45) and in (3.46).
We proceed by commenting on the couplings at . We also first fix for simplicity. Then, in addition to the nonet of light mesons, the ground state of spin- baryons transforms under the adjoint representation of . The usual way [82] to write the couplings corresponding to (3.24) and to (3.26) is
[TABLE]
where and with the baryon octet included in the traceless matrix transforming as . We renamed the couplings and as and following standard conventions.
In order to analyze the number of terms, we may again pack the mesons into and consider the product of the nucleon and quark representations: . When multiplied with the similar antiquark representations each of the representations , , and again gives rise to a singlet. Therefore we obtain three singlets which map to the three terms of (C.207) and (C.208). The same argument actually works for the ground state spin- baryons for any and any odd . In this case the baryons belong to a representation defined through a Young diagram having exactly two rows, one with boxes and the other with boxes [85, 86]. For excited states of baryons in generic representations one obtains independent terms.
Appendix D On-shell nucleon vertices in the limit of large mass
We first consider the bilinears of Dirac spinors in the limit of large nucleon mass . For leading contributions in this limit we expect that the exchanged three-momenta are and the variations in the nucleon energies are . In general the Dirac propagator can be decomposed as
[TABLE]
The second term corresponds to a nucleon propagating backward in time and is suppressed in the limit of large (assuming that we are discussing a nucleon state rather than an anti-nucleon).
This suggests that up to corrections suppressed by the nucleon-meson interactions are determined by the (on-shell) amplitudes where , , , and . We take the incoming nucleon to be in the rest frame with momentum and write the outgoing nucleon momentum as where we dropped terms in the energies. We use a representation where the spinors are given explicitly by
[TABLE]
where a possible basis for the spin wave functions is given by the spinors and , but we leave the basis unspecified. By direct computation we find that
[TABLE]
where . We notice the following identities
[TABLE]
which hold for any value of . Notice also that because the nucleon creation and annihilation amplitudes immediately follow from the above results.
These identities suggest that the operators , and , where is the nucleon spin operator, are identical at least to leading order in the limit of large . The spin is fixed at least to leading order as the amplitudes do not flip spin in a basis aligned with : the transverse components of the amplitude (D.214) contain also potentially spin-flipping elements but these vanish when contracted with the only available momentum .
We also notice that at the couplings involving above are . This seems to be due to parity: when , the only available parity-odd scalar is .444444There are two independent momenta, but invariance of the amplitudes under boosts requires that at linear order only momentum differences (i.e. ) appear. Notice also that is only affected by corrections under such boosts. Notice however that e.g. the nucleon pair creation amplitude is not proportional to .
Appendix E CP-odd couplings in the Skyrme model with
In this section we will provide a derivation of the CP-odd couplings of the axion and of the mesons to the nucleons, in the Skyrme model with non-degenerate flavors. We use the notation in [101], where the Lagrangian, once the substitution is performed, is already in a diagonal basis for the axion. Hence, for the rest of the discussion we drop the subscript “p” of the fields for notational convenience. Moreover, in this section we work in the “chiral limit” ( is defined in (5.91) and is the kaon mass).
Actually, the Lagrangian in [101] is written at linear order in . Generalizing that expression to the non-linear case, the mass term reads
[TABLE]
where
[TABLE]
In this expression and (called in [101]) is a certain combination of the masses and the topological susceptibility which will be irrelevant in the “chiral limit”. Finally , proportional to , will be given in a few lines. The leading effect of isospin breaking will be introduced later on.
The Skyrmion is just the embedding of the one into , so the configuration we need is454545In this expression we have dropped a factor Exp, where is connected to the vacuum value of the singlet, because it gives a subleading correction in to our result.
[TABLE]
where with we denote the vector composed by the first three Gell-Mann matrices. The function is the same as for the model reported in section 5.1. One can calculate the quark contribution to the nucleon mass by setting , which corresponds to setting to zero the axion field, and putting (E.219) on-shell on the Skyrmion solution. The result is
[TABLE]
where is given in (5.101). Note that in this formula we have not taken the chiral limit. In the degenerate mass case, eq. (E.222) exactly reduces to the expression (5.100).
In order to extract the axion couplings we expand (E.219), on-shell on the Skyrmion, up to second order in . In the “chiral limit”, the result turns out to be very simple
[TABLE]
In the same limit, we have [101]
[TABLE]
where
[TABLE]
At leading order the only effect of symmetry breaking is encoded in the parameter . Therefore, from (E.223) we obtain the couplings
[TABLE]
Apart from the factor accounting for isospin symmetry breaking, the result is the same, in form, of the case. In the present case the relation with the mass correction (E.222) is
[TABLE]
From the results in [101] one can as well extract the CP-breaking and couplings to the nucleons. In fact, the relevant Lagrangian term in the “chiral limit” reads464646Note the different convention on the sign of with respect to [101].
[TABLE]
At the order we are working, it is sufficient to consider the mixing in (B.178), where the mixing angle is (see e.g. [101]). Using these expressions in (E.228) we obtain
[TABLE]
Therefore, apart from the last result, which takes into account the mixing, the case in the “chiral limit” gives the same formal outcome of the model for the axion couplings. This suggests that, not unexpectedly, the corrections due to the inclusion of the strange quark are small.
Appendix F Instanton moduli quantization
We remind the reader that the quantum description of the instanton solution in the WSS model, is obtained in analogy with what is done for the Skyrmion [66]. Since , a non relativistic limit can be taken in which the system reduces to a quantum mechanical model for the time-dependent instanton (pseudo) moduli. The slowly rotating instanton solution around is given by
[TABLE]
where, setting we have
[TABLE]
Here,
[TABLE]
with .
Time-dependent rotations act non-trivially on the non-Abelian components of the gauge field. Consistency with the equations of motion, in turn, induces a non trivial time dependence on the Abelian components too, except that on [64].
In the region, the time dependent non-Abelian field components read
[TABLE]
The Abelian field components are given in turn by
[TABLE]
and
[TABLE]
The quantum Hamiltonian for the baryon states is obtained by substituting the above time-dependent solution into the WSS action.
Notice that the structure of the instanton solution above recalls that of the Skyrme model with vector mesons examined in [89]. There, the time-dependent soliton solution includes non-trivial expressions for the time component of the iso-triplet (), the space components of the iso-singlet vector () and the -type meson. The contribution of the latter, which turns out to be proportional to the angular velocity of the spinning soliton, turns out to be crucial for many instances. In [107] it has been used to compute the mass splitting between neutron and proton in the WSS model with non-degenerate quark masses. It is crucial for providing non zero values for both the CP even coupling and the CP-odd one. In turn, these couplings are related, using the mixing terms, to further axion-nucleon couplings.
In the present setup, the angular velocity is given by
[TABLE]
where is the spin operator
[TABLE]
with and .
In turn, the isospin operator
[TABLE]
can also be rewritten as
[TABLE]
and it is such that only states with appear in the spectrum.
Instantons are identified with fermions via an anti-periodicity condition to be implemented on the baryon wave functions. The related states have with positive odd integers.
The quantum state for a baryon, , depends on the (iso)spin and on further quantum numbers and which describe excited states or resonances; the case , corresponds to the unexcited proton (with isospin component ) and neutron () and the corresponding wave functions are
[TABLE]
with
[TABLE]
Generalizations to larger values of can be found in [63].
Appendix G CP-odd couplings in the WSS model with
For the case, we can proceed as in the Skyrme model.474747The quark-mass correction to baryon masses has been computed in [117]. Starting from the general Lagrangian (4) with , which as stated before is found also in the holographic case, one can diagonalize for the physical modes as in (4). In this case the “chiral limit” gives
[TABLE]
for the susceptibility and
[TABLE]
for the axion content of the . Here and in the following, we have made use of the GMOR relations [117]
[TABLE]
and
[TABLE]
We now put the mass part of the Lagrangian (4) on-shell on the nucleon instanton solution and use eq. (G.244) to obtain
[TABLE]
At the order we are working, the instanton solution is the same as the (two-degenerate flavors) one
[TABLE]
where includes the first three Gell-Mann matrices. By expanding (G.247) to quadratic order in and using the first relation in (G.245) we obtain
[TABLE]
where the dots stand for a term in which is vanishing once we integrate the result in space. Up to the latter term, the result (G.249) is exactly the same one of the computation, so the couplings are still the ones in section 6.3. This is consistent with what we found in the Skyrme model. As it happened there, the “chiral limit” renders subleading the corrections due to the third quark flavor.
Appendix H Alternative numerical estimates of the couplings
In this Appendix we collect some alternative numerical estimates of the couplings calculated in the main body of the paper in the Skyrme and WSS models, using a different parameterization with respect to the one employed in section 7. Here we plug in the formulae for the couplings the standard values of the parameters usually employed in the two theories we have considered.
Let us begin from the Skyrme model. For the case the coefficient of the Skyrme term is usually fixed to by fitting the masses of baryons and the pion; the fit also sets MeV [100]. Therefore, with these values of parameters the estimate for the magnitude of the axion couplings (5.106) in the mass degenerate case of the Skyrme model is
[TABLE]
For the coupling of the in (5.107), we obtain that
[TABLE]
For what concerns , the standard choice of parameters in the extended model of [89, 98], to which we refer for the notations, is
[TABLE]
from which [98]
[TABLE]
and finally in (5.107)
[TABLE]
It is worth noticing that a first estimate of the same coefficient, in chiral perturbation theory with , gave [70].
The parameters extracted from hadron mass fittings in the case are instead [101]
[TABLE]
so, remembering that and using [13], the numerical estimate of the axion coupling (E.226) in the Skyrme model is
[TABLE]
The sizable numerical difference between (H.250) and (H.256) is due to the difference in the two cases of the values of the parameters usually employed to fit nuclear data. This is a known drawback of the Skyrme model. For and in (E.230), (E.229) we obtain
[TABLE]
Coming to the holographic WSS model, fitting the model parameters to the experimental values
[TABLE]
implies setting [61]
[TABLE]
In the classical limit where and (6.123), we obtain for the axion derivative couplings (6.2)
[TABLE]
A class of important corrections to the results for the couplings in the WSS model at large comes from the quantization of the instanton size. This implies that the nucleon wave function contains a -dependent factor [63]
[TABLE]
Therefore, in the formulae we replace with
[TABLE]
Setting and fixing the parameters as above one finds for the quantum corrected couplings
[TABLE]
Concerning the axion non-derivative coupling (6.131) in the mass-degenerate case, by keeping to its classical value we obtain the estimate
[TABLE]
Including the quantum corrections as above we get instead
[TABLE]
Analogously, we get for the coupling (6.132)
[TABLE]
in the classical case and
[TABLE]
with the quantum corrections. We finally find for the pion coupling (6.133)
[TABLE]
in the classical limit and
[TABLE]
taking into account the quantization of the instanton moduli.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. D. Peccei and H. R. Quinn, “CP Conservation in the Presence of Instantons,” Phys. Rev. Lett. 38 (1977) 1440 . · doi ↗
- 2[2] F. Wilczek, “Problem of Strong P 𝑃 P and T 𝑇 T Invariance in the Presence of Instantons,” Phys. Rev. Lett. 40 (1978) 279 .
- 3[3] S. Weinberg, “A New Light Boson?,” Phys. Rev. Lett. 40 (1978) 223 . · doi ↗
- 4[4] E. Witten, “Current Algebra Theorems for the U(1) Goldstone Boson,” Nucl. Phys. B 156 (1979) 269 . · doi ↗
- 5[5] G. Veneziano, “U(1) Without Instantons,” Nucl. Phys. B 159 (1979) 213 . · doi ↗
- 6[6] J. E. Kim, “Weak Interaction Singlet and Strong CP Invariance,” Phys. Rev. Lett. 43 (1979) 103 . · doi ↗
- 7[7] M. A. Shifman, A. I. Vainshtein and V. I. Zakharov, “Can Confinement Ensure Natural CP Invariance of Strong Interactions?,” Nucl. Phys. B 166 (1980) 493 . · doi ↗
- 8[8] M. Dine, W. Fischler and M. Srednicki, “A Simple Solution to the Strong CP Problem with a Harmless Axion,” Phys. Lett. 104B (1981) 199 . · doi ↗
