Axions are blind to anomalies
J\'er\'emie Quevillon, Christopher Smith

TL;DR
This paper demonstrates that axion couplings to gauge bosons originate from non-anomalous fermion loops, challenging the traditional anomaly-based understanding, especially impacting chiral gauge theories and axion-like particle searches.
Contribution
It proves that axion couplings are independent of anomaly structures and provides explicit calculations showing deviations from expected anomaly patterns in chiral gauge theories.
Findings
Axion couplings arise from non-anomalous fermion loops.
Couplings to electroweak gauge bosons do not follow anomaly patterns.
Implications for axion-like particle searches are significant.
Abstract
The axion couplings to SM gauge bosons are derived in various models, and shown to always arise entirely from non-anomalous fermion loops. They are thus independent of the anomaly structure of the model. This fact is without consequence for vector gauge interactions like QCD and QED, but has a major impact for chiral gauge theories. For example, in the DFSZ axion model, the couplings of axions to electroweak gauge bosons do not follow the pattern expected from chiral anomalies, as we prove by an explicit calculation. The reason for this mismatch is traced back to triangle Feynman diagrams sensitive to the anomalous breaking of the vector Ward identity, and is ultimately related to the conservation of baryon and lepton numbers. Though our analyses are entirely done for true axion models, this observation could have important consequences for axion-like particle searches.
| Linear | Polar | |||
|---|---|---|---|---|
| Anomalous | ||||
| interactions | ||||
| 0 | ||||
| 0 | ||||
| 0 | ||||
| 0 | 0 | |||
| 0 | 0 | |||
| 0 | 0 | 0 | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
March 2019
Axions are blind to anomalies
Jérémie Quevillon1 and Christopher Smith2
Laboratoire de Physique Subatomique et de Cosmologie, Université Grenoble-Alpes, CNRS/IN2P3, Grenoble INP, 38000 Grenoble, France.
Abstract
The axion couplings to SM gauge bosons are derived in various models, and shown to always arise entirely from non-anomalous fermion loops. They are thus independent of the anomaly structure of the model. This fact is without consequence for vector gauge interactions like QCD and QED, but has a major impact for chiral gauge theories. For example, in the DFSZ axion model, the couplings of axions to electroweak gauge bosons do not follow the pattern expected from chiral anomalies, as we prove by an explicit calculation. The reason for this mismatch is traced back to triangle Feynman diagrams sensitive to the anomalous breaking of the vector Ward identity, and is ultimately related to the conservation of baryon and lepton numbers. Though our analyses are entirely done for true axion models, this observation could have important consequences for axion-like particle searches.
††footnotetext: [email protected]††footnotetext: [email protected]
1 Introduction
Axions have been around for more than 40 years, and even if the simplest models have been ruled out, they still remain probably the best solution for the strong problem of the Standard Model. They come under many guises, but the basic principle is always the same: design a spontaneously broken global symmetry and assign chiral charges to some colored fermions [1]. This ensures the associated Goldstone boson , the axion [2, 3], has a direct coupling to gluons, and possibly also to photons, of the form
[TABLE]
where is the vacuum expectation value (VEV) of the scalar breaking the symmetry, and some constants related to the fermion and gauge charges. Naively, the -violating QCD coupling can then be rotated away no matter the value of since the Goldstone boson is shifted as under a transformation of parameter . In reality, what happens is that non-perturbative QCD effects dressing create an effective potential for the axion field [1] (see also e.g. Ref. [5]), whose minimum is attained precisely when . In the process, the axion acquires a small QCD-induced mass, typically well below the eV scale [6, 7]. For a recent review, see for example Ref. [8].
Though this picture is correct, the nature of the couplings in Eq. (1) is often wrongly ascribed to the anomaly of the fermionic current. This idea comes from Noether’s theorem: the Goldstone boson is coupled to its symmetry current, , and the current is anomalous, , hence it seems Eq. (1) is immediately recovered. It has been known for a long time that this derivation of the axion couplings is not correct but sufficient for practical purpose [9, 10]. It permits to identify the couplings of axions to gluons and photons as induced by heavy fermions. So, by a common abuse of language, the and processes are said to be induced by the anomaly in the current.
However, as we will explicitly demonstrate in this paper, using this same procedure to derive the couplings of the axion to electroweak gauge bosons fails whenever chiral fermions are charged under . In that case, the true gauge couplings do not follow the pattern expected from the anomalies in the current. More precisely, for the gauge group, one expects from chiral anomalies [9] (see also Ref. [11])
[TABLE]
where are derived from the charges and gauge quantum numbers of the SM fermions. While for QCD and QED, which are vector theories, the coefficients and indeed tune the and decays, we will prove that the , , and decay amplitudes are independent of and ! The true processes driving these decays are not anomalous and their relative strengths do not follow the pattern expected from Eq. (2). The reason for this mismatch between the anomalous couplings and the true axion couplings will be traced to the existence of additional anomalies, occurring only for chiral theories. These additional anomalies cancel exactly the contributions in Eq. (2), leaving as remainder only non-local, non-anomalous processes.
The paper is organized as follow. To set the stage, we start in the next section by presenting in details an axion toy model [1]. Though simple, this model illustrates many important physical features of more realistic axion models. In particular, it will be clear that the axion couplings to gauge bosons are not anomalous, but that this has no quantitative consequence for photons or gluons. Then, before turning our attention to full-fledged axion models, we derive a number of important results for anomalies in Section 3. Specifically, to be able to treat chiral gauge theories, the Ward identities applied to or triangle graphs have to be properly calculated, where and denote the vector and axial currents. Indeed, it is necessary to go beyond simple regularization procedures to be able to locate the anomaly of the triangle in one of the vector Ward identities or to break explicitly the Bose symmetry of the triangle. Equipped with these results, we turn to the Peccei-Quinn axion model in Section 4, derive the correct axion couplings to gauge bosons, and identify precisely where the naive procedure leading to Eq. (2) fails. This analysis is then trivially extended to invisible axion models. Finally, our results are summarized in Section 5, along with their implications for axion-like particle searches.
2 An axionic toy model
Let us consider the following simple extension of QED. Starting with a massless Dirac fermion , charged under a local symmetry , we add a scalar field charged under a global symmetry. The complete Lagrangian is
[TABLE]
where and is the electric charge of , and . When the Yukawa coupling is non-zero, , the fermion fields have to be charged under also. If we normalize the scalar charge to , then under a transformation of parameter ,
[TABLE]
At the classical level, this clearly leaves the Lagrangian invariant whatever the free parameter . Actually, this parameter corresponds to the conservation of the global fermion number, which is aligned with the electric charge in this simple one-fermion model.
When the global symmetry is broken spontaneously, a Goldstone boson is left behind and will be identified with the axion [2, 3]. So, we choose the potential as with . Let us analyze the resulting theory using two different representations for the scalar field.
2.1 Linear representation
Shifted to the true vacuum, the usual linear representation for the scalar field is
[TABLE]
with . The Goldstone boson remains massless and is the axion, while the field acquires a mass . Assuming is real and denoting the fermion mass as , the whole Lagrangian when is expended around the true vacuum becomes
[TABLE]
Since is not charged under , does not directly couple to photons. However, this coupling arises through one-loop triangle graphs, see Fig. 1. Adopting the Pauli-Villars procedure to regulate the loop amplitude in intermediate steps, we compute
[TABLE]
The three-point scalar function obeys111We use the notations and conventions of Ref. [4]
[TABLE]
The regulator can thus safely be sent to infinity, and the decay amplitude in the linear representation for the scalar field is
[TABLE]
In the limit, this amplitude corresponds to the local interaction
[TABLE]
It is interesting to remark that this term is not decoupling 222This non-decoupling is to be understood in the formal sense, since strictly speaking the limit is not compatible with our perturbative treatment of the Yukawa couplings., even though as , thanks to the factor coming from the coupling to the fermion. Still, at the level of the linearly-realized theory, anomalies do not show up and the loop amplitude can be safely computed using a naive regularization procedure, even if we know from the fermion charges in Eq. (4) that is anomalous.
2.2 Polar representation
Instead of the linear representation for the scalar field, there is another representation more suited to the circular geometry of the vacuum333Note that the polar representation still allows the pseudoscalar of having a small mass which could be treated as a perturbation as this is usually done in axion phenomenology.. The polar or exponential representation is
[TABLE]
Then, under the action of a rotation of parameter , the component fields have simple (though non-linear) transformation properties: the field is constant, , while the axion is shifted by a constant, . Importantly, and even if we are using the same notations and , these fields are not the same ones as in Eq. (5). In particular, the couples differently to fermions. Clearly, expanding the exponential, couplings of arbitrarily high dimensions are present, and the theory is no longer manifestly renormalizable.
As a next step, after plugging the polar representation of in the Lagrangian, we perform a reparametrization of the fermion fields to make them invariant under :
[TABLE]
This has three effects. First, the Goldstone field disappears from the Yukawa coupling. Second, the fermion kinetic terms induce derivative interactions,
[TABLE]
where one can recognize the fermionic current associated to the symmetry. Third, the fermionic path integral measure is not invariant, and the Jacobian of the transformation has to be included as a new local interaction:
[TABLE]
The whole non-linear Lagrangian is thus
[TABLE]
Under this form, the Lagrangian is manifestly -symmetric but for the local Jacobian term since , , and are invariant but is shifted by a constant. This permits to get rid of any pre-existing term in the Lagrangian, by shifting . Of course, for an abelian theory, is a total derivative that can be safely discarded, but this toy model can easily be adapted to the non-abelian case, and closely resembles the KSVZ invisible axion model [12] when applied to QCD.
Let us now compute within the non-linearly realized theory. Together with the local amplitude from , we must include the triangle graphs arising from , which have both a vector and axial component. The vector current does not contribute since the photon couplings are also vectorial, and the corresponding triangle graph vanishes thanks to Furry’s theorem. The axial current , on the other hand, gives a non-zero contribution. We recognize the well-known triangle graph:
[TABLE]
Adopting a Pauli-Villars regularization, we add to this a second term with , and proceed with the calculation in dimensions. With the derivative from amounting to taking the divergence, we find
[TABLE]
The regulator term does not decouple in the limit but gives a finite local contribution. Interestingly, this contribution precisely cancel with the local term from , so altogether, we recover precisely the result obtained in the linear case, see Eq. (9):
[TABLE]
This exercise illustrates a number of important features:
- •
In the polar representation, the appearance of the anomalous local term is spurious. When computing specific amplitudes, it only serves to cancel out the anomalous term arising from the derivative interaction . At the end, this is nothing but an application of the well-known axial current Ward identity:
[TABLE]
where , . The right-hand side corresponds to the amplitude in the linear representation, and the left-hand side to the two contributions arising in the polar representation. It is thus clear that it would be wrong to understand the coupling as induced by the anomaly. The anomalous interaction of Eq. (14) only arises because of the reparametrization in Eq. (12), and necessarily comes together with the appropriate derivative interactions.
- •
In the limit, the amplitude vanishes exactly,
[TABLE]
In the linear representation, this trivially follows from the vanishing of the coupling of to fermions, see Eq. (6). In the polar representation, it requires an exact cancellation of the local anomalous contribution with the non-local triangle amplitudes. Again, this can be understood in terms of the axial current Ward identity. In this limit, the spurious nature of the contact interaction is manifest.
- •
The reason why is often misinterpreted as induced by the anomaly can be understood looking at the limit. Indeed, the contribution of the derivative term vanishes,
[TABLE]
as can be trivially seen in Eq. (17). Since then all that remains in the non-linear theory is the local term from , it necessarily corresponds to the contribution surviving in the limit in the linear theory, Eq. (10). Yet, this is only a parametric correspondence, and certainly not a physical identification. The anomaly still cancels in , and the only surviving contribution is actually the first term of Eq. (17). Ultimately, the local anomalous term in Eq. (14) is no more than a convenient book-keeping device tracking all the fields that have been integrated out [9, 10].
- •
Furry’s theorem together with the vector current Ward identity ensures the absence of any dependence on the free parameter in the non-linear theory, in agreement with the absence of this parameter in the corresponding linear representation.
- •
As an aside, it should be clear that though the anomalous coupling in Eq. (14) is not affected by radiative corrections [13], neither is the anomalous part of the triangle graph in Eq. (17). So, the two exactly cancel at all orders, but the amplitude does get corrected by higher order effects since it is actually not induced by the anomaly. Further, note that the theory in Eq. (3) is obviously renormalizable, so radiative corrections can be calculated perturbatively using standard techniques. In this respect, if the photons are also coupled to some other fermions , the two-loop process is even UV finite in the linear representation [14]. This fact would clearly be difficult to guess using the polar representation, where the local vertex from leads to a UV divergent diagram.
- •
As said earlier, the present toy model can be generalized to more complicated, and more realistic KSVZ-like axion models [12]. Consider for instance the SM, to which a gauge-singlet scalar and a set of vector-like fermions are added:
[TABLE]
with the same scalar potential as in Eq. (3). A priori, the covariant derivative can include all three SM interactions,
[TABLE]
where and are the and generators in the representation carried by , and its hypercharge. Then, if and , with the electroweak vacuum expectation value, the effect of the heavy fermion is to induce (see Eq. (10))
[TABLE]
with the quadratic invariants , , and , and the corresponding dimensions of the and representations. Yet, several points must be clear: (1) These couplings are not anomalous, but result from the appropriately generalized Eq. (9) in the limit. (2) The integrated fermion must have vector couplings to gauge fields, otherwise the free parameter corresponding to fermion number does not cancel in Eq. (14). We will see later on how to deal with chiral theories. (3) Even if vector-like, no mass term is allowed for because it would explicitly break the symmetry. (4) The couplings in are not protected from radiative corrections since they are not anomalous. (5) Finally, the nature of the divergence arising when using this effective Lagrangian to compute e.g. the couplings of to SM fermions is clear. In the UV complete theory, Eq. (23), the , , and vertices are never local. Instead, the pseudoscalar triangle of Eq. (9) acts as a form factor and is sufficient to regulate the UV behavior of the vector boson loops, making them finite.
3 On the consistent use of anomalies
In the course of the previous section, the anomaly appeared several times, and there is a number of peculiarities that are worth detailing. First, consider the calculation of the local term , see Eq. (14). The simplest way to compute the Jacobian of the transformation is to compute a triangle graph, this time with three left-(or right-)handed currents (the momentum flow is defined as in Fig. 1):
[TABLE]
with . Following Ref. [15], the divergences of this amplitude are calculated by first keeping track of the ambiguities in the loop momentum routing, and then from the surviving surface terms, and one finds:
[TABLE]
Yet, it is worth stressing that this choice is not the one usually adopted, as it breaks the Bose symmetry of the amplitude under the exchange of the three currents. The consistent anomaly is that with , which restores the Bose symmetry as
[TABLE]
Importantly, this choice is not always explicit. For example, using the Pauli-Villars regularization, one immediately arrives at the Bose symmetric result (see Ref. [16]).
In terms of vector and axial currents, the and triangle amplitudes cancel out because of Furry’s theorem. This leaves the and diagrams, for which we obtain using the same method as in Ref. [15]:
[TABLE]
These results are valid in the massless limit for the fermions. As for the consistent anomaly, the usual conventions are for the triangle to keep the vector currents conserved, and for the triangle to enforce the Bose symmetry. The former case is the basis for Eq. (17). These conventions are automatically enforced when using simple regularization procedures like Pauli-Villars or dimensional regularization. However, for the following, the freedom to assign the anomaly in one of the vector currents or to explicitly break the Bose symmetry will be essential, so these regularization procedures are inadequate.
At the level of the and triangles, there is no obstruction to consider massive fermions. These contributions can be separated from the anomalous pieces, and do not depend on the chosen momentum routing. Actually, a simple calculation leads to
[TABLE]
and
[TABLE]
with the pseudoscalar triangle amplitudes (see Eq. (7))
[TABLE]
where .
In some sense, the usual conventions are automatic for the mass terms: the vector currents are always conserved and Bose symmetry must be manifest for the triangle. Indeed, once the anomalous terms are separated, the rest obeys the naive Ward identities and , with the pseudoscalar current. This means that the mass-dependent term must be absent from and , and it must be equal for all three divergences of . This picture is trivially confirmed using dispersion relations [17], see Fig. 2: cutting out a vector current necessarily gives zero since the intermediate fermions are on their mass shell and , while cutting out any one of the axial currents, the identity must hold.
Finally, in the limit, using and , the various divergences are
[TABLE]
and
[TABLE]
Crucially for the following, the vanishing of all the divergences in Eq. (33) in the limit is only true when the anomaly is carried by the axial current in the triangle, which implies the parametric choice . Similarly, the vanishing of all the divergences in Eq. (34) in the limit is only true when the anomaly is carried equally by the three axial currents in the triangle, which implies the choice . But evidently, there is no reason for this to hold in general and we will see in the next section how this plays a role in the derivation of the axion couplings to electroweak gauge bosons.
4 The Peccei-Quinn axion and its couplings
Historically, the first axion model was proposed by Peccei and Quinn [1]. It is based on a Two Higgs Doublet Model (THDM), but with an enlarged symmetry. Denoting the Higgs doublets as
[TABLE]
the scalar potential is assumed to allow for the independent rephasing
[TABLE]
In the standard notation (see Ref. [18]), this forces the parameters , to vanish. One combination of these s can be identified with the hypercharge, since both doublets have the same gauge quantum numbers, while another combination is denoted . The vacuum expectation values
[TABLE]
break spontaneously, where and . The symmetry is local and its Goldstone boson is eaten up by the boson, but the is only global and the massless left over state is the axion. Indeed, assuming Yukawa couplings of Type II,
[TABLE]
the symmetry is necessarily chiral and anomalous. Note that at this level, there is a lot of freedom in how to assign charges for the fermions, and we will see later on how to (partially) fix them. Remark also that we did not include a phase difference between and in Eq. (37) because it can be removed by a transformation. Actually, the symmetry being anomalous, this rather moves the phase difference between and into the strong and electroweak terms. This is precisely the interplay that permits to solve the strong puzzle once the symmetry is spontaneously broken, since it entangles the phase difference with the axion field.
To further explore this simple model, we follow the same strategy as for the toy model of Section 2, and perform the analysis using either a linear or a polar representation for the scalar fields .
4.1 Linear representation
The most convenient basis for the scalar states is obtained by performing a rotation of angle ,
[TABLE]
so that the VEV is carried by one doublet only
[TABLE]
where only the SM Goldstone bosons have been exponentiated. The pseudoscalar is the massless Goldstone boson corresponding to the breaking, and it is parametrized linearly. The scalar states are related to the mass eigenstates through a rotation of angle which depends on the scalar potential parameters, and is the charged scalar boson. We note in passing that the charge assignments corresponding to thus require orthogonality of the Goldstone states and , and cannot be guessed a priori from the charges of the doublets in Eq. (36). Actually, since is a -dependent combination of and , the charges of the doublets are also function of . We will come back to this point later, and for now, we simply proceed by plugging Eq. (40) in the THDM Lagrangian.
Because has no vacuum expectation value, the boson does not couple to pairs of gauge bosons at tree level. On the other hand, it does couple to fermions. From Eq. (38), we derive
[TABLE]
To reach this form, the mass terms are identified as and and the fermions are rotated to their mass basis. At the one-loop level, the then couples to pairs of gauge bosons through the and triangle graphs. Denoting the fermion currents coupled to gauge bosons as with, using the short-hand :
[TABLE]
where generators are normalized such that , it is a simple exercise to find (see Eq. (32)):
[TABLE]
where and are the polarizations of and , carrying respectively the momentum and . To account for the absence of crossed diagrams for , we also define for , for . Also, the trace over color is understood for quarks, bringing a factor for , and for . Finally, the loop functions in and are the same as in Eq. (32) for , and generalized as with the partner of for . In that case, the CKM entries sum up to one when the three generations are included. The expression Eq. (43) for the decay amplitudes agrees with the standard result of Ref. [19].
A few limiting cases are interesting. First, remark that
[TABLE]
since the coupling to fermions is proportional to their masses. Second, in the opposite limit of infinite fermion masses, the loop functions behave as and , so
[TABLE]
The only special case is that of the lepton contribution to , since neutrinos remain massless. In that case, and in the limit, and cancels out. Plugging all these limits in the amplitudes, they take the simple form
[TABLE]
for some mass-independent coefficients . Working out their values, to these amplitudes correspond the effective interactions
[TABLE]
with the coefficients (to simplify the comparison, we adopt the standard convention ):
[TABLE]
Often, one interprets the non-decoupling of these amplitudes when as an indirect manifestation of the underlying anomalies, but it is important to understand this statement correctly. The pseudoscalar triangles do decouple in that limit, and when , but this is compensated by the mass-dependent couplings of the to fermions. The fact that this compensation works, leaving a constant remainder in the , is guaranteed by the anomalous axial Ward identity of Eq. (20). Yet, there is no anomalous contribution of any kind to the amplitudes, and those are driven entirely by non-local and well-behaved fermion loops. The situation is quite similar for the loop-induced Higgs decay in the SM, whose non-decoupling behavior in the limit is interpreted as a manifestation of the trace anomaly [20]. Yet, there is of course no direct contribution of that anomaly to , which can be safely computed perturbatively.
4.2 Polar representation
The Higgs boson fields are written in the polar representation555Note that one cannot start from the rotated basis Eq. (40) to exponentiate the Goldstone field , because such a representation would not be canonical for since it has no vacuum expectation value [21]. as (see Eq. (11)):
[TABLE]
where , . The and states are the Goldstone bosons associated to the and symmetries. They are related to the physical Goldstone bosons and by the same rotation as in Eq. (39):
[TABLE]
Note that the components of and are the same since they have the same hypercharge, and the and states are orthogonal to each other. Now, the can be gauged away via a rotation, leaving
[TABLE]
where we introduce to stick to common conventions. The remaining neutral Goldstone field, here denoted instead of , is the axion. Under this form, the true charges of the Higgs doublets are finally apparent, since under such a transformation, . Then, the Yukawa Lagrangian can also be made invariant under provided the fermion charges are set as
[TABLE]
with
[TABLE]
where and are free parameters, corresponding to the conservation of baryon and lepton numbers and .
Following the same steps as in Section 2, we now perform a reparametrization of the fermion fields to force them to be invariant under . This is achieved by the field-dependent rotations . The axion field disappears from the Yukawa Lagrangian, but reappears in derivative interactions from the fermion kinetic terms
[TABLE]
with
[TABLE]
The vector current couplings are and -dependent, since those parameters reflect the conservation of and , which are vectorial [9]. The dependence of is related to the peculiar nature of neutrinos, which are kept massless. As we will see in Section 4.3.4, it disappears if right handed neutrinos together with the -invariant Yukawa interaction are added.
In addition, given the charges in Eq. (52), the fermion reparametrization is anomalous and its Jacobian has to be included
[TABLE]
Given the SM fermion quantum numbers, the coefficients are
[TABLE]
where , are the and dimensions and quadratic Casimir invariant of the representation carried by the field , respectively, and by extension, with . These expressions correspond to Eq. (27), choosing to reflect the absence of anomalies for the SM gauge symmetries (so no Bose symmetry here).
The parameters and enter in a combination reflecting the anomaly in the current. Interestingly, neither the QED nor the QCD coefficients depend on them, but those to electroweak gauge bosons do. To see this, it sufficies to plug , and in Eq. (56):
[TABLE]
and the free parameters cancel out in
[TABLE]
Concerning the strong puzzle, the polar representation makes the shift symmetry explicit. Under a transformation of parameter , all the fields are invariant but for . So, if there is a term in the gauge sector, it can be eliminated by an adequate shift of the field. As said in the introduction, what happens in reality is that one is forced by non-perturbative QCD effects to shift the field by precisely that amount [1]. Once the shift is set by QCD, the electroweak can still be removed by an appropriate choice of . All that remains then is the abelian , which is an inoccuous total derivative. We thus recover the well-known fact that the electroweak term can be rotated away thanks to the and invariance of the model.
4.3 Matching the polar and linear representations
The physics must not depend on the representation chosen for the scalar fields or on the parametrization of the fermion fields. So, the one-loop amplitudes of Eq. (43) must match the amplitudes computed using the local terms together with the triangle graphs arising from the derivative interactions in (these two contributions are of the same order in the coupling constants). At this stage, it is immediately clear that the triangle graphs will play a crucial role. Indeed, taken alone, the contributions from the anomalous local terms of match those found in the linear case in the limit, Eq. (47), only for the and final states. For the others, there is no dependence on and in Eq. (47), but there is one through in . Even worse, no choice of these parameters could make the two results compatible because the relative strengths of the , , , and decays are irremediably different. As we will now detail, it is only once the anomalous contributions from precisely cancel out with those hidden in the triangle graphs that the amplitudes match the amplitudes.
Throughout this section, the momentum flow is defined as , with , and neither the final gauge bosons nor the initial axion are necessarily on-shell. The final amplitudes thus depend on , , and , and are to be compared to those obtained in the linear case in Eq. (43).
4.3.1 The and decays
For the decays into photons or gluons, only the triangle contributes since and are vector symmetries. Then, the situation is exactly the same as in the toy model of Section 2. Setting in Eq. (30), we find
[TABLE]
and
[TABLE]
where , , is the electric charge of , and , . The amplitudes in the polar representation thus trivially match that in the linear case, see Fig. 3,
[TABLE]
In other words, the anomalous contact interactions do cancel out systematically with the anomalous part of the triangle graphs.
As for the toy model, the polar representation is thus interesting only to make the shift symmetry manifest, and because the contact and interactions read off are reliable book-keeping of the effects of heavy fermions. Specifically, implies that . Finally, remark that the cancellation of the local anomalous terms ensures in the limit. So, though interpreting the axion coupling to photons or gluons as induced by the anomaly is incorrect, this misidentification does not lead to serious consequences for those final states. For heavy fermions, the coupling to gluons is tuned by , and that to photons by , and their ratio, when restricted to quarks, give back the usual . However, as we will see in the next subsection, interpreting the axion coupling involving at least one electroweak gauge boson as induced by the anomaly is not only wrong in principle but also leads to incorrect couplings.
4.3.2 The decay
For electroweak gauge bosons in the final state, the situation is less simple. Consider first the final state. The derivative interactions induce again only the triangle graphs since the photon coupling is vectorial. However, this time we have two possible contributions, depending on which current is carrying the anomaly, see Fig. 4. First, there are the usual triangles, for which the anomaly is in the axial current. Using Eq. (30) with , we find
[TABLE]
where . But contrary to the and final state, there are now new contributions from the triangles, with the axion vector couplings of Eq. (54). Using again Eq. (30) but this time with to preserve , and noting that ,
[TABLE]
As explained in Section 3, this contribution is free of any mass-dependent term because the naive vector current conservation must hold, up to the anomaly.
These two contributions from the derivative interactions combine with the Jacobian term from Eq. (60),
[TABLE]
to give the total decay amplitude in the polar representation:
[TABLE]
as it should. We can now understand why does not match in the limit. Indeed, while vanishes in the limit, obviously does not since it is independent of . In that limit, we should write
[TABLE]
in which the parameters and cancel out. So, for a chiral gauge theory, the local terms coming from are no longer reliable book-keeping of the effect of heavy fermions, because part of the anomaly is hidden in the triangle.
Actually, the reason why the triangle plays a role can be understood directly from the fermion reparametrization, given the charges in Eq. (52). When and/or are different from zero, the fermionic current associated to has a component aligned with baryon and/or lepton number, respectively. The fermionic reparametrization thus generates the anomalous interactions: those correspond to the terms of and in Eq. (56). But as for the toy model, these interactions are spurious and must cancel with the anomalies present in the triangle graphs induced by the derivative interactions . This must necessarily come from a breakdown of the vector Ward identity for the triangle since and are purely vectorial symmetries [22].
4.3.3 The and decays
Turning first to the amplitude, the derivative interactions induce new types of diagrams: the triangles and graphs with neutrinos circulating in the loop, see Fig. 5.
Proceeding with the calculation, the three contributions from the derivative terms are
[TABLE]
where is the part of Eq. (43) proportional to . The neutrino contribution is absent from that term since the loop function in Eq. (32) vanishes for massless fermions. The next piece comes from the divergence of the axionic vector current,
[TABLE]
There is no mass-dependent terms here because of the vector Ward identity. Finally, the triangle gives, using Eq. (31) with
[TABLE]
where is the part of Eq. (43) proportional to . Again, the neutrino contribution to that part vanishes. Combining the three contributions of the derivative interactions with that of the Jacobian
[TABLE]
all the local anomalous contributions cancel exactly, leaving:
[TABLE]
Though the anomalous terms again sum up to zero, they are now completely entangled in the three types of triangle graphs, which all depend on the free parameters and . As a result, none of the derivative amplitudes vanish in the limit. Specifically, because neutrinos remain massless,
[TABLE]
where comes from the neutrino coupling in Eq. (55). Also,
[TABLE]
because in addition to massless neutrinos, the mass term respects Bose symmetry but not the anomaly which resides only in the axion axial current (there is a relative factor of 1/3, see Eq. (34) with ). Finally, does not vanish since it is mass-independent. Because of all this, is clearly not identical to in the limit, and cannot be used as an estimate of the electroweak couplings of the axion.
The amplitude shares many similarities with that for , so we will be very brief. Following the same steps, the various contributions to the decay in the polar representation are
[TABLE]
Note well that it is that occurs in these amplitudes, and not the coefficients to which tends to in the limit. Still, the sum of these four contributions reproduces again the linear result, . As none of the triangle amplitudes vanish in the limit, is clearly not identical to in the limit.
4.3.4 Impact of heavy neutrinos
The effective interactions in Eq. (47) involve five coefficients, but this is partly due to the non-decoupling of neutrinos. To see this, let us add right handed neutrinos together with the Yukawa interaction . Then, the effective amplitude in the linear representation is trivially obtained by extending the sum in Eq. (43) to . If, for the sake of the argument, we now take the infinite mass limit for all fermions, , the effective interactions in Eq. (47) are recovered but with
[TABLE]
By contrast, it is amusing to remark that in the polar representation, the anomalous interactions of Eq. (56) are not modified by the presence of right-handed neutrinos since they are neutral under the whole SM gauge group. Yet, right-handed neutrinos do modify the derivative interactions in Eq. (55) to and . The various triangle amplitudes are then obtained from those quoted in the previous section by replacing and discarding the terms. While now vanish in the limit, this does not change our conclusions and remain different from in that limit.
As a final note, it is interesting to remark that once certain that and disappear from the physical amplitudes, one can chose to fix them as one wishes. In particular, as said before, they can be set to eliminate the electroweak term. Alternatively, one can set , thereby getting rid of the coupling altogether. As we have seen, this does not forbids since the amplitude is independent of and . Finally, this freedom can also be used to allow for the presence of a lepton number violating Majorana mass term, as well as an effective dimension-five operator. We will not explore such settings here, see Ref. [23] for studies involving axions together with neutrino Majorana mass terms.
4.4 Application to the DFSZ axion model
The Peccei-Quinn model is ruled out experimentally because the coupling of the axion to fermions, tuned by the electroweak vacuum expectation value, is too large. One way to extend the model and render the axion invisible was proposed not long after the original idea and is called the DFSZ axion model [24]. Our goal here is to derive the electroweak couplings for that model.
The starting point is to extend the THDM by adding a complex scalar field , singlet under the SM gauge symmetries, and constrain the scalar potential to
[TABLE]
In this way, the whole Lagrangian remains invariant under independent rephasing of the doublet provided the singlet has the charge:
[TABLE]
The Type II Yukawa couplings of Eq. (38) then imply chiral charges for the fermions when , and part of the symmetry is anomalous.
Let us take the linear representation (compare with Eq. (40))
[TABLE]
where are given in terms of in Eq. (39). Plugging these representations in the potential, and extracting the quadratic terms, the field is immediately massless and corresponds to the Would-be Goldstone of the boson. On the other hand, the and fields mix,
[TABLE]
with and . The field is the axion, and the field is a heavy pseudoscalar Higgs boson. While when , reproducing the THDM discussed previously, the opposite holds when [24]:
[TABLE]
Then, since is not coupled to fermions, it is easy to see that the amplitudes are simply rescaled by , i.e.,
[TABLE]
with the same coefficients as in Eq. (48). In particular, the axion couplings to , , and are not aligned with the purely anomalous interactions one derives looking only at the charges of the fermion fields.
4.5 Anomaly cancellation in generic axion models
In the previous sections, we showed how the anomalies present in the non-linearly realized axion models necessarily cancel out, since they are spuriously generated by reparametrizing an anomaly-free model. Here, we want to follow the opposite route. Starting from a Non Renormalizable (NR) effective Lagrangian valid above the electroweak scale:
[TABLE]
where is some high scale, as well as some a priori free Wilson coefficients, we want to show that whenever the anomalies cancel, this model collapses to a simple renormalizable theory.
To do this, starting from , we require the anomalous interactions to cancel with the local anomalous terms present in the , , and triangle graphs. Imposing this for the five channels , , four independent constraints are derived:
[TABLE]
together with
[TABLE]
Once these conditions are imposed, all the processes are entirely driven by the non-anomalous parts of the and triangle graphs (see Figs. 3, 4, and 5). The triangles disappear since they have no mass-dependent terms. But then, these non-anomalous amplitudes satisfy the naive Ward identities, and are equivalently obtained from and triangles, provided
[TABLE]
Phenomenologically, and are indistinguishable once anomaly cancellation is enforced, even if has far fewer physical parameters. This can be understood as follows. Out of the initial ten parameters in for massless neutrino (), four are eliminated by the anomaly cancellation constraints. Then, the naive Ward identities are valid again. The three parameters disappear thanks to , and leaves only the three parameters as truly physical.
Now, starting from , all the decay amplitudes are easily computed, and given by Eq. (43). In the limit, keeping neutrino massless, these amplitudes match onto the effective couplings in Eq. (47), with
[TABLE]
In this limit, we thus do not recover the manifestly invariant pattern of anomalous interactions of . Finally, note that if one takes , , the results in Eq. (48) are recovered. This shows that is the most generic implementation of the PQ (or DFSZ) axion model, where the charges of the SM fermions can take any value.
5 Conclusion and perspective
In this paper, we derived the couplings of axions to gauge bosons. We started from a simplified toy model, and then extended the discussions to the KSVZ axion [12], the original PQ axion [1], and the DFSZ axion [24]. In each model, our strategy has been to match the axion decay modes computed using either a linear or a polar representation for the scalar field breaking the symmetry. In this way, we were able to unravel the physics at play, and uncovered a number of interesting features:
- •
In the linear representation, the axion couplings to gauge bosons are not induced by the anomaly, but by non-anomalous pseudoscalar triangle graphs with fermions circulating in the loop. Though this has to our knowledge not been exploited before, these amplitudes can be identified with those of the pseudoscalar Higgs to gauge bosons calculated in the THDM, which have been known for a long time [19]. In the limit, they match onto the effective interactions
[TABLE]
with the coefficients shown in the first column of Table 1. In the opposite limit , all these amplitudes vanish since the axion couplings to fermions are proportional to the fermion masses.
- •
In the context of axion models, it is customary to adopt a polar representation for the scalar fields. The pseudoscalar axion couplings to fermions are replaced by contact anomalous interactions to the gauge bosons and axion derivative interactions to the fermions. All these interactions are entirely fixed in terms of the assigned charges of the SM fermions. As shown in Table 1, this parametrization is particularly convenient for and because the derivative interactions do not contribute in the limit [9, 10], and thus the strengths of the and processes can be read off the anomalous couplings and . By contrast, and contrary to what is conjectured in Ref. [9], the central result of this paper is that for chiral gauge theories in which chiral fermions have non-trivial charges, the strengths of the , , and processes do not match the anomalous couplings, even in the limit, as evident comparing the first and second columns of Table 1. They thus cannot be encoded into the invariant effective interactions a^{0}B_{\mu\nu}\tilde{B}^{\mu\nu}\and .
- •
The main reason for this mismatch is due to the presence of triangle graphs arising from the derivative interactions and , which do not vanish in the limit for chiral gauge theories. Importantly, even the vector couplings play a role since the anomalous breaking of the axionic vector current conservation enters through the triangle graphs (see Figs. 4 and 5). One important result of this paper is the proof that once all these triangle contributions (last three columns of Table 1) are summed with the local anomalous amplitudes (second column of Table 1), the THDM results are recovered. Without surprise, physical observables do not depend on the chosen parametrization, and this further confirms that the THDM calculation is correct.
- •
Our analysis also proves that the axion to gauge boson processes are not induced by the anomaly. This is obvious in the linear representation since the fermion loops driving these processes are not anomalous. In the polar representation, what happens is that the local anomalous interactions precisely cancel with the anomalies present in triangle graphs induced by the vector and axial derivative interactions, leaving the non-anomalous pseudoscalar triangles of the THDM as the only surviving mechanism driving all the axion to gauge bosons processes. Ultimately, this precise cancellation is ensured by the anomalous vector and axial Ward identities. It is required because all the amplitudes must vanish in the limit, but the anomalous contributions are independent of the fermion mass666This interpretation is compatible with the matching observed in Table 1 in the limit for . Indeed, in the polar representation, these amplitudes receive two contributions, from the local anomalous term, and from the triangle amplitudes built from . But in this case, the axial Ward identity translates as , with the THDM amplitude. So, even if parametrically, and when , are not induced by the anomaly since it cancels out in ..
- •
In all axion models where chiral fermions have charges but conserve baryon and lepton numbers, both the vector couplings and the , anomalous contact interactions depend on some free parameters [9]. In Table 1, these free parameters enter into , and are related to the freedom to choose the charge of left-handed fermions (there are also explicit occurrence of the free parameter , but this is an artifact of keeping the neutrino massless). It is only once the failure of the naive vector Ward identity in the triangle graphs is properly accounted for that these free parameters cancel out, as they should [22]. To understand how this cancellation is connected with the conservation of baryon and lepton numbers and , notice first that the freedom to choose the charges of the fermions is actually due to the invariance under and . In general, the symmetry has two arbitrary components aligned with and . But then, these components translate into derivative couplings of the axion to the and fermionic vectorial currents together with a local coupling from the anomaly. Clearly, all these couplings are spurious when the axion is a Goldstone boson living in a and -invariant vacuum. Since and are not spontaneously broken, must systematically drop out of physical observables, and Table 1 shows that this indeed occurs.
- •
In the linear representation, the calculation is done in the electroweak broken phase throughout. In the limit, the amplitudes do not match onto invariant operators, as shown in Table 1. Though this is somewhat expected, what is not obvious is the very peculiar way in which the symmetry breaking seeps in. Indeed, in the non-linear representation, the anomalous interactions and the derivative interactions are manifestly invariant once the SM fermions are assigned charges in an invariant way. Further, evidently, the anomalies present in the triangle graphs built on the derivative interactions do not break since that would put the SM gauge invariance itself in jeopardy. So, in that representation, the fact that the chiral symmetry is spontaneously broken enters only and entirely through the fermion masses. The triangle graph explicitly break the symmetry because the limit is incompatible with that symmetry. In the opposite limit , the symmetry is recovered but in a very trivial way: the triangle graphs precisely cancel the local anomalous term and all the amplitudes simply vanish.
- •
We derived generalized forms for all the triangle anomalies, including the terms proportional to the fermion masses, see Eqs. (30) and (31). This requires going beyond usual regularization procedures. To our knowledge, and except those based on dispersion relations [17], only that described by Weinberg in Ref. [15] offers sufficient freedom to choose which current in the and triangles is to carry the anomaly. This is crucial to deal with chiral gauge theories, in which some axial currents have to remain anomaly-free to preserve gauge invariance, as well as for treating the and vectorial currents [22].
In the present paper, we consider only true axion models, those designed to solve the strong puzzle and for which the axion ends up extremely light and extremely weakly coupled to SM fields. However, our results could have consequences for so-called Axion-Like Particle (ALP) searches (see e.g. Refs. [25, 26, 27, 28, 29, 30, 31]). Specifically, our analysis shows that an adequate effective parametrization is
[TABLE]
with from Eq. (93),
[TABLE]
for some free parameters , and from Eq. (25)
[TABLE]
for some free parameters . Typically, encodes the effects of heavy vector-like fermions, while encode the anomaly-free impact of charging the SM fermions under some global symmetry whose current is coupled to the field. The impact of some new heavy chiral fermions (like a sequential fourth generation or a heavy neutrino) is not included, but could be by extending the sum in to those states also. This parametrization of ensures is truly axion-like: no or violating effects are implicit, only physical free parameters are introduced, and UV divergences should be under control. Still, it is important to stress that the , , and amplitudes induced by are entirely tuned by triangle graphs, and in the limit, they do not match the pattern of but rather that in Eq. (47).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. D. Peccei and H. R. Quinn, Phys. Rev. Lett. 38 (1977) 1440; R. D. Peccei and H. R. Quinn, Phys. Rev. D 16 (1977) 1791.
- 2[2] S. Weinberg, Phys. Rev. Lett. 40 (1978) 223.
- 3[3] F. Wilczek, Phys. Rev. Lett. 40 (1978) 279.
- 4[4] V. Shtabovenko, R. Mertig and F. Orellana, Comput. Phys. Commun. 207 (2016) 432 [ar Xiv:1601.01167 [hep-ph]].
- 5[5] G. Grilli di Cortona, E. Hardy, J. Pardo Vega and G. Villadoro, JHEP 1601 (2016) 034 [ar Xiv:1511.02867 [hep-ph]].
- 6[6] W. A. Bardeen and S.-H. H. Tye, Phys. Lett. 74B (1978) 229.
- 7[7] J. E. Kim, Phys. Rept. 150 (1987) 1.
- 8[8] D. J. E. Marsh, Phys. Rept. 643 (2016) 1 [ar Xiv:1510.07633 [astro-ph.CO]].
