TL;DR
This paper derives and automates constraints on 3- and 4-loop $eta$-functions in general four-dimensional quantum field theories, providing new consistency conditions and the complete 3-loop gauge $eta$-function including kinetic mixing.
Contribution
It extends the Osborn equation approach to general gauge theories and automates the derivation of high-loop $eta$-function constraints, including the first complete 3-loop gauge $eta$-function with kinetic mixing.
Findings
Derived extensive non-trivial constraints on $eta$-functions.
Automated the computational procedure for high-loop order calculations.
Presented the complete 3-loop gauge $eta$-function for general QFTs.
Abstract
The -functions of marginal couplings are known to be closely related to the -function through Osborn's equation, derived using the local renormalization group. It is possible to derive strong constraints on the -functions by parametrizing the terms in Osborn's equation as polynomials in the couplings, then eliminating unknown and coefficients. In this paper we extend this program to completely general gauge theories with arbitrarily many Abelian and non-Abelian factors. We detail the computational strategy used to extract consistency conditions on -functions, and discuss our automation of the procedure. Finally, we implement the procedure up to 4-, 3-, and 2-loops for the gauge, Yukawa and quartic couplings respectively, corresponding to the present forefront of general -function computations. We find an extensive collection of…
| No. of coefficients | ||||||||
| TS basis | CCs | |||||||
| 1 | 4 | 1 | 3 | 4 | ||||
| 2 | 14 | 4 | 7 | 5 | 16 | 1 | ||
| 3 | 49 | 27 | 33 | 33 | 5 | 91 | 26 | |
| 4 | 257 | 260 | 9 | 198 | 303 | 33 | 703 | 265 |
| 4 () | 4 | 4 | 5 | 9 | 5 | |||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\setsecnumdepth
subsection
\makepagestylemyruled \makeoddheadmyruled0 \makeheadrulemyruled\normalrulethickness \makepsmarksmyruled \nouppercaseheads\createmarkchapterbothshownumber . \createmarksectionbothshownumber.
\maxtocdepthsubsection
CP3-Origins-2019-23 DNRF90
Constraints on 3- and 4-loop -functions in a general four-dimensional Quantum Field Theory
**Colin Poole111 [email protected] and Anders Eller Thomsen222 [email protected]
** -Origins, University of Southern Denmark,
Campusvej 55, DK-5230 Odense M, Denmark
Abstract
The -functions of marginal couplings are known to be closely related to the -function through Osborn’s equation, derived using the local renormalization group. It is possible to derive strong constraints on the -functions by parametrizing the terms in Osborn’s equation as polynomials in the couplings, then eliminating unknown and coefficients. In this paper we extend this program to completely general gauge theories with arbitrarily many Abelian and non-Abelian factors. We detail the computational strategy used to extract consistency conditions on -functions, and discuss our automation of the procedure. Finally, we implement the procedure up to 4-, 3-, and 2-loops for the gauge, Yukawa and quartic couplings respectively, corresponding to the present forefront of general -function computations. We find an extensive collection of highly non-trivial constraints, and argue that they constitute an useful supplement to traditional perturbative computations; as a corollary, we present the complete 3-loop gauge -function of a general QFT in the scheme, including kinetic mixing.
Contents
-
3.2 Perturbative expansion of quantities in Osborn’s equation
-
A.5 Comparison of 3–2–1 consistency conditions with previous work
1 Introduction
This paper represents the culmination of a rather sporadic research program, investigating constraints on the -functions of a general Quantum Field Theory (QFT), given their relation to a particular scalar function of the couplings. In our experience, attempting to explain how such constraints are derived has proven difficult, not least due to the use of an intricate diagrammatic representation for all quantities involved. Therefore, we have attempted to make our paper as comprehensive and pedagogical as possible, starting with the introduction itself, which we have split into two parts: the first part is shorter, and seeks to contextualize our results with applications in mind; the second part is the introduction proper, written from the underlying theoretical perspective.
1.1 The short version
The Renormalization Group (RG) formalism implies a change in the couplings of a renormalizable Lagrangian density as one varies the energy scale [1]. The change in a coupling, i.e. its RG flow, is described by its -function, thus -functions are of fundamental importance in QFT. At a given order in perturbation theory, the -functions are simply polynomials in the couplings, with the precise form depending on the type of interaction. Given a general Lagrangian with tensor couplings, each -function can be parametrized in terms of Tensor Structures (TSs); that is, a basis of contractions between the various couplings and gauge generators, in the style of Machacek and Vaughn [2, 3, 4]. This method essentially “factors out” the actual field content of the theory, so that the coefficient multiplying each TS depends only on the renormalization scheme, allowing one to derive the -functions for a completely general theory.
Once an adequate representation of the general -functions has been found, investigating non-trivial constraints on possible RG flows becomes much more tractable; furthermore, by using a parametrization with TSs and unknown coefficients, constraints can be investigated in a manifestly scheme-independent manner. The constraints in which we are interested are those derived from Osborn’s equation (see Eq. (1.1) below), which relates coefficients of each -function at different loop orders. This is known as the “3–2–1” ordering, since the coefficients of the gauge -function are related to the Yukawa coefficients one loop lower, and the scalar coefficients two loops lower. Previous attempts at investigating these constraints have been limited in different ways: some analyses of Osborn’s equation have been done using the -functions for a general scalar-fermion theory without gauge interactions [5], or a general theory with a simple gauge group [6]; another analysis included the three gauge groups of the Standard Model (SM), but considered only the top Yukawa and Higgs self-interaction couplings, and limited its investigations to constraints that do not involve higher-order corrections [7].
The power of Osborn’s equation comes with the realization that one can investigate the associated constraints algebraically, without actually knowing the -function coefficients. Phrased this way, Osborn’s equation becomes a way to constrain (or even derive) some higher-order coefficients using only lower-order coefficients. In this paper, we first extend the Machacek and Vaughn construction to manifestly include multiple gauge groups and kinetic mixing, thus creating a representation of the general gauge -function in line with the Yukawa and quartic -functions. We then perform a complete analysis of Osborn’s equation, deriving constraints up to order 4–3–2; not only does this allow us to determine, and for the first time present, the completely general 3-loop gauge -function with kinetic mixing, it also gives us the first predictions for certain coefficients of the general 4-loop gauge -function, and fixes the known ambiguities that arise at four loops when dimensionally-regularizing integrals that involve [8, 9, 10].
1.2 The long version
In 1974, shortly after the formulation of the Renormalization Group (RG) itself [1], Wallace and Zia suggested that the RG flow of a general QFT could in fact be a gradient flow, with the -functions of the theory related to derivatives of some scalar quantity via a positive-definite metric [11, 12]. They were able to show that such a gradient formula holds for the -function of a general theory up to two loops, but that modifications at three loops spoil the manifest positive-definiteness of their metric. In addition, they remarked that allowing additional interactions creates similar issues, as a single term in the scalar function may then contribute to multiple -functions at different loop orders. Eventually, as a consequence of work done on the Local Renormalization Group (LRG) by Jack and Osborn [13, 14, 15], these features were subsumed into a substantially more general framework, valid for completely general theories.
The LRG is a powerful extension of usual RG methods: not only does it allow one to include the effects of composite operators in a straightforward manner, but it also leads to additional restrictions on the allowed RG behavior of general renormalizable QFTs. The LRG was used in [13, 14, 15] to re-derive the -theorem [16], and to provide a perturbative proof of Cardy’s conjectured -theorem [17]. While a weaker formulation has been shown to hold non-perturbatively [18] subject to certain assumptions [19], the stronger statement, concerning the existence of a function of the couplings that satisfies a gradient-flow equation and decreases monotonically under RG flow, is still lacking a non-perturbative proof. Nevertheless, for even-dimensional QFTs, there exists a function of the couplings that satisfies a gradient-flow-like equation, and reduces to the coefficient of the Euler density in the trace anomaly at RG fixed points, .
By considering general QFTs as points on a manifold, specified by position-dependent couplings , one may derive relations between RG functions by imposing the Abelian nature of Weyl rescalings. These relations are known as Weyl Consistency Conditions (Weyl CCs), and include Osborn’s equation
[TABLE]
which forms the basis of our analysis333The strong -theorem is equivalent to the existence of a choice of local counter terms, such that the symmetric part of the tensor appearing in (1.1) is positive-definite. An even stronger version is that there exists a choice of local counter terms such that is itself symmetric and positive-definite; would then act as a metric on the space of couplings, and the -functions would satisfy the gradient-flow equation .. If the kinetic terms of the theory possess a global symmetry, then in (1.1) is replaced by a generalized function , which begins to differ at three loops [5], and is related to the possible existence of theories with limit cycles [20]. While the effects of this shift have been included for the simple case of six-dimensional theory [21] and the more complicated four-dimensional scalar-fermion theory [5], equivalent analyses for general gauge theories have not yet reached the point where considering this shift is necessary [6].
All quantities in Osborn’s equation are given as functions of the couplings and group structures of the theory. Thus, it is possible to parametrize Eq. (1.1) in terms of a finite basis of TSs loop order by loop order. The existence of a function satisfying Eq. (1.1) then implies relations between the coefficients of the -functions of the theory at different loop orders444The order at which the gauge-, Yukawa- and quartic- functions are related is known as the “3–2–1” ordering; for topological reasons that will later become clear, this ordering is manifestly preserved as one calculates at higher loops.. These relations may be formulated in a scheme-independent manner, and are sufficient to predict some higher-order -function coefficients, the precise magnitude of the shift [21], and even the singular parts of certain Feynman integrals [22, 23]. Most recently, it has been shown that (1.1) implies the existence of a constraint on the 4-loop strong-coupling -function of the SM, involving Feynman integrals rendered ambiguous by the treatment of in dimensional regularization; the constraint fixes the ambiguity by relating it to unambiguous terms in the general 3-loop Yukawa -function [10]. In these papers, the construction of was carried out by hand, making use of a diagrammatic representation of the tensor couplings in the corresponding theory. This method was sufficient to render the first few loop orders tractable, but as with any perturbative calculation, the increase in complexity at higher loop orders limits its reliability and practicality. Consequently, the construction of for a general gauge theory (with a simple gauge group) has only been carried out to four loops, falling one loop short of being able to derive predictive constraints on cutting-edge perturbative calculations.
In order to overcome this restriction, we have endeavored to automate the diagrammatic construction of , plus extraction of the associated consistency conditions (CCs), through the use of a custom Mathematica code, GRAFER 555As with all programs in theoretical physics, we too needed a punchy-sounding name with an associated acronym. We have settled on ‘GRAdient Flow Equation Reducer’, which we think is marginally better than the original name, ‘GRAdient Flow Equation Rrrrrrrsomething’.. We have also extended the previous diagrammatic representation of the gauge sector to allow for multiple gauge groups and kinetic mixing of the Abelian factors; combined with a parametrization of the modification, GRAFER allows us to extract the CCs for arbitrary renormalizable four-dimensional QFTs, based on a compact gauge group with any number of Abelian factors. Since -functions determine the change in couplings at different energy scales (hence controlling the strength of particle interactions), the ability to deduce or relate progressively-higher-order -function coefficients has obvious phenomenological relevance. Whether through direct employment or as an independent cross-check of standard perturbative methods, we believe that our approach can be useful in the program of perturbative -function computations.
A key motivation for our work is attempting to demonstrate just how restrictive Weyl CCs can be. Ever since the development of the RG and subsequent discovery of asymptotic freedom in perturbative QFTs [24, 25], -function computations have been carried out for a huge range of models, with applications ranging from critical exponents, phase transitions and universality, to the existence of asymptotically-safe theories described by interacting RG fixed points [26]. Despite this ubiquity, comparatively little attention has been given to the idea of calculating -functions for completely general theories; whereas the QCD -function is now known to five loops [27], and the -functions of the full Standard Model are known to three loops [28, 29, 30], the -functions for completely general theories have been stuck at two loops for over 35 years [2, 3, 4] (modulo an extension in the early ’00s to theories with kinetic mixing [31]). The highest-loop general -function of which we are aware is the 3-loop gauge -function for a general theory with a simple gauge group [32]; we shall show that one can combine this result with the constraints imposed by Weyl CCs to infer the 3-loop gauge -function for any theory with a compact gauge group.
The paper is organized as follows: in section 2, we give a brief review of the LRG, the derivation of Osborn’s equation (plus modifications), and the associated tensors; we also outline a conjecture, in which we speculate that it may always be possible to enforce symmetry of the tensor in Osborn’s equation by adding suitable local counter terms. In section 3, we formulate a general, renormalizable, four-dimensional QFT with an arbitrary compact gauge group, indicating how all fields and couplings of a particular type can be assembled into field multiplets and tensor couplings. In section 4, we introduce the graphical notation employed in [6], extend the gauge sector to the general case, and describe the overall strategy for deriving constraints on the -functions as employed in GRAFER. In section 5, we then present our main results: an extension to the general case of previously-derived CCs at orders 2–1–0 and 3–2–1, followed by a sample and discussion of the conditions at order 4–3–2666Given the sheer quantity of CCs, it would be impractical to include them all in the main text; instead, we have provided ancillary files detailing the GRAFER results with all tensors and CCs., chosen to highlight certain non-trivial features of Osborn’s equation at sufficiently-high order. Section 6 concludes the paper, with comments on how our program can be extended to derive analogous constraints for theories in different spacetime dimensions, and some indications of previous results for supersymmetric theories. The appendix includes the fully general gauge, Yukawa, and quartic -functions for a general four-dimensional QFT, in the scheme, up to order 3–2–2. To facilitate understanding of the diagrammatic notation, we have also included the explicit tensor contractions represented by each diagram in the -functions, and diagrams of all and contributions at order 3–2–1. We have also included direct comparison with previous work on kinetic mixing and Weyl CCs.
Finally, we would like to comment on the results presented in section 5: due to the ever-increasing complexity of constraints at higher orders, we have opted to give only -specific results, thus losing the manifest scheme-independence of the complete constraints derived from Osborn’s equation, but gaining -specific predictive power. Throughout the paper, we have attempted to comment on such scheme-dependent matters when relevant, and will repeatedly emphasize how the underlying method of deriving CCs is completely scheme-independent.
2 Local Renormalization Group
Most of the material in this section is essentially yet another retelling of the LRG, the consequences of Weyl CCs, and the derivation of Osborn’s equation, based on the papers by Jack and Osborn [14, 15, 5]; the uninterested reader may choose to skip to section 3. We have attempted to keep only those features relevant for the eventual extraction of CCs between different -functions in a general four-dimensional theory; for a substantially more in-depth and pedagogical discussion of the LRG in various dimensions, including (but not limited to) its relation to the proof of the two-dimensional -theorem and four-dimensional weak -theorem, the question of scale- vs. conformal-invariance, and the existence of QFTs with limit cycles, we highly recommend two reviews of the subject, given by Shore [19] and Nakayama [33].
2.1 Weyl consistency conditions and Osborn’s equation
Osborn’s equation, (1.1), is derived from the LRG, which posits that the RG flow of a general QFT on a curved spacetime background can be reinterpreted as the response of a theory with “local couplings” to a local Weyl rescaling , accompanied by the appropriate field transformations. By promoting the couplings to spacetime-dependent fields with associated -functions , one can define an action of the form
[TABLE]
where is classically conformally-invariant (in our case it is a sum over conformally-coupled, non-interacting fields on curved spacetime), is a set of marginal operators, is the appropriate set of all -dimensional operators containing curvature terms and spacetime-derivatives of the couplings, and is the set of associated coefficients defined on the space of couplings; may simply be interpreted as the additional counter terms required to ensure finiteness of the associated quantum theory.
The response of the theory to a Weyl rescaling can be offset by a corresponding change of the couplings along their RG trajectory, leaving only an anomaly stemming from the added counter terms. We define the functional differential operator
[TABLE]
that will therefore act on a suitably-renormalized vacuum generating functional777In general, the Schwinger Action Principle implies that -point correlation functions of a local QFT in a general curved spacetime are obtained by corresponding functional derivatives of the vacuum expectation value, so that
and so on. The phrase “suitably renormalized” indicates the notion that all such -point functions are finite; indeed, this finiteness criterion is what first led to the introduction of additional terms containing spacetime-derivatives of the couplings. according to
[TABLE]
is again some combination of curvature scalars, coupling-derivatives, and -derivatives. This is nothing more than an integrated version of the Trace anomaly with additional contributions from local couplings,
[TABLE]
where is the standard gravitational contribution888While all labels are ultimately arbitrary, we have opted for the notation in the two-dimensional Trace anomaly rather than the more conventional central-charge notation , as we wish to emphasize that can be interpreted as the coefficient of , the two-dimensional Euler density. Using this interpretation, the idea that a general even-dimensional analogue of the -theorem is related to the coefficient of the associated Euler density becomes much less mysterious, and was indeed argued to be the case in [34, 35].,
[TABLE]
with coefficients
[TABLE]
for a theory with real scalars, Weyl fermions, and gauge fields999The coefficient , while non-zero for general theories, must vanish at RG fixed points [36]. There are additional four-dimensional contributions to involving and the Pontryagin density: the former can be removed by addition of a local counter term; the latter we shall address briefly at the end of Section 2.2.. Since and are marginal, one can simply define an analogous -function, , for the local counter terms, , and rewrite (2.3) with a global Weyl rescaling as a standard RG equation,
[TABLE]
subject to the constraint
[TABLE]
i.e. plays the rôle of an RG scale. For this reason, (2.3) is taken to be the definition of an RG equation for a theory in curved spacetime with local couplings.
Having maintained the renormalizability of the theory with local couplings (thus establishing consistency of the LRG), Osborn showed that one can derive restrictions on the form of some of the terms that appear in (2.3) by applying the Wess–Zumino CCs for the Trace anomaly. Since a local Weyl rescaling is generated by the operator (2.2), the Abelian nature of Weyl rescalings dictates that the commutator of two such operators, acting on the renormalized action, must vanish:
[TABLE]
This criterion leads to a vast number of intricate identities relating different tensors; such constraints are referred to as Weyl CCs. In four dimensions, we may parametrize the possible terms according to
[TABLE]
where , , and are the Weyl curvature tensor, Euler density, and Einstein tensor respectively. Eq. (2.3) then takes the form
[TABLE]
with
[TABLE]
Imposing (2.9) for arbitrary Weyl rescalings then requires that the contributions proportional to , , and must vanish separately; furthermore, requiring that these identities hold for arbitrary position-dependent couplings eventually leads to the two crucial conditions
[TABLE]
The first condition shows that the function satisfies (1.1),
[TABLE]
while the second relates the symmetric tensor to two- and three-point functions of [14]. Since reduces to the Euler coefficient of the Trace anomaly at RG fixed points () and contracting (1.1) with gives
[TABLE]
the function would satisfy the strong -theorem if were positive-definite. Unfortunately, since there is no manifest positivity constraint on , the second condition is not enough to guarantee positive-definiteness [14]; nevertheless, the existence of the function for a general four-dimensional QFT is sufficient to imply CCs on the -functions, and it is these conditions that we wish to derive in this work. By a slight abuse of terminology, the -function conditions are also referred to as Weyl CCs, as they are ultimately derived from Osborn’s equation.
2.2 Modifications of Osborn’s equation
There are two necessary modifications that must be made to the derivation of (1.1), one of which is highly non-trivial. The first is simply the observation that we may always add additional local counter terms to the theory, and that these counter terms can take the same generic form as (2.10); that is, we are free to change the finite part of the counter terms in Eq. (2.1) by , i.e. terms whose coefficients are lower-case versions of the tensors in (2.10). Doing so induces various shifts in the terms of , the most immediately relevant of which are
[TABLE]
From this, it is easy to see that
[TABLE]
and so Osborn’s equation is invariant under the addition of local counter terms—away from RG fixed points, therefore possesses an arbitrariness that is quadratic in .
The second modification is based on the inclusion of relevant operators, in addition to the usual marginal operators. For a four-dimensional theory containing scalars and fermions, one can include vector operators, i.e. spin-1 composite operators with dimension 3. The effect of these operators on the LRG can be investigated by exploiting the continuous part, , of the global symmetry present in the kinetic terms of a general theory with multiple scalars or fermions101010This continuous part of the global symmetry is simply the freedom to rotate the fields, hence the associated global symmetry is for the scalars, and for the Weyl fermions.. If a generic field has a global symmetry transformation , then one must include a compensating transformation for the local couplings, such that the action remains invariant. The classical action then has an associated current , being the aforementioned spin-1 operator; if is sourced by a background gauge field associated to the symmetry, renormalizability of the quantized theory is guaranteed. Thus, all partial derivatives are replaced by gauge-covariant derivatives , and the following transformation laws are obeyed:
[TABLE]
for local transformation , where is the Lie algebra of . The background gauge field then behaves as a source term for the local vector current of the corresponding quantum theory,
[TABLE]
allowing the inclusion of dimension-3 operators. If one introduces an operator that generates these local transformations,
[TABLE]
with “” denoting the appropriate inner product on , manifest background gauge invariance immediately implies the corresponding Ward identity
[TABLE]
similar to how the Weyl rescaling generates the Trace anomaly. Eq. (2.3) is now extended by replacing the partial derivatives of local couplings with gauge-covariant derivatives , and by introducing the new Weyl variation component
[TABLE]
with the assumption of manifest -covariance,
[TABLE]
so that the Trace anomaly is still obtained from the action of a local Weyl rescaling on the vacuum generating functional,
[TABLE]
Now though, the anomalous variation includes additional contributions related to the local current and the background field-strength tensor , stemming from new counter terms involving the background gauge field. Since the -functions are now only determined up to transformations, there is an inherent ambiguity in the Trace anomaly as defined by (2.25)—this ambiguity may be removed by using the Ward identity (2.22) to express (2.23) in terms of -invariant RG quantities
[TABLE]
using which one can rewrite the Trace anomaly derived from Eq. (2.27) in the unambiguous, -invariant form
[TABLE]
with additional contributions
[TABLE]
This alone is a rather striking result, as it demonstrates that conformal invariance of a theory is decided by the vanishing of , not , even for a standard QFT in flat spacetime with global couplings. However, even more interesting is what happens if one re-derives the Weyl CCs using the new Weyl rescaling operator in (2.25). The action of this new operator on the vacuum functional is
[TABLE]
i.e. (2.11) with partial derivatives replaced by covariant derivatives, plus new terms related to the background field-strength, as expected. Imposing
[TABLE]
then leads to generalized versions of the Weyl CCs, including
[TABLE]
subject to the constraint
[TABLE]
Hence, satisfies an equation of the same form as (1.1), with replaced by , and the tensor extended by an additional antisymmetric component:
[TABLE]
By adding appropriate local counter terms, one can again verify that the new Weyl CCs are still invariant under such additions, and that has the expected arbitrariness quadratic in . The new function , valid for theories with additional vector operators, therefore satisfies
[TABLE]
thus establishing the perturbative -theorem for leading-order positive-definite . For RG flows that include limit cycles, the interpretation of this is that has the potential to act as a metric on the quotient space , where couplings related by transformations are identified, such that define fixed points on , and the associated QFT is conformal [5, 19]. A final extension of the LRG formalism includes the introduction of dimension-2 operators. However this bears no influence on the -function constraints that we wish to explore in the present paper, so we will not address this extension further.
The quantity , appearing in the definition of , stands out as the only symbolic modification to Osborn’s equation that is of relevance to us. As shown in [20], it effectively parametrizes any possible difference between a scale-invariant theory () and a conformally-invariant theory (). Since is an element of the Lie algebra , it is necessarily an antisymmetric tensor, and is directly related to the additional counter terms required when one renormalizes field multiplets with position-dependent couplings111111Since the wave-function-renormalization matrices are functions of , is non-trivial, thus one requires new counter terms. For example, expanding a scalar kinetic term with leads to a counter term of the form .. The spacetime-dependent couplings of the theory necessitate a new kind of counter term,
[TABLE]
where is the current associated with the global symmetry. is related to the simple pole of the counter term coefficient, , by . can therefore be calculated by considering self-energy diagrams with local couplings playing the rôle of spectator fields, then simply extracting any possible antisymmetric combinations of tensors.
Strictly speaking, the derivation and modifications outlined above make use of two non-trivial assumptions. First, the LRG formalism itself is predicated on the Trace anomaly conserving parity, thus (2.10) contains no -tensor contributions such as the Pontryagin density, although the coefficient of this term typically vanishes.121212Indeed, it is still unknown whether any unitary QFT can actually produce the Pontryagin term in its associated Trace anomaly: see [37, 38, 39] and many references therein.. Second, by Eq. (2.24), the Ward identities for any global flavor symmetries are assumed to be non-anomalous, and in particular that there is no associated chiral anomaly. These assumptions were relaxed in [40], where it was shown that any such anomalous behavior can be “consistently factorized out” of the LRG formalism; the inclusion of -tensors in (2.10) leads to no additional constraints on the RG flow, and the effects of a chiral anomaly can be absorbed into a further redefinition of . The end result is that Osborn’s equation, in its form, is effectively unaltered by the presence of these anomalies.
2.3 Imposing symmetry on
Beyond the modifications of the formalism discussed in the previous section, we would also like to comment on an additional feature of Osborn’s equation, which has (so far) been shown to hold in all perturbative calculations, but which we have not seen explicitly addressed in the current literature; namely, whether one can always choose local counter terms to make symmetric. As we have seen, the addition of local counter terms reveals an arbitrariness in , but we have not explicitly discussed their effect on .
In the case where one includes the effects of relevant operators, we have already seen that acquires an additional antisymmetric contribution (again neglecting the effects of dimension-2 operators). Additionally, there exist new local counter terms, defined by lower-case versions of the tensors in (2.29), so the corresponding shifts of the anomalies in (2.17) become more involved. Making use of various non-trivial RG identities implied by manifest -covariance, the end result is that the transformations of Eq. (2.17) are simply extended to their -covariant form with additional transformations for the other new tensors. Since only appears in , we only consider the shifts
[TABLE]
with the local counter term coefficient defined via Eq. (2.29). The total shift of the full , as specified in Eq. (2.33), under addition of the finite counter terms is thus
[TABLE]
We can now raise the question of whether there exist local counter terms that make the anti-symmetric part of vanish. A sufficient condition for this to be the case, is if there exist a symmetric , two scalars , and an antisymmetric , such that
[TABLE]
In this case one can simply add counter terms and , to remove the anti-symmetric part of . Up to two loops, it is possible to show directly that such a decomposition of and exists, and that is indeed symmetric, but it is unclear how the program proceeds at higher loop orders. This is perhaps not surprising, as it has already been shown that the first off-diagonal contributions to in a general theory occur at three loops [6].
When deriving constraints on the -functions, imposing symmetry on would correspond to identifying the coefficients of each pair of tensors related by the interchange of open indices and , and setting them equal—in six-dimensional theory at sufficiently high loop order, it has been shown that this does indeed force an additional CC, which (rather amazingly) turn out to be satisfied by the coefficients, and renormalization-scheme independent [21]. Effectively, we conjecture that it is always possible to choose local counter terms such that is symmetric, and that an analogous construction can be made in the extension to all even dimensions.
Finally, before concluding our discussion of the LRG, we should draw attention to the fact that Osborn’s equation has been generalized to an arbitrary even-dimensional spacetime [34], hence the procedure of using Weyl CCs to derive constraints on the -functions will always work, as long as there exists a corresponding even-dimensional, renormalizable QFT with marginal interactions. For example, such constraints have been thoroughly investigated for the case of a general six-dimensional theory [21], where was constructed up to five loops, and all non-trivial features (a non-zero shift , the ability to impose symmetry of ) were shown to hold for arbitrary renormalization schemes.
3 Formalism
This section introduces a compact notation for general four-dimensional renormalizable theories, and sketches the perturbative computation of -functions and in such theories. We have used the terminology “compact gauge group” throughout: given a general QFT, renormalizability constrains the gauge group to be compact, hence it can be any product of simple and Abelian Lie groups.
3.1 General QFT in four dimensions
We begin with a completely general, four-dimensional, renormalizable QFT, allowing for all marginal operators. None of the relevant couplings in the form of mass terms or trilinear scalar couplings impact the -functions of the marginal couplings, so we can safely ignore them for our purpose. With this in mind, the Lagrangian is typically given on the form131313Matching one’s favorite Lagrangian onto the general form used here can be surprisingly difficult. Explicit representations of the couplings and generators for a single SM generation are given in appendix D of [28]. More generally, one can derive the expressions in any theory, by using the “structure-delta” method of [41].:
[TABLE]
All scalars and fermions have been gathered into two multiplets, which may generally be reducible representations of the gauge group. We work with real scalars and Weyl spinors , as any complex scalar or Dirac fermion can be put into this form. For the gauge symmetry, we consider a compact gauge group with any number of Abelian and non-Abelian factors, . Since may contain more than one Abelian factor, we must allow for kinetic mixing terms in the Lagrangian; we shall return to this complication shortly. As per usual, the covariant derivatives are taken to be
[TABLE]
respectively. For Abelian factors the generators are just charge matrices—diagonal for the complex Weyl spinors and antisymmetric for the real scalar fields.
With an eye on future simplifications, we will take several steps to make the notation more compact, in the sense that all fields of a given spin are gathered into just one multiplet. At the risk of adding an additional layer of abstraction, this will help us keep our expressions more manageable later on. Following [6], we define a new multiplet of Majorana spinors , taking advantage of the unitary equivalence between Weyl and Majorana fermions141414An in-depth exposition of the connections between Dirac, Majorana and Weyl spinors in two- and four-component notation may be found in [42]: our conventions match those spelt out in Appendix G.. This allows us to assemble the Yukawa couplings and fermionic generators into larger matrices,
[TABLE]
employing matrix notation for the fermion indices to reduce the clutter. The spinor part of the Lagrangian thus reads
[TABLE]
Gathering the fermions like this removes the distinction between left- and right-handed fermions (in the sense of their representations under the Lorentz group), and ; that is, for any left-handed fermion loop appearing in a given Feynman diagram there is an equivalent diagram with right-handed fermion running in that loop. By using and , both contributions are automatically included in any quantity constructed from the couplings. Of course, this construction neglects the chiral nature of theories such as the SM, where there is a distinction between left-handed and right-handed fermions. The effects of such a difference do not appear in any -functions up to order 3–2–1, but at order 4–3–2 one encounters the first non-trivial occurrences of . In such diagrams, left- and right-handed fermions contribute with opposite rather than same signs, thus we must slightly extend our tensor notation to accommodate this. We will return to the question of as it becomes relevant for our discussion in section 5.4.
For the gauge interactions, we absorb the gauge couplings into the gauge field , so that the gauge kinetic term becomes and the covariant derivatives come without any gauge coupling. Rather than dealing with every product of the gauge group individually, we again employ a compact notation to account for all of them at the same time. From a computational standpoint, all gauge field have identical propagators and Lorentz structure; only the group factors differ depending on the group. Thus, all gauge fields can be incorporated in each diagram using sufficiently general couplings in the vertices, similar to the way all scalar and fermion fields are incorporated into just one multiplet each.
We gather the gauge fields of the separate product groups into one multiplet taking values in the Lie Algebra . The group generators and structure constants are correspondingly gathered by introducing a general adjoint index taking values in
[TABLE]
The gauge couplings are arranged in a diagonal matrix,
[TABLE]
in this notation. It is a positive definite matrix and works as a metric on ; it is still subject to gauge symmetry, so it stays diagonal and is proportional to the identity on each subalgebra . Gauge invariance ensures that this remains the case along the RG flow. For the group structure, we similarly define the generators
[TABLE]
A simple consequence of this construction is that the structure constants satisfy . The benefit of this construction is that it allows us to ignore much of the inconvenience of having multiple gauge groups in the theory. We may treat all gauge couplings simultaneously by considering directly.
When including multiple Abelian factors in the gauge group, there is an extra complication in the form of kinetic mixing [43]. Invariance of the Abelian field-strength tensors under gauge transformations implies that terms such as
[TABLE]
for Abelian gauge groups are allowed in the Lagrangian by symmetry. In fact, they are needed to renormalize the theory and are generated by the RG evolution, even if set to vanish at a particular scale. The inclusion of these terms in the general 2-loop -function has been discussed in Refs. [31, 44]. We wish to emphasize that the kinetic mixing of multiple Abelian groups fit into the formalism employed here in a very natural way. The gauge couplings have been relegated to the coupling matrix in the kinetic term, thus kinetic mixing is simply the observation that when several of the product groups are Abelian, gauge invariance allow for non-diagonal terms in between the corresponding groups.
To summarize, we consider the general case where the gauge group is taken to be
[TABLE]
such that the factors are non-Abelian for . The first instances of the adjoint index, , thus label the groups, while takes values in the adjoint indices of the non-Abelian groups. For the coupling matrix in particular we have
[TABLE]
where is a symmetric coupling matrix. The off-diagonal terms in ensure that the renormalized Lagrangian contain all possible kinetic terms allowed by gauge symmetry. The generators of the individual groups are still just the associated charges and require no change.
Altogether, in our notation, the Lagrangian (3.1) becomes
[TABLE]
and the couplings of the theory are . Note that rather than is used for the gauge coupling in the general framework. This is done for two reasons: first, only appear in the Lagrangian; second, for the purpose of including multiple Abelian couplings, it is more convenient to work with for the counter terms, as will pick up off-diagonal elements. We have verified that our formalism is indeed capable of reproducing the previously known 2-loop results for the kinetic mixing of the Abelian factors [31]—for details of the comparison and matching to previously used notation see App. A.4. Later, we will deduce the full 3-loop gauge -function for the general theory (3.1), including kinetic mixing; although the result for a semi-simple gauge group has in fact been calculated (but not published) previously [45], as far as we know, this will be the first time that the 3-loop running of kinetic mixing terms has been presented.
3.2 Perturbative expansion of quantities in Osborn’s equation
Presently we will not concern ourselves with the finer details of perturbative -function computation; on the contrary, we aim to demonstrate (and emphasize) that constraints derived from (1.1) require no knowledge of the exact perturbative results in any scheme. Instead, we shall work with a parametrization of all the TSs that may appear in each -function at a particular loop order, where each possible tensor carries an associated coefficient, the value of which then depends on the chosen renormalization scheme151515This method of parameterizing the -functions is essentially implicit in the classic 1-loop calculations of Cheng et al. [46], and the 2-loop calculations of Machacek and Vaughn [3], and Jack and Osborn [47].. The possible TSs that can appear in each -function are completely determined by the form of the interactions in the Lagrangian density, with the Feynman rules providing an exact correspondence between the Feynman diagrams employed in perturbation theory and the associated tensor-coupling contractions. Gauge invariance of a theory with additional Yukawa and scalar interactions requires that certain non-trivial identities involving tensor couplings and gauge generators must hold, which then lead to relations between certain TSs, reducing the number of unique terms. In certain renormalization schemes such as , the number of contributing TSs will be further reduced on general grounds. We will take advantage of this simplification, but we wish to stress that the derivation of CCs works in arbitrary schemes if one includes the additional TSs not present in .
In dimensional regularization the bare couplings are renormalized by setting
[TABLE]
and are counter terms necessary for removing the overall divergences of 1PI vertex functions. In they are singular in and contain no finite part. The field strength normalizations, and , are needed to renormalize 1PI 2-point functions of the fermion and scalar fields respectively. They are given by and similarly for the scalars.
The -function associated with a coupling is then given by
[TABLE]
with being the dimension of the bare coupling in dimensions. The simple pole of the bare coupling must come from either the vertex counter term or from the wave function renormalization of one of the external fields. Hence, the only TSs that can appear in the -functions are 1PI vertex corrections, and 1PI 2-point structures attached to each leg of a vertex.
The situation for the gauge couplings is a priori a little less straightforward, as multiple interaction vertices involve the gauge field. However, using the background field method, gauge invariance is preserved through renormalization [48, 49], and so only one counter term is needed:
[TABLE]
from which one finds
[TABLE]
where the neglected terms only contribute to higher order poles. The counter term is in turn determined by the divergence of the 2-point function of the background field and is given by
[TABLE]
Thus the only TSs that can appear in are those needed to renormalize the amputated 1PI 2-point function of the gauge field. Whereas one must take ghost fields and gauge fixing into account when doing the perturbative computation, we need not concern ourselves with them here: ghost fields introduce no new couplings, and the -functions are independent of gauge-fixing parameters.
We will also need to understand how the tensor in Osborn’s equation comes about in perturbation theory. At leading order, is related to the counter terms of 2-point functions between marginal operators, [14]. Thus, it contains the insertions of the couplings found in . With the couplings of the general theory, Eq. (3.1), we have
[TABLE]
where the ellipses denote higher-order contributions arising as a consequence of renormalization. Each open index in thus has an insertion of one of the tensors (we resolve the collective indices down to their constituent indices, depending on the coupling in question)
[TABLE]
depending on whether the index corresponds to a gauge, a Yukawa, or a quartic coupling respectively; this also holds at higher orders. Borrowing terminology from Feynman diagrams, the TSs appearing in are then simply all possible 1PI contractions with two such open indices and no tadpole substructures (to be precise, the tensors should be 1PI only after contracting e.g. couplings on the two open indices).
Having specified the form of and , we will need to include the change from to , c.f. Eq. (2.26), albeit is our ultimate interest in the present work. Thus, we need to determine the structure of . Given the continuous symmetry of the scalar and fermion kinetic terms, takes values in the associated Lie algebra, . Since the scalars and fermions are charged under this symmetry, it acts non-trivially on scalar and fermion indices. Separating out the scalar and fermion groups and Lie algebras we may write . The real scalars being in a real representation of implies that can be written in terms of anti-symmetric TSs with two open scalar indices. Similarly, the collective fermion index running over both left- and right-handed fermions transform as if in a real representation and is parameterizable in terms of TSs antisymmetric in two open fermion indices. Consequently, to identify possible contributions to , one needs only consider the two-point TSs at a given loop order, determine which are not symmetric under exchange of indices, and define an associated term as an antisymmetric combination of the tensor and its transpose. As seen in [20] and [5], the first such antisymmetric TSs may appear at 3-loop order, giving non-zero contributions to the 3-loop Yukawa -function; when expanding Osborn’s equation to construct the -function, failing to allow for these shifts leads to an inconsistent system of equations.
For the last component of Osborn’s equation, we can infer the form of directly, since it has no open indices in coupling space. Hence, it must be parameterizable by fully contracted 1PI TSs. There is one more constraint on the TSs that may appear in , following from Osborn’s equation: no TSs that can be split into two disconnected parts by removing any one tensor (coupling or generator) will appear in . Suppose for contradiction that one or more such TS appears in : will contain at least one TS with disconnected parts. It would then follow from Osborn’s equation that the same TS appears in , but both and are connected, and so is their contraction. Hence, does not contain any TSs that can be disconnected by eliminating just one tensor.
The discussion in this section mostly apply to other renormalization schemes, but some modifications may be necessary. In more general renormalization schemes, one is free to define the finite part of each renormalization constant however one wishes, hence there would exist schemes in which the restrictions above no longer apply (although the restriction of contributions to 2-point functions will always hold, as it follows from renormalizability). If such a scheme is chosen, one need only include extra TSs representing any additional non-zero contributions to each -function. In addition, the relation between the -functions in two different schemes may be deduced by calculating the effects of a coupling-redefinition. Given a redefinition , there exist two relations between the original function of couplings and the new function of couplings :
[TABLE]
Expanding by loop order, the change may then be inferred order-by-order, giving
[TABLE]
and so on. may then be parametrized as a sum over a gauge basis of all TSs, both 1PI and 1PR, each with its own coefficient. One will then find that the changes in some of the -function coefficients are zero, establishing scheme-independence; furthermore, one can easily see whether a given scheme requires 1PR contributions, as the necessary choice of will lead to non-zero 1PR coefficients. By extension, if one simply imposes the desired relations between coefficients, it is possible to define a class of -like renormalization schemes in which 1PR contributions are absent161616It turns out that there is a subtlety in these calculations, which only occurs for general theories containing field multiplets. Attempting to ensure that all 1PR terms cancel forces certain relations between the coefficients of a general redefinition , but these relations may leave behind certain contributions, formed by attaching antisymmetric combinations of 1PR 2-point TSs to a tensor coupling, similar to the terms generated by in the -functions. In [21], the coupling redefinitions required to relate to momentum subtraction (MOM) produced non-zero coefficients for these 1PR terms, despite the explicit MOM calculation giving zero as expected. The resolution to this apparent paradox is described in [50], whereby such 1PR terms can be absorbed into a redefinition of the otherwise-arbitrary antisymmetric part of the anomalous dimension matrix for the scalar multiplet.. By calculating the effects of the most general possible coupling redefinition, it can also explicitly be shown that constraints derived using Osborn’s equation are scheme-independent, as all associated variations in the CCs cancel.
4 Procedure for extracting -function constraints
We will now explain the procedures employed in GRAFER to identify unique TSs and to determine the CCs on the coefficients of the -functions.
4.1 Tensor-graph identification
Part of constructing the Weyl CCs comes down to identifying the unique ways of contracting the indices of the couplings. For suitably large combinations of the couplings, it will quickly become unmanageable to manually (or automatically) identify unique combinations using regular dummy-index notation. One way of dealing with this problem is to map the TSs into graphs, in which case identifying identical TSs becomes a problem of identifying graph isomorphisms.
The rules for mapping between TSs and graphs are a further development of the notation developed in Ref. [6]; they essentially come down to associating a graph with the tensor part of a Feynman diagram. The starting point is that the edges of the graph come in three varieties,
[TABLE]
and are to be understood as contractions of field indices of the corresponding kind: gauge (adjoint), fermion, and scalar respectively. Remember here that all internal gauge edges carry the gauge coupling matrix by definition.
There are five kinds of vertices in the graph notation, each corresponding to a coupling or a group generator:
[TABLE]
As discussed previously, the factor may be “commuted through” the triple-gauge vertex. The graph vertices roughly correspond to the interaction vertices of a QFT, except for the interactions between two scalars and two gauge bosons, or between four gauge bosons. The couplings associated to those interactions can be reduced to combinations of the vertices above.
The Feynman rules for the fermions in the Lagrangian (3.1) introduce a in the subspace spanned by and on every fermion propagator and in any gauge-fermion vertex171717The difference between and vanishes in every fermion loop, with the exception of the contributions discussed in Sec. 5.4.. We introduce the notation
[TABLE]
to account for the corresponding chirality flips. To see how this works out on a fermion line, consider the following example:
[TABLE]
In practice, it becomes unnecessary to explicitly worry about including the s on the fermion lines: the effect of them is merely the unique choice that ensures that is always contracted with between two vertices. At the level of the tensors it is sufficient to remember the following rules, reading left to right along a fermion line:
is always followed by a regardless of the number of gauge generators between the two. 2. 2.
All generators following and before are of the kind . 3. 3.
All generators before and following are of the kind .
Any fermion line satisfying these rules makes for a valid tensor. On the other hand, the first rule cannot be satisfied for a fermion loop with an odd number of Yukawa insertions. These are invalid because a fermion loop must have an even number of chirality flips.
As an example of how the graph representation works in practice, take the 1-loop gauge -function. In graph notation this is given by
[TABLE]
This can be compared to the traditional notation for the case of a simple gauge group [51]:
[TABLE]
where is the quadratic Casimir of the adjoint and denotes the trace normalization of the fermion [scalar] representation. The graph notation accurately incorporates the appropriate group invariants for any choice of indices selecting a specific gauge group. The factor of two difference in the coefficient of the second term between our notation and the traditional result is due to our notation automatically including the complex conjugate of every fermion loop. In this simple—but well known—example, our notation unfortunately comes across as slightly cumbersome, which is the price to pay for complete generality. At higher loop orders or in theories with more gauge groups, however, the notation comes into its own. As a more advanced example of the notation, consider the following TS from the 3-loop gauge -function:
[TABLE]
Note the factor coming from the internal gauge line. This line contains contributions from all the gauge groups, and will give a sum of the quadratic Casimirs weighted with the respective gauge couplings.
There is a potential pitfall with the diagram notation, which must be addressed. Since the gauge generators are antisymmetric, there is an ambiguity in which leg is associated with each index. For scalar generators, we will impose the rule that all generators on the same scalar line (beginning and ending at Yukawa or quartic interactions or open indices) have the same orientation, that is, they join in combinations such as . For an even number of generators on a line, as is almost exclusively the case in the TSs we work with, this resolves the ambiguity. In other cases the orientation will have to be specified explicitly. On fermion loops the ambiguity comes down to what way one should go around the loop. This results in a sign ambiguity when there is an odd number of gauge generators on said loop, but that is not the case for any of the TSs in our basis.
A final point regarding the graph notation is that we will use a small blob to denote an open index, , such as in e.g. , with corresponding tensors
[TABLE]
as per Eq. (3.19). To distinguish between different open indices we will use a diamond instead where it is required, though the interpretation wrt. tensors is the same.
At this point, we can finally address the underlying topological reasoning for the 3–2–1 ordering. By construction, , and are sums over two-, three-, and four-point TSs respectively. Similarly then, the open indices in involve two, three, and four internal lines. It then becomes clear that the leading contribution to , , and occur at loop orders , , and respectively. This is basically a consequence of the characteristic formula for graphs: , where is the number of loops, the number of internal lines, and the number of vertices. Since in the loop expansion of Osborn’s equation the number of loops of the -function and must add to that of , it follows that always enter at one loop order less than , which itself is one order less than . Hence the ordering receives contributions from the 3-loop gauge, 2-loop Yukawa and 1-loop quartic -functions, and relates the various coefficients via (1.1). Clearly, this pattern extends to all higher orders where simply relates the -functions at 4-, 3-, and 2-loops.
The graph notation developed in this section is similar to that of Feynman diagrams. This is no accident and the similarities makes it easy to visualize a typical Feynman diagram with a similar contraction of the couplings. However, the graphs are not to be thought of as diagrams, as they contain no information about loops or momentum structure, and several Feynman diagrams can share coupling dependence due to gauge symmetry.
4.2 Gauge basis
In the Lagrangian (3.1), the tensor couplings and are contracted with field multiplets, which have non-trivial gauge transformations. Clearly, if (3.1) is to be gauge-invariant, there must be non-trivial relations involving the tensor couplings and the gauge generators; by demanding invariance under an infinitesimal gauge variation, one finds that
[TABLE]
In addition to these identities, the gauge generators still satisfy the Lie algebra, which in our notation reads the usual way:
[TABLE]
This follows from the structure constants vanishing whenever two of the gauge indices belong to distinct product groups.
The gauge identities in the tensor space introduce redundancy between all the unique graphs that can be constructed. Since Osborn’s equation involves derivatives of the TSs associated with each graph, there will exist degeneracies in the resulting system of equations, unless one attempts to express all TSs in terms of a basis; one must therefore account for this arbitrariness before trying to extract the Weyl CCs. At higher loop orders, removing this redundancy also greatly reduces the number of arbitrary coefficients introduced into the system. For instance, GRAFER produces 2890 unique graphs in the 4-loop order gauge -function, but after accounting for the gauge identities we are left with a basis consisting of just 198 TSs.
To construct a suitable gauge basis of TSs for each -function, we follow the steps outlined here, using the -order contribution to the Yukawa -function, as an example:
Determine all unique -loop graphs with one scalar and two fermion legs (indices) open, removing the isomorphic graphs. 2. 2.
Construct the full set of all gauge relations involving the graphs. 3. 3.
In , only receives contributions from the 1PI leg and 1PI vertex graphs. Starting from this set, use the gauge identities to reduce the set of -loop graphs to a basis. 4. 4.
If possible, choose a basis that maximizes the number of group Casimirs and Dynkin indices.
The last step may seem a bit strange and deserves a comment. In principle, one is free to construct using any of the large number of equivalent gauge bases. However, for practical purposes it is convenient to work with TSs that, in so far as is possible, can be written in terms of group invariants, as explicit -function calculations for gauge theories are usually expressed in terms of such invariants. Such TSs have very useful commutativity properties, and the correspondence between our graph notation and explicit calculations becomes more straightforward.
What we find using this procedure is a basis of TSs that can appear in , plus any scheme related to by a coupling redefinition that preserves the form of the TSs181818If one wishes to work with a completely general scheme, one need only remove the -like restriction in step 3 above and repeat the procedure, taking all possible graphs into account.. Once the Feynman diagrams have been evaluated, it may turn out that some coefficients vanish, as was found in previous general calculations. As there is no a priori way to know if a particular coefficient will be zero, or whether such a zero is (at sufficiently-high order) merely an artefact of using , the tensor should still be included.
4.3 Overall strategy
To extract the CCs from Osborn’s equation, one must first parameterize each of the quantities in terms of a basis of TSs with unknown coefficients. The external indices are then contracted in the combination prescribed by the equation, before the two sides are set equal TS by TS in the basis. The result is a system of equations between all the unknown coefficients.
It is typically convenient to think of Osborn’s equation in differential form,
[TABLE]
In this form the differentiated -function can be constructed by marking couplings in each graph one by one, and tallying up the result. Working to loop order , GRAFER performs the following steps for each side of Osborn’s equation:
For the LHS:
Generate all unique closed graphs at ’th order, using the five kinds of vertices of Eq. (4.2). Each graph corresponds to a TS. 2. 2.
Use the gauge identities to determine all relations between the TSs. Eliminate tensors using linear dependence to achieve a basis. These are the unique TSs parameterizing the -function, and each is given an unknown coefficient. 3. 3.
The differential operator is applied to the -function, acting on each TS. In each TS new TSs are made by marking a Yukawa or quartic vertex or gauge edge signifying a derivative, and equivalent TSs are gathered and matched to a new basis using gauge identities (counting multiplicity). The unique, marked TSs are all the terms that can show on the LHS.
For the RHS:
For each coupling, the TSs showing up in the -function are determined from the unique closed contractions with a corresponding coupling removed, viz. removing a vertex in the case of Yukawa or quartic -functions, or breaking a gauge line for the gauge -function. 2. 2.
All TSs that do not correspond to 1PI vertex or field-strength graphs are eliminated, as they do not appear in -like schemes. The remaining TSs are thinned to a basis using gauge identities and each is given a coefficient191919The diagrams chosen for the basis itself are not necessarily 1PI.. At sufficiently-high loop order, the shift is taken into account by identifying any possible antisymmetric combinations of -point TSs, attaching to a leg, and assigning a new arbitrary coefficient. 3. 3.
The TSs in the tensor are constructed from all unique closed contractions appearing in the -function at order . In each structure two couplings are marked, one as a derivative and the other as a point where a -function can be inserted. Each unique TS is given a coefficient. 4. 4.
The -loop -function tensors are inserted into the -loop tensors at the relevant positions, such that the total loop order is . The resulting TSs are then matched onto the same basis as the LHS.
Once both sides of Osborn’s equation are constructed and written in terms of the same gauge basis, equality must apply for the coefficients of each TS individually. In the resulting set of equations, all the unknown coefficients from and are eliminated. This leaves a system of equations involving only the coefficients of the -functions, which we refer to as the CCs.
Before proceeding to our analysis we would like to make a brief comment on the inclusion of multiple Abelian groups in the computations. Whereas the -function TSs automatically include the all contributions from multiple Abelian factors once they are set up to work with a semi-simple gauge group, this is not the case for . The reason for this being that in the semi-simple case all 2-point subtensors on a gauge line are diagonal in the gauge indices. Thus they commute with e.g. or gauge -functions that could be contracted into the open indices. This is not the case with multiple Abelian groups where matter loops, e.g. fermion-loops like , do not have to be diagonal. If one were to perform the analysis with multiple Abelian factors, it would therefore require more TSs in the basis of and in the final basis for Osborn’s equation. Whereas the former introduce more unknowns into the system, the latter provides additional constraints as equality must hold for each TS in the basis. What we found was that the inclusion of multiple Abelian factors in the analysis did not contribute extra CCs, nor did it produce fewer (which would have been a red flag, as all the semi-simple information is included in the analysis with multiple Abelian factors). Since no extra information was gained by the extra complication, we present the analysis of the next section in the case with at most one Abelian factor. We stress that the CCs are nevertheless valid for -functions of theories with multiple such factors.
5 Weyl consistency conditions
We have now—finally—arrived at the point where we can present the constraints on the -functions. In this section we will discuss the CCs order by order in the loop expansion and discuss new complications as they occur.
5.1 Tensor basis for the -functions
Before discussing the CCs we will begin by expanding the -functions by loop order, as
[TABLE]
The sum over permutations are over the 2 permutations of the gauge indices in , the 2 permutations of the fermion indices in , and the 24 permutations of the scalar indices in . Symmetrizing the -functions in this way ensures that they respect the same symmetries as the couplings themselves; in the case of the gauge -function, symmetrization is only required in the event of multiple Abelian factors. The symmetrization is performed at the level of the -functions rather than for individual TSs, to keep notation to a minimum beyond this point.
At each loop order is in turn parametrized in terms of a basis of TSs, so we write
[TABLE]
where the blobs denote TSs with the appropriate external indices and the sums run over the full basis at each loop order. The gauge -function stands out, as we have factored out two common gauge-coupling matrices. In this way the TSs in correspond to 2-point structures of the background field, c.f. Eq. (3.17).
We can now present a basis of TSs for the general -functions, or indeed any scheme in which 1PR contributions do not appear. Our choice of basis defines the coefficients that will eventually be constrained by the CCs. The gauge basis we have used for the -function is chosen such that there are as few gauge generators as possible that appear outside (possibly nested) 2-point subgraphs of group invariants. Here we have presented the TSs by mapping them onto graphs using the notation of Sec. 4.1, as we believe this to be the most intuitive way of representing them, in the same manner that Feynman diagrams are usually the easiest way of thinking about loop amplitudes. If the reader is unsure as to the interpretation of the graphs we refer to App. A.1, where TSs are written out explicitly.
The basis of TSs for the general gauge -function, up to three loops, is shown in Figs. 1 and 2. The 1-loop basis consists of the three well-known contributions from fermion, scalar, and gauge/ghost loops respectively. At 2-loop order there are another 7 TSs, whereas typical result quote just 6 contributions [2, 3, 4, 51]. The discrepancy is due to , which can only be determined from computation of the contributing Feynman diagrams. At 3-loop order we find another 33 TSs in . In the limiting case where the gauge group is simple, the tensors with coefficients and coincide, as do those with coefficients and (a point which we will elaborate further on in Sec. 5.3), reducing our basis to the 31 TSs of Pickering et al. [32]. For the general Yukawa -function, the Tensor basis include 5 and 33 TSs for 1- and 2-loop orders respectively, shown in Figs. 3 and 4. This basis is more extensive than the one of Luo et al. [51], as 6 of the coefficients vanish, c.f. appendix A.2. In Fig. 4 we also include the 5 TSs parameterizing the general 1-loop quartic -function; Figs. 1-4 therefore comprise the full list of TSs contributing to in Osborn’s equation, up to .
As has been mentioned, all of the coefficients multiplying these TSs have been computed previously, though the determination of is unpublished. We list the coefficients in appendix A.2; in section 5.3, we detail how Weyl CCs augment the results of [32] to fix the remaining four coefficients. Since the 2-loop quartic coefficients are also known, we have included the results for , along with explicit representations of our basis of TSs. Appendices A.1 and A.2 therefore contains the full -functions for a general QFT based on a compact gauge group up to order 3–2–2.
5.2 Consistency condition at order 2–1–0
Now, using tensor notation, we shall begin using Osborn’s equation to parametrize loop order by loop order. The first-order calculation of is precisely the trace anomaly, thus contributions to are simply in one-to-one correspondence with the field content of the theory. These terms are effectively the constant of integration left undetermined by (1.1), and hence are not related to the -functions.
The first order at which is related to the -functions is order 1–0–0, corresponding to the 2-loop -function. contains 4 fully contracted TSs and thus 4 unknown coefficients shown in Fig. 5. In addition there is a single coefficient in and 3 in . Osborn’s equation reduces down to just 4 equations, and it would be impossible to extract information on without knowing terms in and/or (though one finds that ). This is in some sense due to the TSs of being so simple that there are the same number of TSs in the basis before and after differentiation wrt. the couplings.
The first non-trivial CC in a general theory occurs at loop order 2–1–0, or equivalently , and was in fact missed by Jack and Poole [6], as they substituted in all (scheme-independent) 1-loop coefficients. The main difference in our approach is that we have generalized the notation to allow for any compact gauge group. Nevertheless, at this order, the structure of the calculation is identical, hence we do not obtain any additional CCs beyond this one.
At order 2–1–0 there are 14 new TSs in the -function, illustrated in Fig. 5, and 4 new TSs in . The non-trivial constraints on the -function coefficients stem from the terms in the -function with coefficients and . When differentiating the corresponding TSs wrt. the couplings to obtain the LHS of Osborn’s equation, the derivative can act either on a gauge or a Yukawa coupling, giving rise to two distinct TSs. The part of Osborn’s equation involving these terms read
[TABLE]
The leading-order coefficients of —in the sense that there are no contributions involving the respective coupling at lower order—were calculated in Ref. [14], and are given by
[TABLE]
The knowledge that is sufficient to eliminate the coefficients from the resulting system of equations, leaving the CC
[TABLE]
This condition is indeed satisfied, as it is well known that . No other CCs occur at this order, since no other term in the -function gives multiple unique TSs under coupling differentiation. By considering the contributions to listed in Fig. 6, our graphical notation should also make it obvious that symmetry of at this order is in fact trivial, and hence imposes no extra CCs.
It is worth noting that this CC can be used in another way: if we did not know , the same set of equations would imply that
[TABLE]
assuming that at least one of the ratios were well-defined. Since we know that
[TABLE]
Eq. (1.1) would automatically imply the ratio
[TABLE]
between the leading-order coefficients, revealing a hitherto-unknown restriction on . While the impact of this relation is again somewhat diminished by the explicit calculations, we hope it is clear that similar relations will exist at higher orders, where the corresponding coefficients are not known.
At the next loop-order we find that not only does the number of CCs increase dramatically, but so too does the ratio between the number of CCs and the number of -function coefficients, making the whole endeavour much more worthwhile.
5.3 Consistency conditions at order 3–2–1
Proceeding to Osborn’s equation at order 3–2–1, or , the list of possible TSs starts becoming overwhelming: the number of -function tensors alone rises from 14 to 49 (down from 147 before eliminating redundant TSs with gauge identities), and most receive multiple contributions from the various -functions and tensors at each loop order. All of these are shown in App. A.3, as we will not make any further use of them here. At this order we also introduce 71 new -function coefficients. The upshot of the increased complexity is that we obtain 26 new CCs, compared to the single CC at order 2–1–0.
At this loop-order, once one starts eliminating the coefficients of and , the CCs in full generality start becoming rather involved. To keep things tractable and (almost) linear, we input the known coefficients at order 2–1–0202020Recall that the gauge -function is scheme-independent to two loops, and the Yukawa -function to one loop, thus these conditions are still fully general, at least with respect to schemes preserving the structure of the -function.. Expressed this way, the CCs reflect only the new information gained at the highest loop order. We find
[TABLE]
As remarked previously, the construction at order 3–2–1 has been done for a single gauge group by Jack and Poole [6]. Our construction for a completely general theory has produced an additional 7 CCs, including some relations between tensors with different group Casimirs and Dynkin indices, which was not resolved in that work. Since our general construction must reduce to the special case, we would expect that the conditions obtained by Jack and Poole can be expressed as linear combinations of our new conditions, and we have indeed verified that this is the case (see App. A.5 for further details).
We may now demonstrate the power of these CCs, by showing how one can obtain the complete 3-loop gauge -function for a general theory. In the case where the gauge group is simple, there is a one-to-one correspondence between 29 of the TSs in our expression for and the TSs described in Ref. [32]. As mentioned in the previous subsection, the last 4 of our TSs, with , reduce to the two remaining TSs of Pickering et al., as the TSs coincide in this simple case. Direct comparison with their result then translates into the requirements that
[TABLE]
Meanwhile, utilizing the known results for all the remaining coefficients at order 3–2–1, the CCs translate into just the two conditions (from Eqs. (5.9b) and (5.9g))
[TABLE]
These four constraints are sufficient to uniquely determine the four coefficients, which are found to be
[TABLE]
With this result, the full gauge -function is known to 3-loop order (c.f. the appendix). It is thus possible to verify that with our result, the general -function can reproduce the explicit 3-loop computation for the SM212121Incidentally, it was pointed out to us by Esben Mølgaard [45] that comparison with the SM provides an alternative route to extract the semi-simple result, which in turn is structurally identical to the compact gauge group result (at least in our notation). [30, 52].
Seeing as the difficulty of the perturbative computation increase rapidly with increasing loop order, an interesting prospect is seeing how much information can be extracted for the coefficients given only the coefficients of , , and . Between the first 16 coefficients we are left with just 3 CCs:
[TABLE]
These 16 coefficients are those that determine in a general gauge theory without Yukawa or quartic interactions. By itself, (5.13) is insufficient to make headway in any computation, and would have to serve as a cross check for a regular perturbative calculation. This is not a surprising result; knowing the Yukawa and quartic -functions does not provide much information about the -function of a purely gauge theory. However, the picture changes dramatically once we consider the remaining 17 coefficients of , all of which involve Yukawa or quartic couplings. We see that the CCs with the known results for and predict the remaining coefficients up to an ambiguity involving just two parameters:
[TABLE]
Amongst these, stands out like a sore thumb, but it is predicted in terms of the purely gauge terms. What all of this shows is that Weyl CCs may be used to postdict the 3-loop -function in a general theory, if one knows the following:
- i)
The coefficients of , , and
- ii)
The part of that does not involve Yukawa or quartic interactions
- iii)
The coefficient of two of the remaining tensors, e.g. and .
Having achieved such a large amount of additional information on the 3-loop gauge -function, our hope is that constructing the -function at the next order and repeating the approach will be sufficient to infer the 4-loop gauge -function. As shown by the above analysis, this would almost certainly require the full 3-loop Yukawa -function, and even then the purely-gauge terms are most likely under-determined. However, various purely-gauge calculations (such as Yang-Mills with matter) have already been done at 4 loops [53], so it is plausible that the general 4-loop result can be almost completely determined by this approach.
Finally, we shall comment on an augmentation of Eq. (5.8) that follows from a similar subset of equations involving . If one considers the -function tensors with coefficients and (see App. A.3), one finds
[TABLE]
implying that
[TABLE]
As with (5.8), this can be used in two ways: eliminating the coefficients gives a linear combination of CCs (5.9y) and (5.9z); alternatively, using the known -function coefficients produces another (scheme-independent) ratio between and . Consequently, combining this result with (5.8), we find that all three leading-order contributions to for a general theory must be proportional:
[TABLE]
This relation is indeed satisfied by the known results, c.f. Eq. (5.4). Again, despite being somewhat redundant due to the existence of explicit calculations, Eq. (5.17) is a powerful, hitherto-unknown restriction on the allowed RG flow of a completely general QFT: the mere existence of Osborn’s equation automatically implies that leading-order positive-definiteness of (and thus a perturbative proof of the -theorem) actually follows from leading-order positivity of for a pure-gauge (), pure-Yukawa (), or pure-scalar () theory.
At this order, we have not yet explicitly mentioned the conjecture regarding symmetry of , nor any possible contributions to . The new TSs appearing in at this order are all given in App A.3—it is again clear that all diagonal contributions are manifestly symmetric, thus we need only investigate the new mixed terms. Our conjecture requires that we deliberately set
[TABLE]
then re-derive the CCs. As it happens, no additional CCs are obtained, and so imposing symmetry of is still trivial, at least in this sense. This clarifies earlier assertions made in two papers: in [5], the authors speculated on the potential for imposing such symmetry, concluding that “this need not be true in general renormalization schemes”; in [6], the authors mistakenly specify values for two coefficients, as a consequence of using explicit coefficient for the rest of the computation.
5.4 Treatment of
So far we have ignored possible contributions to the -functions from Feynman diagrams with insertions. To obtain the full -function at order 4–3–2 one must include such contributions, and their corresponding TSs. In dimensional regularization the treatment of , or equivalently , is ambiguous222222This ambiguity has been known since the original ’t Hooft–Veltman paper [54]. A full analysis of the difficulties encountered when attempting to treat is given in [55]., leading to an undetermined parameter in the 4-loop -function of the SM strong coupling [8, 9]. This ambiguity is not present in the 3-loop Yukawa -function, where a “semi-naïve” treatment of is sufficient [56, 57, 52]. As we previously pointed out [10], the use of Weyl CCs relates the coefficients of the ambiguous terms in the 4-loop gauge -function to those of the unambiguous (and known) terms in the Yukawa -function, uniquely settling the treatment of in the corresponding 4-loop diagrams.
Contributions from Feynman diagrams with non-vanishing contributions are treated separately in our framework, because contributions in a fermion loop give opposite signs to each fermion chirality. TSs constructed just from and give same-sign contributions to the two chiralities by construction. Thus, we include on fermion lines at the level of TSs to account for the sign difference. The introduction of on the fermion lines introduces the need for a specific interpretation of the use of tensors with or without tilde, see Eq. (4.3). We use the graph notation
[TABLE]
that is, a blob on a Yukawa vertex corresponds to a on the fermion line. One should use if the Yukawa is taken to be and if the Yukawa is taken to be . In this way we preserve the symmetry of the fermion lines under replacements
[TABLE]
We should not blindly go ahead and try to build all possible TSs with on the fermion lines, as there are only a few select Feynman diagrams that can give non-vanishing contributions. For a fermion loop to give rise to an contribution in the Feynman diagram there needs to be at least four independent Lorentz indices on the loop, which can either come from momenta or from Lorentz indices on the gauge lines. Denoting by a fermion line with gauge insertions and Yukawa insertions, then for a line to contribute a Levi-Cevita symbol it must be of type at least , , , or . Furthermore, the Feynman diagrams must have at least two distinct fermion lines contributing an for the contribution to be non-vanishing, so there must be two such lines (recall that loops must have an even number of Yukawa couplings). First we note that lines of type and cannot appear at the present loop-order. Additionally, a Feynman diagram with a line of type leads to a contribution containing
[TABLE]
Such terms vanish in sound gauge theories that are anomaly free; a similar argument eliminates diagrams with a line. This leaves only TSs from contributions stemming from diagrams with 2 fermion lines of type or higher.
From our considerations we arrive at just 4 kinds of Feynman diagrams with insertions that can contribute to , and 5 that can contribute to . In neither case do gauge identities lead to a redundancy between the TSs resulting frome these diagrams. Though it is possible to rewrite the basis of -relevant TSs to one with more simple sub-tensors (such as 2-point functions and Casimirs), we have deviated slightly from our usual modus operandi here, and have kept the basis directly corresponding to the relevant Feynman diagrams232323A on a fermion line does not influence the gauge identities, as .. In this basis, the correspondence between each Feynman diagram and its TS becomes much more explicit. The extra contributions to the -functions are shown in Fig. 7.
Since our chosen basis for terms involves only primitive graphs, we may use the topological argument indicated in [10], which guarantees that the only relevant contributions to Osborn’s equation come from inserting these TSs into the leading-order . Alternatively one can argue that no terms can enter in the RHS of the Osborn’s equation without it either entering in in or . No terms at present order can include contributions, so we have identified all relevant terms below. Thus, the –relevant TSs with on the fermion lines separate neatly into their own subsystem of Osborn’s equation. There are four new terms in the -function, shown in Fig. 7. Plugging the new -function and -function terms into Osborn’s equation yields 9 equations, corresponding to the 9 distinct derivatives of :
[TABLE]
Eliminating the coefficients from the system, yields
[TABLE]
and since the ratio between and is fixed by (5.17), we find242424Through direct comparison with the SM results of [52], we extracted explicit values of the Yukawa coefficients:
[TABLE]
These conditions are what allowed us to uniquely determine the –relevant terms in in Ref. [10].
5.5 Consistency conditions at order 4–3–2
We will now move on to discuss, for the first time, the Weyl Consistency Conditions for the -functions at loop order 4–3–2, or . A summary of our results are shown in Table 1.
We have derived the CCs at order 4–3–2 using the GRAFER code that we have developed for this purpose. This naturally begs the question of validation of the program to ensure the validity of the results. As described in the previous section, we have used the GRAFER routines to derive the CCs at order 3–2–1, where all -function coefficients are known. In that case all the conditions are fulfilled, providing a strong check. As a further point regarding validation, we will point to the fact that all the CCs we have recovered at order 4–3–2 are internally consistent. This would not a priori be the case if there were to be any errors in the program implementation. With the large number of overlapping equations, this is not a trivial check.
Order 4–3–2 is the first occurrence of 2-point TSs for scalar and fermion lines that are not inherently symmetric. There are a total of 6 such structures with fermion indices, , and 3 with scalar indices . As discussed in Sec. 2.1, these must be included in , otherwise the system of equations will be inconsistent. These extra terms are included in the 3-loop Yukawa -function, by setting
[TABLE]
No changes are needed for the quartic -function, as this only enters at two loops. The tensor is parametrized by
[TABLE]
and the individual TSs are given in Fig. 8. Eliminating from Osborn’s equation predict all the and in terms of the standard -function coefficients. Between the remaining four terms, , there is one undetermined parameter252525Explicit results for the and coefficients are included in the ancillary file..
As the reader may already have seen in Table 1, the number of TSs at order 4–3–2 increase dramatically compared to previous orders, as does the number of CCs that can be inferred from Osborn’s equation. For this reason we find ourselves unable to present the full result in the present paper. For interested parties we have instead included our findings in an ancillary file to the arXiv version of this paper. This includes a full list of diagrams for the TSs appearing at 4–3–2 and a comprehensive list of all the CCs; here, we will content ourselves with presenting a few highlights. Once again we substitute the known values of the -function coefficients at order 3–2–1 into the system of equations, before eliminating the coefficients of and the -function. Without this step, even Mathematica struggles with what would be a system of 814 non-linear equations.
Four of the CCs involve only the coefficients, which are known, and read
[TABLE]
We see that they are indeed satisfied by the coefficients of App. A.2, providing one more check of the implementation of the routines in GRAFER. Turning now to the unknown coefficients, it turns out that 4 of the 3-loop Yukawa coefficients are predicted outright:
[TABLE]
If we take the additional step of using the long-known results for the 2-loop quartic -function, we get an additional 10 predicted coefficients, including 4-loop gauge coefficients:
[TABLE]
These provide some easy predictions right out of the box, but by themselves the 14 now-known coefficients are too few to do much more than check eventual direct loop computations. From a more practical point of view there are 133 CCs involving only the coefficients (after substituting in the known ). Conceivably this might be enough to extend the explicit 3-loop Yukawa computations for the SM and 2HDM [30, 52] to the general case. Just as the 2-loop Yukawa -function was computed before the 3-loop gauge -function, it stands to reason that it is easier to obtain the 3-loop Yukawa than the 4-loop gauge; once the 3-loop Yukawa is known, we may substitute the coefficients into the CCs and obtain 128 CCs for the 4-loop gauge -function coefficients. Given our success at three loops, we therefore expect that these CCs will prove a valuable asset to the general 4-loop computation. Indeed, as discussed in the last section, the corresponding CCs for TSs from Feynman diagrams with non-trivial contributions from are enough to resolve the corresponding ambiguity in the computation of the 4-loop gauge -function.
We now return to the conjecture that can be symmetrized (see Sec. 2.3). Imposing symmetry on up to terms relevant for , we have re-derived the Weyl CCs. Whereas symmetry of did not provide any new CCs at order 3–2–1, it turns out that this is not the case at 4–3–2: we find an additional 10 new CCs that appear only after enforcing symmetry. Of particular interest is that these extra conditions are enough to give solutions for 3 of the coefficients in . We find
[TABLE]
whereas without symmetrization there was one unknown parameter parameterizing all three coefficients. Since only the 2-loop quartic -function is known to this order, these are the only predictions we can check. Indeed we find agreement between the predictions found with the symmetrized , and the result (see App. A.2), providing a hint that can be made symmetric.
6 Conclusions and outlook
In this paper, we have attempted to provide a comprehensive analysis of the constraints imposed by Weyl Consistency Conditions on the -functions of a completely general, renormalizable, four-dimensional QFT. These constraints are a direct consequence of Osborn’s equation: their existence was noted as early as [13, 14], yet the first investigations of these constraints for their own sake did not occur for over two decades [5, 7]. Furthermore, despite steady increases in generality (in particular, higher loops and QFTs in other spacetime dimensions), a full derivation of the constraints on QFTs of phenomenological relevance had still not been done. Quite frankly, we find this astonishing, as the work done in non-phenomenological cases has led to multiple, non-trivial implications262626This list is not exhaustive—as mentioned earlier, it has also been shown that CCs imply the existence of 1PR contributions to the -functions in commonly-used non-minimal schemes such as momentum subtraction, despite explicit calculations giving zero for the coefficients of such terms. The resolution of this paradox relies on absorbing such terms into the antisymmetric part of the anomalous dimension [50], but the effect of this on Osborn’s equation (and in particular the antisymmetric tensor ) is still unknown.:
- •
Due to the CCs relating -function coefficients at different loop-orders, i.e. the 3–2–1 ordering, one can use lower-order coefficients to predict higher-order coefficients;
- •
Using coupling-redefinitions to parametrize perturbative renormalization schemes, the scheme-independence of all CCs can be directly shown, hence they form a rare example of physical information along RG flows;
- •
The antisymmetric tensor , parameterizing the extension of Osborn’s equation in the presence of relevant operators, can in fact be related to terms in the standard -functions using CCs;
- •
By parameterizing the contribution of an unknown Feynman integral to the -functions, one can use CCs to determine its value, assuming that sufficiently-many other coefficients are already known.
The first and last implications are of most obvious interest to phenomenologists, as they provide a potential alternative to directly computing tens of millions of Feynman diagrams, or master integrals of ever-increasing complexity.
A derivation of the CCs for a four-dimensional QFT with a simple gauge group was pursued in [6], albeit only up to order 3–2–1, and the relations were verified in using known -function coefficients. Consequently, the only requirement to extend this analysis to a general gauge theory was a suitable notation for multiple gauge groups, in line with the tensor-coupling notation used to represent the Yukawa and quartic interactions. We have developed such a notation (both tensorial and graphical), which treats the general, unified gauge -function as a rank-two tensors in coupling-space, with the corresponding gauge couplings, group Casimirs and Dynkin indices arranged into a (block-)diagonal form. With identities from gauge invariance (and, of course, the Jacobi identity for gauge theories), the possible tensors that depend on the gauge coupling in each -function are reduced to a basis, and we have attempted to further simplify the tensors by maximizing the number of Casimirs and Dynkin indices, in line with the preferences of the community. Essentially, our notation is a formalized and generally applicable version of the substitution rules described by Machacek and Vaughn [2], capable of including multiple groups via a coupling-matrix. That being said, the notation is highly efficient, and indeed allows us to derive CCs for a completely general four-dimensional QFT in direct analogy with the Yukawa and quartic notation employed in [6]; furthermore, the 3–2–1 ordering is manifestly a simple topological consequence of attempting to form scalars of a given loop order out of two-, three-, and four-index tensors, and is thus trivially preserved at higher loops.
Having established the notation, we developed our custom code, GRAFER, in order to automate the derivation of CCs. Beginning at order 2–1–0, we have shown how the results for a single gauge group extend trivially to the general case with a compact gauge group, and derived two results that have so far been missed in previous analyses: a CC involving 2-loop gauge and 1-loop Yukawa coefficients, where the calculated values happen to be 0; and a simple linear relation between leading-order coefficients, which is indeed satisfied by the known values [14]. Next, we generated the results at order 3–2–1, again extending the results to arbitrary gauge groups and resolving the group structures. As a cross-check of GRAFER’s output, we have applied various checks:
- •
The CCs obtained are self consistent;
- •
The conditions are satisfied by all known coefficients for a general theory;
- •
The 19 old CCs for a theory with a simple gauge group are linear combinations of our 26 new, more general CCs for a theory with a compact gauge group.
Armed with this, we have shown how the 3-loop gauge -function for a single gauge group can be matched to our general notation, up to two free parameters, and how this algebraic subsystem is then uniquely solved by the inclusion of our CCs, thereby deriving the complete 3-loop gauge -function for an arbitrary, renormalizable, four-dimensional QFT. As referenced in section 5, this result has in fact been calculated previously via other methods [45]; nevertheless, our method provides the first novel application of Weyl CCs as a tool for deriving general results from a known special case, demonstrating its viability as a technique at higher loop orders. To our knowledge this also provides the first 3-loop result for the kinetic mixing of the Abelian group factors. We have also demonstrated that the linear relation between coefficients is extended to include the quartic term, thus all three leading-order coefficients appear in a fixed ratio. Again, while this is somewhat irrelevant in light of the explicit calculations, it is interesting that the leading-order positive-definiteness of Osborn’s -function for a completely general theory actually follows from its own self-consistency, plus positive-definiteness of a pure-gauge, pure-Yukawa, or pure-quartic theory.
Finally, we have used GRAFER to generate the CCs at order 4–3–2, extending the previous forefront of -function constraints, and setting up a framework with which one can eventually predict much of the 4-loop gauge -function for a completely general, renormalizable, four-dimensional QFT. Were one to reconstruct the general 3-loop Yukawa -function, and augment the total system with a subset of the 4-loop gauge calculations, it should be possible to construct the general 4-loop gauge -function. The complete -functions for all three SM gauge couplings in the unbroken phase, including all matter content, could then be extracted by simply substituting in the SM coupling matrices and generators for the unbroken phase. Given that the current 4-loop SM calculations [8, 9] have been done only for the strong-coupling, incorporating just the top-Yukawa coupling and the Higgs self-coupling, this would be a monumental advance. Additionally, we have drawn attention to a particularly simple subset of CCs that arises at this order and resolves the explicitly-demonstrated ambiguity related to the treatment of in the 4-loop strong-coupling -function by relating it to unambiguous 3-loop Yukawa coefficients; in our previous paper [10], we fixed this value by simply reconstructing the general coefficients for the relevant terms in , demonstrating that the RG information contained within these CCs may eventually lead to an unambiguous treatment of in dimensionally-regularized chiral gauge theories.
We have also reached the point at which we may again test various non-trivial modifications to Osborn’s equation, namely the -tensor parameterizing the shift , and our conjectured possibility of imposing symmetry on . By rearranging the 4–3–2 CCs, we have derived explicit expressions for all but one of the nine possible coefficients in a general theory; unfortunately, the undetermined coefficients all contribute to the fermionic shift , for which we cannot find any explicit calculation in current literature that would allow us to fix the final coefficient. Imposing symmetry of at order 4–3–2 leads to ten additional CCs, which we have extracted by simple comparison with the CCs when symmetry is not imposed. Our intent was to isolate easily-identifiable criteria for which our conjecture could be falsified. Of the ten conditions, the exact predictions272727Of course, these are actually post-dictions, but as with the derivation of a ratio between leading-order coefficients, the method is such that these exact values arise without any reference to explicit calculations, and clearly demonstrate the potential for similar results at higher orders. listed in (5.30) are indeed the correct values, providing some corroboration of the conjecture.
Presenting explicit results in this paper has been challenging, mostly due to the sheer quantity of explicit tensors and CCs as one goes to higher orders, as demonstrated in Table 1. We have opted to sacrifice the scheme-independence of written results at 3–2–1 and above, in favour of using lower-order coefficients to present relations valid in only; the full list of -function tensors, -function parametrizations, contributions, and CCs are given in the ancillary file. Consequently, we feel that we should again emphasize the scheme-independent nature of -function constraints derived from Weyl CCs: all one needs to do is refrain from inserting the lower-order coefficients and include possible 1PR contributions to the -functions. Previous general investigations into -function constraints have maintained this scheme-independent approach; the diagrammatic representation for the -functions is well-suited to deriving the effects of a scheme change on the general coefficients, allowing one to directly verify scheme-independence of the constraints282828One can also easily reproduce well-known results regarding scheme-independence of the -functions, namely that the quartic and Yukawa -functions are scheme-independent at one loop, while the gauge -function is scheme-independent at one and two loops.. Once certain coefficients are known to be scheme-independent, one can include their explicit values without sacrificing overall scheme-independence of the system, while still leading to simplified CCs; this argument also guarantees that the fixed ratio between leading-order coefficients is a completely general result.
In principle, now that we have an adequate diagrammatic representation of the general gauge -function, our procedure is capable of extracting the CCs for an arbitrary renormalizable four-dimensional QFT at any order. If one requests a parametrization of Osborn’s -function at some loop order , GRAFER derives the complete set of CCs at order ()–()–(), with the only limit being available computing power; our hope is that the CCs will therefore play a rôle in future -function calculations, whenever attempts at such calculations are eventually made292929The 3–2–1 ordering refers only to the maximum order of the -functions appearing in the CCs. It is worth emphasizing again that the utility of our procedure is not restricted to these maximal loop orders: there exist relations not only between the different -functions, but between different orders of each individual -function..
While this work might seem to suggest an end to any further theoretical work on the CCs themselves (beyond seeking results for specific higher-loop diagrams, such as those containing ambiguities), this paper only covers the case of four-dimensional theories. As mentioned in Section 2.1, equivalent versions of Osborn’s equation have been argued to hold in any even-dimensional spacetime [34]. To obtain the equivalent CCs, we need only change the marginal couplings; for example, a general scalar theory in six dimensions has a marginal interaction , hence one can represent the tensor coupling as a three-point vertex with identical lines, and the analogous constructions of , and follow as in four dimensions [21]. Because of this, all consequences of the four-dimensional version immediately carry over to other dimensions: one can again predict higher-order -function coefficients and evaluate undetermined Feynman integrals. Furthermore, a series of exploratory calculations was performed for general 3-dimensional theories [58, 22], where there is no proven analogue of Osborn’s equation, yet the associated -functions still appear to satisfy an equation of the same form for some as-yet-unknown odd-dimensional function 303030Conjectures have been advanced that the three-dimensional function being constructed is in fact the -function [59, 60], which at RG fixed points reduces to the finite part of , the free-energy of a CFT compactified on a 3-sphere. At leading-order, the -function indeed appears to have the same properties as the -function.. The associated marginal interaction term for a three-dimensional scalar-fermion theory is (using both real scalars and fermions), thus all one needs to do is define a graphical representation for the marginal couplings and . Attempts to include gauge interactions have also been made, successfully extending the leading-order construction to three-dimensional theories with a single gauge group, in exact analogy with a four-dimensional theory. In all cases, the procedure is the same, and one can further test our conjecture that symmetry of the analogous may always be imposed.
Another topic that has not been considered is supersymmetry. Compared to general theories, our understanding of in supersymmetric theories is considerably more advanced, ranging from an almost immediate proof of the weak -theorem using -maximization [61]313131The full proof required an extension of -maximization [62, 63], which holds away from RG fixed points., to a proposed non-perturbative expression for the -function [64] that satisfies the strong -theorem subject to a constraint on the chiral superfield anomalous dimension [5, 6]. From our perspective, supersymmetric QFTs are nothing more than special cases of the general four-dimensional QFT, with supersymmetry determined by specific relations between fields, charges, and couplings, and supersymmetry determined by further relations. Consequently, one can test the exact expression at some order of perturbation theory by defining the required relations as in Section 5 of [6], suitably extended to multiple gauge groups, and substituting into the solution provided by GRAFER. While there are technicalities to this procedure, such as having to evaluate the non-supersymmetric -functions using dimensional reduction rather than the more conventional dimensional regularization323232A full discussion of the challenges faced when using dimensional reduction in non-supersymmetric theories, plus its relation to the standard dimensional regularization procedure via a coupling-redefinition, can be found in [65, 66]., the method has been shown to work up to for theories with a single gauge group [6]. Conversely, supersymmetric reductions augment the use of Weyl CCs to reconstruct the -functions of a general theory, since there exist non-renormalization theorems limiting the form of supersymmetric -functions, and even exact results for certain schemes (such as the NSVZ -function [67, 68, 69, 70]), which place similar restrictions on the -function coefficients.
We appreciate that our paper is rather long, so if you’ve skipped to the end looking for a quick summary of the results, think of Grothendieck’s nut-cracking analogy for the two styles of mathematics [71]. Direct -function calculations are the “hammer and chisel” approach—you will get the answer for your particular theory, but it might be a colossal pain to actually set up the calculation, or take too long to obtain the results with the available computing resources. Weyl CCs are more of a “soak it in water” approach333333Grothendieck also came up with a “rising sea” analogy for this latter option, which is delightfully apocalyptic, and adds a much-needed artistic touch to our mundane linear-algebraic labors: given an island surrounded by water, a slowly rising sea appears to have no effect on its structural soundness… until the island is eventually submerged and dismantled by the tides.—eventually, through sheer overwhelming consistency, the -functions will be solved in general, and you can simply extract the results for your special case.
Acknowledgments
We would like to thank Esben Mølgaard for pointing out to us that the 3-loop semi-simple gauge -function can be extracted from [32] by comparison with the known SM results, and for independently cross-checking the calculation. We would also like to thank Florian Herren for helping us understand the nature of contributions in general theories, allowing us to clarify our previous work. CP would like to thank Ian Jack for his careful reading of the manuscript and invaluable feedback on our summary of the LRG, and Hugh Osborn for helpful comments. AET would like to thank Fermilab for hosting him during part of this work, and gratefully acknowledges financial support from the Danish Ministry of Higher Education and Science through an EliteForsk Travel Grant. This work is partially supported by the Danish National Research Foundation grant DNRF90. All diagrams have been drawn using TikZ-Feynman [72].
Appendix A Appendices
These appendices contain the full parametrization of the -function for the couplings of a completely general theory, with Lagrangian density (3.1), up to order 3–2–2. We include the full TS basis for and . Finally we provide a comparison of our results for kinetic mixing with those of [31], and of our 3–2–1 CCs with those derived in [6].
A.1 Tensor contractions in the -functions at order 3–2–2
Here we give our parametrization of the -functions using regular dummy-index notation. We reiterate that fermion indices are always treated in matrix notation and never made explicit. Tildes on more advanced objects are always defined in the usual manner: in the space of fermion indices. Several 1- and 2-loop 2-point function occurring as substructures in the -functions are used throughout. They are:
1-loop gauge 343434Be advised that is twice the trace normalization of the fermions, due to both fermions and their complex conjugated automatically being included in the notation.:
[TABLE]
1-loop fermion:
[TABLE]
1-loop scalar:
[TABLE]
2-loop gauge:
[TABLE]
2-loop fermion:
[TABLE]
2-loop scalar:
[TABLE]
Having defined the 2-point structures, we can proceed with the parametrization of the -functions.
A.1 Gauge -function
The first three loop orders of the gauge -functions are given by
[TABLE]
and
[TABLE]
and
[TABLE]
A.1 Yukawa -function
For the 1-loop Yukawa -function we have
[TABLE]
while at 2-loop orders we have
[TABLE]
A.1 Quartic -function
For the quartic -function we have
[TABLE]
at 1-loop order. For completeness we include the 2-loop quartic -function as this too is known:
[TABLE]
At this point the reader may begin to appreciate the use of the graphical notation employed throughout the paper.
A.2 Coefficients of the -functions at order 3–2–2
The basis we have employed for the -functions in this work deviates from previous works in some places. Part of this is simply due to -specific computations not including terms with vanishing coefficients. It requires some work to match our basis onto [2], [51] and [32], the latter of which has (to our knowledge) never before been extended to a compact gauge group in published literature. We will therefore list all the known coefficients in our basis here.
1-loop gauge coefficients:
[TABLE]
2-loop gauge coefficients:
[TABLE]
3-loop gauge coefficients:
[TABLE]
1-loop Yukawa coefficients:
[TABLE]
2-loop Yukawa coefficients:
[TABLE]
1-loop quartic coefficients:
[TABLE]
2-loop quartic coefficients:
[TABLE]
A.3 -function and tensors at order 3–2–1
The structure of can be inferred from the 3-loop -function, by considering all different ways of replacing couplings with open indices. The 27 resulting TSs are shown in Fig. 9. The 49 TSs making up the 4-loop -function are shown in Figs. 10 and 11.
A.4 Comparison of kinetic mixing results with previous work
We may consider two different -functions related to the gauge coupling matrix . With the generic formula for -functions (3.14), one finds
[TABLE]
using for the RG time, and the inverse (normal -function)
[TABLE]
from Eq. (3.16).
With our -functions under control we can compare to Luo and Xiao [31], where they parametrize the kinetic term of their Lagrangian by
[TABLE]
That is, they keep their gauge couplings in the interaction terms, and let include the effects of the kinetic mixing in off-diagonal terms. Gathering all their gauge couplings in a diagonal matrix we may match their notation to the one developed in the present paper353535We can keep the non-Abelian entires in , but all explicit indices in this section refer to factors.:
[TABLE]
Luo and Xiao define their -functions as
[TABLE]
so that . We then find that
[TABLE]
for . Meanwhile the regular -functions are determined from
[TABLE]
Finally for direct comparison we typically arrange the -function by loop order. It follows from Eqs. (A.22) and (5.1) that
[TABLE]
Thus,
[TABLE]
With the known 2-loop results from appendices A.1 and A.2, we reproduce the results of Luo and Xiao [31]; using a similar extraction procedure, our general 3-loop result encompasses the analogous 3-loop kinetic mixing results.
A.5 Comparison of 3–2–1 consistency conditions with previous work
While the calculation method used in this paper is conceptually identical to [6], there are various differences in the intermediate steps that render direct comparison difficult; furthermore, although implicit in their results, the authors did not explicitly include CCs involving the coefficients of , instead opting to phrase the information as a consistency check on the calculation of [32]. Not only does [6] use rather than our (which simply leads to our coefficients being larger by a factor 2), but they use a different basis for the gauge-dependent tensors in , which does not directly correspond to the single-gauge case of our own basis. Thus, to facilitate comparison, we have translated the 19 conditions obtained (implicitly) in [6] into our notation and conventions, arranged such that each condition is equal to zero:
[TABLE]
Having done the translation, we can verify that these conditions are indeed linear combinations of our more general conditions. Defining conditions as the corresponding equations (5.9) with all terms moved to the LHS, we find that
[TABLE]
From this, we can isolate the new information contained in our calculation. Clearly, equations (5.9a), (5.9b), (5.9e), (5.9f), (5.9g) and (5.9h) are completely new, as they do not appear in the linear combinations. The final piece of information fixes an under-determination involving equations : attempting to solve in terms of leaves one free parameter, hence knowing any one of these nine conditions determines the rest.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Wilson and Kogut [1974] K. G. Wilson and J. B. Kogut, Phys. Rept. 12 , 75 (1974) . · doi ↗
- 2Machacek and Vaughn [1983] M. E. Machacek and M. T. Vaughn, Nucl. Phys. B 222 , 83 (1983) . · doi ↗
- 3Machacek and Vaughn [1984] M. E. Machacek and M. T. Vaughn, Nucl. Phys. B 236 , 221 (1984) . · doi ↗
- 4Machacek and Vaughn [1985] M. E. Machacek and M. T. Vaughn, Nucl. Phys. B 249 , 70 (1985) . · doi ↗
- 5Jack and Osborn [2014] I. Jack and H. Osborn, Nucl. Phys. B 883 , 425 (2014) , ar Xiv:1312.0428 [hep-th] . · doi ↗
- 6Jack and Poole [2015] I. Jack and C. Poole, JHEP 01 , 138 (2015) , ar Xiv:1411.1301 [hep-th] . · doi ↗
- 7Antipin et al. [2013] O. Antipin, M. Gillioz, J. Krog, E. Mølgaard, and F. Sannino, JHEP 08 , 034 (2013) , ar Xiv:1306.3234 [hep-ph] . · doi ↗
- 8Bednyakov and Pikelner [2016] A. V. Bednyakov and A. F. Pikelner, Phys. Lett. B 762 , 151 (2016) , ar Xiv:1508.02680 [hep-ph] . · doi ↗
