Optimization for factorized quantities in perturbative QCD

P. M. Stevenson

arXiv:1904.07159·hep-ph·June 26, 2019

Optimization for factorized quantities in perturbative QCD

P. M. Stevenson

PDF

TL;DR

This paper revisits the optimization of scheme choices in perturbative QCD calculations, correcting previous deficiencies and simplifying the process by identifying proper scheme variables and invariants.

Contribution

It corrects and clarifies the application of the principle of minimal sensitivity in optimizing factorized quantities in perturbative QCD, simplifying the optimization procedure.

Findings

01

Recovered earlier results of Nakkagawa and Niegawa.

02

Showed that optimized coefficient C^opt=1, simplifying calculations.

03

Identified proper scheme variables, RG equations, and invariants.

Abstract

Perturbative calculations of factorized physical quantities, such as moments of structure functions, suffer from renormalization- and factorization-scheme dependence. The application of the principle of minimal sensitivity to "optimize" the scheme choices is reconsidered, correcting deficiencies in the earlier literature. The proper scheme variables, RG equations, and invariants are identified. Earlier results of Nakkagawa and Niegawa are recovered, even though their starting point is, at best, unnecessarily complicated. In particular, the optimized coefficients of the coefficient function C are shown to vanish, so that C^opt=1. The resulting simplifications mean that the optimization procedure is as simple as that for purely-perturbative physical quantities.

Equations207

F_{n} (Q) = ⟨ O_{n} (M)⟩ C_{n} (Q, M),

F_{n} (Q) = ⟨ O_{n} (M)⟩ C_{n} (Q, M),

\frac{M}{⟨ O ⟩} \frac{d ⟨ O ⟩}{d M} \equiv γ_{O} .

\frac{M}{⟨ O ⟩} \frac{d ⟨ O ⟩}{d M} \equiv γ_{O} .

γ_{O} (a) = - b g a (1 + g_{1} a + g_{2} a^{2} + \dots) .

γ_{O} (a) = - b g a (1 + g_{1} a + g_{2} a^{2} + \dots) .

M \frac{\partial a}{\partial M} = β (a) = - b a^{2} (1 + c a + c_{2} a^{2} + \dots) .

M \frac{\partial a}{\partial M} = β (a) = - b a^{2} (1 + c a + c_{2} a^{2} + \dots) .

C (Q, M) = 1 + r_{1} \tilde{a} + r_{2} \tilde{a}^{2} + \dots,

C (Q, M) = 1 + r_{1} \tilde{a} + r_{2} \tilde{a}^{2} + \dots,

τ \equiv b ln (M /Λ), \tilde{τ} \equiv b ln (\tilde{M} /Λ) .

τ \equiv b ln (M /Λ), \tilde{τ} \equiv b ln (\tilde{M} /Λ) .

⟨ O ⟩ = (\mbox const.) exp (\int^{a} d x \frac{γ _{O} ( x )}{β ( x )}) .

⟨ O ⟩ = (\mbox const.) exp (\int^{a} d x \frac{γ _{O} ( x )}{β ( x )}) .

⟨ O ⟩ = A exp (\int_{0}^{a} d x \frac{γ _{O} ( x )}{β ( x )} - \int_{0}^{\infty} d x \frac{g x}{x ^{2} ( 1 + c x )}),

⟨ O ⟩ = A exp (\int_{0}^{a} d x \frac{γ _{O} ( x )}{β ( x )} - \int_{0}^{\infty} d x \frac{g x}{x ^{2} ( 1 + c x )}),

\int_{0}^{a} d x \frac{- b g x ( 1 + g _{1} x )}{- b x ^{2} ( 1 + c x )} - \int_{0}^{\infty} d x \frac{g x}{x ^{2} ( 1 + c x )}

\int_{0}^{a} d x \frac{- b g x ( 1 + g _{1} x )}{- b x ^{2} ( 1 + c x )} - \int_{0}^{\infty} d x \frac{g x}{x ^{2} ( 1 + c x )}

(c a)^{g} (1 + c a)^{- g (1 - g_{1} / c)} .

(c a)^{g} (1 + c a)^{- g (1 - g_{1} / c)} .

F^{(2)} = A (c a)^{g} (1 + c a)^{- g (1 - g_{1} / c)} (1 + r_{1} \tilde{a}) .

F^{(2)} = A (c a)^{g} (1 + c a)^{- g (1 - g_{1} / c)} (1 + r_{1} \tilde{a}) .

\frac{1}{F ^{(2)}} \frac{\partial F ^{(2)}}{\partial τ ~}

\frac{1}{F ^{(2)}} \frac{\partial F ^{(2)}}{\partial τ ~}

\frac{1}{F ^{(2)}} \frac{\partial F ^{(2)}}{\partial τ}

\frac{1}{F ^{(2)}} \frac{\partial F ^{(2)}}{\partial g _{1}}

\frac{\partial r _{1}}{\partial τ ~} = 0, \frac{\partial r _{1}}{\partial τ} = g, \frac{\partial r _{1}}{\partial g _{1}} = - g,

\frac{\partial r _{1}}{\partial τ ~} = 0, \frac{\partial r _{1}}{\partial τ} = g, \frac{\partial r _{1}}{\partial g _{1}} = - g,

r_{1} = g (τ - g_{1} - σ_{1} (Q)),

r_{1} = g (τ - g_{1} - σ_{1} (Q)),

r_{1}^{opt} = 0.

r_{1}^{opt} = 0.

\tilde{a} = a (1 + g_{1} a),

\tilde{a} = a (1 + g_{1} a),

ln (1 + c a) = c \tilde{a} .

ln (1 + c a) = c \tilde{a} .

g_{1}^{opt} = \frac{ln ( 1 + c a ) - c a}{c a ^{2}} .

g_{1}^{opt} = \frac{ln ( 1 + c a ) - c a}{c a ^{2}} .

τ = \frac{1}{a} + c ln \frac{c a}{1 + c a} .

τ = \frac{1}{a} + c ln \frac{c a}{1 + c a} .

ln (1 + c a) - (c a)^{2} ln \frac{c a}{1 + c a} = c a (2 - a σ_{1} (Q)),

ln (1 + c a) - (c a)^{2} ln \frac{c a}{1 + c a} = c a (2 - a σ_{1} (Q)),

F_{opt}^{(2)} = A (c a)^{g} (1 + c a)^{- g (1 - g_{1}^{opt} / c)} .

F_{opt}^{(2)} = A (c a)^{g} (1 + c a)^{- g (1 - g_{1}^{opt} / c)} .

\frac{1}{F} \frac{\partial F}{\partial X} = 0,

\frac{1}{F} \frac{\partial F}{\partial X} = 0,

\frac{1}{F} \frac{\partial F}{\partial τ ~} = \frac{1}{C} \frac{\partial C}{\partial τ ~} .

\frac{1}{F} \frac{\partial F}{\partial τ ~} = \frac{1}{C} \frac{\partial C}{\partial τ ~} .

(\frac{\partial}{\partial τ ~}_{\tilde{a}} + \frac{β ~ ( a ~ )}{b} \frac{d}{d a ~}) C = 0,

(\frac{\partial}{\partial τ ~}_{\tilde{a}} + \frac{β ~ ( a ~ )}{b} \frac{d}{d a ~}) C = 0,

(\frac{\partial}{\partial c ~ _{j}}_{\tilde{a}} + \tilde{β}_{j} (\tilde{a}) \frac{d}{d a ~}) C = 0,

\frac{1}{C} \frac{\partial C}{\partial X} + \frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial X} = 0,

\frac{1}{C} \frac{\partial C}{\partial X} + \frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial X} = 0,

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial τ} = \frac{γ _{O}}{b} .

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial τ} = \frac{γ _{O}}{b} .

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}} = \frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}}_{a} + \frac{1}{⟨ O ⟩} \frac{d ⟨ O ⟩}{d a} \frac{\partial a}{\partial c _{j}},

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}} = \frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}}_{a} + \frac{1}{⟨ O ⟩} \frac{d ⟨ O ⟩}{d a} \frac{\partial a}{\partial c _{j}},

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}} = \int_{0}^{a} d x \frac{γ _{O} ( x )}{β ( x ) ^{2}} b x^{j + 2} + \frac{γ _{O} ( a )}{β ( a )} β_{j} (a) .

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}} = \int_{0}^{a} d x \frac{γ _{O} ( x )}{β ( x ) ^{2}} b x^{j + 2} + \frac{γ _{O} ( a )}{β ( a )} β_{j} (a) .

\frac{1}{⟨ O ⟩} \frac{\partial ⟨ O ⟩}{\partial c _{j}} = \int_{0}^{a} d x \frac{β _{j} ( x )}{β ( x )} γ_{O}^{'} (x),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Optimization for factorized quantities

in perturbative QCD

P. M. Stevenson

*T.W. Bonner Laboratory, Department of Physics and Astronomy,

Rice University, Houston, TX 77251, USA*

Abstract:

Perturbative calculations of factorized physical quantities, such as moments of structure functions, suffer from renormalization- and factorization-scheme dependence. The application of the principle of minimal sensitivity to “optimize” the scheme choices is reconsidered, correcting deficiencies in the earlier literature. The proper scheme variables, RG equations, and invariants are identified. Earlier results of Nakkagawa and Niégawa are recovered, even though their starting point is, at best, unnecessarily complicated. In particular, the optimized coefficients of the coefficient function $C$ are shown to vanish, so that $C^{\rm opt}=1$ . The resulting simplifications mean that the optimization procedure is as simple as that for purely-perturbative physical quantities.

1 Introduction

The application of the principle of minimal sensitivity [1] to the problem of factorization-scheme dependence has had a rather unfortunate history. The present author shares some of the blame, and this paper aims to make amends. The pioneering work by Politzer [2], which showed the way, was marred by a trivial algebraic error, seemingly showing that the optimization equations had no solution. The error was belatedly corrected in Ref. [3]. However, Ref. [3] is, in retrospect, insufficiently general beyond second order. The formulation of Nakkagawa and Niégawa (NN) in a series of papers [4]-[7] is, at best, unnecessarily complicated and creates spurious difficulties. However, NN’s optimization equations are actually equivalent to those we derive below. We discuss their work in Appendix A. Note that in Refs. [2]-[7] “ $b$ ” has the opposite sign to ours.111 Our notation follows Ref. [8], except that we now omit tilde’s on $\Lambda$ and $\rho_{j}$ , which had merely emphasized a difference in definition from previous conventions. Tildes will be needed here for another purpose.

The prototypical factorization problem is in deep-inelastic leptoproduction, where a high-energy lepton collides with a proton, or other hadron, exchanging a virtual photon of large virtuality $Q^{2}$ . Neglecting power-suppressed terms, the $n$ th moment, $\int_{0}^{1}\!\frac{dx}{x}\,x^{n}F(x,Q)$ , of the non-singlet proton structure function can be factorized into the form

[TABLE]

where $\langle{\cal O}_{n}(M)\rangle$ is an operator matrix element, $C_{n}$ is a coefficient function, and $M$ is some arbitrary “factorization scale.” (From now on the moment index $n$ will be suppressed.)

The operator matrix element $\langle{\cal O}(M)\rangle$ has an $M$ dependence given by its anomalous dimension

[TABLE]

While $\langle{\cal O}(M)\rangle$ itself cannot be calculated perturbatively, its anomalous dimension, $\gamma_{{\scriptscriptstyle\cal O}}$ , has a calculable perturbation series of the form

[TABLE]

The leading-order coefficient is written as $-bg$ for later convenience. While $g$ is invariant the other coefficients, $g_{1},g_{2},\ldots$ are scheme-dependent. The expansion parameter, $a=a(M)$ , is the couplant in some arbitrary renormalization scheme (RS) with renormalization scale $M$ . Its $M$ dependence is given by the $\beta$ function:

[TABLE]

The scheme-dependent coefficients $c_{2},\ldots$ can be regarded as RS labels [1, 8].

The coefficient function $C$ can be calculated as a perturbation series:

[TABLE]

where $\tilde{a}$ is the couplant of some other arbitrary RS – which can be different from the RS used to define $a$ . It can have a different renormalization scale $\tilde{M}$ , and different RS labels $\tilde{c}_{2},\ldots$ . (In the latter respect we differ from Ref. [3].) Perhaps the easiest way to understand that the RS’s for $a$ and $\tilde{a}$ can be distinct, without inconsistency, is to imagine that first both $\langle{\cal O}\rangle$ and $C$ are calculated in the same RS and then a substitution $\tilde{a}=a(1+v_{1}a+v_{2}a^{2}+\ldots)$ , with arbitrary $v_{1},v_{2},\ldots$ , is made in the result for $C$ . In terms of renormalization constants, the $Z_{{\scriptscriptstyle\cal O}}$ constant needed for the renormalization of the operator ${\cal O}$ (which is genuinely an infinite change of normalization) must be consistent between the calculations of $C$ and $\gamma_{{\scriptscriptstyle\cal O}}$ , but the reparametrization step – the substitution of $a=Z_{a}a_{\rm bare}$ and $\tilde{a}=\tilde{Z_{a}}a_{\rm bare}$ in the bare forms of $\gamma_{{\scriptscriptstyle\cal O}}$ and $C$ , respectively – can involve distinct $Z_{a}$ and $\tilde{Z_{a}}$ renormalization constants.

Thus, what we shall call “RS/FS dependence” involves a choice of factorization scheme (FS), parametrized by $g_{1},g_{2},\ldots$ , and two, independent, choices of RS for $a$ and $\tilde{a}$ that are labelled, respectively, by $\tau$ , $c_{2},c_{3},\ldots$ and by $\tilde{\tau}$ , $\tilde{c}_{2},\tilde{c}_{3},\ldots$ , where

[TABLE]

(See Appendix B for the definition of $\Lambda$ . Without loss of generality, we may assume that the two renormalization prescriptions for $a$ and $\tilde{a}$ are defined so that their $\Lambda$ parameters are the same.)

Integrating Eq. (1.2), utilizing the $\beta$ -function equation, gives

[TABLE]

Note that the $M$ dependence of $\langle{\cal O}\rangle$ comes solely from $a$ (whereas the $M$ dependence of $C$ comes solely from the $r_{i}$ coefficients). The constant of integration may be written as a constant $A$ defined by

[TABLE]

where, as with the definition of $\Lambda$ , the lower limit of $x\to 0$ in each integral produces a divergence that cancels between the two integrals. The normalization constant $A$ is not calculable from perturbation theory, but is RS/FS invariant, as shown in Ref [3].

2 Second-order approximation

We first discuss second order, where all authors are in agreement. A second-order approximation corresponds to truncating the series for $\gamma_{{\scriptscriptstyle\cal O}}$ , $C$ , and $\beta$ after two terms. The integrals in Eq. (1.8) become

[TABLE]

which exponentiates to

[TABLE]

Substituting in Eq. (1.1), one obtains the second-order approximation to $F$ as

[TABLE]

This approximant depends on RS/FS choices through three variables, $\tau$ , $\tilde{\tau}$ , and $g_{1}$ . Partial differentiations of Eq. (2.3) yield

[TABLE]

Self-consistency of perturbation theory requires these variations to be of order $a^{2}$ . Noting that $\tilde{a}=a(1+O(a))$ , we see that

[TABLE]

so that $r_{1}$ has the form

[TABLE]

where $\boldsymbol{\sigma}_{1}(Q)$ is an invariant.222 The earlier literature is a bit sloppy at this point, as we discuss in section 4.

Substituting Eq. (2.7) back into Eqs. (2.4–2.6) and equating to zero produces the optimization conditions. Since $\partial r_{1}/\partial\tilde{\tau}$ vanishes, the solution to the optimization equation (2.4) is simply

[TABLE]

The second optimization equation, from (2.5), then reduces to

[TABLE]

and (2.6) gives

[TABLE]

Eliminating $\tilde{a}$ between these last two equations gives us the optimal $g_{1}$ in terms of $a$ :

[TABLE]

Also, from the integrated $\beta$ -function (“int- $\beta$ ”) equation (see Appendix B), at second order, we have

[TABLE]

Substituting for $\tau$ and for $g_{1}$ in Eq. (2.8) and equating to zero, since $r_{1}^{\rm opt}=0$ , we find

[TABLE]

which determines the optimized $a$ in terms of the invariant quantities $c$ and $\boldsymbol{\sigma}_{1}(Q)$ . Substituting back in Eq. (2.12) then fixes $g_{1}^{\rm opt}$ . The final optimized result, from Eq. (2.3), is

[TABLE]

Note that the optimization condition $r_{1}^{\rm opt}=0$ means that $C_{\rm opt}=1$ , so that all perturbative corrections are effectively exponentiated and re-absorbed into the anomalous dimension by the optimization procedure. As we shall see later, this property holds at any order, as first noted by NN [5].

Also note that while the value of $\tilde{a}$ (and hence $\tilde{\tau}$ ) is determined, it is not needed to obtain the result for $F^{(2)}_{\rm opt}$ .

3 RG equations

As discussed above the RS/FS variables are $\tau$ , $c_{j}$ , $\tilde{\tau}$ , $\tilde{c}_{j}$ , and the $g_{i}$ coefficients. We now write down the RG equations expressing the fact that the physical quantity $F$ is independent of all these variables. Symbolically, we have

[TABLE]

where $X$ stands for any of the set of variables $\{\tau,c_{j},\tilde{\tau},\tilde{c}_{j},g_{j}\}$ .

Recalling the factorized form $F=\langle{\cal O}\rangle C$ of Eq. (1.1), and noting that $\langle{\cal O}\rangle$ is manifestly independent of $\tilde{M}$ , we see that

[TABLE]

The same argument applies to the $\tilde{c}_{j}$ derivatives, since $\langle{\cal O}\rangle$ , while it depends on $a$ and its RS variables $\tau,c_{j}$ , is manifestly independent of $\tilde{a}$ and its RS variables $\tilde{\tau},\tilde{c}_{j}$ . Thus, the first two RG equations have the familiar form

[TABLE]

where the first term collects dependence from the $r_{i}$ coefficients of $C$ , while the second term collects the compensating dependence via $\tilde{a}$ . (See Appendix B for the definition of the $\beta_{j}(a)$ functions.)

The other RG equations all take the form

[TABLE]

where $X$ is any of the variables $\tau,c_{j}$ or $g_{j}$ . The first term only involves dependence via the $r_{i}$ coefficients – indeed we are tempted to add “ $|_{\tilde{a}}$ ” (meaning “with $\tilde{a}$ held constant”) to the notation, to match Eqs. (3.3), (3.4), but it is unnecessary since $\tilde{a}$ is manifestly independent of $\tau,c_{j}$ and $g_{j}$ . The second term can be evaluated as follows. In the case $X\to\tau$ , we may simply use the definition of $\gamma_{{\scriptscriptstyle\cal O}}$ , Eq. (1.2), to get

[TABLE]

For $X\to c_{j}$ we can first write

[TABLE]

and then use Eq. (1.8) to obtain

[TABLE]

Although we return to this form later, for the present we follow NN and re-write it as

[TABLE]

where $\gamma_{{\scriptscriptstyle\cal O}}^{\prime}(x)\equiv d\gamma_{{\scriptscriptstyle\cal O}}/dx$ . The equivalence to Eq. (3.8) can be shown by integrating by parts and then using the differential equation satisfied by the $\beta_{j}$ functions (see Appendix B). Finally, for $X\to g_{j}$ we find, from Eq. (1.8),

[TABLE]

Thus, the RG equations, in addition to Eqs. (3.3,3.4), are

[TABLE]

As usual, the RG equations determine how the coefficients $r_{i}$ must depend on the RS/FS variables. We now re-write the RG equations to facilitate finding these dependences. First, we use the series for $\gamma_{{\scriptscriptstyle\cal O}}$ and $C$ :

[TABLE]

with $r_{0}\equiv g_{0}\equiv 1$ . Second, we convert the $\beta,\beta_{j}$ functions to the $B,B_{j}$ functions of Appendix B (whose series begin $1+\ldots$ ). A third simplification, concerning the lower limit of the $i$ summations, is discussed below. We obtain

[TABLE]

The $i$ summations of the $\partial r_{i}/\partial X$ terms inherently begin with $i=1$ , but in the $c_{j}$ and $g_{j}$ equations, where the second term starts only at order $a^{j}$ , it is immediately evident that $r_{i}$ cannot depend on $c_{j}$ or $g_{j}$ for $i<j$ . Thus, we may begin those $i$ summations at $i=j$ . For the $\tilde{c}_{j}$ equation a stronger result holds, since $\partial r_{i}/\partial\tilde{c}_{j}$ must vanish for $i=j$ as well as for $i<j$ . This observation is crucial for the “exponentiation theorem” proved in Sect. 5.

In $(k+1)$ -th order all the sums would go up to $i=k$ only and the equations would only be satisfied, in an arbitrary RS/FS, up to remainder terms of order $a^{k+1}$ . The vanishing of all terms up to and including $a^{k}$ fixes the RS/FS dependence of the $r_{i}$ coefficients, and leads us to identify a set of invariants, $\sigma_{j}$ , as discussed in the next section.

4 Invariants

The scheme dependences of $r_{1}$ were already found in Eq. (2.7) and led us to the first invariant

[TABLE]

It is $Q$ dependent because $r_{1}$ , when calculated from Feynman diagrams, will contain a term $-bg\ln(Q/M)$ . One can view $\boldsymbol{\sigma}_{1}(Q)$ as $b\ln(Q/\Lambda_{F})$ , where $\Lambda_{F}$ is a scale specific to the quantity $F$ , but related, in an exactly calculable way, to the $\Lambda$ of some universal, reference RS. The earlier literature used an “invariant” $\kappa_{1}$ given by

[TABLE]

It is true that $\kappa_{1}$ is invariant under changes of FS and renormalization scale, with the explicit $g_{1}$ and $M$ dependences cancelling the implicit $g_{1}$ and $M$ dependences of $r_{1}$ . Where $\kappa_{1}$ fails to be invariant is under a change of RS that leaves the renormalization scale $M$ unchanged, but changes the renormalization prescription, so that $a^{\prime}=a(1+v_{1}a+\ldots)$ , with some arbitrary $v_{1}$ . Under such a transformation the $a^{g}$ factor in $\langle{\cal O}\rangle$ , see Eq. (2.2), becomes $(a^{\prime})^{g}=a^{g}(1+gv_{1}a+\ldots)$ , so the coefficient $r_{1}$ must become $r_{1}^{\prime}=r_{1}-gv_{1}$ to leave $F=\langle{\cal O}\rangle C$ invariant. Thus, $\kappa_{1}^{\prime}=\kappa_{1}-gv_{1}$ . Since our $\boldsymbol{\sigma}_{1}(Q)$ is

[TABLE]

this change in $\kappa_{1}$ cancels with the change from $\Lambda$ to $\Lambda^{\prime}$ , by the Celmaster-Gonsalves [9] relation.

The higher invariants, $\sigma_{2},\sigma_{3},\ldots$ , can be defined to be $Q$ -independent. As with the $\rho_{j}$ invariants, it is convenient to define the $\sigma_{j}$ ’s so that they reduce to the $\beta$ -function coefficients $c_{j}$ in “effective charge” schemes, defined by the RS/FS choices $g_{j}=0$ , $r_{i}=0$ . The invariants, so defined, depend on $\tau$ and $\tilde{\tau}$ only via the difference $\tilde{\tau}-\tau$ and have no dependence on $Q$ or $\Lambda$ .

To find the invariants we will need the conversion between $\tilde{a}$ and $a$ ; either $\tilde{a}=a(1+V_{1}a+V_{2}a^{2}+\ldots)$ or its inverse

[TABLE]

The $\tilde{V}_{i}$ coefficients can most easily be found from the relation between the $\beta$ functions: $\tilde{\beta}(\tilde{a})=(d\tilde{a}/da)\beta(a)$ . (In fact, the calculation mirrors that for the $\rho_{i}$ invariants in Ref. [8].) The first three coefficients are

[TABLE]

Note that the $\tilde{V}_{i}$ ’s do not only involve differences $c_{j}-\tilde{c}_{j}$ . It is true, though, that the $V_{i}$ coefficients of the inverse relationship are obtained by exchanging all plain and tilde variables.

We now turn to a calculation of the invariant $\sigma_{2}$ . Expanding Eqs. (3.15–3.19) in powers of $a$ and $\tilde{a}$ and using the above result for $\tilde{V}_{1}$ , we can extract the self-consistency conditions. From the lowest-order terms we recover Eqs. (2.7) for $r_{1}$ ’s derivatives, plus confirmation that $r_{1}$ does not depend on the other RS/FS variables ( $c_{2}$ , $\tilde{c}_{2}$ , $g_{2}$ ). From the next-order terms we find

[TABLE]

Integrating each of these equations individually is easy, but combining the results consistently is a little tricky. However, it is straightforward to check our result that $r_{2}$ has the form:

[TABLE]

where the constant is independent of all the RS/FS variables. The constant can be conveniently written as $\frac{g}{2}\sigma_{2}$ so that the invariant $\sigma_{2}$ is given by

[TABLE]

An easier and more systematic way to calculate the $\sigma_{i}$ invariants is to find them as the $\rho_{i}$ invariants associated with the physical quantity

[TABLE]

The perturbation series for ${\cal D}$ can be found in terms of the $C$ and $\gamma_{{\scriptscriptstyle\cal O}}$ series in various ways. Perhaps the simplest is the following. First, note that all the $Q$ dependence of $F$ resides in the $r_{i}$ coefficients of $C$ . For dimensional reasons such $Q$ dependence can come only via the ratios $Q/M$ and $Q/\tilde{M}$ . Thus,

[TABLE]

The $M$ dependence of $C$ must cancel out with that of $\langle{\cal O}\rangle$ in the product $F=\langle{\cal O}\rangle C$ , so that

[TABLE]

while $C$ is independent of $\tilde{M}$ , so that

[TABLE]

From these observations we see that

[TABLE]

Thus, ${\cal D}$ is, in a sense, a “physicalized” version of $\gamma_{{\scriptscriptstyle\cal O}}$ .

Substituting in the above formula we find

[TABLE]

We could now expand out in terms of $\tilde{a}$ , converting $a$ to $\tilde{a}$ using Eq. (4.4). Alternatively, we can eliminate $\tilde{a}$ and find the series expansion in terms of $a$ . The results are more compact in the $a$ scheme:

[TABLE]

with

[TABLE]

and so on. Note that these coefficients are independent of the FS and independent of the tilde RS variables, with the explicit $g_{i}$ and $\tilde{\tau},\tilde{c}_{j}$ dependences exactly cancelling with the implicit dependences from the $r_{i}$ coefficients; see Eqs. (2.7), (4.6). Thus, the $r^{{\cal D}}_{i}$ coefficients only depend, in the usual way, on the RS variables $\tau,c_{j}$ associated with $a$ .

As usual, we can construct the $\rho_{j}$ invariants for the quantity ${\cal D}$ :

[TABLE]

and these coincide with the $\sigma$ ’s. Indeed, it is easy to see that the “effective-charge-type” RS/FS used in the definition of the $\sigma$ ’s corresponds to the usual effective-charge scheme for ${\cal D}$ , so the equivalence of $\rho_{j}^{{\cal D}}$ to $\sigma_{j}$ is true for all $j$ .

The calculation can be straightforwardly extended to higher orders. Defining

[TABLE]

the first three invariants are

[TABLE]

Using these formulas the values of the invariants can be found from Feynman-diagram calculations performed in any convenient RS/FS.

5 The exponentiation theorem

The $(k+1)$ -th order approximation is defined by truncating the series for $C$ , $\gamma_{{\scriptscriptstyle\cal O}}$ , $B$ , and $\tilde{B}$ . The resulting approximant, in general, will have a residual RS/FS dependence that is formally of order $a^{k+1}$ . The optimization conditions correspond to requiring the RG equations to be exactly satisfied, with no remainder. (To avoid notational clutter, we leave it understood that, henceforth, any RS/FS-dependent symbol ( $a,\tilde{a},r_{i},$ etc.) stands for the optimized value of that quantity.)

At second order we saw that the $\tilde{\tau}$ optimization equation gave $r_{1}=0$ . In third order $(k=2$ ) the $\tilde{\tau}$ equation (3.15), in which $\partial r_{2}/\partial\tilde{\tau}=r_{1}$ , reduces to

[TABLE]

Also, the $\tilde{c}_{2}$ equation (3.16), in which the $\tilde{B}_{2}(\tilde{a})$ factor cancels out because $\partial r_{2}/\partial\tilde{c}_{2}=0$ , becomes just

[TABLE]

Substituting this back into the previous equation gives $r_{1}=0$ . Substituting $r_{1}=0$ back into Eq. (5.2) then gives $r_{2}=0$ . The result generalizes to all orders, as first noted by NN.

Theorem (Nakkagawa and Niégawa [5])

The solution to the $\tilde{\tau}$ and $\tilde{c}_{j}$ optimization equations is

[TABLE]

Thus, $C=1$ in the optimal scheme, so that all perturbative corrections are effectively exponentiated and re-absorbed into the anomalous dimension $\gamma_{{\scriptscriptstyle\cal O}}$ .

Proof: The $\tilde{c}_{j}$ optimization equation follows from Eq. (3.16):

[TABLE]

where $dC/d\tilde{a}=\sum_{i=1}^{k}ir_{i}\tilde{a}^{i-1}$ . Recall that all terms up to and including $\tilde{a}^{k}$ must cancel in any RS, thus determining $\partial r_{i}/\partial\tilde{c}_{j}$ . By starting the sum at $i=j+1$ we have already used the fact that $\partial r_{i}/\partial\tilde{c}_{j}$ must vanish for $i<j$ and for $i=j$ , as noted at the end of Sect. 3.

We begin by considering the case $j=k$ . The first term vanishes, as there are no terms in the sum, so we find that in the optimal scheme

[TABLE]

Next, consider the case $j=k-1$ . In any scheme, cancellation of the $\tilde{a}^{k}$ terms requires

[TABLE]

In the optimal scheme the left-hand side must vanish, since $dC/d\tilde{a}$ vanishes in the optimization equation (5.4). Thus, in the optimal scheme, $r_{1}=0$ . Proceeding to the case $j=k-2$ we can find $\partial r_{k}/\partial\tilde{c}_{k-2}$ as a sum of $r_{1}c$ and $r_{2}$ terms. In the optimal scheme this must vanish, and since we already have $r_{1}=0$ , we now find that $r_{2}=0$ , too. We may then proceed to successively lower $j$ cases to see that other $r_{i}$ ’s vanish. Finally, we reach $j=1$ , where we are dealing with the $\tilde{\tau}$ equation, which gives us $r_{k-1}=0$ . Substituting back into $dC/d\tilde{a}=\sum_{i=1}^{k}ir_{i}\tilde{a}^{i-1}=0$ then shows that $r_{k}=0$ .

6 The optimization equations

The fact that $C=1$ in the optimal scheme allows us to simplify the remaining optimization equations, which follow from Eqs. (3.17–3.19) with the $i$ summations truncated at $i=k$ .

Also, recalling that the $B_{j}(a)$ functions are related to the $I_{j}(a)$ integrals, one sees that the $c_{j}$ equation involves

[TABLE]

This can be simplified by interchanging the order of the two integrations:

[TABLE]

to give

[TABLE]

which corresponds to going back to the form in Eq. (3.8) for $\frac{1}{\langle{\cal O}\rangle}\frac{\partial\langle{\cal O}\rangle}{\partial c_{j}}$ . Also note that the $g_{j}$ optimization equations involve a related set of integrals

[TABLE]

Thus, the $\tau$ , $c_{j}$ , and $g_{j}$ optimization equations can be written as

[TABLE]

In each of these equations the first term is a polynomial in $\tilde{a}$ that must precisely cancel out the terms up to and including $\tilde{a}^{k}$ present in the second term, if it were expanded out in a power series in $\tilde{a}$ . In Ref [8] we used the notation $\mathbb{T}_{n}[G(a)]$ to mean “truncate the series for $G(a)=G_{0}+G_{1}a+\ldots$ immediately after the $a^{n}$ term” (i.e., $\mathbb{T}_{n}[G(a)]\equiv G_{0}+G_{1}a+\ldots+G_{n}a^{n}$ ). Here we will need $\tilde{\mathbb{T}}_{n}$ as the equivalent operation in the expansion parameter $\tilde{a}$ . Thus, we may re-write the equations (swapping the order of the two terms and dividing out a $g$ factor) as

[TABLE]

However, note that the arguments of the $\tilde{\mathbb{T}}_{k}$ ’s are all functions of $a$ , rather than $\tilde{a}$ , so it is best to think of the $\tilde{\mathbb{T}}_{k}[G]$ operation in three stages (i) expand $G$ as series in $a$ up to $a^{k}$ , (ii) convert $a$ to $\tilde{a}$ using Eq. (4.4), and (iii) re-expand as a series in $\tilde{a}$ , and truncate after the $\tilde{a}^{k}$ term.

A further simplification results from the realization that, since $C=1$ , we do not need to know the optimized value of $\tilde{a}$ ; nor do we need to know the $\tilde{c}_{j}$ ’s or $\tilde{\tau}$ : they do not enter into the optimized result for $F$ , which just involves evaluating $\langle{\cal O}\rangle$ in the optimal scheme. Thus, what we need to do is to take combinations of the optimization equations in which $\tilde{a}$ and the $\tilde{V}_{i}$ ’s cancel out. From the resulting equation combinations we can solve for the $g_{j}$ coefficients in terms of the “principal variables” $a,c_{2},\ldots c_{k}$ . (Note that the $I$ and $J$ integrals are functions of these principal variables.) Finally, we can use the invariants, $\sigma_{i}$ and $\boldsymbol{\sigma}_{1}(Q)$ , and the int- $\beta$ equation to determine the optimized result. Note that when $r_{i}$ =0 the $\sigma_{j}$ ’s have exactly the same form as the usual $\rho_{j}$ invariants with $g_{i}$ ’s in place of $r_{i}$ ’s.

In the next section we illustrate the above observations in the case of third order.

7 Third-order approximation

In third order ( $k=2$ ) we have four remaining optimization equations, in the variables $\tau$ , $c_{2}$ , $g_{1}$ , and $g_{2}$ . From Eqs.(6.8)–(6.10) these are

[TABLE]

Taking the $g_{1}$ equation minus the $\tau$ equation cancels the $\tilde{a}$ terms and, not coincidentally, the $\tilde{V}_{1}$ terms, leaving

[TABLE]

An $\tilde{a}^{2}$ term remains, but we can substitute from the $g_{2}$ equation to obtain

[TABLE]

Taking the $g_{2}$ equation minus the $c_{2}$ equation cancels the $\tilde{a}^{2}$ terms, giving

[TABLE]

We may solve these last two equations for $g_{1},g_{2}$ in terms of the principal variables $a,c_{2}$ .

From the four original equations we have extracted just two equations that give us the $g_{1},g_{2}$ coefficients that we need. There are effectively two other equations that we can just ignore; they would determine $\tilde{a}$ and $\tilde{V}_{1}$ (which gives $\tilde{\tau}$ and, combined with the int- $\tilde{\beta}$ equation of the tilde scheme, would then fix $\tilde{c}_{2}$ ), but we have no need to obtain values for these variables.

To relate the principal variables to $Q$ and the invariants, we substitute the optimal-scheme quantities into the expressions for $\sigma_{2}$ and $\boldsymbol{\sigma}_{1}(Q)$ , combining the latter with the int- $\beta$ equation to eliminate $\tau$ . In the optimal scheme, since $r_{i}=0$ , the formula for $\sigma_{2}$ reduces to

[TABLE]

which is the familiar form of a $\rho_{2}$ invariant, but with $g_{i}$ ’s as the coefficients. Similarly, in the optimal scheme

[TABLE]

where $K^{(3)}(a)$ is the third-order approximation to the $K(a)$ function of the int- $\beta$ equation.

8 A simpler approach

In fact, there is a simpler approach that allows us to get directly to the equations determining the optimal $g_{i}$ ’s. Consider the physical quantity ${\cal D}$ defined in Eq. (4.9), which we showed is given by Eq. (4.13), so that ${\cal D}=\gamma_{{\scriptscriptstyle\cal O}}$ when $C=1$ . That suggests that we consider $F$ in the form:

[TABLE]

where “ $[0]$ ’ is a shorthand for the same “lower limit of [math] with subtraction of the suitable infinite scheme-independent constant,” as in Eq. (1.8). Formally, this expression for $F$ is valid quite generally, and is independent of the RS used, so it satisfies RG equations saying that the total dependences on $\tau$ and $c_{j}$ all vanish. What we are doing in RS/FS optimization is equivalent to a normal RS optimization applied to $F$ , except that the approximants being optimized are not truncations of the perturbation series for $F$ , but are approximants formed by truncating the perturbation series for ${\cal D}$ and $\beta$ . That is, the $(k+1)$ -th approximant to $F$ is given by substituting

[TABLE]

into Eq. (8.1). The optimization equations follow from requiring the $\tau$ and $c_{j}$ derivatives to vanish. (Note that when we take such derivatives the infinite constant plays no role and the “ $[0]$ ” lower limit can safely be replaced by [math], since the resulting integrals converge.) For $\tau$ we have

[TABLE]

while for $c_{j}$

[TABLE]

Substituting the series form for ${\cal D}(a)$ leads to

[TABLE]

where $I_{j,i}(a)=a^{i+1}I_{j}(a)-I_{i+j+1}(a)$ arises from the first and third terms of Eq. (8.4).

The derivatives $\partial r_{i}^{{\cal D}}/\partial\tau$ and $\partial r_{i}^{{\cal D}}/\partial c_{j}$ are the usual RS dependences of perturbative coefficients [1, 8], and can be quickly found from the expressions for the $\rho_{i}^{{\cal D}}$ invariants. Thus,

[TABLE]

Using these results, and recalling that in the FS/RS optimal scheme the optimized $r_{i}^{{\cal D}}$ ’s equal the optimized $g_{i}$ ’s, the reader can quickly check that at 3rd order ( $k=2$ ) Eqs. (8.5) and (8.6) lead directly to Eqs. (7.6) and (7.7).

At 4th order ( $k=3$ ) the $\tau,c_{2},c_{3}$ equations reduce to

[TABLE]

We have explicitly checked that these are indeed the equations one would obtain from appropriate combinations of Eqs. (6.8), (6.9), (6.10).

9 Conclusions and outlook

The optimization approach to the problem of RS/FS dependence is now, we believe, on a firm footing. It is far less daunting than it might appear at first sight. There are $3k$ scheme variables at $(k+1)$ -th order and $k$ coefficients, $r_{i}$ . However, $k$ of the optimization equations lead to $r_{1}=\ldots=r_{k}=0$ , so that $C=1$ ; another $k$ variables ( $\tilde{\tau},\tilde{c}_{2},\ldots,\tilde{c}_{k}$ ) then need not be solved for. That leaves $k$ combinations of optimization equations that can be solved for $g_{1},\ldots,g_{k}$ in terms of the “principal variables” $a,c_{2},\ldots,c_{k}$ . In fact, these equations can be obtained more directly by the approach in the last section. By substituting in the expressions for the invariants, one can then solve for all the needed quantities. The last step will require an iterative algorithm, as in ordinary optimization [8].

Our results have applications to various quantities, such as charmonium decays to hadrons, $B$ decays to charmonium, or Higgs boson decay to hadrons: These quantities have a factorized form involving the wavefunction at the origin or, in the last case, the quark masses. For applications involving parton distribution functions and fragmentation functions there is more work to be done. We have only considered the non-singlet case; the flavour-singlet case involves matrices describing quark-gluon mixing. Also, our analysis has used the language of structure-function moments, which is convenient theoretically since it reduces a convolution integral to a simple product. However, phenomenologically, it seems preferable to deal directly with the parton distributions using parton-evolution (DGLAP) equations. It would be valuable to see if our moments-based approach can be reformulated in that language and put into practice.

We end with a plea to recognize of the importance of this effort. When QCD was young, the use of phenomenological, ad hoc choices was excusable, perhaps even necessary to make progress. Now that the theory is mature we cannot go on using arbitrary renormalization prescriptions and blind guesses at the “right” renormalization and factorization scales (which don’t even exist, since it is only the ratios of $M$ and $\tilde{M}$ to the prescription-dependent $\Lambda$ that matter). If “precision QCD” is to be a valid scientific enterprise, it must be based on a systematic treatment of RS/FS ambiguities, with a respect for RG invariance at its core.

Appendix A: Discussion of the work of NN

In this appendix we critique the work of Nakkagawa and Niégawa (NN) [4]-[7] and outline why, nevertheless, their optimization equations are equivalent to ours. Note that their “ $\mu$ ” corresponds to our $\tilde{M}$ (and their “ $b$ ” is the opposite sign to ours). Their $\tilde{a}$ is the same as ours, but their $a$ is somehow supposed to explicitly depend on both $M$ and $\tilde{M}$ . They write $a=a(\mu,\xi)$ where $\xi=M/\mu$ . It is never clear quite how this object is defined. Because of its supposed dependence on two scales, NN associate it with two $\beta$ functions, whose coefficients are supposed to depend on $\xi$ . We find this rather odd; it might not be wrong, but it certainly creates difficulties without gaining any generality. In our approach the couplant $a$ is a normal couplant, with a renormalization scale $M$ , in a RS labelled by $\tau\equiv b\ln(M/\Lambda),c_{2},c_{3},\ldots$ . This RS is distinct from, and independent of, the tilde RS used for $\tilde{a}$ , whose scale is $\tilde{M}$ and whose scheme labels are $\tilde{\tau},\tilde{c}_{2},\tilde{c}_{3},\ldots$ . Along with FS labels $g_{1},g_{2},\ldots$ these form the complete set of RS/FS labels, and variation of any one label, in a partial derivative, is made holding the other labels constant. Thus, there is no question of $c_{j}$ ’s “depending” on $M$ or $\tilde{M}$ or their ratio.

For NN the integration of their two $\beta$ -function equations for “ $a(\mu,\xi)$ ” is problematic [5, 6], because of a dependence on the integration path. Later [7] they claimed to have resolved this problem, and made the $\xi$ dependence of their $c_{j}$ ’s go away. In our view, this dependence and the integration-path problem should never have been there in the first place!

NN’s analysis involves a somewhat mysterious variable $\Phi$ , which it seems must actually be, in their notation, $b\ln(M/\mu)$ . In our notation that means $\Phi=-b\ln(M/\tilde{M})=\tilde{\tau}-\tau$ . Provided that we make this identification, we find that their equations (Eqs. (18a-e) of Ref. [5]) are equivalent to ours. Apart from straightforward conversion of notation we need to recognize that they work with variables $\mu$ and $\Phi$ , etc., while we work with $\tilde{M}=\mu$ and $M$ (related to $\tilde{\tau}$ and $\tau$ , respectively). Thus their $\partial/\partial\Phi$ is at constant $\mu$ and coincides with our $-(1/b)M\partial/\partial M=-\partial/\partial\tau$ : However, their $\mu\partial/\partial\mu$ is at constant $\Phi$ and so corresponds to our $\tilde{M}\partial/\partial\tilde{M}+M\partial/\partial M=b(\partial/\partial\tilde{\tau}+\partial/\partial\tau$ ). Hence, their optimization equation associated with $\mu$ is a sum of our $\tilde{\tau}$ and $\tau$ optimization equations.

Notwithstanding our criticisms, NN deserve praise for arriving at the correct optimization equations, and they were correct to criticize Refs. [2, 3]’s formulation as insufficiently general. The applications of their results, pursued with Yokota [10], are valid and important. In particular, they show how optimization naturally resolves the issue that, in a naïvely fixed scheme, the perturbative coefficients for the $n$ th moment would grow like $\ln n^{2}$ .

Appendix B: $\beta(a)$ and $\beta_{j}(a)$ functions

For the reader’s convenience we list here some key formulas from Refs. [1, 8]. The integrated form of the $\beta$ -function equation, referred to as the “int- $\beta$ ” equation, is

[TABLE]

with

[TABLE]

The $\beta_{j}$ functions, defined as $\partial a/\partial c_{j}$ , are given by

[TABLE]

Their series expansions begin at order $a^{j+1}$ so it is convenient to define $B_{j}(a)$ functions which begin $1+O(a)$ :

[TABLE]

For $j=1$ it is natural to define

[TABLE]

with the convention that $c_{0}\equiv 1$ and $c_{1}\equiv c$ . Equation (B.3) can then be re-written as

[TABLE]

where

[TABLE]

(Note that this formula for $B_{j}(a)$ even holds for $j=1$ if the r.h.s. is interpreted as the limit $j\to 1$ from above.)

Differentiating Eq. (B.3) leads to

[TABLE]

where here the prime indicates differentiation with respect to $a$ , regarding the coefficients $c_{j}$ as fixed.

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] P. M. Stevenson, Phys. Rev. D 23 , 2916 (1981).
2[2] H. D. Politzer, Nucl. Phys. B 194 , 493 (1982).
3[3] P. M. Stevenson and H. D. Politzer, Nucl. Phys. B 277 , 758 (1986).
4[4] H. Nakkagawa and A. Niégawa, Phys. Lett. B 119 , 415 (1982).
5[5] H. Nakkagawa and A. Niégawa, Prog. Theor. Phys. 70 , 511 (1983).
6[6] H. Nakkagawa and A. Niégawa, Prog. Theor. Phys. 71 , 339 (1984).
7[7] H. Nakkagawa and A. Niégawa, Prog. Theor. Phys. 71 , 816 (1984).
8[8] P. M. Stevenson, Nucl. Phys. B 868 , 38 (2013).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

1 Introduction

2 Second-order approximation

3 RG equations

4 Invariants

5 The exponentiation theorem

6 The optimization equations

7 Third-order approximation

8 A simpler approach

9 Conclusions and outlook

Appendix A: Discussion of the work of NN

Appendix B: β(a)\beta(a)β(a) and βj(a)\beta_{j}(a)βj​(a) functions

Appendix B: $\beta(a)$ and $\beta_{j}(a)$ functions