Spin connection formulations of real Lorentzian General Relativity
Ermis Mitsou

TL;DR
This paper develops real Lorentzian formulations of General Relativity using spin connection and BF approaches, avoiding complex constraints and analyzing their viability and extensions.
Contribution
It introduces pure spin connection and constraint-free BF formulations of real Lorentzian GR, simplifying quantization and exploring extensions with matter fields.
Findings
Real Lorentzian formulations avoid reality constraints.
Modified theories contain ghosts and are not classically viable.
Extra dynamical fields can incorporate matter sectors.
Abstract
We derive the pure spin connection and constraint-free BF formulations of real four-dimensional Lorentzian vacuum General Relativity. In contrast to the existing complex formulations, an important advantage is that they do not require the reality constraints that complicate quantization. We also consider the corresponding modified gravity theories and point out that, contrary to their self-dual analogues, they are not viable because they necessarily contain ghosts. In particular, this constrains the set of potentially viable unified theories one can build by extending the gauge group to the ones with the action structure of General Relativity. We find, however, that the resulting theories do not admit classical solutions. This issue can be solved by introducing extra dynamical fields which, incidentally, could also provide a way to include a matter sector.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Spin connection formulations of real Lorentzian General Relativity
Ermis Mitsou
Abstract
We derive the pure spin connection and constraint-free BF formulations of real four-dimensional Lorentzian vacuum General Relativity. In contrast to the existing complex formulations, an important advantage is that they do not require the reality constraints that complicate quantization. We also consider the corresponding modified gravity theories and point out that, contrary to their self-dual analogues, they are not viable because they necessarily contain ghosts. In particular, this constrains the set of potentially viable unified theories one can build by extending the gauge group to the ones with the action structure of General Relativity. We find, however, that the resulting theories do not admit classical solutions. This issue can be solved by introducing extra dynamical fields which, incidentally, could also provide a way to include a matter sector.
1 Introduction & summary
Alternative formulations of classical General Relativity (GR) provide us with important insights about the theory, new perspectives for potential modifications or extensions and also genuinely new starting points for approaching the quantum theory. One line of research in this direction are the so-called “pure connection” formulations, where the usual gravitational field that is a metric or a vierbein, is replaced by a connection associated with some group, in close analogy with the mathematical description we have for the rest of the known forces of nature. In fact, this suggestive resemblance is an important motivation for this approach, because enlarging the gauge group leads to extra connection components and therefore constitutes an elegant potential path towards unification (see the review [1] and references therein). If one considers the metric formalism, then the only available group is the diffeomorphism group, so extending it necessarily introduces extra dimensions. In the vierbein formalism, however, we have the internal action of the Lorentz group, so the latter can be extended without altering the dimensionality of space-time.
Nevertheless, it is instructive to quickly expose the metric case, whose development dates back to Eddington [2] and Schrödinger [3]. The starting point is the Hilbert-Palatini action for a metric and an independent symmetric affine connection
[TABLE]
where
[TABLE]
Integrating out , i.e. replacing it with the solution to its equation of motion, yields the Einstein-Hilbert action. On the other hand, integrating out leads to the pure connection action
[TABLE]
Note that this manipulation requires , which fits within the present concordance viewpoint on cosmology and leads in particular to a tiny dimensionless “coupling constant” in units.
In the vierbein formalism, the situation is more involved. If one starts with the analogous Hilbert-Palatini formulation, with the independent connection being now associated with the Lorentz group (the “spin” connection), then it is easy to see that the vierbein cannot be isolated in its equation of motion in an algebraically simple manner. A more fruitful starting point is rather the Plebanski action [4], which has led to a formulation involving the spin connection and an auxiliary scalar density by Capovilla, Jacobson and Dell [5, 6, 7] (see also [8, 9]) and more recently to a formulation involving the spin connection alone for the case by Krasnov [10, 11] (see also [12, 13, 14, 15] for the corresponding quantum field theory). One common feature of these formulations is that they depend on the self-dual component of the spin connection alone. They can therefore be seen as gauge theories of the complexified group SO, commonly referred to as “complex GR”, and thus require “reality” constraints on the curvature field in order to select the sector corresponding to real Lorentzian GR. These constraints depend on the action and field content and are especially difficult to handle in the quantum theory. We will refer to these formulations as the “self-dual” ones (SD).
In the case of real GR, where the full spin connection is considered, the algebraic structure is more complicated and this fact has prevented the derivation of the pure connection action so far. In this paper we will show how to overcome this difficulty and will therefore obtain the pure spin connection formulation of vacuum GR with . As in the SD case [10, 11], the action involves a matrix square root
[TABLE]
where
[TABLE]
are seen as a matrices in antisymmetric pairs of Lorentz indices , are the curvature 2-forms, is the Immirzi parameter, is the Levi-Civita symbol and the indices are displaced with the Minkowski metric . Note that in the case (1.4) can be expressed non-singularly with the help of an extra Lagrange multiplier density, but the corresponding dynamics are not well defined, contrary to the SD formulation [6].
Another useful approach to gauge theories is through the so-called “BF” or “covariant Hamiltonian” formulation, where one integrates in an auxiliary set of 2-forms to make the equations of motion first-order in derivatives. The best known BF formulation of GR is the Plebanski action, in both complex [4] and real [16] forms, in which case one requires an extra set of Lagrange multipliers in order to impose some conditions on known as the “simplicity” or “metricity” constraints. Recently, however, it was noted by Herfray and Krasnov [17, 18] that there exists a constraint-free BF formulation of GR, at least in the real case, since the complex one still requires the reality constraints. This description is therefore closer to the structure of standard gauge theory, which is again an important feature if one is interested in unified theories. Due to the same kind of algebraic complications that prevented the derivation of the real pure connection action, the real version of this BF formulation has not yet been given either. Here we derive this action too, with the result being
[TABLE]
where now
[TABLE]
and , are two additional free parameters. In contrast to the pure connection formulation (1.4), here we do have access to the case.
By expressing (1.4) and (1.6) in their SL form, one can recover the SD formulations [10, 11, 17, 18] either by simply setting , or by imposing the appropriate reality constraints. In doing so, we will clarify how an implicit sign ambiguity in the scalar reality constraint, for the pure connection case, is related to the two sectors of the underlying Plebanski theory. Another interesting computation in this paper is the linearization of the pure connection theory around (anti-)de-Sitter space-time ((A)dS). We perform it for generic values and find agreement with the earlier perturbative constructions for the non-chiral case [19, 20]. A special aspect of the quadratic action is that it only depends on the fully traceless part of the curvature tensor and we show that this is a peculiarity of the GR action.
Finally, we will also consider the known generalizations of (1.4) and (1.6) where the matrix function is replaced by an arbitrary homogeneous function of degree one [21, 22, 23, 24, 25, 26, 27, 28, 29]. We will argue that, contrary to the SD case, these modified gravity theories have an unavoidable pathology in that one of their two gravitons is necessarily a ghost. This fact, which does not seem to be noted in the literature to our knowledge, makes it all the more important to know the precise matrix functions (1.4) and (1.6) that correspond to GR. This is also relevant for the corresponding unified theories, i.e. those obtained by extending the gauge group , because then only the GR Lagrangian function has a chance of yielding viable results. We will show, however, that the corresponding equations of motion admit no solution as soon as the dimension of exceeds six. This situation could be avoided by introducing extra dynamical fields which, incidentally, could also allow one to describe the matter sector.
The paper is organized as follows. In section 2 we define a compact matrix notation along with an algebraic toolkit that will make our computations much more straightforward and transparent. In section 3 we point out the equivalence between a nine-parameter family of real Plebanski formulations, which is useful because some of them can be more suited than others, depending on the problem at hand. We then derive the pure-connection action in section 4 and the constraint-free BF action in section 5. In section 6 we show the relation with the SD formulations and in section 7 we derive the linearized pure connection theory. Finally, in section 8 we discuss the ghost problem of the modified gravity theories and in section 9 we consider some aspects of the group extension of the GR action.
2 Notation and useful identities
We will work with tensors in the Lorentz algebra that are therefore indexed by antisymmetric pairs of Lorentz indices forming a six-dimensional index. More precisely, there will be vectors and 2-tensors and the metric displacing these pairs of indices is the Killing form of the algebra
[TABLE]
We can therefore introduce a compact notation in which is a vector and is a matrix. The product and trace definitions must then include factors for each contraction
[TABLE]
to avoid counting every independent component twice. The Lorentz-compatible transposition operation is
[TABLE]
which therefore yields covectors when acting on vectors , and vice-versa, and exchanges the pairs of indices for tensors . We will refer to it simply as “transposition” and note that it has all the algebraic properties of the usual matrix transposition. In particular, we can define the corresponding “symmetric” and “antisymmetric” matrices as the ones satisfying
[TABLE]
respectively, and any matrix can be decomposed as
[TABLE]
We can also define two Lorentz-invariant inner products and an outer one
[TABLE]
with the inner ones being invariant under transposition, while . There are two Lorentz-invariant matrices, the identity and the dual operator (or “complex structure”)
[TABLE]
respectively, which are symmetric and satisfy
[TABLE]
With the identity operator one can define the inverse of a matrix
[TABLE]
Observe that the real matrices of the form have the algebraic behavior of complex numbers among themselves, e.g.
[TABLE]
with the only important difference being that they do not commute with other matrices in general. We will refer to such matrices as the “invariant” ones. Every symmetric matrix can be decomposed into an invariant part and a covariant one
[TABLE]
or, equivalently, into scalar and pseudo-scalar traces and a traceless part. We next note that the presence of a complex structure provides a “conjugation” operation on matrices
[TABLE]
which is also a symmetry of the matrix inner product (2.6) and of the invariant matrices and . Moreover, it commutes with transposition , it is an involution and also a similarity transformation , meaning that it commutes with any analytic matrix function . We can then decompose any matrix into its “even” and “odd” components
[TABLE]
which can be alternatively stated as
[TABLE]
respectively. We also note that, as in the case of antisymmetric matrices , the odd ones have no invariant component . Expressing this conjugation in terms of Lorentz indices we find
[TABLE]
where
[TABLE]
is identically traceless. Therefore, for a symmetric matrix
[TABLE]
is the fully traceless component with the symmetries of the Weyl tensor, while
[TABLE]
is only partially traceless , therefore capturing the 2-tensor part of . These roles are switched for antisymmetric matrices, i.e. the fully traceless part is the odd component
[TABLE]
while the even component is the partially traceless one
[TABLE]
Thus, a generic matrix can be uniquely decomposed as
[TABLE]
with every term having a definite behavior under both transposition and conjugation and being normal to all others with respect to the matrix inner product (2.6). Moreover, notice that each of these components corresponds to an irreducible representation of the Lorentz group. In contrast to the usual definitions involving index symmetries, we see that in four dimensions the presence of allows one to define these components in a more algebraic way, namely, by distinguishing among all the possible behaviors under matrix trace, transposition and conjugation.
Finally, the (co-)vectors and matrices described above will always be differential forms of even degree, and it is understood that their products will always be wedge products, so their matrix/vector components commute. Also, when manipulating the Lagrangian, we will sometimes consider rational powers of 4-forms, as in (1.4) for instance. These should therefore be interpreted as powers of the corresponding weight one densities, with the factors always appearing with the correct power at the end to make the result coordinate-independent.
3 Plebanski actions
Here by “Plebanski Lagrangian” we will mean a member of the following family
[TABLE]
where
[TABLE]
are the curvature 2-forms of a real spin connection 1-form , while , and are real auxiliary 0, 2, and 4-forms, respectively, and is symmetric. Moreover, the are real invariant matrices with constant coefficients. Out of the nine parameters present in (3.1), only two are relevant, because seven of them can be eliminated by linear/affine redefinitions of the auxiliary fields
[TABLE]
and these relate all the possible formulations found in the literature [16, 30, 31, 32, 33]. The two relevant parameters are ultimately and that were mentioned in the introduction. Observe, however, that the equivalence of all these Lagrangians is a feature of the Lorentzian case only, because in the Euclidean case, where , not all real are invertible. Note also that the constraint imposed by on is sometimes considered implicitly, i.e. without including the corresponding term in the Lagrangian.
4 Derivation of the pure connection action
We begin with the vierbein Hilbert-Palatini-Holst Lagrangian 4-form of vacuum GR with mostly-pluses signature
[TABLE]
where
[TABLE]
are the vierbein 1-forms and are the (independent) spin connection 1-forms. Keep in mind that it is also possible to include the topological terms , where is some constant invariant matrix, which will only affect the quantum theory, but we neglect these here.
The case corresponds to the standard Hilbert-Palatini action, which is parity-symmetric (“non-chiral”), while a finite leads to the inclusion of the parity-violating (“chiral”) Holst term [34]. As in the metric case, by integrating out one recovers the Einstein-Hilbert action for the metric . In this process, the relation between the vierbein and the spin connection is independent of and the Holst term vanishes identically, so this extension does not influence the classical dynamics. In fact, if the theory is quantized in a Lorentz-covariant manner, then the physical observables seem to be independent of in the quantum theory too [35, 36]. This was definitely settled in [37], where it was shown that with the appropriate parametrization of phase space the dependence drops already at the level of the canonical action.
The theory (4.1) can be obtained as a sector of the Plebanski theory (3.1) and here a convenient choice will be
[TABLE]
where
[TABLE]
The relation to (4.1) is obtained through the equation of motion of , i.e. the “simplicity” constraints
[TABLE]
which are solved by either
[TABLE]
or
[TABLE]
for some vierbein , where we have used
[TABLE]
The solution (4.6) with the plus sign reproduces (4.1), while the one with the minus sign corresponds to ghost-like gravity, i.e. gravitons with negative kinetic energy. On the other hand, (4.7) leads to the same dynamics (i.e. neglecting the Holst term) as (4.6) for finite , but with rescaled constants
[TABLE]
In the limit the kinetic term in the Lagrangian becomes pure Holst , thus corresponding to a topological theory. For later reference, we will refer to (4.6) as the “right” sector and to (4.7) as the “wrong” sector, since our starting point is (4.1). The pure connection formulation being based on (4.3), it will necessarily contain all of these four solutions, i.e. the right and wrong sectors along with their respective sign ambiguities, and we will see how to distinguish among these later on. Note also that the relations (4.6) and (4.7) depend on the choice of Plebanski Lagrangian, but the relation (4.9) between the two sectors does not.
In complete analogy with the treatment of the SD case [5, 6, 7, 8, 9, 10, 11], we will now integrate out , then and, if , also . In that procedure, the non-trivial step which was resisting completion so far [18] is finding the solution of . Plugging the solution to the equation of motion of
[TABLE]
back inside the Lagrangian (4.3), we get
[TABLE]
where
[TABLE]
are symmetric matrices of 4-forms. Next, given the inverse matrix variation rule
[TABLE]
and the cyclicity of the trace, the equation of motion of is
[TABLE]
whose solution is therefore simply
[TABLE]
with the choice of sign being irrelevant for the final action. Had we considered a different Plebanski formulation (3.1), as was the case in earlier works [18], the equation of motion of would have taken the form
[TABLE]
for two symmetric matrices , which is therefore not solved by simply taking a square root. For instance, we could have chosen (5.1), which is obtained from (4.3) by the redefinitions (5.2), and leads to and . Therefore, we see that by taking full advantage of the freedom in redefining the auxiliary fields of the Plebanski Lagrangian, we can considerably simplify some algebraic manipulations. Nevertheless, it is instructive to note that one can actually proceed in the general case too, i.e. that (4.16) also admits a closed-form symmetric solution
[TABLE]
In order to manipulate such expressions back inside the Lagrangian, however, it is useful to have in mind that (at least for invertible matrices) the cyclicity of the trace holds even in the presence of a matrix square root, because analytic matrix functions commute with similarity transformations
[TABLE]
Whichever the approach we choose, the resulting Lagrangian reads
[TABLE]
since it cannot depend on redefinitions of fields that have been integrated out. There are now two options: either or . In the former case cannot be integrated out because it is not present in its equation of motion, i.e. it acts as a Lagrange multiplier enforcing the constraint
[TABLE]
Moreover, the equation of motion of involves the inverse matrix , meaning that it requires to be invertible. However, in the absence of a cosmological constant and sources, the maximally symmetric solution is Minkowski space-time , meaning , so one cannot describe fluctuations around that solution. At the level of the action, this issue manifests itself as the fact that the square root is not differentiable at zero. In conclusion, contrary to the SD case111In the SD case with one can avoid this problem thanks to the lower dimensionality of the involved matrices [6]. The Lagrangian is formally (4.11), but the corresponding matrices are complex symmetric. Thanks to this, by manipulating the characteristic equation of one can show that , which makes (4.11) quadratic in , after a redefinition of to absorb the determinant. Thus, integrating out leads to a Lagrangian of the form (4.19) with , but with the analogue of entering quadratically. Unfortunately, in the real case this trick does not work because the characteristic polynomial is of order six instead of three, meaning that can be expressed in terms of , where goes up to five. This alternative constraint can therefore only make the equation of motion of worst, i.e. of higher order than the quadratic one (4.14)., the real pure connection theory with is not well-defined in general.
Let us next consider the case, where now the field can be integrated out of (4.19). Plugging the solution to its equation of motion
[TABLE]
back inside (4.19), we find the pure spin connection Lagrangian
[TABLE]
hence the action (1.4). We can obtain an alternative form by using (4.18)
[TABLE]
which is simpler, although the matrix argument is no longer symmetric in general. In the non-chiral case we have
[TABLE]
so the combines with the Levi-Civita symbol hiding in the wedge product to yield a parity-symmetric Lagrangian indeed. To obtain the equations of motion of (4.22), we note that the variation of an analytic matrix function inside a trace is simply , which is shown using the Taylor expansion and the cyclicity of the trace, and find
[TABLE]
where is the exterior covariant derivative, we have used and the Bianchi identity . Now the presence of the inverse of is no longer a problem, because the maximally symmetric solution is (A)dS, for which is invertible, so we do have access to the dynamics of the corresponding fluctuations. More generally, we can describe any space-time for which is invertible everywhere, since is also invertible for real .
Let us finally come back to the issue of selecting the set of solutions corresponding to the original theory (4.1). As we will see in sections 6 and 7, it will sometimes be possible to distinguish between the “right” and “wrong” sectors, but the action will be insensitive to the choice of sign within each sector. This is because the two options are related by the sign flip at the level of the Hilbert-Palatini-Holst Lagrangian (4.1), and the pure connection one (4.22) only depends on the quadratic combination , which is invariant. Nevertheless, this will not be a problem for practical purposes. Indeed, given the relation (4.10) between and , we see that for to change sign must go through zero, meaning that must become non-invertible, in which case the whole construct breaks down anyway. Thus, restricting to solutions where is invertible everywhere automatically selects a definite sign within each sector.
5 Derivation of the constraint-free BF action
Here we find it more convenient to consider as our starting point the following Plebanski Lagrangian
[TABLE]
which is obtained from (4.3) with the redefinitions
[TABLE]
Following [18], we consider a redefinition of the form
[TABLE]
where are such that the resulting Lagrangian maintains its “canonical” BF form, i.e. the term is simply and the term arises only through the topological combinations , for some constant invariant matrix , which we neglect anyway. We find it actually more convenient to proceed in two steps. First, we demand that the resulting matrices in and be the same invariant matrix , so that the corresponding conditions read
[TABLE]
and admit the symmetric solutions
[TABLE]
where
[TABLE]
The resulting Lagrangian reads
[TABLE]
so we perform one more redefinition
[TABLE]
to obtain the canonical BF form
[TABLE]
The Plebanski Lagrangian now corresponds to the limit. For finite , however, (5.9) is no longer linear in , so the latter can be integrated out without constraining other fields. Its equation of motion can be put in the form (4.16)
[TABLE]
and is therefore solved using (4.17)
[TABLE]
leading to
[TABLE]
Finally, integrating out , whose solution reads
[TABLE]
we get the constraint-free BF Lagrangian
[TABLE]
hence the action (1.6). As in the pure connection case, we can use (4.18) to write this as
[TABLE]
at the expense of having non-symmetric matrix arguments. Note that, contrary to the pure connection case, here we do have access to the theory. We still have a matrix square root leading to inverse matrices in the equations of motion, but now this concerns , which is invertible on the Minkowski solution because is essentially the vierbein.
As a last check of the pure connection Lagrangian (4.22), we can now also integrate out . Its equation of motion reads
[TABLE]
so we must express in the square bracket in terms of in order to invert this. We first take the outer product of each side with itself, multiply by from right and left to form a total square and then take the square root
[TABLE]
Next, taking the trace of this
[TABLE]
and plugging the result back inside (5.17), we obtain
[TABLE]
which, once plugged inside (5.16), allows us to isolate
[TABLE]
Finally, plugging this back inside (5.14), and using also the shortcut (5.19), we recover the pure connection Lagrangian (4.22), up to topological terms . The latter are , thus precisely canceling the term that was introduced by the redefinition (5.3). The resulting pure connection Lagrangian is therefore consistently independent of , since the latter controls the redefinition of a field that has been integrated out.
6 The SL and SD formulations
Let us now express the Lagrangians (4.22) and (5.14) in terms of the universal covering group SL, in which case the independent variables are the self-dual components
[TABLE]
and their complex conjugates, and we use to denote the spatial Lorentz indices that are displaced with . In terms of these quantities the Lagrangians are formally the same as (4.22) and (5.14), only now
[TABLE]
which are complex symmetric matrices. In particular, a real invariant matrix becomes
[TABLE]
To prove (6.4) one must simply check that all the traces coincide, because these numbers fully determine the characteristic polynomial of , and thus its eigenvalues, which in turn fully determine . The same then holds for the functions of . With the SL forms at hand, we can now recover the SD Lagrangians in their pure-connection and BF forms. We start with the former, which is given by [10, 11]
[TABLE]
and is supplemented by the reality constraints [38]
[TABLE]
with the latter making the Lagrangian real. Note that the complex metric defined by [39, 38]
[TABLE]
is the one satisfying the Einstein equations when is a solution [38], and the ten reality constraints (6.7) are equivalent to the statement that this metric is real Lorentzian. Since this property is preserved under evolution, these constraints are compatible with the dynamics from the canonical viewpoint.
The easiest way to obtain (6.6) from the real Lagrangian is to simply set in (4.23) with (6.4), thus projecting out the anti-self-dual component
[TABLE]
Another option is to impose the reality constraints (6.7) on (4.23) using (6.5) for generic . The first constraint of (6.7) makes block-diagonal, so
[TABLE]
Then, rewriting the second constraint of (6.7) as
[TABLE]
we find, depending on the choice of sign,
[TABLE]
Thus, with the plus sign we recover (6.6) indeed, while with the minus sign the constants are rescaled as in (4.9), meaning that this corresponds to the wrong sector of the theory. It is therefore more accurate to state (6.11) with “” as the scalar constraint leading to GR, rather than the one in (6.7).
We next want to derive the BF form without simplicity constraints in the SD case. Note that, contrary to the SD Plebanski Lagrangian, where the reality constraints are expressed exclusively in terms of , here this is no longer the case, because that field has been mixed with in the redefinition
[TABLE]
For this reason, here we will only consider the easier approach which is to simply set . As in the pure-connection Lagrangian, the resulting projects out all dependencies in the matrix square root term of (5.15), while in the other two terms the anti-self-dual components are already decoupled. We can therefore collect the SD sector to find
[TABLE]
where is the complex number in according to the relation (6.5). The above Lagrangian is indeed the Lorentzian version of the one found in [17, 18] which is obtained by sending .
7 The linearized pure connection action over (A)dS
Let us now consider the (A)dS solution , in which case there exists some vierbein such that
[TABLE]
where for the solution in the right sector and in the wrong one, given the relation (4.9) between the two. In particular, the matrix entering the Lagrangian (4.22) is
[TABLE]
We next consider a perturbation around that solution
[TABLE]
and compute the part of the Lagrangian that is second order in , i.e. the linearized theory. In order to expand the Lagrangian in powers of , we need to expand the trace of a matrix square root around . However, the latter is not a multiple of the identity when is finite, so we cannot simply Taylor expand without caring about the order of the matrices. We therefore require a bit more algebra before we can begin. We first decompose into irreducible pieces (2.21)
[TABLE]
and note that the last two are first order in , since the background (7.2) is an invariant matrix, so we can expand the Lagrangian to second order in and . The desired expression is found by taking the most general symmetric ansatz containing the involved matrices
[TABLE]
squaring both sides and solving for the coefficients, which therefore depend on and . Taking the trace of the result and then the square, only four combinations survive
[TABLE]
Indeed, all first order terms drop thanks to the tracelessness of , and are odd, so their trace vanishes too, and we have used the cyclicity of the trace to also get
[TABLE]
The coefficients , and in (7.6) are determined up to a sign ambiguity, e.g.
[TABLE]
which, in fact, distinguishes the right and wrong sectors. To determine which is which, we first evaluate the Lagrangian on the (A)dS solution (7.2) to find
[TABLE]
and compare with the original Lagrangian (4.1) evaluated on the same solution (7.1)
[TABLE]
We see that the right sector corresponds to the plus sign in (7.9), whereas the wrong sector , which is related to (7.10) by the replacement (4.9), is consistently reproduced by taking the minus sign in (7.9). Focusing on the right sector from now on, the coefficients in (7.6) are given by
[TABLE]
As far as the part of the Lagrangian is concerned, and can be directly evaluated on the background solution (7.2)
[TABLE]
since they multiply quantities that are already . As for , the most general form of the second-order part in is
[TABLE]
but the terms do not contribute to the equations of motion, because they correspond to the second-order part of the topological terms which we neglect here. It then turns out that for this particular function (7.11) the terms vanish as well, leaving us with
[TABLE]
We thus need to compute , and to linear order in . In particular, we want to express these in terms of the components of the curvature 2-form in the background space-time basis
[TABLE]
so that we can now treat as a non-symmetric matrix and in particular
[TABLE]
Perturbing this expression, and decomposing into irreducible components (2.21)
[TABLE]
we find
[TABLE]
and thus
[TABLE]
The quantities of interest are therefore
[TABLE]
so plugging these inside (7.6) and using (7.12) and (7.14) gives
[TABLE]
and we must finally compute , and in terms of through (7.17). We find
[TABLE]
where
[TABLE]
is traceless, having therefore zero background, but without the symmetries of the Weyl tensor. The final result can then be put in the following form
[TABLE]
where
[TABLE]
satisfy the symmetries of the Weyl and Riemann tensor, respectively.222Indeed, is shown by contracting both sides with and is shown by contracting with . Note, however, that is not traceless, although it is effectively made so in the action by being contracted with a traceless tensor. As one could expect, a finite Immirzi parameter leads to a parity violating term , while in the parity-symmetric case we recover the result of [19, 20]
[TABLE]
On the other hand, given the symmetries of and the tracelessness of , in the (anti-)self-dual cases the action depends exclusively on the irreducible component
[TABLE]
and only through the combination
[TABLE]
Finally, the fact that the action depends solely on the traceless component , and not on , is a peculiarity of GR. To see this, let us consider the generalization of (4.22) to an arbitrary Lagrangian function
[TABLE]
that is a homogeneous function of of degree one
[TABLE]
This requirement is essentially the condition of being a 4-form, or equivalently of having a Lagrangian density of weight one, i.e. it is needed for invariance under the full diffeomorphism group. Repeating the above procedure for (7.29), we first note that we can again focus on squares of , because the contributions correspond to topological terms. Then (7.18) implies that the result can only depend on the , , and components of the perturbation . As we just saw, all of them depend exclusively on the fully traceless component , except for . Thus, it is the fact that the function is independent of that makes the quadratic GR action special.
8 No go for real modified actions
A remarkable fact about the SD formulation is that it admits an infinite parameter family of modifications with the same degrees of freedom as GR [21, 22, 23, 24, 25, 26, 27, 28, 29]. In its pure connection formulation this is simply the SD analogue of (7.29), that is, all the Lagrangians of the form
[TABLE]
where is homogeneous of degree one, thus generalizing (6.6). As for the reality constraints, a natural generalization of (6.7) would be [38]
[TABLE]
where the latter makes again the action real. Unlike in the case of GR, however, these reality constraints (8.2) are not guaranteed to be compatible with the dynamics of (8.1) [38] and, in fact, it has been claimed that they are not [40]. If this is the case, then the analogous metric to (6.8)
[TABLE]
where is some mass scale, cannot be real Lorentzian throughout the evolution. There might exist different reality constraints than (8.2) that are compatible, and a different metric than (8.3) that these constraints would make real Lorentzian, but if not, then these modified theories admit no real Lorentzian metric formulation and therefore would not count as theories of gravity. Nevertheless, prior to imposing the reality constraints, the Lagrangian (8.1) has the same degrees of freedom as (6.6). Consequently, in the Euclidean case where is real, (8.3) is real Euclidean and no constraints are required, these theories are genuine modified (Euclidean) gravity theories.
Putting aside this yet undetermined aspect for the Lorentzian case, the existence of this family of modified massless spin-2 self-interactions is possible because the standard uniqueness theorem of GR assumes symmetry under parity transformations, which is not the case of theories based on the self-dual component alone [41]. In contrast, therefore, the analogous modification in the real non-chiral case has eight degrees of freedom in general [42], namely, one massless graviton (2), one massive graviton (5) and a scalar (1) [43, 44], meaning this is actually a “bigravity” theory. This degree of freedom count remains unchanged if we allow for real parity-violating terms, i.e. the general Lagrangian (7.29), because in the absence of imaginary factors both the self-dual and the anti-self-dual components will be present and thus generically active. Note that this is not true in the Euclidean case, because the corresponding irreducible representations of SO(4) are real, so one can project out one of the two gravitons using only real parameters [44]. Note also that, at the level of the BF formulation, the corresponding generalization of (5.14) takes the form
[TABLE]
for some “potential” that is also homogeneous of degree one. Integrating out then leads to a Lagrangian of the form (7.29).
Here we would like to point out that, in the real Lorentzian formulation of these modified theories, one of the two gravitons is necessarily a “ghost”, i.e. it has negative-definite kinetic energy. The corresponding QFT is therefore either unstable or non-unitary and, more generally, such a pathology casts doubt on whether the fully non-linear theory can lead to sensible dynamics, both at the classical and quantum levels. Note that in [44] the focus was on the scalar, which is notoriously ghost-like in the case of standard bigravity [45], and on ways to eliminate it from the spectrum, in analogy with standard bigravity where this is possible [46, 47, 48, 49, 50]. However, if one of the two gravitons is a ghost too, then the only viable theory is the one with no scalar and no second graviton, that is, GR. Our argumentation will also show why the second graviton was not identified as a ghost in [43, 44].
In order to prove our assertion, we first need to lay down some quick facts. Let us consider the linearized theory for the fluctuations around some background solution with associated torsionless vierbein . Focusing on spin-2 excitations (gravitons), we note that these lie in symmetric 3-tensors and that one can form exactly two such fields out of
[TABLE]
The equations of motion of the pure connection theory being second-order in derivatives, there are at most two gravitons. In the case of GR, the simplicity constraints imposed by on , and therefore ultimately on the dynamics of the connection, neutralize a combination of and , thus leaving us with a single graviton. From the viewpoint of the canonical formulation, this corresponds to the presence of second-class constraints [42, 35] which are therefore due to the particular structure of the action, not to its gauge symmetries. The necessity of such constraints can be understood from the fact that the gauge parameters of the theory do not contain symmetric 3-tensors, so the corresponding first-class constraints and possible gauge choices cannot completely eliminate components of the form (8.5). As already noted in the previous section, at the level of the linearized action (7.24), the special structure of GR manifests itself in the fact that the dynamics only depend on part of the curvature components , namely, the fully traceless tensor .
Thus, in the non-GR cases where the Lagrangian does not have the specific structure (4.22) or (5.14), one should generically expect both and to contain degrees of freedom, as shown at the fully non-perturbative level in [42]. In contrast, these extra degrees of freedom cannot occur in the SD case, simply because one can only form a single symmetric 3-tensor out of the self-dual component of the connection alone . From the canonical viewpoint, SO GR only contains first-class constraints that are related to the gauge symmetries. Since the latter are the same for all Lagrangians (8.1) and the number of reality constraints (8.2) is also the same, the degree of freedom count is the same as in GR.
Now note that and are even and odd under parity, respectively. Thus, if the action is parity-symmetric, then its free part will be diagonal in and , while if it is parity-violating, the two gravitons will lie in two independent linear combinations of these two fields. In any case, however, the fact that and correspond to a boost and a rotation component of , respectively, implies that the kinetic terms of the two independent combinations, along with their associated momenta, will enter with opposite signs in the free Hamiltonian. This is because the Killing form of the Lorentz algebra has split signature so, whatever the independent combinations of and are, they cannot enter through a positive definite scalar product. As a result, even after taking into account all available constraints and all possible gauge fixings, the free Hamiltonian of the modified theories cannot be positive-definite. In particular, if one graviton has positive-definite kinetic energy, then the other one will have negative-definite kinetic energy. This does not occur in standard bigravity, because in that case one has full control over the kinetic terms of the two metric/vierbein fields. In contrast, here both spin-2 excitations are forced to enter through a single spin connection field, which incidentally forces them to have the opposite behavior under parity transformations, contrary to the two gravitons of standard bigravity. The reason why this important point was missed in the literature is probably the fact that the explicit studies of the linearized theories performed so far [43, 44] were always carried out in Euclidean signature, where the Killing form has definite sign , and that the Lorentzian case is usually approached through complex GR.
The fact that the modifications (7.29) are pathological in the real case is important for two reasons. First, it highlights in yet another way the special place GR holds in describing interacting spin-2 dynamics. Second, such modified theories have been considered as a starting point for unified gauge theories where the gauge group is extended to include other forces on top of gravity [51, 52]. Since the purely gravitational theory is non-viable to begin with, such generalizations are bound to fail too, i.e. they lack the action structure that is required to eliminate the ghost graviton. Consequently, if one wishes to extend the group using a modified gravity theory, then only the SD theories (8.1) would seem appropriate, as considered in [53, 54] for instance. In that case, however, there remains the difficulty of finding reality constraints that are compatible and that allow the construction of some real Lorentzian metric.
9 Group extensions and unification
The fact that only a particular choice of Lagrangian function leads to sensible dynamics at the purely gravitational level constraints severely the set of potentially viable actions for the corresponding unified theories, i.e. those obtained through extension of the gauge group . Indeed, if the case has ghosts, then so will the one. Conversely, extending the group with a Lagrangian that has the GR form has a chance of yielding a viable theory. So let us discuss some aspects of these theories.
We first observe that the presence of the operator precludes us from extending the gauge group in general. The only exceptions are groups that also have such a complex structure, e.g. groups that are the complexification of some other group. This is also clear from the SL form of the Lagrangian, i.e. (4.22) with (6.4), which is straightforwardly generalizable for groups with complex curvature coefficients. Note, however, that for the specific chiral choices we have , so the pure connection Lagrangian (4.23) is independent of , and so is the BF one (5.15) if we restrict to . Observe also that the difference between the two choices amounts to flipping the sign of , so from now on we consider the case for definiteness. Moreover, the right and wrong sectors too are now related by a simple sign flip of (see (4.9)).
With the GR actions can be generalized straightforwardly to the ones of arbitrary group by simply keeping the same matrix/vector notation, only now the implicit indices are of the adjoint representation of , with structure constants , and are displaced using the Killing form
[TABLE]
Interestingly, with the choice , or equivalently SU(2), we obtain the pure-connection Lagrangian for Euclidean four-dimensional GR derived by Krasnov [10, 11] as a stepping stone for the SD Lorentzian one. We now consider the relevant case for unification , meaning in particular that , and work with the -dependent BF formulation (5.9), now reading
[TABLE]
The equations of motion of and read
[TABLE]
respectively, and with the former we can rewrite the latter as
[TABLE]
Invoking some arbitrary non-singular vierbein field , defining
[TABLE]
and using a matrix/vector notation only for the indices of that quantity, we can express (9.4) as
[TABLE]
Now if we assume that is semi-simple, then there exists a generator basis such that is diagonal with entries. In that case (9.6) admits no solutions, because it states that there exist six-dimensional vectors that are linearly independent with respect to the scalar product . The gauge group of the Standard Model is semi-simple, so if it is to be unified with SO(1,3) within some larger group, one would expect the Jordan form of the corresponding to contain at least one diagonal block with dimension larger than six, thus making again (9.6) impossible to solve. At the level of the pure-connection (4.22) and constraint-free BF (5.14) formulations, this problem manifests itself as the impossibility of and to be invertible matrices for . They both take the respective forms and , but the vectors forming the columns of these matrices cannot be linearly independent for . Note that the absence of classical solutions does not necessarily mean that the quantum theory does not exist, it only means that the path integral has no saddle point. Without a notion of “on-shell”, however, most of the perturbative mathematical methods fail, thus making the manipulation of such theories considerably more involved.
This situation can be avoided by introducing an extra set of dynamical 0-forms , forming some representation of , along with their corresponding covariant momenta, i.e. a set of 3-forms , given that we are working with first-order Lagrangians. Using the generators in this representation , one could then form a symmetric matrix of 0-forms and consider for example Lagrangians that follow the GR structure (5.9)
[TABLE]
The analogous equation to (9.4) would then be
[TABLE]
and would therefore admit solutions where has rank six at most. In particular, the vacuum solution of physical interest, where space-time exists and the extra forces are trivial, would correspond to the VEV of being some pure-Lorentz invariant matrix, thus providing the desired dynamical symmetry breaking mechanism . Observe also that it is necessary to make dynamical, because otherwise it would alter the -dependence of the action, thus introducing ghosts. In fact, although we have maintained the GR structure in this extension, it is not at all guaranteed that a Lagrangian of the form (9.7) maintains the desired second-class constraints, or that it is devoid of pathologies in general, and a dedicated study should be carried out. We leave this to future work.
Finally, note that the presence of fields like and is also welcome for another reason, namely, because they allow us to include the type of fields one encounters in the matter sector of the Standard Model. Indeed, on the one hand the Higgs field is a set of 0-forms, while 3-forms with non-zero VEV are needed in order to form first-order Lagrangians out of spinor 0-forms .
Acknowledgments
The author is grateful to the referees for useful comments and suggestions. This work is supported by a Consolidator Grant of the European Research Council (ERC-2015-CoG grant 680886).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. Krasnov and R. Percacci, Class. Quant. Grav. 35 (2018) no.14, 143001, 1712.03061 .
- 2[2] A. S. Eddington, The Mathematical Theory of Relativity (Cambridge University Press, 1920), Chap. 7.
- 3[3] E. Schrödinger, Space-Time Structure (Cambridge University Press, 1950), Chap. 12.
- 4[4] J. F. Plebanski, J. Math. Phys. 18 (1977) 2511.
- 5[5] R. Capovilla, T. Jacobson and J. Dell, Phys. Rev. Lett. 63 (1989) 2325.
- 6[6] R. Capovilla, T. Jacobson and J. Dell, Class. Quant. Grav. 8 (1991) 59.
- 7[7] R. Capovilla, T. Jacobson and J. Dell, Class. Quant. Grav. 9 (1992) 1839.
- 8[8] P. Peldan, Class. Quant. Grav. 11 (1994) 1087, gr-qc/9305011 .
