Obtaining consistent Lorentz gauging for a gravitationally coupled fermion
John Fredsted

TL;DR
This paper develops a formalism for gravitationally coupled fermions that maintains the consistency of local gauge transformations, avoiding the usual violations of the equivalence principle in the vierbein formalism.
Contribution
It introduces a new approach where the fermion field carries a world index instead of Lorentz indices, ensuring gauge transformation commutativity in gravitational coupling.
Findings
Achieves gauge invariance consistency for fermions in gravity
Maintains Einstein-Hilbert action for gravity
Simplifies fermion coupling by removing Lorentz indices
Abstract
For internal gauge forces, the result of locally gauging, i.e., of performing the substitution , is physically the same whether performed on the action or on the corresponding Euler-Lagrange equations of motion. Rather unsettling, though, such commutativity fails for the standard way of coupling a Dirac fermion to the gravitational field in the setting of a local Lorentz gauge theory of general relativity in the vierbein formalism, the equivalence principle thus seemingly being here violated. This paper will present a formalism in which commutativity holds for the gravitational force as well, the action for the gravitational field itself being still the Einstein-Hilbert one. Notably, in this formalism, the spinor field will carry a world/coordinate index, rather than a Lorentz spinor index as it does standardly. More generally, no Lorentz indices will figure,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Obtaining consistent Lorentz gauging for a gravitationally coupled
fermion
John Fredsted
Rømøvænget 32B, 8381 Tilst, Denmark [email protected]
Abstract
For internal gauge forces, the result of locally gauging, i.e., of performing the substitution , is physically the same whether performed on the action or on the corresponding Euler-Lagrange equations of motion. Rather unsettling, though, such commutativity fails for the standard way of coupling a Dirac fermion to the gravitational field in the setting of a local Lorentz gauge theory of general relativity in the vierbein formalism, the equivalence principle thus seemingly being here violated. This paper will present a formalism in which commutativity holds for the gravitational force as well, the action for the gravitational field itself being still the Einstein-Hilbert one. Notably, in this formalism, the spinor field will carry a world/coordinate index, rather than a Lorentz spinor index as it does standardly. More generally, no Lorentz indices will figure, neither vector indices nor spinor indices, which from a parsimonious point of view seems quite satisfactory.
1 Introduction
Consider in global Minkowski spacetime, with metric in Euclidian coordinates , the following free Dirac action (written in explicitly hermitian form):
[TABLE]
Here, the factor , although trivial, has been included for completeness. The corresponding Euler-Lagrange equations of motion are given by . Let have an electric charge , say. Then in the presence of an external electromagnetic field , the Lagrangian is augmented to
[TABLE]
where . The corresponding Euler-Lagrange equations of motion are now given by . Reassuringly, results whether the substitution is performed on or on ; the substitution procedure may thus be said to commute with the Euler-Lagrange variational procedure. This commutativity property holds not only for an electromagnetically coupled fermion; it holds as well for a weakly coupled fermionic doublet, and for a strongly coupled fermionic triplet, the reason being that the generators for the weak and strong forces, respectively, commute with and , where means the unit matrix. Generally, it holds for any internal gauge force with generators commuting with (for appropiate values of ).
All this would be pretty uninteresting, though, was it not for the following fact: such commutativity fails for a standardly gravitationally coupled fermion. Proof: To switch on gravitational and/or inertial forces, in the realm of general relativity recasted as a local Lorentz gauge theory, the standard procedure, compare [1, Sec. 31.A] and [2, Sec. 12.1], is 1.) to introduce a vierbein, , and an associated minimal (i.e., torsionless) spin connection, , and 2.) to perform in conjunction the substitutions and and
[TABLE]
where are the generators of the spinor representation of the Lorentz group, being here defined in the ’mathematicians way’ without an explicit . Applied to the free action previously given, the result is
[TABLE]
for a Dirac fermion in an external gravitational field. The corresponding Euler-Lagrange equations of motion are given by
[TABLE]
using 1.) the identity , where is the Levi-Civita connection, and 2.) the minimality of the spin connection. But if the substitution is instead applied to the Euler-Lagrange equations of motion previously given, the result is
[TABLE]
End of proof. This nonequality of and seems to the author rather unsettling as it seems to imply that the equivalence principle is violated (for a Dirac fermion): Whereas seems to be the correct way of implementing the equivalence principle, derived from an action by Euler-Lagrange variation must necessarily take precedence over it, thus resulting in an inconsistency.
The main purpose of this paper is to present a formalism for the coupling of a fermion to the gravitational field in which no such ambiguity arises, i.e., in which the substitution procedure , now with a different of course, commute with the Euler-Lagrange variational procedure. The formalism will contain only world indices, with neither Lorentz vector indices nor Lorentz spinor indices figuring; contrary to what appears to be standard wisdom, it will prove possible to have the spinor field carry a world index rather than a Lorentz spinor index.
2 Preliminaries, I: Geometry
Let be a Riemannian manifold equipped with a metric of signature and corresponding Levi-Civita connection . Introduce on this manifold one timelike- and three spacelike vector fields, and , respectively, subject to the following conditions:
[TABLE]
[TABLE]
introducing the three-vector of four-vectors by , the dot product being performed over the Latin indices. Here, and below, a bar will denote a three-vector quantity. As for the standard vierbein, this expression for the metric in terms of four vector fields introduces excess local degrees of freedom: the metric is invariant under the following local Lorentz transformations:
[TABLE]
[TABLE]
i.e., they are the generators of the vector representation of the Lorentz group.
A remark: Strictly speaking, the above transformation is not a local Lorentz transformation, as it operates on world indices, rather than on Lorentz vector indices. But it may, nonetheless, by a mild abuse of terminology (which will be adhered to in the rest of the paper), be called so for the following reason: A genuine (infinitesimal) local Lorentz transformation, not acting on any world indices, is given by
[TABLE]
[TABLE]
The overall minus sign in this relationship is due to the fact that the Lorentz transformations of Eqs. (4a)-(4b) act on contravariant world indices, whereas the Lorentz transformations of Eqs. (6a)-(6b) act on covariant Lorentz indices (the of , remembering the previously mentioned possible identifications and ). End of remark.
As for the standard vierbein formulation of general relativity, compare again [1, Sec. 31.A] and [2, Sec. 12.1], these excess Lorentz degrees of freedom should be killed in order to avoid augmenting the standard content of general relativity. As standardly, this is done by requirering that the local Lorentz frame field consisting of and in conjunction be covariantly constant:
[TABLE]
for some connection to be introduced. The unique solution to these conditions is
[TABLE]
[TABLE]
[A remark: The presence of the Levi-Civita covariant derivative in Eq. (10) makes a type world tensor. By a continuing mild abuse of terminology, compare previous remark, it is also seen to be a type Lorentz tensor in the indices , just as the expression
[TABLE]
for the standard spin connection is a type Lorentz tensor in the indices . End of remark.] It is readily established that
[TABLE]
as is appropiate for a proper covariant derivative. These relations say that and are each type Lorentz tensors (in the index ). Therefore
[TABLE]
from which it follows that
[TABLE]
where
[TABLE]
introducing . Here, is of course the standard Riemann curvature tensor in terms of the Levi-Civita connection. [A remark: Although and have closely analogous structure, the following difference should be noted: Whereas the Levi-Civita symbols transform only as a tensor in the index , the spin connection transforms as a tensor in all its indices. This explains the appearance of Levi-Civita covariant derivatives in the definition of , as opposed to only the partial derivatives in the expression for . End of remark.] But then
[TABLE]
using , from which it follows that . As the Riemann tensor is locally Lorentz invariant, because the metric is so, this immediately implies that is as well. The Einstein-Hilbert action is obviously proportional to .
3 Preliminaries, II: Algebra
It will prove useful to define a transposition operator , say, by
[TABLE]
for any four-column vector , and any matrix . Its action would become that of the standard transposition operator if . Note that is, as it should be, a row vector, carrying a lower/covariant index. [A remark: For any matrix, the row index will always be an upper/contravariant index, and the column index will always be a lower/covariant index, with matrix multiplication thus being given by , as usual, for any two matrices . End of remark.] It is readily proved that it shares with the properties and , for any matrices , and any four-column vector . Naturally associated with is defined by
[TABLE]
A matrix for which will be called hat-(anti)symmetric, and a matrix for which will be called hat-(anti)hermitian.
3.1 Concerning Klein-Gordon compatibility
By the notion ’Klein-Gordon compatibility’ is generally meant the requirement that all solutions to some given Euler-Lagrange equations of motion are on mass-shell. Consider the following tensorial quantities:
[TABLE]
where is the Levi-Civita tensor in the notation of [4, Eq. (8.10a)]. Note that constitutes three rank two (world) tensors, one for each value of . Define the matrices and by
[TABLE]
They satisfy the following algebra (note that there is no complex conjugation of in the second relation):
[TABLE]
where and are respectively the identity matrix and the zero matrix. Furthermore, the matrices satisfy the following algebra:
[TABLE]
The sign in front of the Levi-Civita symbol depends on whether the three spacelike form a right-handed basis (plus sign) or left-handed basis (minus sign) when considered as three-vectors in the subspace they span. Note that for the right-handed case, thus obey the same algebra as do the Pauli matrices. Eqs. (19)-(21) are relevant for the proof of Klein-Gordon compatibility of some Euler-Lagrange equations of motion to be derived in Sec. 4 below. In particular, Eq. (19) will in the present formalism play a role analogous to the Dirac algebra (of the gamma matrices) in the standard Dirac algebra. The matrices and are respectively hat-hermitian and hat-antisymmetric:
[TABLE]
These two relations are relevant for the proof of hermiticity (complex self-conjugacy) of the Lagrangian to be studied in Sec. 4 below.
3.2 Concerning Lorentz invariance
Introduce the matrices by
[TABLE]
where is given by Eq. (17). They satisfy the following relations (note that there is no complex conjugation of in the second relation):
[TABLE]
These relations are readily proved using respectively Eq. (19) and Eq. (20). The proof of Eq. (25), in particular, is structurally analogous to the proof, using the Dirac algebra of gamma matrices, of the identity in the standard Dirac formalism, the only difference being the appearence of complex conjugations here and there. These matrices constitute the spinor representation of the Lorentz algebra in the sense that they satisfy
[TABLE]
i.e., they are the generators of the spinor representation of the Lorentz group. This algebra is readily proved using Eq. (25), the proof being structurally analogous to the proof, using , of the fact that in the standard Dirac formalism constitute the spinor representation of the Lorentz algebra, the only difference being, as before, the appearence of complex conjugations here and there. These generators are related to the previously introduced vector representation , compare Sec. 2, as
[TABLE]
from which it follows that is self-dual, , and that
[TABLE]
using and the identity . Using Eq. (30), to switch into (plus some ), Eqs. (25)-(26) may be rewritten as
[TABLE]
which using Eq. (29) and the definition of , Eq. (14), may also be written as
[TABLE]
As the metric, and thus as well the Levi-Civita tensor, is invariant under local Lorentz transformations of and , Eqs. (4a)-(4b), these transformations of and induce via Eqs. (15)-(16) the following relations:
[TABLE]
i.e., and transform as type and Lorentz tensors, respectively; or, equivalently, by raising various indices appropiately:
[TABLE]
i.e., and transform as type and Lorentz tensors, respectively. Using the almost trivial identity , they may be written in matrix form as
[TABLE]
Eq. (35) may be compared with the relation , where , in the standard Dirac (vierbein)formalism. The ’extra’ commutator term in Eq. (35) is due to the fact that the row and column indices of transform as well under Lorentz transformations, in compliance with Eq. (33). This is of course also the reason for the commutator term in Eq. (36). In conjunction with Eqs. (31)-(32), these relations then finally imply that
[TABLE]
These two relations are relevant for the proof of (local) Lorentz invariance of some specific action to be introduced in Sec. 4 below. From Eqs. (33)-(34) it follows that the Lorentz covariant derivatives of and are necessarily given by
[TABLE]
the identically vanishing of which is due to Eqs. (3), (7)-(8), and (15)-(16). Using the almost trivial identity , they may be written in matrix form as
[TABLE]
where and (boldfaced nablas) mean, respectively, type and type world tensors with components and . [Notational remark: A boldfaced and/or will be used whenever the covariant derivative acts on a matrix/vector-valued quantity, to remind the reader that the Levi-Civita covariant derivative will have to act also on the hidden row and/or column world indices, thus producing one or two extra Christoffel terms when fully expanded in tensor components. For the case just given, and (no boldface) could be mistaken to mean and . End of remark.]
A final note: The vector and spinor representations and , which both depend only on the metric, are both (locally) Lorentz invariant, i.e., . This is reassuring, as the opposite case, i.e., having Lorentz generators that were not Lorentz invariant, would be somewhat of a conceptual quagmire.
4 Action and Euler-Lagrange equations of motion
This is the main section of the paper in which the pieces laid out in the previous section (on preliminaries) come together.
Consider in global Minkowski spacetime, in Cartesian coordinates , endowed with spacetime-independent and for which , the following action:
[TABLE]
where for some constants obeying . Here, and are given by Eqs. (15)-(16). The action will be considered at the classical level using (complex) Grassmann-valued . The most distinctive property of is that the spinor field carries a world index (the letter referring to this), as advertised in the Introduction, rather than a standard (Lorentz) spinor index. Using Eqs. (13)-(14), the Lagrangian may also be written in matrix notation as
[TABLE]
where and are determined by Eqs. (17)-(18), and where is the four-column vector with components , obviously. The hat-hermiticity of , Eq. (22), guarantees that the kinetic part of the Lagrangian is complex self-conjugate; and the hat-antisymmetry of , Eq. (23), guarantees that the Majorana-like mass term is both complex self-conjugate and nontrivial. Eqs. (37)-(38) guarantee that the Lagrangian is globally invariant under the following Lorentz transformation (of the fundamental fields):
[TABLE]
[TABLE]
using the spacetime-independency of and , due to the assumed spacetime-independency of and ; or, equivalently, in matrix notation (by raising the -index):
[TABLE]
As they should be, these Euler-Lagrange equations of motion are Klein-Gordon compatible, i.e., any plane wave solution is on mass shell, because
[TABLE]
using 1.) in the form with the operator of complex conjugation, 2.) Eqs. (19)-(20), and 3.) due to Eq. (21) and the constraint , compare previous.
Now, in analogy with the standard procedure for switching on gravitational and/or inertial forces, compare the Introduction, 1.) let the coordinates be arbitrary, 2.) let and be subject only to the orthonormality conditions given by Eq. (2), the metric itself thus becoming arbitrary, and 3.) perform the substitution
[TABLE]
with determined by Eq. (9), and determined by Eq. (24). Note that as the spinor field now carries a world index, the explicit appearence of is mandatory, in contrast to Eq. (1), where carries only a spinor index. In matrix notation, Eq. (44) may be written as
[TABLE]
where means a type world tensor field with components , compare previous remark concerning boldfaced nabla. As shown in the Appendix at the end of this paper, is a proper Lorentz covariant derivative in the sense that it transforms as under Eq. (42) with arbitrary. The action above then becomes the following coordinate invariant and locally Lorentz invariant action:
[TABLE]
where now and are generally spacetime-dependent. Explicitly expanding the covariant derivatives, the Lagrangian is given by
[TABLE]
from which it readily follows that
[TABLE]
from which in turn it follows that the Euler-Lagrange equations of motion for are given by
[TABLE]
using the identity . Expressing in terms of (and three Christoffel terms), these equations may be rewritten as
[TABLE]
Being now manifestly tensorial, there being no explicit Christoffel symbols present, these equations can be rewritten, by unproblematically raising/lowering various indices, in matrix form as follows (note boldfaced nabla, compare previous remark):
[TABLE]
using the identity , and Eqs. (31) and (39). But , as thus defined, is simply , Eq. (43), subjected to the substitution Eq. (45), in conjunction with letting and become generically spacetime-dependent. Therefore, in the present formalism, the Lorentz gauging procedure commute with the Euler-Lagrange variational procedure, as asserted in the Introduction.
Conclusion
The formalism, as presented in this paper, for the coupling of a spinor field to the gravitational field has the following two distinguishing properties:
- •
Commutativity of the Lorentz gauging procedure and the Euler-Lagrange variational procedure, respectively, compare Sec. 4. In contrast, compare the Introduction, this property is not satisfied by the standard vierbein formalism for the coupling of a Dirac fermion to the gravitational field, the equivalence principle thus seemingly being here violated, quite unsettlingly.
- •
The use of world indices only, there being present neither Lorentz vector indices nor Lorentz spinor indices. Although this property is regarded by the author to be much less important than the commutativity property, previous bullet item, it nonetheless endows the formalism with the following two attractive properties: 1.) concerning indices, it is more parsimonious than the standard formalism is, and 2.) it seems to treat all types of fields, tensorial and spinorial ones, on an equal footing, contrary to the standard formalism.
There remains of course much to be investigated. Possible subjects are the following ones, just to mention some:
- •
Internal symmetry of the Lagrangian, and the coupling of to the electromagnetic field.
- •
Discrete symmetries: , , and .
- •
Free-particle solutions of the equations of motion for .
- •
Hamiltonian and the possible quantization of the theory.
The author intends to return to some or all of these issues in one or more future papers while at the same time of course most warmly welcoming anyone interested in contributing.
5 Appendix
The derivative , as defined by Eq. (45), is a proper Lorentz covariant derivative in the sense that it transforms as under Eq. (42) with arbitrary. Proof: The left-hand side is given by (note various boldface nablas)
[TABLE]
using and due to , and (boldface nabla) due to ; and the right-hand side is given by (note boldface nabla)
[TABLE]
Using Eq. (27), these two expressions will be equal if and only if
[TABLE]
which is in fact so due to Eq. (10). End of proof.
This proof is structurally analogous to the proof in the standard formalism for the coupling of a Dirac spinor field to the gravitational field, compare again [1, Sec. 31.A] and [2, Sec. 12.1], that the Lorentz covariant derivative of a Dirac spinor field transforms properly under local Lorentz transformations. It is given here, nonetheless, for the benefit of the reader.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Weinberg, The Quantum Theory of Fields , Vol. 1-3 (Cambridge University Press, Cambridge, 2002).
- 2[2] M. B. Green, J. H. Schwarz, and E. Witten, Superstring Theory (Cambridge University Press, Cambridge, 1987).
- 3[3] S. Weinberg, Gravitation and Cosmology (John Wiley & Sons, 1972).
- 4[4] C. W. Misner, K. S. Thorne, and J. A. Wheeler, Gravitation (W. H. Freeman and Company, New York, 1973).
