Canonical Transformation Path to Gauge Theories of Gravity
J. Struckmeier, J. Muench, D. Vasak, J. Kirsch, M. Hanauske, H. Stoecker

TL;DR
This paper derives a gauge theory of gravity using canonical transformations and the action principle, providing a framework that extends beyond Einstein's General Relativity by incorporating a Hamiltonian quadratic in conjugate momenta.
Contribution
It introduces a canonical transformation approach to formulate geometrodynamics as a gauge theory, ensuring form-invariance and coupling of fields to spacetime geometry.
Findings
The Hamiltonian is at least quadratic in conjugate momenta, unlike Einstein-Hilbert.
The approach maintains the form of the action functional under spacetime transformations.
It unambiguously describes how spin-0 and spin-1 fields couple to spacetime.
Abstract
In this paper, the generic part of the gauge theory of gravity is derived, based merely on the action principle and on the general principle of relativity. We apply the canonical transformation framework to formulate geometrodynamics as a gauge theory. The starting point of our paper is constituted by the general De Donder-Weyl Hamiltonian of a system of scalar and vector fields, which is supposed to be form-invariant under (global) Lorentz transformations. Following the reasoning of gauge theories, the corresponding locally form-invariant system is worked out by means of canonical transformations. The canonical transformation approach ensures by construction that the form of the action functional is maintained. We thus encounter amended Hamiltonian systems which are form-invariant under arbitrary spacetime transformations. This amended system complies with the general principle of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Canonical transformation path to gauge theories of gravity
J. Struckmeier
Frankfurt Institute for Advanced Studies (FIAS), Ruth-Moufang-Strasse 1, 60438 Frankfurt am Main, Germany
Goethe Universität, Max-von-Laue-Strasse 1, 60438 Frankfurt am Main, Germany
GSI Helmholtzzentrum für Schwerionenforschung GmbH, Planckstrasse 1, 64291 Darmstadt, Germany
J. Muench
Frankfurt Institute for Advanced Studies (FIAS), Ruth-Moufang-Strasse 1, 60438 Frankfurt am Main, Germany
Goethe Universität, Max-von-Laue-Strasse 1, 60438 Frankfurt am Main, Germany
D. Vasak
J. Kirsch
Frankfurt Institute for Advanced Studies (FIAS), Ruth-Moufang-Strasse 1, 60438 Frankfurt am Main, Germany
M. Hanauske
Frankfurt Institute for Advanced Studies (FIAS), Ruth-Moufang-Strasse 1, 60438 Frankfurt am Main, Germany
Goethe Universität, Max-von-Laue-Strasse 1, 60438 Frankfurt am Main, Germany
H. Stoecker
Frankfurt Institute for Advanced Studies (FIAS), Ruth-Moufang-Strasse 1, 60438 Frankfurt am Main, Germany
Goethe Universität, Max-von-Laue-Strasse 1, 60438 Frankfurt am Main, Germany
GSI Helmholtzzentrum für Schwerionenforschung GmbH, Planckstrasse 1, 64291 Darmstadt, Germany
(25 April 2017; 27 June 2017)
Abstract
In this paper, the generic part of the gauge theory of gravity is derived, based merely on the action principle and on the general principle of relativity. We apply the canonical transformation framework to formulate geometrodynamics as a gauge theory. The starting point of our paper is constituted by the general De Donder-Weyl Hamiltonian of a system of scalar and vector fields, which is supposed to be form-invariant under (global) Lorentz transformations. Following the reasoning of gauge theories, the corresponding locally form-invariant system is worked out by means of canonical transformations. The canonical transformation approach ensures by construction that the form of the action functional is maintained. We thus encounter amended Hamiltonian systems which are form-invariant under arbitrary spacetime transformations. This amended system complies with the general principle of relativity and describes both, the dynamics of the given physical system’s fields and their coupling to those quantities which describe the dynamics of the spacetime geometry. In this way, it is unambiguously determined how spin-[math] and spin- fields couple to the dynamics of spacetime.
A term that describes the dynamics of the free gauge fields must finally be added to the amended Hamiltonian, as common to all gauge theories, to allow for a dynamic spacetime geometry. The choice of this “dynamics Hamiltonian” is outside of the scope of gauge theory as presented in this paper. It accounts for the remaining indefiniteness of any gauge theory of gravity and must be chosen “by hand” on the basis of physical reasoning. The final Hamiltonian of the gauge theory of gravity is shown to be at least quadratic in the conjugate momenta of the gauge fields—this is beyond the Einstein-Hilbert theory of general relativity.
DOI: 10.1103/PhysRevD.95.124048
I Introduction
The theory of general relativity, as proposed by A. Einstein in 1915 [1]—in conjunction with the vacuum solution of K. Schwarzschild [2]—has provided a stunningly accurate description of the dynamics of celestial bodies. This fact becomes even more surprising as Einstein’s approach was in fact an “educated guess”, or—in H. Weyl’s words—“a purely speculative theory” [3]. The modern comprehension of general relativity as a gauge theory arises from requiring a given Lorentz-invariant theory to be invariant as well under local Lorentz transformations. This approach was pioneered by Utiyama [4] in 1956. On the other hand, a rigorous derivation of the theory that describes the interaction of matter/energy with the spacetime fabric on the basis of the action principle and the requirement that the description of any system should be form-invariant under general spacetime transformations has not yet been delivered.
Moreover, Einstein’s theory has severe limitations:
- (i)
The theory is not scale-invariant as the pertaining coupling constant is not dimensionless.
- (ii)
The underlying coarse-grained energy-momentum balance equation of the theory appears to be inaccessible to quantization. A more detailed and quantizable theory would describe the direct interaction of individual elementary particle fields with the gravitational field, the latter described by the (uncontracted) Riemann tensor—similarly to the form of the Maxwell equation.
- (iii)
The observed dynamics of clusters of galaxies and stars in galaxies led to postulates of the existence of “dark matter”, in order to fit into the solutions of the Einstein equations.
In our previous attempts [5, 6], we have advocated a strategy by which a formalism of extended canonical transformations is constructed in the realm of covariant Hamiltonian field theory [7, 8], which enables a description of canonical transformations of fields under general mappings of the spacetime geometry. Any theory derived from an action principle must maintain the general form of the action principle under transformations of its dynamic quantities. Consequently, those mappings are most naturally formulated as canonical transformations, hence as transformations whose rules are derived from generating functions.
Any theory which conforms to the general principle of relativity—hence, which respects the requirement of form-invariance under general mappings of spacetime geometry—must then be formulated as a canonical transformation along the well-established procedures of gauge transformations, dating back to H. Weyl [9] and W. Fock [10]. The particular transformation rule for the Hamiltonian then “automatically” provides the structure of the “gauge Hamiltonian” which renders the original system locally form-invariant. The form-invariance of the action functional is achieved by simultaneously defining both, the appropriate transformation rules for the fields, the conjugate momentum fields, and the transformation rule for the Hamiltonian. In the present context, the particular gauge Hamiltonian is to be isolated which renders a given (globally) Lorentz-invariant Hamiltonian system form-invariant under local Lorentz transformations. The gauge Hamiltonian thus describes uniquely the coupling between the dynamic quantities of the given physical system with those describing the spacetime dynamics. This conforms to the procedure generally pursued in gauge theories of gravity [11] and is in stark contrast with a postulation of a particular Lagrangian—the latter procedure was first presented by D. Hilbert [12] with the appropriate Lagrangian that led to the postulated Einstein equation. For the reader’s convenience, an outline of the gauge procedure in given in Sec. VII.
The canonical transformation approach is presented starting in Sec. II with a brief review of the extended canonical formalism in the realm of covariant Hamiltonian field theory. The theory of canonical transformations then isolates exactly those transformations of the dynamic variables which maintain the general form of the action principle—and hence the general form of the canonical field equations.
The formalism is then applied to a system of scalar and vector fields, in general curvilinear spacetime in Sec. III. The Hamiltonian is required to be form-invariant under general spacetime transformations. This enforces the introduction of the affine connections as the appropriate “gauge fields” which must obey their particular transformation rules. The affine gauge coefficients are not necessarily symmetric in their lower index pair. Hence, the torsion of spacetime is included in this theory explicitly. Introducing the affine connections as the gauge quantities promotes, in the language of gauge theories, the global Lorentz-symmetry of the given system into a local, general relativistic symmetry. This renders the action integral invariant under arbitrary mappings of the reference frame.
Yet, at this stage the affine connection coefficients constitute external gauge fields [4], with their dynamics being left open. The connection coefficients are then treated as internal dynamic quantities by letting their transformation rules follow from a particularly crafted generating function. The subsequent transformation rule for the Hamiltonian then yields the particular gauge Hamiltonian that amends the given Lorentz-invariant Hamiltonian to become a generally covariant Hamiltonian. The gauge Hamiltonian is then inserted back into the initial action integral in Sec. V. Remarkably, the Riemann-Cartan tensor is in no way introduced “by hand” as in classical GR theories, but emerges at this point of the canonical gauge formalism as the particular function of the gauge field and its partial derivatives. This constitutes the main result of our paper: the obtained generally covariant Hamiltonian represents the generic Hamiltonian that is common to any particular theory of gravity. Moreover, in order to encounter a closed set of field equations, we show that the final Hamiltonian with dynamic space-time must be at least quadratic in the conjugate momenta of the gauge fields. This contrasts with the Einstein-Hilbert theory of general relativity. The Hamiltonian describing the dynamics of the “free” gauge fields is to be inserted “by hand”, as is common to all gauge theories. This Hamiltonian must be chosen on the basis of physical reasoning. A particular “free gravity” Hamiltonian is presented in Sec. VIII.7. Concluding remarks are given in Sec. IX.
II Canonical transformation rules under a dynamic spacetime geometry
The formalism of canonical transformations, in the realm of classical field theory under dynamic spacetime, was presented earlier [5, 6]. Here it is reformulated and thereby simplified considerably. The extended canonical formalism of field theory involves the description how dynamic quantities transform under the transition from one reference frame to another, . To achieve form-invariance of the action integral, the transformation of the volume form must be taken into account. denotes the determinant of the Jacobi matrix of the transformation
[TABLE]
The volume form transforms as a relative scalar of weight
[TABLE]
A general covariant second rank tensor transforms as
[TABLE]
and hence its determinant transforms according to
[TABLE]
In the following, is supposed to denote the covariant representation of the metric tensor. Then has maximum rank and . The transformation rule for the determinant of the covariant representation of the metric tensor, , follows as:
[TABLE]
thus represents a relative scalar of weight , i.e. a scalar density. The product transforms as a scalar of weight , hence, in conjunction with Eq. (3), as an absolute scalar:
[TABLE]
is thus referred to as the invariant volume form.
The variation of the action functional for a dynamical system of a scalar field and a vector field transforms in conjunction with their respective conjugate momentum field densities and as
[TABLE]
where denotes the Hamiltonian scalar density pertaining to the scalar \mathcal{H}_{0}\big{(}\phi,\tilde{\pi}^{\nu},a_{\mu},\tilde{p}^{\,\mu\nu},g_{\mu\eta}\big{)}, while , , and denote the respective transformed quantities. The terms and thus are Lorentz scalar densities. Additional gauge quantities must be introduced for general spacetime transformations, in order to ensure that the integrands in Eq. (5) are world scalar densities, thus maintaining their form under general local spacetime transformations. This corresponds to replacing the partial derivatives of the fields in (5) by covariant derivatives.
In other words, the differences of partial and covariant derivatives define the gauge quantities. This important result is worked out in Sec. III.
The action integral is to be varied, therefore Eq. (5) implies that the integrands may differ by the ordinary divergence of a vector density , whose variation vanishes on the boundary of the integration region in spacetime. For a relative tensor of weight , a generalized version of Stokes’ theorem writes: (see [13], Eq. (E.14), for instance):
[TABLE]
The variation of the action integral (5) is not modified by adding a term to the integrand which can be converted into a surface integral according to Eq. (6)—commonly denoted briefly as a surface term: the integrand is only determined up to the divergence of the functions .
On the other hand, we are not restricted to this case here. We may confine ourselves to canonical transformations which merely provide form-invariance of the action. For the actual task of promoting a Lorentz-invariant to a diffeomorphism-invariant system by means of incorporating dynamical gauge fields, we encounter couplings of the given fields to the gauge fields. We thus modify the action functional by finally deriving an additional gauge Hamiltonian that converts the initial action integrand into a world scalar density. The crucial point is that these gauge Hamiltonians must (and will) have the same functional form in the original and in the transformed system in order to yield the required preservation of the canonical field equations, and hence to implement the postulate of diffeomorphism-invariance of the gauged system.
The integrand condition for a canonical transformation writes
[TABLE]
with the transformation rule of the volume form from Eq. (2) and to be taken at . The transformation rules are obtained by comparing the coefficients
[TABLE]
Obviously, can be devised to generate specific transformation rules of the involved fields and their conjugates—this is the reason that is called a generating function. The generating function can be Legendre-transformed into the equivalent generating function according to
[TABLE]
The transformation rules for to be taken at the spacetime event are:
[TABLE]
While the Hamiltonians and do not necessarily represent scalar densities, the total integrands in the action integrals (5) must be world scalars in order to keep their form under general spacetime transformations. This ensures that the canonical field equations emerge as tensor equations.
III General spacetime transformation of systems of scalar-,
vector-, and tensor fields
A Hamiltonian is now considered which describes the dynamics of distinct classical fields, namely a scalar field , and a vector field . The quantity in the argument list of the Hamiltonian is interpreted as the covariant representation of the (symmetric) metric tensor. The contravariant vector field denotes the canonical conjugate of in the context of covariant Hamiltonian field theory [14]. Hence, is dual quantity of the covariant vector of spacetime derivatives of the scalar field . Likewise, the tensor stands for the canonical conjugate of the covariant field vector , hence, for the dual quantity of the derivatives . A tensor , accordingly, represents the canonical conjugate of the metric tensor and hence is the dual quantity of the partial derivatives . The tensor will be introduced later, in the action functional, in order to describe the metric as an internal dynamic quantity of an amended Hamiltonian system, rather than as the external field variable in . The Hamiltonian is assumed to be form-invariant under global spacetime transformations, hence, constitutes a Lorentz scalar. The scalar field, the vector field, and the metric tensor transform under local coordinate transitions, i.e. if the transformation of the spacetime event is applied, according to
[TABLE]
These transformations are generated, in the context of the extended canonical transformation formalism of covariant Hamiltonian field theory [15], by
[TABLE]
Here and denote the independent variables in the two distinct reference frames. With Eqs. (1) and (3), the tensor density denotes the canonical conjugate of the transformed scalar fields .
In analogy, the stand for the corresponding conjugates of the transformed vector fields, , and denote the momenta of the transformed tensor field .
The particular generating function (11) embodies a contravariant vector of weight and, hence, a vector density. But this need not in general be the case: if transformations of nontensorial quantities are defined—such as the connection coefficients—then the corresponding cannot represent a vector density.
The crucial requirement is that the total integrand of the action functional represents a world scalar density.
Equation (10) constitutes a global symmetry transformation if its coefficients do not depend on the spacetime event and if is form-invariant under this transformation. In contrast, the transformation is referred to as being local if the coefficients do depend on spacetime. The Hamiltonian is then no longer form-invariant under the corresponding canonical transformation rule. Appropriate dynamic gauge quantities must then be introduced to restore the form-invariance of a then amended Hamiltonian system—as is usual for all gauge theories.
Explicitly, the new canonical transformation rules, which emerge from \tilde{\mathcal{F}}_{2}^{\mu}\big{(}\phi,\tilde{\Pi}^{\nu},a_{\mu},\tilde{P}^{\mu\nu},g_{\mu\eta},\tilde{K}^{\mu\eta\nu}\big{)} of Eq. (11), are
[TABLE]
Obviously, the required transformation rules (10) of the fields are reproduced. By virtue of Eq. (1), the momentum fields obey the rules required for relative tensors of weight . Hence, for tensor densities
[TABLE]
The general transformation rule for the Hamiltonian densities is given by Eq. (9). For the actual generating function, Eq. (11), the divergence of the explicit -dependent terms of follows as (see Appendix A for the vanishing first term)
[TABLE]
The divergence (13) can be expressed completely in terms of the original dynamic quantities by inserting the transformation rules for the momentum fields from Eq. (12):
[TABLE]
The divergence of vanishes exactly only if the second derivatives of the do all vanish:
[TABLE]
In this case, the Hamiltonian is kept unchanged. The transformation then does not depend on the spacetime location and is, therefore, referred to as global. Otherwise, for the case of a local transformation, the Hamiltonians are not form-invariant. A “gauge Hamiltonian” must be defined which matches in its dependencies on the fields those of Eq. (14). This finally yields the particular amended Hamiltonian which is form-invariant under the local transformation:
[TABLE]
Herein, the formally denote the gauge quantities, whose physical meaning will be clarified below setting up their transformation rule. Of course, the “gauge Hamiltonian” of the transformed system must have the same form in order to work out the locally form-invariant amended Hamiltonian
[TABLE]
The transformation rule of the gauge quantities and is determined by expressing the transformed gauge Hamiltonian (17) in terms of the original fields according to the canonical transformation rules (10) and (12):
[TABLE]
The demanded correlation of the formally introduced gauge quantities and is obtained by inserting the transformed gauge Hamiltonian in the representation of Eq. (18) and the original gauge Hamiltonian (16) with (14) into Eq. (9)
[TABLE]
The coefficients are compared to yield the condition
[TABLE]
The transformation rule for the “gauge fields” follows, after solving for the , as
[TABLE]
This transformation rule corresponds to the transformation rule of the affine connection coefficients. In the following, we identify the gauge fields —formally introduced in Eq. (16)—with the affine connection coefficients. In this aspect, we follow the approach of Palatini [16], who first treated the metric and the connection coefficients as separate dynamic quantities, which entails an additional equation of motion providing their mutual correlation.
A Hamiltonian system, , which is supposed to be invariant under Lorentz transformations as the global symmetry group, is rendered form-invariant under the local diffeomorphism group if and only if it is amended according to Eq. (16), provided that the gauge quantities transform according to Eq. (19):
[TABLE]
The requirement of form-invariance of the given Hamiltonian under local spacetime transformations thus induces a coupling term of the vector- and the tensor fields and their conjugates via the gauge coefficients in . The —interpreted as connection coefficients—act in a way to convert the partial derivatives in the action functional (5) into covariant derivatives. Furthermore, the metric is promoted by the gauge procedure from an external quantity in to an internal spacetime-dependent quantity, whose coupling to the vector and tensor fields is described by :
[TABLE]
Generally speaking, the form of the coupling term in gauge theories is uniquely determined by the particular global symmetry property of the system, which is rendered local. The in Eq. (19) need not be symmetric in the lower indices and [17]. Yet, restricting the theory to only the symmetric part of —which is equivalent to postulating a vanishing torsion of spacetime—greatly simplifies the field equations for the spacetime dynamics.
We have now successfully eliminated the arbitrary coefficients and their derivatives by replacing them with the physical gauge fields , i.e., the connection coefficients. Yet, these coefficients constitute external gauge fields whose dynamics are not determined by the amended Hamiltonian . In order to include the dynamics of the gauge fields into the description of the dynamical system, the generating function (11) must be amended accordingly to also define the transformation rule (19). The set of canonical transformation rules then also yields the rules for the conjugates of the gauge fields and the rule for a second amended Hamiltonian . In other words, the gauge fields are now treated as internal fields such that the system described by is closed. The set of canonical equations then establishes a closed set of coupled field equations, hence, no further gauge quantities need to be introduced. This “miracle,” as an important and welcome surprise, will be the topic of the next section.
IV Include the dynamics of the gauge fields
The extended generating function from Eq. (11) will now be amended to define the transformation law (19), i.e., the canonical transformation which maps reference frame to frame
[TABLE]
The quantities denote in this definition of an extended generating function of type , formally the canonical conjugates of the of the transformed system and, hence, the dual quantities to the -derivatives of the . As the stand for the gauge coefficients of the original system, the quantities denote, accordingly, the dual quantities of the -derivatives of the gauge coefficients of the original system. No prediction with respect to the physical meaning of and is made, at this point. Rather, their physical meaning will be determined in a later paper by setting up the canonical field equations of the final, locally form-invariant Hamiltonian.
The amended generating function (22) entails the following additional transformation rules
[TABLE]
and
[TABLE]
hence, solved for ,
[TABLE]
We observe that the inhomogeneous transformation rule for the gauge coefficients from Eq. (19) is recovered. Furthermore, the canonical conjugates of the gauge coefficients, introduced formally in the generating function (22), now transform as tensor densities.
The transformation rule for the Hamiltonians is again obtained by taking the divergence of from Eq. (22)
[TABLE]
The first term on the right-hand side of Eq. (24) is given by Eq. (20). The transformation rule for the Hamiltonians shall be expressed completely in terms of the dynamic variables: All derivatives of the functions and in (24) are to be expressed in terms of the original and transformed gauge coefficients and , and their conjugates, and , by making use of the respective canonical transformation rules (19) and (23).
This calculation was worked out earlier [6] and is rewritten in Appendix A in a notation adapted to the actual context. In summary, the coefficients in the transformation rule (24) can completely and symmetrically be expressed in terms of the physical fields of the original and of the transformed system as
[TABLE]
The terms on the right-hand side can be regarded as amendments to the given system Hamiltonians, and its transformed counterpart , which promote the given globally form-invariant Hamiltonians and to locally form-invariant Hamiltonians and . The Hamiltonians amended accordingly are form-invariant under the extended canonical transformation generated by Eq. (22).
Amending the Hamiltonian from Eq. (16) further on according to (25) yields a second amended Hamiltonian
[TABLE]
In the first stage of Sec. III, the metric was rendered an internal system variable. The gauge coefficients had to be introduced to restore form-invariance of the system. In contrast, no further gauge fields had to be introduced in the actual second stage, where the gauge coefficients are promoted to internal dynamic quantities. Rather, the gauge fields now interact with themselves, which induces the terms quadratic in to finally yield the locally form-invariant Hamiltonian .
Observe that the gauge terms occurring in Eq. (26)—hence the terms that must be added to the given Lorentz-invariant system Hamiltonian —have exactly the same structure as those of the SU (Yang-Mills) gauge theory, in the canonical formalism [18]
[TABLE]
The set of fermionic fields for the Yang-Mills case corresponds to the vector field of Eq. (26), whereas the hermitian matrix of bosonic Yang-Mills -vector gauge fields now reappear anew as the connection coefficients . In contrast the gauge fields of the SU theory, the connection coefficients are no tensors. Yet, this is a necessary property here: the non-tensorial quantities in (27) will be shown in the next section to exactly complement the non-tensorial terms in the initial action functional to render the final integrand a world scalar density.
V Insert the amended Hamiltonian into the action integral
The derivative of the nontensorial dynamic quantity now additionally appears in the amended action integral (for the notation see Sec. VII)
[TABLE]
as the affine connection , in conjunction with their conjugates, , are now internal dynamic variables of the system.
Now it is obvious why the second amended cannot represent a world scalar density by itself: the terms in must complement the nontensorial terms in (28), such that the total integrand is rendered a world scalar density. Expressing in Eq. (28) according to Eqs. (26) and (27) in terms of and yields:
[TABLE]
The terms linear in convert the the partial derivatives of and into covariant derivatives. Remarkably, the terms proportional to sum up to the Riemann-Cartan curvature tensor :
[TABLE]
Hence, the Riemann-Cartan tensor is not postulated at this point, but actually emerges from the gauge procedure: the gauge fields —introduced in Eqs. (16) and (17)—come about in the action exactly in the arrangement of Eq. (30). It is the total contraction of with the tensor density which actually yields a world scalar density. As the Riemann-Cartan tensor (30) is skew-symmetric in its last index pair, and , only the skew-symmetric part in and of the—as yet undetermined—conjugate field contributes to the action integral. Therefore, can be assumed to be skew-symmetric in its last index pair and as well
[TABLE]
The action (29) now directly reveals how the gauge procedure works. To set up a generally form-invariant action integral on the basis of a Lorentz-invariant one, the partial derivatives of the fields and are amended by the linear terms in the connection coefficients to yield covariant derivatives. In contrast, the partial derivative of the connection coefficient —which cannot be converted into a covariant derivative due to its nontensorial character—is amended by the quadratic terms in to yield the Riemann-Cartan tensor. In both cases, the gauge procedure provides tensor quantities in place of partial derivatives. This result of the canonical derivation of the gauge theory of gravity is new and is not encountered in a Lagrangian formalism.
With semicolons denoting covariant derivatives and the Riemann-Cartan tensor (30) constituted by the terms in the third line of Eq. (29), this action takes on the concise form:
[TABLE]
The given system Hamiltonian is a scalar density by presupposition, hence, the entire integrand consists of contracted tensor quantities. It thus makes up a world scalar density, as required for a generally relativistic form-invariant action integral. The action (32) is not postulated, but emerges from the gauge principle, which here means to amend a given (globally) Lorentz-invariant system Hamiltonian in a way to render it invariant under local, i.e. spacetime dependent, Lorentz transformations.
VI Add the “free-field” Hamiltonian
As usual, the gauge formalism fixes the coupling of the gauge fields with the fields described by the given system Hamiltonian but does not provide the dynamics of the “free” gauge fields, i.e. their dynamics in the absence of any coupling. If the respective gauge fields are considered dynamic (propagating) quantities, the obtained generally form-invariant Hamiltonian must be further amended in order for the canonical equations to yield nonstatic solutions for the gauge fields. Hence, the above form-invariant Hamiltonians (25) must be further amended by “free field” Hamiltonians and that obey the transformation rule
[TABLE]
in order for the final extended Hamiltonians to describe the dynamics of the gauge fields and to maintain the general invariance of the action integral (32).
In conjunction with the precondition (15) of a globally, i.e., Lorentz-invariant Hamiltonian , the final extended Hamiltonian , which is form-invariant under the corresponding local transformation generated by (22), now reads
[TABLE]
with given by Eq. (27). The correlation of the derivatives of the fields with their momenta will be identified in Sec. VIII by setting up the respective canonical field equations.
VII Summary of the gauge procedure
The starting point is a classical Hamiltonian system of a real or complex scalar field and a vector field , with the metric assumed to be of Minkowski type, . The particular Hamiltonian of this system needs not to be specified except for its property to be form-invariant under global coordinate transformations with
[TABLE]
The globally form-invariant action is to be amended to yield a locally form-invariant action that describes in addition to the scalar and vector fields the dynamics of the gauge quantities. In the first step, the metric in Eq. (35) is redefined as an arbitrary spacetime-dependent function, . Consequently, the invariant volume form is now given by . In the following, the factor is absorbed into defining the canonical momenta as tensor densities rather than absolute tensors, hence and . Correspondingly, the scalar in Eq. (35) is converted into a scalar density . Moreover, the conjugate momentum as the dual of the derivative of the metric must be introduced into the action functional
[TABLE]
The metric is now formally treated as as an internal quantity in the action functional (36) of the system. As the partial derivatives of tensors do not transform as tensors, the integrand of (36) is now no longer a scalar. The scalar property of the integrand is restored by adding the gauge Hamiltonian to the integrand
[TABLE]
with
[TABLE]
This amounts to introducing the connection coefficients as external gauge quantities and then promoting the partial derivatives in (37) to covariant derivatives. The action (37) is thus equivalently expressed as
[TABLE]
In the next step, the so far external gauge fields are treated as internal fields, which means that the description of their dynamics is to be included in a further amended action functional. This requires to define the canonical momentum as the dual of the partial -derivative of the gauge field and to add the contraction of both terms to the action
[TABLE]
As the connection is no tensor, neither is its partial derivative. Again, the invariance property of the integrand (39) is restored by supplementing the appropriate term of the gauge formalism—hence the term quadratic in —to the integrand of Eq. (39)
[TABLE]
By virtue of the skew-symmetry of in its last index pair—which follows from the gauge formalism—the terms proportional to can be merged to yield the Riemann curvature tensor (30)
[TABLE]
As a result of the gauge procedure, the action is obtained completely in terms of tensor quantities and thus represents a world scalar density. We observe that the gauge term for the partial derivative of the connection coefficients in Eq. (40) is given by the term quadratic in . This is a consequence of the fact that the coefficients in the transformation rule for the Hamiltonian from Eq. (66) can completely be expressed in term of the connection coefficients. Therefore, no new gauge fields were needed—which is the reason why the procedure to promote the global Lorentz symmetry into a local one truncates here and does not produce an infinite hierarchy of gauge fields with their pertaining transformation conditions.
In the last step, a Hamiltonian of the “free” momenta and must be introduced “by hand” in order for the corresponding fields and to be dynamic
[TABLE]
One option for defining will be discussed in Sec. VIII.7. The canonical field equations, summarized in Sec. VIII.5, follow from the action principle .
VIII Canonical field equations:
arbitrary
VIII.1 Field equations for and
As the locally form-invariant extended Hamiltonian (34) does not contain additional terms involving and , the dynamics of these fields is determined by the globally form-invariant Hamiltonian only. The respective field equations are
[TABLE]
For a vector density , the ordinary divergence of can be expressed in terms of the covariant divergence as
[TABLE]
wherein denotes the contraction of the torsion tensor
[TABLE]
Thus, for connection coefficients that are symmetric in their lower index pair, the torsion tensor vanishes identically. The second field equation follows as the tensor equation
[TABLE]
Both field equations thus emerge as tensor equations.
VIII.2 Field equations for and
Due to the coupling term in the extended Hamiltonian (34) the field equations for and acquire an additional term. The respective field equations are
[TABLE]
The partial derivatives of the fields and the terms proportional to the affine connections can be combined to yield covariant derivatives
[TABLE]
which yields the tensor equations
[TABLE]
The coupling term in the extended Hamiltonian thus converts the nontensor equations for and which emerge from the system’s Hamiltonian into tensor equations which hold in any reference frame.
VIII.3 Field equations for and
The canonical equation for the metric is
[TABLE]
hence
[TABLE]
The field equation thus means that the covariant derivative of the metric is the dual of the nonmetricity tensor—which describes the length change of a vector under parallel transport.
The canonical equation for the conjugate of the metric follows as
[TABLE]
hence
[TABLE]
The coupling terms and in the gauge-invariant extended Hamiltonian from Eq. (34) thus convert the nontensor equations for and into tensor equations. As is symmetric, is induced to be symmetric in its first index pair, .
The given Lorentz-invariant system Hamiltonian describes the dynamics in a static spacetime background. For this reason, is supposed not to depend on the conjugate of the metric, . The derivative of the Hamiltonian density with respect to the metric then represents the symmetric energy-momentum tensor density of
[TABLE]
Thus, does not describe the energy-momentum contributed by a gravitational field and a dynamic spacetime.
VIII.4 Field equations for and
The canonical equation that provides the correlation of the -derivative of the with their duals, , follows as
[TABLE]
Solved for , one finds
[TABLE]
Thus, by virtue of the skew-symmetry of in and
[TABLE]
On the right-hand side, the connection coefficients and their derivatives sum up to the combination which represents the Riemann curvature tensor (30). The field equation (52) thus states that the Riemann tensor vanishes identically everywhere—and thus any curvature of spacetime—if there is no “free-field” Hamiltonian . Therefore, it must then be added “by hand” to the derived gauge Hamiltonian in order to allow for a consistent spacetime dynamics 111A Hamiltonian that is of even power in the momenta ensures the convergence of path integrals and hence satisfies a necessary condition for quantization. A Hamiltonian quadratic and linear in the momenta has in addition the right dimension and allows for a dimensionless coupling constant which is crucial for renormalization—in complete analogy to all SU gauge theories. Otherwise, the Hamiltonian does not provide a closed set of field equations. Hence, the final Hamiltonian of geometrodynamics is at least quadratic in the conjugate momenta of the gauge fields—which is beyond the Einstein-Hilbert theory of General Relativity [see K.S. Stelle and P.C. West, Phys. Rev. D 21, 1466 (1980)]..
The divergence of is given by the derivative of the gauge Hamiltonian from Eq. (27) with respect to the
[TABLE]
This equation does not depend on the particular choice of as the latter is supposed to not depend on the gauge fields . With the gauge Hamiltonian from Eq. (27), we find
[TABLE]
In order to express Eq. (53) manifestly as a tensor equation, we write the covariant divergence of the tensor density
[TABLE]
As is skew-symmetric in and , the first term in the second line can be expressed as well in terms of the torsion tensor
[TABLE]
The field equation (53) thus actually represents the tensor density equation
[TABLE]
We thus found that all field equations emerging from are tensor equations, hence, their forms are the same in any reference frame.
VIII.5 Summary of the coupled set of field equations
With the abbreviations (30) and (43) for particular combinations of the gauge fields and their partial derivatives, the complete set of coupled field equations is summarized as
[TABLE]
The canonical equations with respect to the momenta in the left column of Eqs. (55) show that for a dynamic spacetime the covariant derivatives of the fields actually represent the canonical conjugates of the momenta. The surprising fact in the last line of Eqs. (55) is that the naïvely expected covariant derivatives of the gauge fields, i.e., the covariant derivatives of the connection coefficients —which do not exist—are replaced by the Riemann tensor as a result of the canonical gauge procedure—and thus again establish a tensor equation, as is required for a generally covariant theory.
Only with given, the entire set of eight canonical field equations for the fields , and and their respective conjugates , and is closed and can then be integrated to yield the combined dynamics of fields and spacetime geometry. As does not emerge from the gauge formalism, it must be chosen on the basis of physical reasoning. The field equations (55) thus depend on this choice. A particular choice of will be discussed in Sec. VIII.7.
VIII.6 Consistency relation
Similar to U and SU gauge theories, the set of field equations brings about a consistency condition. Differentiating Eq. (53) with respect to , the left-hand side vanishes due to the skew-symmetry of in its last index pair, as stated in Eq. (31). Accordingly, the right-hand side of (53) yields the condition
[TABLE]
The partial derivative representations of the field equations (44), (46), (48), (51), and (53) from Secs. VIII.2, VIII.3, and VIII.4 can now be inserted to yield the consistency condition (see Appendix B)
[TABLE]
This is a second rank tensor equation. In conjunction with Eq. (52), it relates the source terms of the right-hand side to the Riemann tensor terms on the left-hand side. Note that Eq. (57) holds as well for the cases of nonvanishing torsion () and nonmetricity (). Equation (57) represents a generic Einstein equation that holds for any given system of scalar and vector fields described by and the particular model for the dynamics of the free gravitational fields, as described by .
At this point, it would be interesting for the reader to find out whether our theory makes observational predictions similar to general relativity. As general relativity predicts successfully observations made in the solar system (weak field), is the gauge theory given by Eqs. (55) and (57) capable of giving something like general relativity plus small corrections? We will address these issues briefly in the following section.
VIII.7 Sample
As an example, we postulate as a linear combination of quadratic and linear terms in
[TABLE]
In contrast to the dimensionless coupling constant , the coupling constant has the natural dimension . Note that the sample Hamiltonian (58) does not depend on and thus directly induces the metric compatibility condition according to Eqs. (55). In a subsequent paper, we will discuss the more general case of a which also depends quadratically on .
The correlation of the canonical momentum to the Riemann tensor then follows from Eq. (52) as
[TABLE]
with
[TABLE]
the Riemann tensor for a maximally symmetric -dimensional manifold with constant Ricci curvature . The derivatives of with respect to in Eq. (57) then cancel for a symmetric Ricci tensor, hence for vanishing torsion , as
[TABLE]
The derivative of with respect to follows as
[TABLE]
Substituting the -terms according to Eq. (59) and writing the derivative of with respect to the metric as the given system’s metric energy-momentum tensor according to Eq. (50)
[TABLE]
the consistency relation (57) for the Hamiltonian (58) emerges as
[TABLE]
and thus represents a generalized Einstein equation. The scalar (spin-[math]) field contributes to the source merely via its metric energy-momentum tensor terms, whereas the vector (spin-) field contributes in addition with the first two terms on the right-hand side of Eq. (60), which sum up to the canonical energy-momentum tensor . So, for systems with only scalar fields, the right-hand side of Eq. (60) reduces to that of the Einstein equation.
Without the term proportional to the coupling constant , the left-hand side of Eq. (60) reduces to the Einstein tensor. The coupling constants and contained in (58) can be expressed in terms of the cosmological constant and the gravitational constant as 222 As a consequence of the particular definition of from Eq. (58), the coupling constant in front of the quadratic Riemann tensor terms in Eq. (60) is fixed. In our forthcoming paper, we investigate a generalization of , which introduces a third additional coupling constant and hence allows more flexibility in the relations between , , and .
[TABLE]
Solutions of the field equation for particular systems , namely for Klein-Gordon, Maxwell, and Proca systems, will be discussed in detail in our subsequent paper [21].
Whether or not our theory can provide new insights with respect to the dark matter issue remains to be clarified.
Remarkably, the metrics obtained from Eq. (60) for the exterior regions of nonrotating black holes or rotating black holes coincide with those solving the Einstein equation with cosmological constant. In other words, the vacuum field equation
[TABLE]
is likewise satisfied not only by the Schwarzschild metric [22], but also by the more general Schwarzschild-De Sitter and the Kerr-De Sitter metrics. Thus, for a vanishing right-hand side of Eq. (60), both parts of the left-hand side, the quadratic part (62) and the “Einstein part” are satisfied by the same metrics. As a consequence, the classical tests of general relativity, namely, the bending of light, the perihelion shift, and the Newtonian limit are equally passed by the solutions of the field equation (62). However, the metrics obtained from Eq. (60) for cases where matter/fields are present will be shown to be different from those emerging from the Einstein equation. This changes, for instance, the prediction of measurable observables of neutron stars.
IX Conclusions
By means of the framework of canonical transformations, we have demonstrated that any (globally) Lorentz-invariant Lagrangian/Hamiltonian system can be converted into an amended Lagrangian/Hamiltonian which is form-invariant under a general local transformation of the reference frame, following the well-established lines of reasoning of gauge theories. No assumptions or postulates were incorporated into the theory. Remarkably, the Riemann-Cartan tensor is not worked in by hand, but crops up from the gauge formalism as the specific combination of the gauge fields, i.e., the connection coefficients, and their partial derivatives. In particular, our approach includes a nonvanishing torsion of spacetime and is not restricted to the usual assumption of metric compatibility.
Thus, the description of the spacetime dynamics emerges from basic principles only, namely the action principle and the requirement of the form-invariance of the action integral under general spacetime transformations—which ensures the general principle of relativity to be satisfied. The ensuing coupling of spacetime dynamics with matter fields involve the affine connection coefficients, which thus act as gauge quantities. The derivation was worked out in the Hamiltonian framework making use of the canonical transformation formalism—which by construction ensures the action principle to be maintained in its form. The integrand (32) of the final action integral was shown to represent a world scalar density and thereby meets the requirement to be form-invariant under general spacetime transformations.
The reader might wonder about the constraints that arise in conventional Hamiltonian formulations of gauge theories. To address this issue, we must recall a general feature of the covariant (DeDonder-Weyl) Hamiltonian formalism. Generally, if a Hamiltonian in point dynamics does not depend on a dynamical quantity, then the canonical conjugate quantity is a constant of motion. The analogue applies in covariant Hamiltonian field theories. So, in our case of a diffeomorphism invariance, the divergence of the th column (or row) of the total system’s energy-momentum tensor vanishes if from Eq. (34) does not explicitly depend on —which is the case for a background-independent system. In this regard, our formalism differs from the standard -split Hamiltonian description (see, e.g. [23, 24]).
For the closed description of the spacetime dynamics, a Hamiltonian which describes the dynamics of the “free” gauge fields must be postulated. This is a common feature of all gauge theories and reflects here the residual indeterminacy of any gauge theory of gravity. In this sense, we have derived the generic part of the description of geometrodynamics which is common to all specific theories described by a Hamiltonian that are based on a particular .
Most importantly, we found that in any case spin- fields contribute with additional source terms to the equation of motion for the metric—which do not occur for spin-[math] fields. Furthermore, the canonical formulation of the gauge theory of gravity requires a term quadratic in the canonical momenta of the gauge fields in order for the set of field equations to be closed. This contrasts with the Einstein approach, which is restricted—in its Hamiltonian formulation—to a linear momentum term.
Acknowledgements.
The authors are deeply indebted to the “Walter Greiner-Gesellschaft zur Förderung der physikalischen Grundlagenforschung e.V.” in Frankfurt for its support. They thank E. Guendelman (Ben Gurion University, Israel), A. Redelbach (University Würzburg, Germany), and A. Koenigstein (University Frankfurt, Germany) for valuable discussions.
Appendix A EXPLICIT CALCULATION OF THE TRANSFORMATION RULE (25)
First, we show that the second term on the right-hand side of Eq. (24) vanishes identically. According to the chain rule, we have
[TABLE]
By virtue of the general identity for the derivative of the determinant of a matrix with respect to a matrix element
[TABLE]
the -derivative of in Eq. (63) is converted into
[TABLE]
Inserting Eq. (LABEL:eq:H-deri-expr) into (63) then yields
[TABLE]
In order to express the third term on the right-hand side of Eq. (24), the partial derivatives are first of all written in expanded form
[TABLE]
This expression is now split into a skew-symmetric and a symmetric part of in the indices and according to
[TABLE]
For the skew-symmetric part, , the two terms in (66) which are symmetric in and vanish, leaving
[TABLE]
The two mixed terms in and cancel each other due to the skew-symmetry of in and .
The contribution of (24) emerging from the symmetric part can be expressed in terms of the derivatives of the connection coefficients, whose transformation rule is
[TABLE]
Thus
[TABLE]
The total transformation rule (66) expressed in terms of connection coefficients is then
[TABLE]
Appendix B EXPLICIT CALCULATION OF THE CONSISTENCY EQUATION (57)
Equation (56) reads, in explicit form:
[TABLE]
The partial derivative representations (44), (46), (48), (51), and (53) of the canonical field equations can now be inserted to replace all derivatives with respect to , which yields
[TABLE]
All terms depending on the affine connections cancel, as can be seen after rearranging and relabeling some running indices:
[TABLE]
The remaining terms constitute the second rank tensor equation (57)
[TABLE]
Appendix C LAGRANGIAN DESCRIPTION
In order to derive the Lagrangian representation of the consistency equation (67), we start again with Eq. (56), which writes in explicit form:
[TABLE]
Inserting the canonical field equation (53) into the rightmost two terms yields
[TABLE]
Regrouping the terms and relabelling some running indices leads to
[TABLE]
wherein the term is added an subtracted in the -dependent terms. As is skew-symmetric in and , the last two terms in parentheses can be abbreviated by onehalf the Riemmann-Cartan tensor, defined in Eq. (30). All other sums in parentheses can be expressed as covariant derivatives and torsion tensor terms:
[TABLE]
From the definition of the Legendre transformation
[TABLE]
one finds inserting the respective canonical field equations from the coupled set (55):
[TABLE]
With the momentum fields as the duals of the corresponding canonical equations,
[TABLE]
the consistency equation (57), resp. (67) finally acquires the equivalent Lagrangian representation:
[TABLE]
Note that the covariant derivatives of the vector field and the metric actually constitute the dual quantities of the respective canonical momenta, and . Remarkably, minus one-half the Riemann-Cartan tensor turns up here as the replacement of the non-existing covariant derivative of the connection field , with the pertaining dual field.
The consistency equation in the form of Eq. (68) exhibits a clear interpretation of its components. The two derivatives with respect to the metric represent the classical Einstein approach to vary the Lagrangians of gravity and matter fields with respect to the metric. The derivatives of with respect to the vector field and its covariant derivative emerge from the generalization of the theory to also include vector fields in place of the restriction to scalar matter fields. On the side of the gravitational Lagrangian , the term proportional to the covariant derivative of the metric stands for the generalization to “non-metricity”, hence a not covariantly conserved metric. Finally, the difference of the terms proportional to the Riemann tensor can be shown on the basis of (58) to vanish in the case of a torsion-free spacetime, and hence serves to include torsion effects into the generalized description of gravity.
By virtue of the identities for scalar density valued functions of tensors and the metric [25], one encounters the following identities from the dependencies of the Lagrangians and :
[TABLE]
The consistency equation (68) thus has the equivalent representation (after dividing by ):
[TABLE]
The left-hand side of Eq. (69) is minus the covariant version of the canonical energy-momentum tensor of the Lagrangian system of scalar and vector fields. In a flat spacetime this tensor is given by:
[TABLE]
Yet, in a curved spacetime this is not a tensor! Remarkably, the canonical gauge formalism yields this expression with the partial derivatives of the vector field replaced by covariant derivatives. The consistency equation (69) thus requires to define the generalized energy-momentum tensor of as
[TABLE]
which maintains its tensor property in a curved spacetime. The covariant derivative of the vector field induces the connection in addition to the metric to act as a coupling quantity.
The right-hand side of Eq. (69) can be interpreted as the canonical energy-momentum tensor associated with the Lagrangian of the free gravitational field, hence
[TABLE]
It represents the generic energy-momentum balance equation
[TABLE]
which conforms with the conjecture of a zero-energy universe [26, 27, 28, 29] and holds for non-zero torsion as well as non-metricity. For metric compatibility, hence for , Eq. (70) reduces to the Einstein-type equation:
[TABLE]
For the Hilbert Lagrangian as a particular model Lagrangian for the free gravitational field, one immediately encounters the form of the classical Einstein equation
[TABLE]
which here also holds for non-zero torsion with a non-symmetric Ricci tensor in the case of a non-symmetric energy-momentum tensor .
The Lagrangian that corresponds to the sample Hamiltonian , defined in Eq. (58) is obtained via the Legendre transformation
[TABLE]
To obtain the proper Lagrangian, the momenta must be replaced by the Riemann tensor according to Eq. (59), which finally yields:
[TABLE]
Inserting this Lagrangian into the generic Einstein-type equation (71) yields the pertaining field equation:
[TABLE]
which coincides with Eq. (60) for the definitions (61) of the coupling constants and .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Einstein [1915] A. Einstein, Sitzungsberichte der Königlich-Preussischen Akademie der Wissenschaften 1915 , 844 (1915).
- 2Schwarzschild [1916] K. Schwarzschild, Sitzungsberichte der Königlich-Preussischen Akademie der Wissenschaften 1916 , 189 (1916) . · doi ↗
- 3Weyl [1921] H. Weyl, Raum, Zeit, Materie (Springer, 1921). · doi ↗
- 4Utiyama [1956] R. Utiyama, Phys Rev 101 , 1597 (1956).
- 5Struckmeier [2013] J. Struckmeier, J Phys G: Nucl Phys 40 , 015007 (2013).
- 6Struckmeier et al. [2015] J. Struckmeier, D. Vasak, and H. Stoecker, Astron Nachr/AN 336 , 731 (2015) . · doi ↗
- 7De Donder [1930] T. De Donder, Théorie Invariantive Du Calcul des Variations (Gaulthier-Villars & Cie., Paris, 1930).
- 8Weyl [1935] H. Weyl, Ann Math 36 , 607 (1935) . · doi ↗
