Quantum gravity in timeless configuration space
Henrique Gomes

TL;DR
This paper proposes a novel approach to quantum gravity by encoding it in a timeless configuration space, leading to emergent time and a Schrödinger-like evolution that avoids common issues in traditional quantum gravity formalisms.
Contribution
It introduces a framework where boundary conditions are derived from fundamental principles, resulting in a unique reduced configuration space and emergent time, applicable to theories like shape dynamics.
Findings
Derives a Schrödinger equation in the classical limit for weakly coupled fields.
Provides a method to calculate gravitational semi-classical probabilities.
Avoids the multiple solution problem of standard WKB in quantum gravity.
Abstract
On the path towards quantum gravity, we find friction between temporal relations in quantum mechanics (QM) (where they are fixed and field-independent), and in general relativity (where they are field-dependent and dynamic). This paper aims to attenuate that friction, by encoding gravity in the timeless configuration space of spatial fields with dynamics given by a path integral. The framework demands that boundary conditions for this path integral be uniquely given, but unlike other approaches where they are prescribed --- such as the no-boundary and the tunneling proposals --- here I postulate basic principles to identify boundary conditions in a large class of theories. Uniqueness arises only if a reduced configuration space can be defined and if it has a profoundly asymmetric fundamental structure. These requirements place strong restrictions on the field and symmetry content of…
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Quantum gravity in timeless configuration space
**Henrique Gomes111[email protected]
***Perimeter Institute for Theoretical Physics
31 Caroline Street, ON, N2L 2Y5, Canada*
Abstract
On the path towards quantum gravity we find friction between temporal relations in quantum mechanics (QM) (where they are fixed and field-independent), and in general relativity (where they are field-dependent and dynamic). This paper aims to attenuate that friction, by encoding gravity in the timeless configuration space of spatial fields with dynamics given by a path integral. The framework demands that boundary conditions for this path integral be uniquely given, but unlike other approaches where they are prescribed — such as the no-boundary and the tunneling proposals — here I postulate basic principles to identify boundary conditions in a large class of theories. Uniqueness arises only if a reduced configuration space can be defined and if it has a profoundly asymmetric fundamental structure. These requirements place strong restrictions on the field and symmetry content of theories encompassed here; shape dynamics is one such theory. When these constraints are met, any emerging theory will have a Born rule given merely by a particular volume element built from the path integral in (reduced) configuration space. Also as in other boundary proposals, Time, including space-time, emerges as an effective concept; valid for certain curves in configuration space but not assumed from the start. When some such notion of time becomes available, conservation of (positive) probability currents ensues. I show that, in the appropriate limits, a Schroedinger equation dictates the evolution of weakly coupled source fields on a classical gravitational background. Due to the asymmetry of reduced configuration space, these probabilities and currents avoid a known difficulty of standard WKB approximations for Wheeler DeWitt in minisuperspace: the selection of a unique Hamilton-Jacobi solution to serve as background. I illustrate these constructions with a simple example of a full quantum gravitational theory (i.e. not in minisuperspace) for which the formalism is applicable, and give a formula for calculating gravitational semi-classical relative probabilities in it.
Contents
1 Introduction.
1.1 Motivation
Conventional approaches to quantum gravity suffer from serious conceptual and technical problems. On the technical side, most focus has been concentrated on the perturbative regime and on extending the theory to higher energy domains while maintaining some semblance of unitarity. Conceptual problems on the other hand, are starker when trying to make sense of the non-perturbative theory. There we witness a violent clash between, on the one hand, the standard quantum mechanical properties of:
- 1a)
the quantum superposition principle
- 2a)
the non-locality of instantaneous measurements;
and, on the other, the general relativistic properties of
- 1b)
a fixed causal structure
- 2b)
space-time covariance
These two pairs still leave out difficulties quantum cosmology faces arising directly from the measurement problem in the foundations of quantum mechanics, which the present work also aims to address. Of course, the non-perturbative regime poses infinitely more challenging obstacles to quantitative treatment. Nonetheless, resolving a conceptual incongruence between full quantum mechanics and general relativity might point us to different approaches, friendlier to quantum gravity. The aim of this paper is indeed to point out a framework in which such incongruences are resolved. The framework suggests the adoption of alternative formulations of gravity at a fundamental level, such as shape dynamics [1] (see also [2] and references therein).
The first clash, between 1a and 1b, arises because, intuitively, a quantum superposition makes sense for space-like separated components of a system – i.e. belonging to a single constant-time hypersurface. But, to be meaningful, the label ‘space-like’ already requires a fixed, unique,222Unique up to conformal transformations. gravitational field.
Accounting for back-reaction, it becomes obvious a problem lurks here. We understand how non-back-reacting matter degrees of freedom can be in states of superposition in a background spacetime, e.g. what we usually mean when we write something of the sort in the position space representation. But due to the universality of the gravitational interactions, back-reaction would necessarily put the gravitational field in a state of superposition as well. How does a superposed state gravitate – and therefore affect the causal structure? The problem is also present in covariant axiomatic QFT language, where one demands that operators with space-like separated support, commute, . Again, ‘space-like’ presumes a definite property for the very field we would like to be able to superpose.333For QFT in curved spacetimes, different “time” problems are present. There, the issue rears its head in the absence of Killing vector fields. The issue is that one should find solutions of the dynamical equations , which are also eigenfunctions of the time derivative (i.e. along the Killing vector), . One then uses this eigenfunction split into positive and negative frequencies to construct associated raising and lowering operators, a Fock space, etc. In the absence of a preferred time-like direction in space-time, raising and lowering operators become less meaningful (see [3] for a more complete treatment of this point). What is the meaning of this demand when the operators are acting on states representing superpositions of gravitational fields? It is hard to tell, and it seems to me there are very few ways forward in thinking about these problems.444See [4] for a notable exception, using the formalism of partial and complete observables [5] (see [6, 7] for reviews).
Further tension lies between 2a and 2b. It stems from distinct roles of (global) time in quantum theory and in general relativity. In quantum mechanics, all measurements are made at “instants of time”, and in this sense only quantities referring to the instantaneous state of a system have physical meaning. Dynamics is most naturally understood as an evolution from one instantaneous state to another. On the other hand, in general relativity, a global “time” is an arbitrary label assigned to spacelike hypersurfaces, and physically meaningful quantities are required to be independent of such labels. In other words, in GR, only the spacetime geometry is measurable, and thus only the histories of the Universe have physical meaning. The dynamics of the gravitational field must be carved out of the theory by subjecting it to an initial value formulation, an unnatural procedure from the covariant standpoint.
When (naively) merging quantum theory and general relativity, it should cause no surprise that the only physically meaningful surviving quantities are those which are both: 1) instantaneously measurable (i.e. refer to quantities defined on a spacelike hypersurface) and 2) depend only on the spacetime geometry (i.e. are independent of a choice of spacelike hypersurface) (see e.g. [8, 9]). In a full geometrodynamical theory, this creates great constraints on what constitute reasonable quantum observables [10, 11].
Lastly lies a further distinction (but not necessarily a clash) between the notions of local time, or duration, in the two theories. In classical GR, there exists one notion of local time inherent in the theory, independently of the space-time metric: proper time. Proper time gives an approximate notion of duration along a world-line, it is a dimensionful quantity that is not given in relation to anything else. It has this role whether a space-time satisfies the Einstein equations or not, and is therefore a kinematical, general proxy for elapsed time. On the other hand, in modern relational approaches to quantum mechanics [12, 13], one can parse two notions of time: time as measured by relations between subsystems and just an external evolution time. One of these subsystems is usually called the “clock”; its emerging notion of local time is akin to duration, but only becomes available through relational dynamics. The other, global evolution time, is indeed not dynamic, but it also has no meaningful parallel in GR.
These issues surface on the technical front as well; efforts to obtain a viable notion of quantum evolution in quantum gravity – even at the semi-classical level – founder for a variety of reasons [14].555At least outside of drastic truncations of field space, such as minisuperspace. In many cases, such truncations do not commute with quantization [15]. My claim is that these reasons have a common birthplace. They arise from the distinction between unitary evolution and reduction processes in quantum mechanics, and from the confluence of dynamics and kinematics in general relativity.
1.2 A diplomatic resolution
This paper outlines a proposal to address such questions. Although it is still of largely qualitative character, the aim of this first paper is to discuss the mechanisms by which one could implement said proposal.
My present attempt to untangle both the problem of the superposition of causal structures and the problems of time consists in basing gravitational theories, at the kinematical level, on fields for which there is no notion of causality. Causality emerges, but only dynamically. In this way, physics should be encoded in a path integral operating in timeless configuration space, embodying only compatible spatial symmetries and with prescribed boundary conditions. Such symmetries allow us to define and exploit a reduced space of physical configurations. No such reduced configuration space exists for theories with local refoliation invariance — such as GR — even abstractly.
The boundary conditions required for the constructions here are extremely special. They are based on a fundamental asymmetry of the reduced configuration space (configuration space modulo gauge-symmetries). Namely a reduced configuration space must not only exist, but also have a unique, most homogeneous element. This requirement places extreme constraints on the field content, on the symmetries, and on the topology of the manifold, but its satisfaction is what will allow most of the constructions here to be defined unambiguously. Anchoring the transition amplitude on this unique boundary,666Or more precisely, this unique ‘corner’, as we will see below. the present framework provides a ‘first principles’, non-covariant alternative to the Hartle-Hawking prescription of the global wavefunction of the Universe [16].
It also addresses questions that need to be answered in any fully relational, timeless theory referring to instantaneous states, such as Hartle-Hawking and Vilenkin’s tunneling proposals. These proposals yield a wave-function whose argument is an instantaneous field content, e.g. , where is the 3-metric and some spatial matter field. In this respect, such theories thus share many of the features of the framework constructed here. The difference, again, is that in such approaches, the wave-functions still need to correspond to covariant (i.e. space-time) quantities. Outside of minisuperpsace approximations, the requirement of covariance (at a fundamental level) does not allow such theories to be describable in a physical configuration space, even abstractly, and thus they do not assuage the conflicts between QM and GR I have outlined in the previous section.
As I will show, the present setup yields a single static wavefunction and a corresponding volume-form in physical configuration space. The passage of time is then abstracted from a particular notion of ‘records’ – subsets of configuration space where the volume form concentrates. Causality emerges thus as an approximate pattern encoded in the volume-form when it is well-described semi-classically. In this sense, no separate causal structures ‘superpose’ — the frozen patterns we associate with them can at most be said to mesh and interfere in configuration space. Unlike previous work in which one attempts to emulate the effects of wave-function collapse within standard unitary evolution, here both evolution and reduction should be emergent from a fundamentally timeless description of the entire Universe.
Here, space-times don’t exist a priori, but require a relational construction of duration – or definitions of clocks – on classical solutions. I.e. in general one must define clocks and an experienced duration through the classical evolution of relational observables; I call such an emergent notion of time duration-time. Indeed, the price to pay in the present, purely relational framework, is apparent in trying to recover, in the appropriate limits, the local, experienced time of GR. In general relativity, experienced time is directly related to proper time, but the gravitational theories compatible with the framework have no universal replacement for proper time. This feature might not be so damaging; it is also the case in relational approaches to quantum mechanics that one constructs clock-time from relations between subsystems [12], as discussed in the previous section.
The challenge to the sort of theories encompassed by the present work thus shifts, from the standard fundamental problems mentioned above — superposition of causal structures, untangling evolution and symmetries, and reduction vs unitary evolution — to one of recovering a smooth space-time description. While it may indeed be difficult to recognize classical relativity of simultaneity from the global evolution-time — i.e. the global time parametrization that will follow from the construction of records — it can be gleaned when using duration-time, as we will see.
In sum, the claim here is that, although unorthodox, this approach resolves issues both with the superposition of causal structures and the ‘evolution vs reduction’ dichotomy, while bringing the notion of duration in gravity closer to the non-unique versions of ‘clock systems’ in quantum mechanics. The constructions advocated in this paper can similarly work with other types of timeless configuration space; they do not limit themselves to the geometrodynamic setting in which they are mostly pursued here. Even the consequences of this approach to the construction of gravitational models will be only touched on here, being further pursued in an upcoming publication, as well as in [17].
Roadmap
I will now describe the main constructions of this paper and its relevance for quantum mechanics and quantum gravity. I will start by making clear what are the main assumptions, or axioms, of the work. This is done in section 2. The non-relativistic setting will in many ways resemble standard particle quantum mechanics. Leaving at most a single reparametrization constraint, the assumptions are then perfectly compatible with past work on relational dynamics and quantum mechanics, which I very briefly review in section 3. At the end of this section, I include a proof that the principles single out the Born rule as a measure in configuration space. Finally, I discuss another problem that is not fully addressed in relational quantum mechanical approaches within the context of path integrals: a replacement for the role of the epistemological updating of probabilities in the timeless context, through the notion of “record-holding submanifolds”, in section 4. Lastly, I sketch a gravitational dynamical toy model based on strong gravity, to illustrate the structures introduced here.
2 Axioms
I would like to examine the conceptual picture that emerges from theories for which time plays no role at the kinematical level. I will first clarify the axioms I am led to adopt in order to have a geometric theory of gravity without such kinematical causal relations.
The five given structures that the present work is based on are the following:
A closed topological manifold, . Prior to defining our fields, we require some weak notion of locality, which I will take to mean ‘open neighborhoods’. In the gravitational case, I will take this to be provided by the closed topological manifold (i.e. compact without boundary), of dimension .777 The demand of being closed is a consequence of relationalism. It does not imply that through evolution we couldn’t get causal horizons, or that there are no meaningful definitions of effectively open subsystems within . If one wanted to apply the following constructions to a fundamentally discrete theory, could be replaced by a lattice, or piecewise linear manifold, on top of which the physical degrees of freedom live. 2. 2.
The kinematic field space . In the field theory case, this is the infinite-dimensional space of field configurations over . Each configuration – the ‘instantaneous’ field content of an entire Universe– is represented by a point of , which we denote by . These should correspond to relational data, on which the dynamical laws act. For defiteness, I will take field configurations to be sections of tensor bundles over . The example for pure gravity would be , the space of positive smooth sections on the symmetrized (0,2)-covariant tensor bundle. In principle the constructions here should apply for non-causally related observables of any kind, such as e.g. field values on a lattice. 3. 3.
“The Past Hypothesis” – boundary (or ‘initial’) conditions for the Universe: The requirement of such boundary conditions for the wave-function goes in line with the fact that, if time plays no fundamental role, the wave-function of the Universe must be given uniquely, and only once. Thus I require unique boundary conditions in order to anchor the construction of a unique transition amplitude. Unlike what is the case with the ‘no-boundary’ proposal [16], this will be a choice of ‘the most homogeneous’ configuration, , with respect to which the wave-function is defined, , see equation (1), below. As I will show, furthermore plays a fundamental part in the definition of ‘records’, and for ‘records’ to function in the capacity their name suggests, the boundary conditions should correspond to field configurations which are as ‘structureless as possible’. Below, I give a criterion that can, in some circumstances, select a unique such boundary state. 4. 4.
An action functional on curves on . I.e. , for , invariant wrt the gauge group acting on , , and (up to boundary terms) depending only on first derivatives in the time parametrization. The action should be invariant with respect to the given gauge symmetry group , defining , for . For this definition, the gauge-symmetry group needs to have a pointwise action on . Such an action functional will be used for the transition amplitude between two physical configurations, for , and :
[TABLE]
To avoid cluttered notation, I will drop the square brackets in most of the paper, calling attention when the distinction between full configuration space and the reduced one becomes material. 5. 5.
A positive scalar function . I will call this function the pre-probability density, , as it will give a measure in configuration space. To serve our purposes, must preserve the multiplicative group structure,
[TABLE]
For future reasons I will term (2) the factorization property of the density. It is a required condition for the definition of record that I introduce here to have physical significance.888And for it to have cluster decomposition properties [18]). It is the specific form of this obeying (2) which encodes the Born rule.
These axioms are quite standard implicit choices in most work related to field theory, here I am only making these choices explicit. Axiom 5 is implicit in the Born rule, and 3 is usually explicitly stated in the literature, either as boundary conditions, or as some initial condition.
The only structure required for doing physics, arising from the five premises above, is the density over , given by
[TABLE]
where is restricted by axiom 4, is given by axiom 5, and is defined from the action functional given in axiom 3, and (12) and (14). This measure gives a volume form on configuration space. It gives a way to “count” configurations, , and thereby the likelihood of finding certain relational observables within . It is assumed to be a positive functional of the only non-trivial function we have defined pointwise on , namely, . I should stress that roughly the same or similar assumptions are implicit in both Hartle-Hawking and Vilenkin’s tunneling proposals.
Here, we have a pre-conceived notion of space, or rather, “of things which are not causally related”, but not necessarily of time, in either its absolute or relativistic forms. We have the space of relational objects on which dynamical laws should act.
At the end of the day, we have at our disposal the density over field space, . But without space-time, and without any notion of absolute time, can we still extract some physics from the formalism? I would like to show that there can still be enough structure in the timeless path integral in timeless configuration space for doing just that.999Moreover, this formalism could apply to any relational system with a specified configuration space, action over curves on it, and unique maximally homogeneous point. For example, certain types of tensor network models. This is what I will focus on for most of the paper.
2.1 Relationalism and the symmetry group .
Intimately related to our clash 2a and 2b is what is known as the ‘problem of time’. The GR Hamiltonian mingles local gauge symmetries and evolution [14, 8].101010I note that while a relativistic particle also has a Hamiltonian which generates a single reparametrization invariance, it does not mix this symmetry with local gauge transformations [14]. Moreover, as a subsystem, it is easily describable by relational observables. I give a broader analysis of the time problems in appendix A, and related ones in their formulation of a transition amplitude in superspace, in appendix A.2. At least without the use of matter fields there is no preferred manner to split the Hamiltonian constraints such that all but one are fixed by a “definition of simultaneity”– a partial gauge-fixing of the Hamiltonian well defined everywhere in phase space.
Although the main constructions of the paper should be applicable in a more general setting, let’s investigate the standard case for gravity in closer detail.
The standard ADM action and symmetries.
Upon a Legendre transformation, the vacuum Einstein-Hilbert action yields primary and then secondary first-class constraints, . The action can then be put in the form
[TABLE]
where are Lagrange multipliers and
[TABLE]
with some algebra given by
[TABLE]
It has been shown that the local constraints above are essentially unique if the algebra they generate is required to mimic the commutation algebra of vector fields orthogonally decomposed along a hypersurface of space-time (the hypersurface deformation algebra). Thus the Hamiltonian constraint, and therefore the local Wheeler-DeWitt equation, are intimately tied to a covariant picture of space-time.
Under the transformation generated by the flow of the constraints,
[TABLE]
and under the more ad hoc
[TABLE]
the action transforms by a boundary term:
[TABLE]
which clearly vanishes for constraints linear in the momenta, or if the generator of refoliations, , vanishes at the initial and final surfaces.
Whereas,
[TABLE]
has pointwise dependence in , the transformation
[TABLE]
depends not only on the metric, but also on the momenta. We could also have obtained a similar result directly from the ADM decomposition (with no Legendre transformation): decomposing a vector field along the time direction and tangential components to the hypersurface, , one obtains (assuming zero shift):
[TABLE]
where is the Lie derivative along the vector field tangential to the hypersurface. This part of the action is easy to make sense of: it is the infinitesimal action of a spatial diffeomorphism, acting on the spatial metric through pull-back: Diff.
On the other hand, equation (10) shows us that refoliations act on the metric in a manner also depending on its velocity.111111In fact, the strict relationship between the Hamiltonian constraint and refoliations holds only on-shell. It is the action of refoliations that makes it difficult to meaningfully define surfaces and points in superspace, the quotient space Diff. For suppose one has two different curves intersecting at , , , for a positive-definite symmetric tensor (they intersect at ). Since the action of the spatial diffeomorphisms depends solely on the metric, the two curves will still intersect after the action of (or ), as they will be jointly moved. But with the action of , the curves will be shifted: , and will not intersect anymore. Taking the quotient by (spatial) diffeomorphisms will not help since it is easy to find such that for all Diff.121212That this must be so is easy to ascertain from a simple degree of freedom count: the space of symmetric tensor has 6 degrees of freedom per space point, whereas the generators of spatial diffeomorphisms can only account for three.
Even though is an infinite-dimensional space, there is at least one situation in which such curves can still intersect at different . If the transformation corresponds merely to a reparametrization along each curve, , the transformed curves merely shift their time-parameters:131313This would happen order by order for higher contributions in powers of . , and thus the curves will still intersect, but now at . However, if the refoliation is not spatially constant, generically the curves will miss each other, also in superspace.
This fact makes it doubtful that one can implement boundary conditions on superspace for the path integral that are physically meaningful from a covariant space-time perspective.
The present requirement on symmetries.
The alternative explored here requires its fundamental symmetries to be generated by local constraints linear in the momenta. The treatment of such linear symmetries, including their quantization, is then much more straightforward than for local constraints quadratic in the momenta. I will show how this non-fundamentally-covariant setting still allows the emergence of an on-shell refoliation invariance.141414I should note that in the ADM Hamiltonian formalism for GR [19], refoliations only generate a symmetry on-shell in any case [20].
Although the principle of having a pointwise action in configuration space is generically applicable, in the case of gravity in metric variables, i.e. Riem(M), the symmetries which satisfy this criterion are found by requiring a phase space representation of first-class constraints linear in the momenta,
[TABLE]
where is a constant in , and are appropriate (tensor) smearings of the local constraint densities, , e.g.
[TABLE]
the conjugate momentum to the metric is , and curly brackets denote the Poisson bracket wrt this canonical relation. Linearity in the momenta implies that the symmetries have an intrinsic action on configuration space, as required, i.e.
[TABLE]
i.e. its action on depends only on and the smearing. The first class property implies that these constraints correspond to symmetries. It turns out that, under some assumptions, that the allowed non-trivial symmetries that act as a group in the metric configuration space — and thus fulfill our axioms — are diffeomorphisms and scale transformations (see [17] for more details).
In the field theoretic framework proposed here, relational observables are those that live in the quotient – e.g.: those whose spatial position and scale are only defined relationally – and thus I will bypass a more complicated relational phrasing of the amplitudes by just assuming that all statements are suitably translated when necessary, from to . In the presence of this structure, boundary conditions for the path integral will have physical meaning.
The demands of our axioms thus severely restrict the form of the infinite-dimensional groups , and imply that configuration space form a principal fiber bundle ,151515This is not quite true, for the base space may not form a manifold, as it doesn’t for the spatial diffeomorphisms. However, insofar as I will use this structure for a field-space connection 1-form – which I associate with an abstract observer – the group can be restricted by a further condition: only allow the diffeomorphisms that maintain a given (observer location) and a observer frame at , fixed. Calling this restricted set of diffeomorphisms Diff, then the space Diff is indeed a manifold [21] and one can use its principal fiber bundle structure in the usual ways. making the quantum treatment of gauge-symmetries straightforward. For instance, unlike what is the case for ADM [19], in which one requires the more complicated use of the FBV formalism [22, 23], standard Fadeev-Popov is sufficient to give rise to a well-defined BRST charge; the Fadeev Popov determinant is a true determinant, which implements gauge-covariance of the gauge-fixed path integral. It is the principal fiber bundle structure that allows us to unambiguously consider individual points and dynamics in the quotient space , an impossibility when gauge symmetries become tangled with dynamics (as is the case with ADM).
In the 3+1 path integral setting here, one can make use of the principal fiber bundle structure for a straightforward treatment of gauge-symmetries, with the use of a a field-space connection 1-form, , as described in appendix B. This field-space connection 1-form selects a way to horizontally lift to a given curve in . For instance, take Diff and Riem(M), , the vector fields with the algebra being just the commutator algebra, acting on the metric with the Lie derivative. The horizontal projection of a field-space vector, is given by . I.e. it acts as a field-dependent shift in the ADM formalism.161616The connection 1-form itself can be related to a choice of equilocality and associated to abstract, non-back-reacting, observers [24] (see also footnote 15). In a completely gauge-covarint way, it selects a manner in which an observer dynamically distinguishes physical transformations from pure gauge transformations, along time. This is a generalization of Barbour’s concept of “best-matching coordinates’ [2]. According to (88), under a time-dependent diffeomorphism, Diff, generated by the vector field ,171717Strictly speaking, we are only considering diffeomorphisms connected to the identity. Outside of this domain, many problems appear. For instance, there are always diffeomorphisms arbitrarily close to the identity which are not connected to it through the exponential flow of some vector field. This poses problems for the path integral. we have the following transformation of :
[TABLE]
which is precisely the transformation required of the shift from (8) under time-dependent spatial diffeomorphisms.
Summary of this section.
In this subsection I have sketched how the local symmetry groups can be more or less uniquely selected in such a manner as to have a pointwise representation in configurational field space. In this manner, a well-defined reduced configuration space exists, and I guarantee that local symmetries will be distinguished from dynamics. Moreover, a gauge-invariant path integral may be constructed making use of the connection-form, possible from the emerging principal fiber bundle structure (see also [24, 17] for a more complete account).
2.2 Uniqueness of preferred boundary conditions
The principle I will use to select boundary conditions is based, roughly, on information content. I would like to set the anchor of the transition amplitude (1) to be the configuration (or reduced configuration) that carries the least amount of information. What I mean by that is that it carries the most amount of symmetries, under my group of transformations. I.e. it is the most “homogeneous” configuration.
As mentioned in footnote 15, it is not true, for groups which don’t act freely on configuration space, that the quotient forms a manifold. Indeed, if there are elements of that remain fixed under the action of subgroups of , such as is the case with and Diff for instance, then the quotient can be at most a stratified manifold [25]. A stratified manifold is basically a union of manifolds of different dimensions, with concatenated boundaries of boundaries. The standard example is a cube, with its bulk being of dimension 3, and its boundary being composed of a further union of manifolds; a face of dimension two, with its boundaries composed of a further union of manifolds; edges of dimension one, with its boundaries composed of a further union of (zero-dimensional) manifolds. The points with the highest isotropy group will correspond to the lowest dimensional boundaries (of all the other boundaries). In other words,
[TABLE]
we have:
[TABLE]
where each is a manifold with boundaries, such that for .
Given , let be the set composed of all the most homogeneous field configurations, i.e. the subset of elements with the largest dimensional isotropy subgroup of . This will stand in for the configurations being ‘as structureless as possible’. For example, let us take and , acting through pull-back . Then let
[TABLE]
Allowing for degenerate metrics, we have a unique such point, , the completely degenerate metric, since it has the full Diff(M) as an isotropy group.
In accordance with the symmetry principles of the theory, as described above, one could extend the group Diff to the one given by the ‘maximum geometric group’, Diff, with the Weyl group of conformal transformations (through pointwise multiplication of the metric by positive scalar functions , and a semi-direct product between the two groups) [26, 17].
In fact, the case of Diff does not require further specification of the field space boundary at all (as is required, for example in Hartle-Hawking [16]). This property can be seen either by parametrizing physical space with unimodular metrics, or using the horizontal lifts of the previous section. In the extended case, the orbit in corresponding to the completely degenerate metric is not continuously path-connected to the rest of . For , this leaves only , the round metric, as generating the unique allowed past orbit.
In the first case, using unimodular metrics as the conformal section of the principal fiber bundle ,181818Note that since this group is Abelian, there is no Gribov problem [27]. for a curve of metrics in to change signature, the determinant must become degenerate, which disconnects the physical spaces of positive definite signatures from those with other signatures. In other words, the ‘cone’191919Riem is a cone inside the affine space of sections of symmetric (0,2)-covariant tensors, , in that the sum of two metrics is still inside Riem, but not their difference [25]. which makes up the boundary of Riem inside the affine space becomes unreachable coming from .
In the horizontal lift picture, the orbits are defined by vectors of the form . For any ultralocal supermetric in configuration space of the form
[TABLE]
where where , and is a function of the metric and its (spatial) derivatives, the orthogonal vectors to the orbits are going to be of traceless form. Thus the standard horizontal lifts orthogonal to the Weyl orbits (through any standard canonical supermetric in Riem) imply a traceless velocity, , which does not change the volume-form . Thus such curves cannot reach metrics with different signatures than the initial ones, as this would require going through a zero in the determinant of the metric. We can thus formulate the path integral (see section below) in terms of horizontal lifts and without stipulating further boundary conditions.
Summary of this section.
In either the cases of Diff or Diff, the quotient space is only a stratified manifold, and the orbits corresponding to are the lowest strata, the ‘ultimate boundaries’, or the lowest dimensional corners, of , and the natural place to set up boundary conditions. When there is a unique least structured configuration, which is also the corner of corners in reduced configuration space, the boundary conditions can be uniquely specified. This is the case for Diff on . Moreover, no further boundary conditions for the path integral on field space need to be specified.
In other words, the specification of such a unique initial point in the amplitude kernel is sufficient to fully determine the entire wave-function of the Universe. Moreover, as mentioned above, the existence of such a preferred point will be fundamental in defining a ‘record’, which, in its turn, will replace standard notions of time.
3 Path integrals in configuration space
There are different ways of relating the standard relational setting for quantum mechanics to the path integral formalism. In the end, the constructed wave-function should obey some form of the reparametrization constraint. In appendix C I report on work of Chiou showing, for a relational particle model, how a rigorously defined timeless path integral regains this constraint, and how it reduces to the standard transition amplitude in the presence of a subsystems that behaves like a clock. The mechanisms used in this proof (e.g. the Riemann-Stieltjes integral in terms of a mesh) can be recycled for building the path integral with only global reparametrization constraints, as we have here.
Equation (94) guarantees that in the presence of a reliable clock in a given portion of configuration space, one can recover the standard quantum mechanics transition amplitude purely relationally. But it is silent in what regards the existence of such subsystems. As we will see, the asymmetry of reduced configuration space will give us enough structure to build such time functions in the appropriate approximations, even for the (infinite-dimensional) gravitational case.
3.1 The basic definitions
Given an action functional as above, a connection-form , and a preferred ‘in’ configuration (see The Past Hypothesis, below), the (timeless) transition amplitude (or propagator) to the orbit of the configuration is given by a timeless Feynman path integral in configuration space:202020Rigorously, I should start with a phase space action, and only if the momenta can be integrated out of the path integral – which up to the measure amounts to a Legendre transformation – move onto a configuration space action. Here I overlook these issues.
[TABLE]
where here acts on an arbitrary representative of the final point of the transition amplitude, , and is integrated over with some measure. A Haar measure is not required here; unlike the standard case, we are not doing a group averaging procedure, each path on the base space corresponds to at most one . If the curvature of the field-space connection form is zero, there is no relative holonomy on the fiber for two paths , interpolating between and the orbit of . Thus all the lifts for the paths will end up in a single height of the orbit, let’s say . Then the path integral will acquire a functional delta: , cancelling the integral over .
In the case of gravity, we would have:
[TABLE]
The class of paths under which this is integrated over are the horizontal lifts of paths in , as explained in the previous section. I.e. is a path that has a horizontal velocity (i.e. it is a horizontal lift through ), ending at . The implementation of horizontality will in general incur a Jacobian, which substitutes the standard Fadeev-Popov determinant. For the purposes of this paper, the specification is not necessary.212121In the interest of completeness, horizontality by being orthogonal to the fibers generated by the group of conformal diffeomorphisms, Diff, as discussed below, for the standard supermetric in , will implement transverse and traceless conditions on the metric velocities, . The appropriate Jacobian is calculated in appendix F. We note that in the case of odd-dimensional Weyl symmetries, there is no conformal anomaly, and thus the measure can be suitably made Weyl-invariant in conjunction with the action. This is also true in the Hamiltonian setting if the anomalies have a local representation [28]. This procedure eliminates the degenerate directions of the action functional — and makes the projection of the Liouville measure non-degenerate — and does not suffer from Gribov ambiguities (see [17] for more details).
Regarding the measure , Barvinsky (see [29], Sec II) has shown how to split such functional measures into a lower dimensional functional integral over spatial fields, and then another integral over parametrizations. The first order part is just the projected Liouville measure:
[TABLE]
where we are using DeWitt notation, so that here runs over both the tensor indexes and the spatial continuous ones, and
[TABLE]
and is the functional partial derivative. This is nothing but the invertibility matrix between velocity and momenta of course (and it is only non-degenerate in the absence of gauge symmetries). Of course, by using the horizontality conditions, here one would replace . Suppose some gravitational action is given (again) by (11)
[TABLE]
where does not depend on the velocities, and , according to [24, 30]. Then
[TABLE]
which has non-zero determinant. As mentioned above, a choice of a connection form is largely equivalent to a choice of gauge, but more suited to the context we are applying here. For a very simple connection form for the diffeomorphisms, one would get that horizontal vectors are those for which , i.e. the transverse ones [30].
Going back to the more general case, when combining the measure with the one-loop determinant integrated over time parametrizations, one finds the amplitude: , which is only a determinant over the spatial fields, with no time integration, but for the on-shell action (equation 3.24 in [29]). The semi-classical approximation will be recapitulated in section 3.
The measure for the spatial diffeomorphisms (connected to the identity), are given by first replacing the diffeomorphisms by the path-ordered exponential of vector fields. I.e. by vector fields such that (here on the rhs actually indicates the coordinates of the image of under ), and such that , i.e. . Calling again the structure constants of the commutator algebra of diffeomorphisms, given implicitly in e.g. (8), and introducing the continuous matrix , Teitelboim has shown that [31]: . Finally, if the Weyl group is part of , as it is in the example given, Diff(M), the integration over the conformal factor cancels out with a functional delta, because the conformal field-space connection-form is flat; all lifted paths end up at the same height in the conformal orbit, as mentioned above (between eqs (12)-(13)).
For more on the folding properties of the path integral, and other technical details, we can follow Teitelboim in the procedure given in [31], where only the conditions on the shift are local. These details, for a specific conformally invariant model, are made explicit in [17].
The horizontal lift and integration over the final point guarantees that the action functional is invariant wrt to gauge transformations everywhere but the initial point (which can be fixed by boundary conditions). Note also that there is no redundancy in this integral. Each path in the reduced configuration space is lifted uniquely. If the field-space connection had zero associated curvature, no integration over would be necessary, for all horizontal lifts would end up at the same height in the fiber over the final point .
Summary of this subsection:
In this subsection, we have briefly and formally sketched how the field space path integral can be defined in the timeless context. We have explored the fact that the local gauge groups allow the construction of a reduced configuration space, to formulate a natural “particle-like” path integral formally defining a gauge-invariant wave-function. We have largely used the example of gravity with spatial diffeomorphisms and Weyl transformations. From now, unless otherwise specified, I will keep a more abstract approach that ignores the presence of gauge-symmetry.
3.2 The semi-classical transition amplitude
Now we move back to the more general field configuration space specified above. First, I should note the ubiquity of interference experiments relying solely on multiple extremal paths; double slit and all sorts of interferometers rely on no dynamical information besides a semi-classical approximation with multiple extremal paths interpolating between initial and final configurations.
Indeed, here I will mostly study transition amplitudes between configurations that have at least one extremal (classical) path interpolating between them, and only briefly touch on more general cases in the accompanying [32].222222In the usual quantum mechanics setting, if the energy of the particle is lower than a potential barrier, then the fixed energy transition amplitude from one side to the other is exponentially decaying in the phenomenon known as tunneling. One can model the transition using imaginary time, or an Euclidean version of the path integral. Having said this, much progress has been made in explaining tunneling in the usual (or real time) path integral [33, 34]. Moreover, we can separate fields, as I will discuss in section 4.2. One field can be in a semi-classical approximation and serve as background, while the other undergoes evolution through the Schroedinger equation in this background. In the context of path integrals in configuration space, I will be in the semi-classical (or WKB, or saddle point), approximation (in the oscillatory domain).
Explicitly, the setting is given by a path integral in configuration space, (12), for (locally) extremal paths parametrized by the set , , between an initial and a final field configuration , where here stands for both the tensorial and continuous indices. I will denote the on-shell action for these paths as . The expansion, which is accurate for in arc-length parametrization, is then:
[TABLE]
where is field-independent, and the Van Vleck determinant has been defined as
[TABLE]
We here write the action as a functional of its initial and final points along . To clarify the notation, if we wanted to use continuous indices explicitly, we would have e.g.: the on-shell momenta defined as
[TABLE]
where we used DeWitt’s mixed functional/local dependence notation . For a proof of (16) in finite dimensions in this context, see [35], and in the Euclidean field theory setting, see [29], Sec III.
I do not concern myself with the normalization factor for the moment, nor with the effect of the Maslov index (which emerges after focusing points only).232323Equation (16) is valid up to the point where the first eigenvalue of reaches an isolated zero, which is a focal point of the classical paths. At such points the approximation momentarily breaks down. However, it becomes again valid after the focal point, acquiring the phase factor known as the Maslov index , which is basically the Morse index (given by the signature of the Hessian) of the action. The field determinant contained in the Van Vleck factor is the only element that would require regularization. As previously mentioned, this Van-Vleck determinant arises from the combination of the projected Liouville measure and the one-loop determinant. In more generality, these semi-classical prefactors can be translated into each other also in the field theory setting using what are known as “reduction methods for functional determinants” [29], for which standard regularization methods can be applied.
It is fruitful to present this object in the context of Riemannian geometry, for later purposes. There, its given by a time integral of the expansion scalar along a geodesic congruence. I.e. let the Lagrangian be just the infinitesimal line element of a curve, in a -dimensional (semi) Riemannian manifold, and the action is the length of the curve, . Let then be a geodesic congruence, defining the expansion as , the Van-Vleck can be written as:
[TABLE]
where is the proper distance along between and , and is the arc-length parameter along . The geometric version of the Van-Vleck can be seen as quantifying the focussing or defocussing of classical trajectories interpolating between and and around . It provides a useful analogy to dynamical systems in configuration space, and a bridge between classical and quantum behavior, which we will explore later in section 5.
Indeed, the standard interpretation of (17) is as an indicator of the spread of classical trajectories in configuration space. That is, the Van Vleck determinant measures how the dynamics expand or contract extremal paths along configuration space, i.e. how densely the initial configurations are transported by the equations of motion to the final configurations.242424The interpretation of this fact is quite ubiquitous, for instance, the Raychaudhuri equation – which describes the spreading of geodesic congruences – has a compact formulation in terms of the Van Vleck determinant [36]. The same is true of the Jacobi fields used to perform the semi-classical expansion of the path integral [37]. As in the particle case, I assume that the classical field history maps an infinitesimal configuration space density to an infinitesimal configuration space density , and the Van Vleck determinant gives a ratio of these densities as propagated by paths around :
[TABLE]
Under the assumption that there exists at least one classical (locally extremal) path between the two configurations, we find for the absolute value squared of the amplitude:
[TABLE]
Here, to unclutter notation, I have omitted the dependence of the Van-Vleck on the configurations. Interference terms can be clearly identified from (20).
The most appropriate way of extending equation (16)-(20) to higher order of approximations was described in [37]. This is still based on extremal paths, and it can be incorporated with a piecewise approximation as well. Although I will not technically require or use these higher approximations, they provide – at least in principle – a way of extending the analysis done here to a more general context.
Summary of this subsection.
I have here summarized necessary concepts regarding semi-classical approximations for oscillatory path integrals in configuration space. Most important is the appearance in second order of the Van-Vleck determinant; it has the interpretation of a ‘focussing’ of extremal trajectories.
3.3 The Born rule
In the absence of a separation into system and apparatus, the meaning of measurements and of the Born rule become more nebulous. Since my ultimate aim is to apply the constructions here to the whole Universe and to cosmology, I need to address the emergence and meaning of the Born amplitude in the present context.
I have left the form of the function , which “counts” the number of configurations in small regions undetermined. We can associate the Born rule with a relative ‘density of observers’ if we interpret the likelihood of finding oneself e.g. in a region around configuration relative to configuration as the relative volume of these two regions. I now turn to this.
Decoherence is a form of dynamical diagonalization of the (reduced) density matrix. I will use a form of diagonalization appropriate to timeless configuration space, assuming there is no significant interference between different extremal trajectories.252525This is the description of decoherence used extensively in the consistent histories formulation [38]. In this regime, using (20), we can compare the density function to the classically propagated volume elements in configuration space.
From (19),
[TABLE]
Where there is only one extremal path connecting and , namely . More generally, one would have multiple extremal paths between and . This fact forbid us to take to define the densities, since it is path dependent, giving only , not – which needs to take into account interference from other paths. Nonetheless, for the semi-classical kernel, from the no-interference terms in (20), we also have
[TABLE]
Thus if we demand that our density functional gives the classically propagated densities in the no-interference limit, i.e.
[TABLE]
we find that from (22)
[TABLE]
Since we will have a unique choice of , given by the preferred ‘vacuum’, or in-state , we can absorb into the normalization . This proves that, semi-classically, the Born rule emerges from the classical propagation of volumes in configuration space.
But even outside of the semi-classical regime, demanding positivity of , it is easy to see that if one expands the function into polynomials of , i.e. , one would obtain:
[TABLE]
and, on the other hand
[TABLE]
Only the diagonal terms of (25) can match the polynomials of (24). Therefore, only one , say for can survive, and only if . By the semi-classical limit above, (22), .
Alternatively, taking , by differentiating both sides wrt and setting we obtain:
[TABLE]
and a similar equation for the complex conjugates, , which means that the function is homogeneous: , and demanding positivity, .262626I thank W. Wieland for these remarks. By the above classical limit, we must finally have that .
Therefore, extending (23) to the full volume form, from the static wave-function over configuration space ,272727Alternatively, we could have defined a “vacuum state” existing over a trivial (one complex dimension) Hilbert space with the usual complex space inner product, over , and then taking as an operator such that . gives
[TABLE]
Equation (26) – i.e. the Born rule – is the extension of to arbitrary points .
Note that no mention of “measurements”, or “observations” need to be made. Volumes in configuration space suffice.
Summary of this subsection.
The decomposition property of the function is necessary if we want to translate certain statements about the amplitude to statements about probability. Here I have shown that given just this property and implementing a semi-classical limit, one can uniquely derive the Born rule as giving the volume-form in configuration space.
4 Records
Given the Past Hypothesis, even allowing for an objective meaning to a relational transition amplitude, , we should be able to do science from assumptions about relative number of configurations in . It is true that while examining the results of an experiment, all we have access to for comparison are the memories, or records, of the setup of the experiment.
Since we are in a context that cannot rely on absolute time, the meaning of measurements and the updating of probabilities needs to be significantly modified. Here, I will give a semi-classical definition of records. This definition should be seen as a mathematical pre-requisite for any functioning role for records; they do not, however, pinpoint the physical encoding of information in physical structures.
4.1 Semi-classical records
I will denote a recorded configuration as , and the manifold whose elements have as a record, as . If these represent experiments, coexisting with the given record, each copy of “the experimenter” will find itself in one specific configuration, . The manifold , consisting of all those configurations with the same records, then represents the setup of the experiment. In the cosmological setting this could be, for instance, all of the configurations that contain a ‘record’ (as defined below) of the surface of last scattering. The post-selection is locating your configuration within .
Loosely, a configuration will be defined to hold a record of (the recorded) configuration if, for a given ‘in’ configuration , all extremal paths from to go through . This definition is meant to embody the idea that ’s “happening” is encoded in the state . At least semi-classically, ‘branches’ of the wavefunction contributing to the amplitude of have “gone through” .
One note of caution: in the oscillatory semi-classical regime, when one speaks of the major contribution to the amplitude as coming from extremal paths, what is really meant is that there is a coarse-graining of paths, seeded by the extremal ones, in which the paths close to the extremal ones enjoy constructive interference, whereas ones that deviate much have their interference wash out. To be rigorous, in the companion paper [39] I have formulated the constructions here directly from specific types of coarse-grainings of paths. The results below go through without problems.282828 In that more rigorous setting, for the semi-classical record to be of order , the length of the extremal paths that seed the coarse-graining, , need to obey , and, up to this order of approximation, the preferred extremal coarse-graining defined in [39] cannot resolve if lies along the actual extremal paths, or just around them. Moreover, to extend this notion to that of piece-wise extremal paths, I need to use that notion of extremal coarse-grainings as well.
Definition 1
Given , the action on curves in configuration space, and the collection of parametrized extremal paths such that , , then is said to have a semi-classical record of if for each , there exists a , such that and .
And with this definition we can prove the following
Theorem 1
Given a configuration with a semi-classical record of , then
[TABLE]
Combined with property (2) for the density functional, this means that the equation for the probability of automatically becomes an equation for conditional probability on ,
[TABLE]
where .
Using the semi-classical approximation (16),
[TABLE]
If the system is deparametrizable, this means that one of the configuration variables can be used as “time”, and extremal trajectories can be monotonically parametrized by this variable. Then we could use (94), and the semi-classical composition law for the semi-classical transition amplitude (see [40])292929Here we are crucially assuming the ‘folding property’ for the path integral. See [31]. to write:
[TABLE]
for a given intermediary time . Choosing , there is a unique configuration through which all of the paths go through, . The integral gains a , since extremal paths pass only through that point at , and thus:
[TABLE]
In the more general case however, there is more work to be done.
Using (16), we first write the rhs of (27),
[TABLE]
where are the sets of extremal paths interpolating between and and those between and . What we want to show is that to order , the contributing paths will be those that are continuous at . After this is done, we need to show that the Van Vleck determinants have the right composition law. This is done in appendix D.
Strings of records
For more than one recorded configurations, say , definition 1 demands that each extremal path go through in one order or another. Let contain multiple semi-classical records , i.e. . Then an ordering of the records must exist for each extremal curve. For each , the set is ordered: .
In the presence of a single element ,303030And remembering that the on-shell action of between and must be much larger than for records to be defined. we can concatenate . The decomposition of the density follows:
[TABLE]
In other words, if there is one single (no interference), and the semi-classical limit holds (i.e. the action spacing between the records is larger than ), then the records in yield a coarse-grained classical history of the field. In this case we recover a (granular) notion of classical Time.313131In a context of consistent histories, the separation of order greater than between records will avoid the argument of Halliwell concerning the quantum Zeno effect [41].
We can still obtain a consistent string of records for many interfering branches if the ordering given for each , coincide, i.e. if for all and any , we have . Then kernel still can be decomposed:
[TABLE]
This matches the analysis performed by Halliwell, in which he recovers “time” from a simple Hamiltonian Mott bubble chamber ansatz,323232Where spontaneous emission of -particles from a source ionize bubbles in a chamber of water vapor [42]. finding that the amplitude for -bubbles to be excited, with the -th bubble configuration being , is given by:
[TABLE]
where are Green’s functions (for the free Hamiltonian), are projections onto small regions of configuration space surrounding the -th configuration, and is the dimension of configuration space. For this derivation, Halliwell notes that it is essential that there is some asymmetry in configuration space, marked by the source of the -particles. It is the same here: we require axiom 5 of an’origin’ of configuration space (see also [32] for an extended version of this relationship).
The definition implies that a configuration can hold many records, and an ordering among these, with earlier recorded configurations being themselves recorded in later recorded configurations. Through this ordering, a semblance of global time emerges from a fundamentally timeless theory. Again, this is in line with Halliwell’s reconstruction of time in the Mott chamber context [43]. Configurations that are far from the given initial configuration – but still connected to it by extremal paths – will in general have concentrated amplitude, and hold more records.
Relative probabilities of regions in with the same records.
The setup of an experiment implies the fixing of a submanifold in configuration space, , characterized by its points all possessing the same records. In this abstract simple example, the submanifold is characterized by a single configuration , which could represent for instance, “the whole laboratory setup of a double slit experiment and the firing of the electron gun”, or, “the cosmological surface of last scattering”.
We can compare transition amplitudes for configurations with the same records in the following way, given , for any initial :
[TABLE]
This ratio gets rid of a common quantity to both density functions.
In most experimental settings then, equation (31) justifies the practical use of the record configuration as the effective initial point in the kernel. It is as if a measurement occurred, determining . While it is true that the theory is timeless, we can still give a meaning to active verbs such as “updating” (e.g. of our confidence level): to the extent that humans are classical systems, extremal paths in configuration will reflect anything that the equations of motion predict, including rational (and irrational) “updating” of our theories.
Summary of this subsection.
I have here defined records. These are structures that can arise given our volume-form on , and which have many interesting properties. They yield conditional probabilities (in much the same way that the Mott bubbles do); as correlated volumes in configuration space that emulate a notion of “causation”. I have only discussed these structures in the semi-classical regime, whence one obtains at most a ‘granular’ history 333333With records separated by action of order greater than . In the arc-length parametrization of the following sections, this is a separation in superspace.
It is important to note here that our arguments from section 2.1, regarding the gauge-dependent character of intersecting curves in superspace if refoliations are allowed as a local symmetry. I.e. two curves might intersect before, but not after the action of a refoliation. This is not the case if one allows only the sort of local symmetries we have considered here, and/or global reparametrizations. This is the reason why the relativistic particle and minisuperspace models would not present such a problem. But it shows a disconnect, in so far as the concept of records explored here is concerned, between minisuperspace and the full theory with local refoliations.
4.2 Records and conservation of probability
The generic case should be one of redundancy of records in .343434 I mean this in two ways: first, that many different subsystems will have redundant records of the same “event” (subset of a configuration, in the sense of [18]). For example, all the photons released by an electron hitting a fluorescent screen. By comparing the consistency of these records, one can formulate theories about the configuration space action and the probability amplitude. This “consistency of records” approach, has been recently emphasized in the context of selecting the basis for branch selection in decoherence [44]. Since in this paper I’m not explicitly dealing with subsystems – a topic I have dealt with in [18] – I will ignore this type of redundancy; although it is very important for doing science. In other words, within one could have a polygamy of record relations; e.g.: with . This happens for example in the case of strings of records, discussed above. Ideally, when discussing conservation of probabilities, we would be able to discard such redundancy.
A choice of subset of for which there is no such redundancy will be called a screen, written as . In other words,
[TABLE]
Screens are important when we want to discuss conservation of probability. As I will now show, we can expect probabilities to be conserved between records and the total amplitude of its screen.
According to (94), whenever we have a good ‘clock’ subsystem, we know that the propagator will obey a Schroedinger equation with respect to clock time. It follows that suitably defined currents and probabilities will obey conservation laws. However, part of the challenge of quantum cosmology is precisely to inform us of physics when such clock systems are not available. Therefore, here I will show how a stand in for conservation of probability can be extracted from records.
Now, in standard quantum gravity, there is no fixed causal structure, and thus there is difficulty in defining concepts such as conservation of probability. In our case it is clear that e.g. for actions that are of geodesic type in configuration space, at the semi-classical (WKB) level probability conservation laws should hold, since the form of the equations resemble that of a particle in a constant potential in a zero energy eigenstate.
For a screen as defined through (32), by the definition of semi-classical records 1, if extremal trajectories go through a small volume , the total semi-classical probability flux for any screen will not exceed that around (or of a previous record-screen). In other words:
[TABLE]
where the extremal paths that leave and intersect the screen are parametrized by . In this particular discussion, it is assumed that there is no interference at the screen; to each point on the screen corresponds a single . Moreover, the flux is equal to the Born volume of a (infinitesimally) thickened region. In particular, since
[TABLE]
this means that
[TABLE]
If every extremal path that crosses also intersects the screen, then, up to higher orders of , the inequality of (34) is saturated.
Application to timeless configuration space
Suppose then that we were able to find a metric in configuration space for which extremal trajectories of the action are given by geodesics.353535These are called Jacobi metrics, and there is wide class of dynamical systems that can be put into this form [45]. From the Jacobi procedure, this does not imply that there is no potential for the dynamical system, just that it can be reabsorbed into an effective metric in configuration space, with the use of specific conformal factors.
Using DeWitt functional notation, let the index represent the totality of both continuous and discrete indices of the spatial field in question (thus contraction includes spatial integration). Then, for a configuration-dependent supermetric , and configuration space curves ,
[TABLE]
Where is the conformal Jacobi factor. Reparametrization invariance implies a reparametrization constraint, since
[TABLE]
(where represents the functional partial derivative). We thus get:
[TABLE]
Note that (37) is not a local equation like the usual scalar ADM constraint [19], since there is a hidden integration on all contracted indices. It is a bona fide global conservation law, and that makes all the difference. Although we are not here at a homogeneous (minisuperspace) approximation, we can still use many of its standard techniques and results. Of course, in the standard superspace approximation of homogeneous fields, equation (35) would no longer contain the local (integrated over) index, and only the actual indices appearing in (37).
For the reparametrizations generated by (37), the algebra is a bit more laborious, but in the end one obtains, in (9), for a global (in space) parameter, just and . Thus, from (78), it follows that the wave-function constructed from the path integral satisfies the constraint:
[TABLE]
where I have chosen the non-self-adjoint factor ordering, with a possibly non-ultralocal supermetric symmetric in .363636Non ultralocal, means that and is not necessarily proportional to a Dirac delta. Such generalization is useful in order to facilitate point-splitting regularization methods, even if one takes the ultralocal limit later. With the polar form for a wavefunction (assuming that there is at most one extremal solution connecting to ),
[TABLE]
with (the on-shell action from to ) and real functionals, we obtain
[TABLE]
with . To , it implements the Hamilton-Jacobi equation (at order 0) and the standard current conservation law (at order ),
[TABLE]
with . This will hold for any such Jacobi action, and without approximations other than the semi-classical one.
Recovering Schroedinger.
But we would like to recover an approximation of the Schroedinger equation, for a weak coupling to gravitational degrees of freedom. We will now roughly go over a similar derivation by Banks [46], here adapted to our setting; namely a simple geometrodynamical model.
Suppose that one separates two kinds of fields, , i.e. should be seen as some sort of source matter field for the metric. Suppose moreover that the gravitational field is mostly unaffected by the source field, since the coupling is assumed to be very weak. In this approximation, for the analogue of Hamiltonian (37), we would have (reinstating and ):
[TABLE]
where is the Planck mass (which will give us an ordering), basically establishing the separation of scales between the gravitational and source Hamiltonians, and where, to avoid ambiguity now that we will explicitly refer to local and functional dependence, I used square brackets to denote functional dependence. The gravitational functional Laplacian is:
[TABLE]
where the simplest example of a supermetric is , and the potential is left completely arbitrary.
Assuming moreover that the WKB state now takes the form
[TABLE]
Here I am assuming that the WKB part of the wave-function only holds for the gravitational field. In other words, we are at the no interference limit for gravity.373737In the path integral context, the perturbations due to the source fields should remain “small”, i.e. within the extremal coarse-graining (defined in [32]), which define a tubular bundle around the extremal paths. The two conditions can be shown to be identical. Note moreover that the split between and is largely arbitrary. This allows us to set as satisfying the functional differential equation (implementing a conservation law along the direction of the momenta):383838The difference between self-adjoint factor ordering and the one we chose here, amounts to a difference in the above definition; whether we put the supermetric inside or outside – our choice – of the functional derivatives. Had we chosen the self-adjoint one, with the supermetric inside the derivatives, this would have amounted to using a covariant divergence in superspace, instead of the simple one we used. I decided to use the non-covariant one for simplicity of the formulas, but everything here is translatable to the covariant (self-adjoint) context. Note moreover, that there are two covariances at work here. One is the one given by the principal fiber bundle structure; it ensures that the relevant structures live in the reduced configuration space. The other, which I have not given much attention to, ensures that quantities don’t depend on the coordinates in reduced configuration space.
[TABLE]
with the components of this (bare) current being:
[TABLE]
and still leaving the semi-classical wave-function general.
Expanding (41), at order we get the Hamilton-Jacobi equation, as expected:
[TABLE]
and next order () we obtain:
[TABLE]
and finally, using (44), one obtains that the state functional satisfies the first order functional variational equation:
[TABLE]
For a given region in , one might choose a smooth screen by finding a functional time such that:
[TABLE]
which is a standard choice for time functions for Wheeler-DeWitt in minisuperspace (see eq 10.25, [14]).404040This is almost the integrable arc-length parametrization (which need not exist for large regions of configuration space), but which in general should exist for given small region (around a record, for example). It is almost, but not quite, because we have not used the Jacobi metric. Had we done so, then it is trivially checked that, from the Hamilton-Jacobi equation, one would get . In the Jacobi metric, the standard way of building such surfaces is the following: particular solutions to the Hamilton-Jacobi equations define a congruence of classical trajectories. By choosing an initial arbitrary surface and Lie dragging along the (arc-length parametrized) trajectories, one builds a foliation. In our case, we can Lie drag from the (small region around the)393939As made explicit in [32], records, and the coarse-grainings around extremal paths, are not infinitesimal, but form finite regions in configuration space, determined by the degree of distinguishability of said regions, a distinguishibility which in its turn is determined by the accuracy of the semi-classical transition amplitude. record configuration, and thus there is little arbitrary choice of the initial surface. The remaining directions would be the ones orthogonal to the gradient of the time function (they would span the screen).
Using the choice of time (47) on (46), we obtain the Schroedinger equation (on a given classical background),
[TABLE]
It is easy to show that the probability currents and densities of gravity and sources also factorize, and the Hamiltonian is Hermitean with respect to the usual Schroedinger inner product for the source fields. I show this in appendix E. And thus we obtain the standard Schroedinger interpretation in the given timeless background satisfying the Hamilton-Jacobi equation.414141 We could also have records obeying the approximations for the different fields. This naturally allows for tunneling of the source fields on a given background.
For example, if we want to find the partial infinitesimal flux of the bare probability current (45), through our smooth screen as defined by (47), we just need to take the inner product between the vectors and the respective current (i.e. ). In components, we have:
[TABLE]
since then, from the Hamilton-Jacobi equation, .
Infinitesimally, in this approximation, the flux of current is given by the probability:
[TABLE]
where we used the Hamilton-Jacobi equation. This equation is approximately conserved along the flow of , as per (44), and it represents the infinitesimal Born volume of the region corresponding to the infinitesimal thickening of the screen area element.
Differences from Wheeler-DeWitt.
This approach has profound advantages in comparison to the analogous Wheeler-DeWitt calculation – which has many significant flaws. Here I will list only two of these flaws, which I believe are the most relevant (for a complete list see [14], pgs 54-62), and explain how the present approach overcomes them.
- •
Superposition problem. The previous calculations, but done in the standard – minisuperpsace WdW – context, are bedeviled by a lack of uniqueness of the Hamilton-Jacobi functional. In our WKB ansatz, the only assumption we required was that we were in the no interference domain for the gravitational part (interference for the source terms was still allowed by our form of (43)). For most gravitational systems, this just means that records are sufficiently far away (in terms of arc-length distance) from the region being considered so that gravitational decoherence has ensued (see section 5 below).424242For an instantiation of this fact, and a rationale based on chaos for which such non-linear classical interactions should quickly (in terms of arc-length time along extremal paths in reduced configuration space) should suffice for establishing decoherence (in the consistent histories sense). See also [32]. With our preferred initial configuration , there is no further choice that goes into determining the solutions to the Hamilton-Jacobi equation.
This is not the case in general, and in particular it is not the case for WdW. Due to the linearity of the equations, one could in principle have a solution in the form:
[TABLE]
The Hamilton-Jacobi equation is not linear in , and thus two solutions don’t necessarily yield another solution.434343Not to mention they might have different boundary conditions.
In the presence of more than one Hamilton-Jacobi solution, one loses almost all the nice features of the semi-classical approximation above. For instance, the derivation of the Schroedinger equation (48) no longer holds. Even if one assumes that and separately satisfy the Hamilton-Jacobi equation, and that and separately satisfy the conservation equation, one still only obtains a sum, for :
[TABLE]
which provides many more solutions than just the individual Schroedinger equations. Moreover, such a superposition would allow for solutions with negative norm. In the WdW case, since the long-distance dynamics is silent about initial conditions (and their Hamilton-Jacobi associated solution), unless the short-distance dynamics somehow select a unique solution, the wave-function will not have a desired semi-classical interpretation.Moreover, it seems that, no matter what the short-distance dynamics is, if is a solution, so is . It is easy to find solutions of the form (51) which then have zero (Klein-Gordon) norm.
The reason we have avoided this fate is that due to our fundamental asymmetry of configuration space, we have naturally restricted the space of solutions to ones corresponding to a single Hamilton-Jacobi principal function, , and by choosing a smooth screen (corresponding to a time function) transverse to the classical trajectories and with the orientation coming from the classical trajectories themselves (away from the record). The point is that these are not choices in our framework, but consequences of the given approximation and our axioms.
- •
Many-fingered time problem. This is the problem of time, in the semi-classical approximation of minisuperspace. Outside of minisuperspace for standard Wheeler-deWitt, is not a global time, it is space-dependent. Fluxes, conservation laws, and the like become ill defined concepts in the full superspace, partly because of the problems outlined in section 2.1 (after equation (10)): equal time surfaces in superspace will not be gauge-invariant quantities. Moving on to minisuperspace, consequences first appear in that the time function given in (47) “should correspond to a foliation in which geometry is homogeneous. It would be a consistent choice if we found that gravity had such homogeneity down to the Planck scale; which it doesn’t” [14]. Moreover, one should note that within gravity no space-time scalar can be built just from the spatial metric, as should be the case if (47) was to have meaning in a fully covariant theory. Thus an ontological barrier between the semi-classical approximation in a homogeneous (minisuperspace) approximation and the full theory is erected, isolating the two; the approximation becomes its own consistent theory, without a strong connection to the full theory [14].
As mentioned in the beginning of this section, the standard covariant interpretation of gravity has no place for (reduced) configuration space structures, such as records and screens. We do, because we have different symmetry principles and only global conservation laws associated to (global) time. In the context of timeless theories in configuration space as I have presented them here, the results of this section still of course arise from an approximation; but only one of convenience, supported by the hierarchy of the interactions and dynamical behavior of gravity.
Summary of this subsection.
Although the Born rule gives a volume in configuration space, in the absence of external time we need to define what “conservation” means. For this, I defined screens: sets of configurations with the same record, so that no element of this set is a record of another. That is, there is no redundancy of records within this set. I then showed that for smooth screens – defined by the flow of the extremal paths – one can indeed find conservation of probability (for the associated foliation, where it exists). Moreover, I showed that, for weak matter couplings, one can recover the Schroedinger equation for matter fields in the classical background (of each extremal path). Unlike what is the case for minisuperspace WKB approximations of WdW, records and screens define the time functions and unique Hamilton-Jacobi functionals – each in its domain of existence – which allow the construction of positive currents associated to the probability densities, which can be at least formally extended to the full theory.
5 Applications to quantum gravity
I started the paper discussing fault lines between the foundations of quantum mechanics and those of gravity. I went on to explain some of the problems for defining a reduced configuration space in the presence of local refoliations. Most of these problems are not visible from minisuperspace. Thus, to highlight this aspect of the distinction between approaches, I now briefly present a simple gravitational toy model, corresponding to a form of strong gravity444444In gauge, but with a strictly positive kinetic term, i.e. with no negative conformal modes. that is not a symmetry reduced model. Indeed, usually, one must resort to minisuperspace approximations to obtain information about quantum cosmology. Here I propose a different simplification (which might have even less to do with our Universe than standard minisuperspace cosmology [15]). The proposal is to use features of the infinite-dimensional geometry — specifically the relation between the Van-Vleck determinant and the geometric expansion scalar along a geodesic congruence [36] — to directly obtain first order quantum effects for gravitational theories. Here I will give only a very brief demonstration of this tool. The geometric analysis used below, however, is available for a much larger set of geometric actions than the one presented here (see [47]).
5.1 Gravitational toy model
First, a brief word on the use of the geometry of infinite-dimensional spaces in order to study quantum cosmology in the semi-classical limit.
Geometry and dynamics.
Dynamical systems that are quadratic in the momenta and time-independent can have their flows associated to geodesics of a Riemannian metric (52). It turns out that even more general dynamical systems have similar geometric formulations, as shown in [45], albeit not necessarily through the use of Riemannian metrics.
This implies for example that the confluence or divergence of nearby trajectories are governed by the Jacobi equation (the geodesic deviation):
[TABLE]
where is the Riemann tensor associated with the Jacobi metric and . Equation (52) is easily obtained by defining a geodesic congruence ,
[TABLE]
where we used abstract index notation. Definition through the map implies that , and appropriately commuting the derivative on the lhs of (52) yields the rhs.
Many dynamical conclusions can be brought to bear here from geometry. For instance, the Hadamard-Cartan theorem states that if the sectional curvature is strictly negative along , then there are no conjugate points to ; i.e. extremal paths don’t “reconverge” or ‘recohere’. The other side of this is the Bonnet-Myers theorem (similar to the standard Focussing theorem), which states that for a bounded sectional curvature along the curve then for , contains a conjugate point before reaching time . This is important in the context of the application of the semi-classical formula and the occurrence of interference (20). It also suggests that interference between these extremal paths will be suppressed for many types of chaotic behavior.
Moreover, in this geometric case, one can get many interesting relations between the Van-Vleck determinant (17), the Raychaudhuri equation, the expansion scalar, Jacobi fields and the Riemann curvature itself. Reference [36] gives a plethora of such relations, for congruences of different signatures.
The toy model
To recapitulate how my assumptions here fit the axioms of section 2: I will take , the 3-sphere, to be given by Riem again, i.e. Riem and the gauge-group to be the group of 3-dimensional diffeomorphisms, Diff acting through pull-back on the metric. The non-degenerate configurations with the highest isotropy subgroup are on the conformal class of . Although in the case of diffeomorphisms the degenerate boundaries of Riem (inside its affine vector space ) are accessible by the dynamics, I will for now ignore this, and later transition to the full Diff group. I will thus, more or less arbitrarily for now, select and ignore boundaries of Riem. The volume form will be dictated by , with .
For the action, given the space of 3-metrics, Riem (given in section 2), I will choose the simplest possible dynamical system, where the action is given by the length functional, for :
[TABLE]
This action is globally (but not locally) reparametrization invariant. Here the path integral could be defined using the arc-length gauge-fixing of the parametrization, discussed in appendix F.2. Alternatively, I could use the Riemann-Stieltjes integration over parametrizations (see section IV, A in [35]). Note that we can do this here mainly because there is a single reparametrization constraint; there is no mixing of local gauge-symmetries with evolution, and the existence of a supermetric allows us to take limits of infintesimal segments to zero, i.e. there is meaning in taking the size of the “mesh” to zero, and no need to define the gauge-fixing before defining the path integral.454545In the presence of refoliation invariance, one needs to define a physical slicing condition – so that small steps of the skeletonization make sense – even before defining the path integral. The challenge then is to determine how a change of gauge would affect this definition (see appendix A.2).
However, the action (53) is still not invariant under configuration-space-dependent diffeomorphisms, i.e. under it acquires non-covariant terms depending on . To covariantize, I would have to add a prescription for lifting curves from superspace, Diff to . This can be done by exploring the principal fiber bundle structure of , and introducing a connection form, as explained in section 2.1. In this case, one can take advantage of the metric on to introduce a connection which is roughly a projection orthogonal to the orbits of Diff(M). In other words, configuration space vectors tangent to the orbits are of the form for a vector field, and thus orthogonal vectors to the fibers wrt to the metric (53) are such that . The connection itself is a type of Green’s function; the inverse of a second order differential operator, .464646See also [24] for a relation between a choice of connection, locality, and the representation of abstract observers. This implies that and iff . Since the connection-form is being defined by an orthogonality conditions for a -invariant metric in configuration space, it automatically satisfies both requirements, (87)-(88).
The action then becomes:
[TABLE]
Under a diffeomorphism, it is now easy to show that the integrand remains invariant.474747Note that the terms which take into account the metric variation of itself vanish by the equations defining . The same sort of lift would ensue if the symmetry group included Weyl transformations, and the absence of Weyl anomalies in 3d extends, through the use of the horizontal lift, to the absence of Weyl anomaly in the lifted path integral. Note moreover that, unlike for refoliations, in this case it is easy to show that the Fadeev-Popov determinant behaves like a true determinant, and so, taken together with the measure, is invariant under the given transformations.
As I am mostly interested in a semi-classical approximation, I will take a short-cut, sufficient for my purposes in this analysis. Since the orbits of Diff are Killing directions of the metric given in (53), the inner product between the tangent to a geodesic and the orbit remains constant.484848 for Killing. This elementary fact about Pseudo-Riemannian geometry remains true also in the infinite-dimensional case [48]. Thus a geodesic with initial transverse velocity will remain transverse. I thus assume here that all initial velocities are transverse, and thus we can use for all practical purposes just (53), not (54), since (see below for the appropriate Jacobian that would appear in the path integral).
The geodesic equations are:
[TABLE]
and one can, in fact, find explicit analytic solutions to the equations of motion [49]. Given and , let , with and . Then the geodesic starting out at in the direction of is given by:
[TABLE]
where is the matrix given by ,
[TABLE]
Note that the geodesics are ultralocal. Since the action has no spatial derivatives, the solution depends only on and . In arc-length units, the velocity has units of sec. Let me briefly note that the Jacobi fields can also be explicitly solved for, but the resulting expression is too involved to be written here (see Theorem 4.7 in [49]).
For standard homogeneous cosmological models, one would have , greatly simplifying the solution, as and where is then the instantaneous expansion. In this case, however, the volume of the manifold, which is dictated by above, will reach a singular value in finite time for . If , however, geodesics will run forever.
Endowed with such a metric, =Riem has an associated Riemann super-curvature (of the Levi-Civita connection). Along pure trace directions (as would happen in a purely homogeneous expansion), it is zero. Otherwise, in index-free notation it is given by:
[TABLE]
where are matrix vector fields, extending the sort of matrix vector given above for to . I have assumed that , and the standard matrix commutator is used. The sectional super-curvature is then calculated to be:494949All the formulas in this section are generalizable to dimensions. For instance, the above in fact comes from [49]. Moreover, one should note that there are two sorts of connections: one related to the internal gauge space, and the other the Levi-Civita. This is precisely the analogous case of Kaluza-Klein, and the two satisfy relations [50].
[TABLE]
For orthogonal, i.e.
[TABLE]
we immediately get from (57) that . By the Hadamard-Cartan theorem, this means that there will be no conjugate points for these dynamical paths. It also means that interference of alternative paths will quickly subside, and semi-classical quantum effects will become predominantly due to other fields, present on a fixed metric background (see section 4.2).
For quite general cases, the geodesic equation, the exponential maps, Jacobi fields, curvature, etc, have been computed [47]. Quantities of interest become geometrical; for example, the Van-Vleck matrix:
[TABLE]
Where is the Jacobi matrix associated to the geodesic variations.505050Jacobi fields - geodesic variations of - are uniquely defined by its value and first derivative at . Alternatively, it can be characterized by it values at and , through the two-point tensor matrix . The determinant here requires regularization, but in principle we can calculate everything needed for a semi-classical path-integral within the classical geometry of , by using equation (16) and (17) together with (53) and the expression for the derivatives of the Riemann exponential map. Alternatively, we could use Barvinsky’s reduction methods for funcional determinants [29], which reduce the 1-loop functional determinant to a spatial one, integrated over a given time parametrization. Here I will choose the first, simpler route.
There are two possible procedures to obtain closed expressions for the Van-Vleck determinant: i) input equation (55) into the action (53), with the end-point as a function of (for fixed ). Or, ii) use the given closed expression for the Jacobi fields in Theorem 4.7 in [49] to directly write the Jacobi matrix.
Both manners are calculable, but the resulting expression is not compact, or enlightening at face value. We pursue the first option in appendix G. Alternatively, we can use geometric approximations for the Van-Vleck which work, respectively, for weak field and small geodesic distance [36] (for finite-dimensions, with the geodesics):
[TABLE]
The Ricci super-curvature does not properly exist, since the mapping is just the push forward of the section by a certain tensor field, a differential operator of order 0. If this is not zero, it induces a topological linear isomorphism between certain infinite dimensional subspaces of , and is therefore never of trace class. In this point of this approximation we need to smuggle in some sort of regularization.515151For an action with spatial derivatives, which would give Laplacians, and an expansion on eigenvalues of the tensor Laplacian would provide a cut-off for the trace inherent in the determinant. This will also be important to get some inhomogeneity in our probability for gravitational modes.
Here, following [49], I take a shortcut by considering the pointwise trace of the local action of the Riemann tensor:
[TABLE]
which is almost proportional to the supermetric, except that one of the components gains a traceless projection. Due to this traceless projection, we don’t have to worry about the extremal paths under consideration reaching the boundary of Riem (where we would have to set up boundary conditions). The problem with the super-curvature being proportional to the supermetric (and that inherent in (56)) is that it means in a sense we are in a constant curvature regime, and cannot really apply (59). We are better off applying (60). For points close to in arc-length, i.e. for , and initial velocity , we get:
[TABLE]
Clearly the pure trace directions (cosmologically homogeneous) have no contribution from terms, and thus, at least close to , are of greater amplitude. Note however, that here we must strike a fine balance. That is because the semi-classical approximation is valid for , and thus we must have an expansion for small arc-length , but , and there is no other dimensionful constant to compare to.
We must thus consider this Van Vleck as giving a one-loop functional determinant measure (combined with the projected Liouville measure), on the tangent space . I.e. on the space of initial cosmological directions at .525252 We should remind the reader that a more standard computation exists in terms of the integral of the one-loop determinant in time, which gives the Van-Vleck [29]. Because here we are exploring the geometrical analogy, we will not pursue that strategy.
According to (14), for the short distance approximation, we are in fact integrating over directions, , and such that . But because of the latter, ‘gauge-fixing’ condition, we gain a Jacobian, . This is calculated in appendix F, equation (114). The determinant that emerges however, depends only on , and thus is independent of the integration variable.
Finally, since (62) is obtained from an exponential, we obtain, for the relation between two total surface volumes (or screen-flux) for the screen regions parametrized by , at a geodesic distance of , and with the same nominal volume (i.e. same volume under the homogeneous measure :
[TABLE]
This is the relative probability for these two screens defined by the same arc-length distance, in an expansion for arc-length distance from larger than Planck. Explicit numbers can be computed by using an eigenbasis of transverse transverse traceless tensors on the round 3-sphere, . A convenient such basis is the tensor harmonic basis on , fully described in [51]. Since is compact without boundaries, and the standard Laplacian is a self-adjoint operator with respect to the standard inner product (see (118)), by the spectral theorem the eigenfunctions with different eigenvalues are orthogonal. In terms of these specific eigenvalues, the measure becomes , if the basis is orthonormal (see (119)). If it is not, another Jacobian appears for the transformation.535353These are the coefficients for the eigenfunctions described in 17-18c of [51]. The value of (63) then becomes a difference for the integrals for different ranges of the prefactors of such a harmonic basis, which can be now calculated explicitly. But this is just a homogeneous measure,545454Had we chosen a basis which is not orthonormal, the Jacobian would have cancelled out with the emerging prefactors in any case. and thus
[TABLE]
In other words, this measure does not care about the eigenvalues of the transverse-traceless modes. It is homogeneous, and thus would not probabilistically favor “flatter” modes, as is expected of inflation.
However, preliminary calculations show that with coupling to a metric, i.e. having instead of (53),
[TABLE]
changes terms in (63) in the following way:
[TABLE]
where is the degree of . In this case, the higher the eigenvalues of the region in comparison to , the smaller is its relative probability flux. This would mean indeed that more homogeneous modes would be favored. Although it should be noted that the eigenvalues of the basis in (119) are given by , for . And thus, one wouldn’t have complete homogeneity represented in this space.
Comments on the cases where is not small.
For the full expression of (58), we need replace the solution (55) into the action (53), between and . Then the action as
[TABLE]
This is a cumbersome procedure, requiring quite a bit of algebra. As the purpose of this exercise is a proof of principle for the method, we leave these calculations for the appendix G (see equation (130)).
In the general case it also happens that, since there will be at most one extremal path between and , the semi-classical probability density is given simply by
[TABLE]
where lies in a classical path from with a horizontal initial condition, i.e. and is defined by the determinant of the geometrical Jacobi matrix, in (58). It is clear from (55) that we could not have chosen the completely degenerate metric as .
There are two qualitative features for the gravitational quantum mechanics model that can be seen at this primitive stage: firstly, there is no gravitational interference, as the sectional super-curvature is strictly negative (57).555555Although, see [52] for arguments about the naturalness of certain reflecting boundary conditions for geodesics in superspace. Here too, one can impose certain artificial boundary conditions and obtain interference patterns directly from (20). Since generically interacting Hamiltonians in more than dimensions are chaotic, it seems even in the more general geometric case we can use the Rauch comparison theorem in the other direction: generically, semi-classical interference effects will become suppressed after a short evolution time (in arc-length). Different cosmological histories will quickly decohere, with no chance of recohering. Thus, in exactly the same fashion as the Mott bubble chamber (a quantum mechanical decay process), metrics along the same geodesic (separated by more than in arc-length) can serve as records for ones coming later along the geodesic. At this level in , with weak coupling to matter fields, the gravitational degrees of freedom would quickly become a background for the quantum effects of these other matter fields. Thus in principle we can fully determine the gravitational semi-classical theory, complete with records, interference (or lack thereof) and so on.
Secondly, geodesics can be extended indefinitely, but it is not true that any two points can be connected by a minimizing geodesic (thus the Hopf-Rinow theorem for finite-dimensional Riemannian geometry fails in this infinite-dimensional context). This already means that certain configurations have negligible volume from the start, without even taking into account the Van-Vleck determinant or the relevant records. It also counters some finite-dimensional intuition, that would lead one to believe that any configuration could be reached by a judicious choice of initial condition. There are, in the gravitational configuration space, deserts of very little volume.
On top of this structure, we could now proceed along the lines of section 4.2. This would allow us to discuss the quantum evolution of source fields on top of each one of these classical gravitational solutions, where each classical gravitational solutions is weighed by its own probability according to (63).
Summary of this subsection.
This section was a proof of principle. Here I have described a simple gravitational toy model. With it I was able to implement the structures presented in this paper (e.g.: the field-space connection 1-form, the semi-classical approximation, etc) for building the semi-classical volume form. Extending an equivalence between the Van-Vleck determinant and geometric objects (depending on Ricci curvature), I proposed an equation for the semi-classical amplitude in the vicinity of the round 3-sphere. With it, I found a simple result for relative volumes of a geodesic screen close (in configuration space) to the round sphere. These represent relative transitions from a total amplitude which, according to the previous section, would be preserved in the geodesic arc-length time.565656At least while these “geodesic spheres” in field space are integrable. In the present case, by the Gauss lemma, their integrability could have a very large domain, since there are no focusing points. In this simple case, these amplitudes are just proportional to the respective “functional area” encompassed by two sets of TT-modes (of the round ), on the tangent space . Nonetheless, I indicated how, including some coupling between points (through the inclusion of spatial derivatives in the action functional), the relative transition amplitude would naturally favor more homogeneous directions in field space (departing from the round sphere). It is important to note that this was all done without a minisuperspace assumption, and the results are not strictly classical, as they require the computation of the semi-classical effects through the Van-Vleck determinant.
5.2 Regaining space-time
As I mentioned in the introduction, it should be noted that even after having a good semi-classical quantum gravitational model of the sort presented here, one must still reconstruct an effective space-time from a curve of geometries. In other words, although along a classical trajectory we can order records and obtain a global (in space) notion of time, this might not be relevant for what observers within these trajectories in Riem perceive as duration. This touches on the common question in quantum mechanics, surrounding what constitutes a relational clock. This section does not provide a definitive argument, but a sketch on different directions that could be pursued in this respect.
It should be said that all theories that refer fundamentally only to the 3-metric, — all standard interpretations of canonical quantum gravity, such as Hartle-Hawking — would require a reconstruction of space-time at a hypothetical classical limit. As with the necessity of records, these issues are less pressing in the minisuperspace context, and usually glossed over.
For shape dynamics [1], the dynamics of matter fields can reproduce those of GR (at least for some finite duration). As has been shown in different ways [53, 54], one can then reconstruct an effective slab of space-time which is indistinguishable from GR, and recover aspects of the equivalence principle [53]. But these solutions are more or less fine-tuned to match general relativity, and the way in which they acquire those features is not completely understood.
In the more general case pursued here — which gives a method and a paradigm, not a specific theory — we would like to understand the nuts and bolts about how any space-time can be recovered from a curve in (conformal) superspace, and what are its properties. This, we don’t yet know how to do, but here is one attempt. I will first discuss the emergence of some limited, finite refoliation symmetry, based on scalar fields.
Refoliations
It is not really mysterious how regaining a space-time structure from matter fields can break refoliation invariance. Indeed, the useful matter fields for this role should be expected to break refoliation invariance; after all, they can define a rest-frame, or a surface in space-time where their gradient is purely timelike. For example, the CMB can be taken as one such reference.
Moreover, the observability of a “preferred foliation” is not obvious from the form of the action. For instance, the BSW action [55]:
[TABLE]
where is the “dressed” horizontal metric velocity, for a specific functional of the metric and its time metric. Equation (64) is not explicitly space-time covariant — it does not contain the lapse or the shift, components of the space-time metric in the covariant form — and yet it is exactly the Einstein-Hilbert action, written in 3+1 form and with eliminated lapse and shift. The important aspect of GR that can be seen in (64) is the fact that it contains the same amount of spatial and time derivatives.
Given these two considerations, I thus start with an action of the form:
[TABLE]
(let’s ignore the shift, because that is easy to accommodate), and supposing that all scalar fields are such that in some domain, having no spatial gradients in the action. The point is that I can replace by , and there I will have a freedom of choosing ” the lapse” in different ways, one for each field determining time.
That is, I can rewrite the action as:
[TABLE]
This is basically the standard gravitational Einstein action with replaced by the derivative along the scalar field, and with the ”lapse” . Now, one can shift from one i to the other iteratively, i.e. forgetting about their original relation to , obtaining the same results. I.e. by using in the equations above, one gets precisely the same action, with , i.e. just described under evolution for a different scalar field. It is a “discrete” sort of refoliation.
Moreover, the perturbation equations for the metric are hyperbolic, and would construct a universal gravitational light cone. I.e. in this extremely simple case, the equations of motion of the gravitational fields will be hyperbolic (for the unit lapse case):
[TABLE]
where are the Christoffel symbols for the standard DeWitt supermetric. And thus, the propagation of gravitational perturbations around stationary solutions, such as (the flat Euclidean metric), will form null-cones. The requirement that these characteristics are indeed the null-cones of a metric, will determine the conformal class of the space-time metric (and thus a relation between the lapse and the spatial metric).
What is most unnatural about equation (65), however, is the fact that there are no spatial gradients of the scalar fields. This was motivated by the way one would use such a scalar field in space-time to define a foliation. Nonetheless, we can remove this condition by adding gradient terms — e.g. the standard one — and using shift vectors adapted to each scalar field, in the following way. Correcting the time velocities of the scalar fields: , we obtain the following correction to the kinetic terms:
[TABLE]
Now, choosing , we get a quadratic equation on so that
[TABLE]
namely,
[TABLE]
which can be solved for the fields,
[TABLE]
It thus seems that one can choose the shift vectors to be physically given in such a way as to always “select a surface of spatially constant ”. In this way, one could in principle retain the spatial gradients of one in a foliation, but get rid of another.
But there are many problems with this resolution however: when coupling to the metric, or with the other fields, it would imply a non-miminal coupling between and , which would also reinstate the dependence of . This perhaps could be resolved by adding the terms from the beginning and solving a possibly more complicated equation, depending on all the fields. Another problem is that the solution for doesn’t seem to define a bona-fide connection-form. I.e. it doesn’t have the right covariant transformation properties as in (88) or (8). Nonetheless, this is first attempt, a proof of principle that certain things can work, while others not at this point.
Null-cone and equivalence principle.
As a simple example, given extremal paths of (53), we can attempt to reconstruct a space-time just with the use of scalar fields. The easiest manner to do this is to introduce a weakly back-reacting scalar field, with action in arc-length parametrization given by
[TABLE]
for a very small . An objection here is that this action is largely inspired by a relativistic one – in that it contains the same number of spatial and temporal derivatives. Ideally, one would have more than one field, e.g.: , and use the characteristics of one to define the propagation of the other. We leave this for further work. Using the characteristics of the field equations (on this background ), we could have a metric given by:
[TABLE]
where is given in (55). The first question one can ask is when (68) also obeys the Einstein equations for example.
Now, to discuss something such as the equivalence principle, one could proceed as in [53]: couple a point-like particle with trajectory – with the usual covariant action, parametrized by the superspace arc-length, and with an induced lapse given (in this simple case) by (68) – i.e.
[TABLE]
The equations of motion are the usual geodesic ones for the metric (68). According to [53], to first order, one could use time-dependent spatial diffeomorphisms to correct the frame of reference centered on such a particle to be described by geodesics in Minkowski space, apart from transformations of the lapse.575757Note here that the propagation of spatial coordinates are also “physical”, as they are given by a choice of . But different choices will be related by spatial diffeomorphisms, as is the case with different charts for the manifold. In the present simplified case, the physical lapse is already equal to one, so we regain one of the statements named ‘the equivalence principle’ – namely, there is a gauge symmetry that will bring an arbitrary classical trajectory to a geodesic of Minkowski space, at least infinitesimally. This issue, however, needs to be further investigated in more general cases with more general matter fields; it is not clear to me that it must always hold.
Summary of this subsection.
In this subsection I have sketched different ways one would go about reconstructing a space-time metric and regaining the equivalence principle (and other notions of refoliation invariance). This can be accomplished from the relative evolution of observables along an extremal path in configuration space. There, there can still be more than one way of defining duration, and perhaps these can replace different notions of refoliations (which is an inherently spacetime operation). Indeed, what do we physically do when we want to refoliate? We choose a different set of clocks spread out through space. Moreover, we still want a hyperbolic structure. Since we do observe that light has the same speed for all observers, and that can only happen if the metric has a null cone. In specific cases this translation seems to go through without much problems, but no generic statement was proven.
6 Conclusions
6.1 Problems between records and refoliation invariance.
Differences from Hartle-Hawking and the tunneling proposals
As discussed in the main text, the formalism presented here has certain features in common with the Hartle-Hawking no-boundary [16], and with Vilenkin’s tunneling [56] proposals. Vilenkin’s model in mini-superspace coincides with our model, for the reduced ADM action, as I comment below.
There are, however, some crucial differences. The boundary conditions are based on completely different principles. In Hartle-Hawking, the path integral integrates over all 4-metrics interpolating between a given regular Euclidean 4-metric without boundary and a given 3-geometry with matter content. In the tunneling proposal, the boundary conditions should embody the notion that the Universe “tunnels” from nothing. In both cases, the resulting wave-function (in terms of its one boundary 3-geometry and matter content) should satisfy the Wheeler-DeWitt equation.
Like the present work, these theories attempt to define a wave-function of the Universe from a path integral in some field configuration space. But there are a couple of important distinctions between such proposals and the one developed here: i) Both Hartle-Hawking and tunneling proposals are based on path integrals in superspace. But supespace is not the reduced configuration space of ADM, because one still has refoliations. In our case, there must be a well-defined reduced configuration space for the volume form to have physical meaning. Records and screens required crossing extremal curves in superspace to have objective meaning (there can’t be one if refoliations are part of the local symmetries). ii) In the present approach, the boundary conditions are selected uniquely, and are such that the arrow of time should point in the direction of greater inhomogeneity.
The absence of further specification of boundary conditions is also unlike either Hartle-Hawking or the tunneling wave-functions. In Hartle-Hawking, one usually stipulates boundary conditions such that for the degenerate determinant [16], integrating only over metrics for which and which can be ‘capped-off’ by part of a 4-sphere. In Vilenkin’s proposal (see also, Linde [57]), the definition of the wave-function is given by a transition function from the completely degenerate 3-metric, . One also requires, however, further boundary conditions on superspace. For this, Vilenkin attempts to separate curvature singularities which would correspond to singularities of a 4-geometry – the singular boundary of superspace – and those which would correspond to singularities due to bad choices of slicings – the non-singular boundary of superspace. This is a difficult distinction to make in practice, at least away from minisuperspace. Assuming that it could be done, then the proposal requires the wave-function to have only out-going modes at the singular boundary. There is also a problem here, as in general there are no time-like Killing vector fields in superspace to define positive and negative frequencies. However, the proposal can be carried through in the WKB, minisuperspace context, yielding results similar to Hartle-Hawking.
Here, the boundary (or initial) condition is motivated from an informational and timeless perspective. A reduced configuration space of observables does exist, and in it, one looks for the most homogeneous elements – i.e. the ones that encode the least irregularities, or ‘information’, in its configuration. Under certain symmetry groups, and with certain assumptions about the degrees of freedom of the theory, a unique such element can be singled out. In addition, due to the stratified nature of reduced configuration space – with concatenated boundaries – this point consists of a natural boundary. In a geometric sense it is the ‘boundary of all boundaries’ [25], and no further boundary conditions need to be given. Such a radical asymmetry of reduced configuration space is furthermore necessary for our notion of records (Def 1) – the crucial element which makes the recovery of time possible.
At least in the case of Diff with , this construction means no further boundary conditions need to be imposed apart from our single axiom. This can be seen by parametrizing physical space with unimodular metrics, . To change the signature, the determinant must become degenerate, which disconnects the physical space of positive definite signature from those with other signatures. In other words, the ‘cone’ which makes up the boundary of Riem inside the affine space becomes inaccessible from within reduced configuration space.
Nonetheless, if one was to blindly apply the axioms of this work to Riem and Diff, one would recover Vilenkin’s tunneling proposal, but with an extra scalar degree of freedom. That is because the orbit of is connected to the rest of superspace (but would not be to conformal superspace), and has the highest possible isotropy subgroup of Diff(M). As is the case with the standard tunneling proposal, one would have to establish further boundary conditions on superspace, which could be taken to be the singular boundaries of superspace [58]. In minisuperspace, the extra scalar degree of freedom does not appear, and, having provided such extra boundary conditions, the two approaches coincide.
Needless to say, both Hartle-Hawking and Vilenkin’s proposals are more studied and developed theories of initial conditions than the one presented here, even if they face enduring questions about i) the complex countour integration in which the path integral should be executed [59], ii) the meaning of the Wick rotation (see e.g. [60] and references therein), and iii) the meaning of probabilities, and their conservation. Moreover, it should be noted that due to the algebra of constraints containing the metric, a rigorous proof of gauge-invariance requires the use of the FBV formalism [22], which for this case will include interaction terms of fourth order in ghosts, with no geometric meaning as a Fadeev-Popov determinant [23].585858 In fact, due to this complexity, the BFV covariant version of (74) can only be shown to be gauge-fixing independent if the gauge-parameters are field independent, a result expected from [24].
In the technical front, in the present work simple BRST symmetries with a geometric meaning as a FP determinant [61] can be derived because all constraints are linear in the momenta (see (9) and (77)). The wave-function built from the path integral satisfies the given functional constraint equations (even in the case of Weyl symmetry). A reduced (physical) configuration space exists, where we can make sense of a volume-form, including the records it creates, and decoherence in the sense of consistent histories. In the toy model, the action functional is positive-definite, with no requirements for choosing complex actions and complex contours of integration. More generally, the Wick rotation is well defined along each path. More work needs to be done in this direction to ascertain more precise properties of the semi-classical solution.
Incompatibility between records and fundamental refoliation invariance.
It is not possible to apply the present formalism to the Wheeler-DeWitt equation in general – i.e. outside of the minisuperspace “approximation” – including its embodiment in the Hartle-Hawking no-boundary proposal. This comes about because of refoliation invariance, which makes it impossible to use the acquired notion of time through records, as I now comment on.
Firstly, there is no known form of reduced configuration space for the symmetries of ADM. This creates an insurmountable obstacle for the constructions of this paper. Even if one defines an initial configuration, records, screens, and so on, with respect to a given gauge-fixing of the foliations, these structures won’t be preserved under a different choice of gauge-fixing, as I described in equations in section 2.1.
Moreover, to the extent that such foliations can be of physical character (i.e. some surface of homogeneity of a given field), they should be reproduced by the purely relational evolution of observables, which I described in subsection 6.1.
Lastly, note that if the theory possesses a single global reparametrization constraint, my notion of records remains fully functional. This makes it more akin to minisuperpsace. A curve will go through a given point however it is parametrized. A profound asymmetry of reduced configuration space will then spontaneously form records and, consequently, an effective notion of time. This is how records untangle evolution and symmetries.
Non-causally related observables, and its problems.
This paper was based on assigning a fundamental role to acausal relations, prior, and not defined by, the specific field content which one would like to superpose. Whereas in GR causality is inherited from the space-time metric – ‘space-like’ is a statement about causal independence which depends on the metric – I would like to place local causally independent observables at the foundations of quantum gravity (QG). Space-time physics are to be recovered only effectively. This led me to a description of quantum mechanics on timeless configuration space.
This setup is perfectly suited for relational approaches to quantum mechanics [12, 13, 62], the results of which we are able to recycle. Importantly, foregoing the space-time picture off-shell (or before dynamics) does not imply that a preferred simultaneity surface is classically detectable (see [53] for a explicit example). In fact, models of gravity can be formulated where we get all the advantages of a preferred notion of simultaneity – compatibility with QM collapse and superpositions – and none of the drawbacks – no detectable ‘ether’ for classical (relativistic) mechanics. The fact that such theories need not imply a preferred choice of simultaneity goes contrary to common wisdom. The misguided intuition comes from imagining such ‘space-like’ surfaces as being embedded in space-time, in which case indeed, they do define a notion of simultaneity. However, without the prior existence of space-time, one first requires a recipe to recover duration, using the dynamics of clocks and rods and fields throughout evolution in field-space.595959See [9] for a related view.
For instance, for a theory whose field space is that of spatial metrics, Riem, for a given action , a solution to the equations of motion is a curve of 3-metrics , such that . However, to find out what a space-time corresponding to this curve is, one needs to “fill in” the curve with a notion of duration (or a lapse), e.g.:
[TABLE]
where I used the mixed functional notation of DeWitt (square brackets signify functional dependence, and round brackets ultralocal dependence). In (69) both the “shift” and the lapse are “effective”. The important point is that they only depend on through the curve and are not inherited from space-time; quite the opposite: they build space-time.606060This is what Wheeler used to call the struts and rods holding up space-time from its geometrodynamical skeleton [63]. One example where this is explicitly realized, obtaining exactly the same empirical content of GR, is the BSW formulation of the Einstein-Hilbert action, (64), [55].616161Although, being essentially GR, the Hamiltonian version of the theory reverts back to the standard ADM.
6.2 Summary
To summarize, in this paper I have described a theory existing in configuration space, , with no singled out time variable. In the presence of physical subsystems that act like clocks, the formalism is able to recover the transition amplitude of standard quantum mechanics (see section 3). However, this clock subsystem does not need to be defined over the entire configuration space for the theory to be consistent.
Given an action on curves in , and a preferred boundary condition which is part of the theory, I described how to construct a volume-form in from the path integral. The volume-form must be constructed from the path integral. Under a simple further assumption required so that my notion of records (and a local decomposition property [18]) may later emerge, such a volume-form is uniquely given by the Born density.
What I termed ‘records’ is basically a sort of ‘information’ contained in the volume-form. Such records encode what we usually ascribe to the passage of time (in terms of conditional probabilities), and statements about conservation of probability using records were also shown to hold. It would be interesting to explicitly connect this notion of records with the one emergent in [64], and there claimed to resolve the issues of an arrow of time.
The preferred boundary condition here is an integral part of the theory, but only in the case of a topology and with the group being Diff have I shown it to be unique. In this particular case, the boundary is the unique point which constitutes the least dimensional corner to the physical state space Diff, and no other boundary conditions need to be given.
This is the ultimate asymmetry of configuration space, and it is what makes most of our constructions possible. For example: the notion of conservation of probabilities, discussed in section 4.2; a notion of global time, and of a standard Schroedinger equation for source fields propagating with this time; are made possible because there is such an asymmetry in configuration space. The unique acts in a similar fashion to the point of emission of alpha-particles in a Mott bubble chamber experiment. Here, as there, it is such an asymmetry that allows for the formation of ‘directional records of time’s passage’ [43].
Lastly, I proposed a simple gravitational toy model, similar to strong gravity, compatible with the conditions imposed by this setting, and showcasing the viability of its constructions.626262There is in principle no requirement for the models to be geometrical; only that they satisfy the axioms 1-5 of section 2. Making use of its interpretation in terms of the geometry of superspace, there is a shot-cut way to calculate (and regularize) its emergent Van-Vleck determinant. I went on to compute the relative semi-classical quantum gravity probabilities (or volumes) for regions in configuration space, finally obtaining the approximate (63). This equation allows us to ask precise questions such as: what is the probability of finding a given range of perturbations of gravitational modes of the original round 3-sphere, with respect to some other range? It turns out that for this simple model any mode is just as likely as another. Note moreover that this is a semi-classical effect – depending on the Van-Vleck determinant – not a classical one.
Resolutions
Thus, summarizing the summary, we have the following resolutions of the questions posed at the start:
- •
In this framework there is no issue arising with the ‘superposition of causal structures’; causality can emerge semi-classically, and we can observe interference effects between alternative emergent space-times.
- •
Non-locality is a non-issue. Configuration space is the true arena of physics, and each point in it contains the entire physical space. Locality can and does emerge dynamically however [18].
- •
Neither is there a ‘problem of time’ (see [65]). Evolution is abstracted from records – made possible by the asymmetry of configuration space – and is not mixed with a local gauge symmetry.
- •
One can define positive probabilities and related probability currents in the semi-classical approximation. Unlike the corresponding approximation for WdW (which can only be done in minisuperspace), here there are no issues with positivity because time is defined around records, and at most a single solution of the Hamilton-Jacobi functional can exist. Source fields can be shown to propagate following the Schroedinger equation with respect to such an emergent time. The asymmetry of configuration space is what dictates and allows for these constructions and their consequent properties.
- •
There is no conflict of the sort ‘unitary vs reduction’ processes for the quantum wave-function. Relatedly, there is no “definite outcomes” problem, since the set of all configurations exist, with a volume-form on top of it.
There are however, other challenges, concerned with regaining classical space-times.
6.3 Some future projects
There are standard formulations of gravity which would be amenable to this treatment. Deparametrized shape dynamics [54] is one such model. This approach would give a different take on old problems, both in cosmology and elsewhere. The use of a prescribed boundary condition will restrict quantum cosmological scenarios, in a very similar way that the Hartle-Hawking condition does. In the homogeneous, cosmological scenario, our prescription will give a different boundary condition than the Hartle-Hawking one, but which can be uniquely specified for certain types of field spaces and symmetries acting on them.
Bianchi IX shape dynamics. Perhaps the ideal setting to test these ideas in the cosmological scenario would be the Bianchi IX shape space, characterized in [66] (see also [2]). There, we have an action, we have the explicit classical trajectories once again, and we have a uniquely specified initial point – the round sphere. In such a reduced configuration space, we get basically a Wheeler deWitt type equation, that should be solved with our boundary conditions. This would give a first prescription of a shape quantum cosmology model.
Reconstructing space-time. The most straightforward project directly extending this work however, is to flesh out the outline of the last paragraph of section 5 : add a simple matter scalar field to the action (53), perform the analogous calculations, and use the prescription of [54] to find an effective space-time according to (69), below. Then use the constructions of [53] to evaluate the standing of the equivalence principle. In this way we can explicitly start to probe qualitative implications of quantum gravity for space-times, which this approach suggests.
Extending the analysis to other toy models. Lastly, as I mentioned, one of the disappointing features of the present toy model is that it does not contain derivatives of the metric, and thus it does not couple points. Thus, perhaps the most straightforward project, would be to apply the formalism here – which exploits parallels between geometrical quantities and the Van-Vleck determinant – to more complicated geometries in , explicitly described in [47]. A particularly interesting geodesic action to try out is
[TABLE]
where is now unimodular, i.e. , and with transverse-traceless velocities. For arbitrarily small , this is the Jacobi form of the Einstein-Hilbert action with (and transvere-traceless velocities), and thus its extremal trajectories should coincide for positive Ricci scalar (but not its path integral). That is, the extremal paths of (70) coincide with those of the spatially unimodular Einstein-Hilbert action with positive spatial scalar curvature and small cosmological constant. In fact, much of the groundwork for this analysis has already been set. In [47], general geodesic theories, with arbitrary conformal factors depending on the Riemann tensor have been studied, and many similar results to the ones presented here obtained.
ACKNOWLEDGEMENTS
I would like to thank Lee Smolin, Aldo Riello, Wolfgang Wieland, J. Halliwell, Flavio Mercati and Sean Gryb for comments. This research was supported by Perimeter Institute for Theoretical Physics. Research at Perimeter Institute is supported by the Government of Canada through Industry Canada and by the Province of Ontario through the Ministry of Research and Innovation.
APPENDIX
Appendix A The Wheeler DeWitt equation
A.1 Difficulties with The Wheeler DeWitt equation
Classical GR admits a Hamiltonian formulation, but it is a constrained theory. The configuration variable is taken to be the spatial Riemannian metric on the manifold , alluded to in (74). The space of all such metrics is called Riem. The emerging system (whose specific form is not relevant to our study here) is of a fully constrained Hamiltonian,
[TABLE]
The standard canonical quantization rules would suggest that the states of the system be given by wave functions of the configuration variable, where denotes the time variable occurring in the Hamiltonian formulation. Replacing as usual by its action through multiplication, and of its conjugate momentum with functional derivation, ,636363More formally, we should use the Radon-Nikodym derivatives in field space, which takes the measure into account [67]. we could have the evolution of given by the Schroedinger equation,
[TABLE]
Since is pure constraint, , but also . The changes generated in phase space by the flow of correspond simply to the action of spatial diffeomorphisms on and its conjugate momenta. Its quantization would simply mean that the value of should be unchanged upon the action of an infinitesimal diffeomorphism of , , which acts pointwise on (through pull-backs). The space of metrics quotiented by this gauge symmetry, Riem/Diff, is the space of geometries, also called superspace, and the action of as a quantum constraint operator implies the wavefunction lives over this superspace. This part is recovered in the present setting.
The action of on the wave-functions gives rise to the Wheeler-DeWitt equation:646464 It should be noted that to obtain (73) from (74) one needs to a priori fix refoliations at the initial and final configurations [23], in accordance with our expectations.
[TABLE]
The analogous operator interpretation to here would be invariance wrt diffeomorphisms in space-time which move points orthogonally to . Although that interpretation is not as clear in this case, it is still compatible with the intuitive effect of the implementation of the constraint, namely .
In spite of the indefinite signature in its kinetic term (coming from the DeWitt supermetric) giving rise to quadratic terms of negative sign, the WdW cannot be interpreted as a Klein-Gordon type of equation to be “second-quantized”, for i) it is already quantized, and ii) there are no straightforward ways to split solutions into positive and negative frequencies to construct the annihilation and creation operators, since there is no good time function in superspace. For a review, see [14]. This is not to mention finer points about regularization (point-splitting seems to be the more appropriate one here, due to the occurrence of the singular coming from quantization of the term quadratic in momenta [68]) or factor-ordering.
Moreover, the requirement that the Hamiltonian commute with all observables can be extremely restrictive. For ergodic systems, it will lead to the conclusion that the only observables (continuous in phase space, with the Liouville-induced topology) are polynomials of the Hamiltonian itself [9, 10, 11]; a defect which is at least camouflaged in minisuperspace approximations due to their simplicity.
A.2 Issues with the path-integral definition of the wave-function
[TABLE]
where is the gravitational Lagrangian density, and I have omitted boundary terms, the required gauge-fixings, Fadeev-Popov determinant and regulators (since the theory is non-renormalizable, this action should only be taken as an effective theory at a given scale). Here are initial and final spatial geometries, i.e. for the hypersurfaces and the embedding then , where here is the abstract space-time metric tensor field. The issue is that refoliations shift (in a -dependent manner), and change (74).656565 As I comment on below, however, the issue is more subtle than it appears, if one implements the boundary conditions with a delta function on the boundary. Then gauge transformations don’t act directly on the boundary field.
Such difficulties with giving a simultaneous meaning to refoliations as a gauge-symmetry and defining a meaningful gauge-invariant transition amplitude are reflected in the canonical quantization approach, both for gravity and more basic fields. For instance, in passing from the Hamiltonian transition amplitude to the Lagrangian path integral for field theory by integrating out the momenta, indeed one obtains, e.g. for a scalar field :
[TABLE]
where I used a non-standard relativistic notation , and the integration is done so that lies in , and . The only thing in this formula which is not Lorentz invariant is the domain of integration, . To properly calculate the S-matrix (and the generating functional) this issue is remedied by taking the infinite time interval . Of course, this is an abstraction; measurements take a finite time.
In more generality, for finite intervals, local time reparametrizations – refoliations – don’t act as a gauge group in the boundary field space, thus obstructing the definition of a transition amplitude between fully gauge-invariant states. In sum, one can only show that (74) is independent of an arbitrary choice of bulk space-time gauge-fixings (i.e. those that are the identity at and ) together with arbitrary gauge-fixings of the spatial boundary diffeomorphisms at and [23]. The choice of the ‘space-like’ surfaces is not a gauge-fixing in this sense; different choices will change the amplitude. More involved treatments exist (see the review [7]), in which one defines the boundary field values relationally – with respect to some other matter field for example (or with so-called “embedding variables” [69]). Nonetheless, these resolutions also have their issues; especially in what refers to maintaining a good choice of such boundary-defining functions throughout field space. Without such a relational definition working globally in field space, one cannot ensure unitarity [70].
For a transition amplitude, one could define and in (74) as the inverse value of some scalar functional on the spatial metric (such as the value of a matter field, or the scalar curvature) , as , with restrictions on so that becomes invariant with respect to refoliations. Implicitly, this is what is done to obtain the invariance relations of the path integral with respect to refoliations, and it corresponds to the “embedding variables” of Isham and Kuchar [69]. In this case, the embedding variables function largely as a ‘dust field’, which defines points on the space-time manifold.
Here adopting for the 3-metric and for the 4-metric, we fix one of the boundary conditions. From (74) the wavefunction of the universe on a hypersurface with intrinsic 3-metric and matter configuration, , can be defined by
[TABLE]
where the sum is over a class of 4-metrics, , and matter configurations , which when restricted to the boundary take values and , and is the spacetime covariant action functional.
To find the functional constraints related to the Ward-Takahashi identities such a wave-function might satisfy, one must employ the more formal Hamiltonian definitions. In [23], Hartle and Halliwell attempt to formally clarify this. Instead of (75), one defines (dropping ):
[TABLE]
where for the Hamiltonian path integral, or if one can integrate away the momenta, is the measure obtained from the Liouville one (for which BRST transformations are canonical), is the Fadeev-Popov determinant, is the gauge-fixing condition, and are the Lagrange multipliers. Integration runs also over all the final variables, and there they are fixed by the Dirac delta.
In the case of gravity, the very definition of the path integral through time slicing requires one to choose a notion of physical time already at that level, so that infinitesimal slicings are meaningful and one can take extremal approximations in between the slices. The very construction of the path integral in terms of infinitesimal slicings only makes sense in a gauge-fixing for gravity, which sets it apart from theories in which a background causal structure exists.
Clearly, from (9), if either the gauge transformations vanish at the boundaries, or if the Hamiltonian generators of the symmetries are linear in the momenta, then . Moreover, in those cases, then we can resort to the standard BRST methods to show invariance of the wavefunction (76). This is the main difference between the constraints advocated here and the ones which shift the boundary itself.
By redefining the integration variables according to the transformation (and subtracting the untransformed wavefunction), one obtains
[TABLE]
Note that here the refoliations act on the paths (or on the variables), and therefore on , but not on the boundary fields .
Now, we must use their assumption 5 in section II B, which translates functions on the boundary variables in the path integral to functional equations on the resulting wavefunction (ignoring Lagrange multipliers, gauge-fixings and Fadeev-Popov determinants), i.e.:
[TABLE]
And voila, one obtains the constraint equations applied to the wave-function, by taking into account the transformation of the boundary values together with the transformation of the action functional.666666 However, in the case of reparametrization invariant theories with constraints quadratic in the momenta, there are various unsatisfactory features in this derivation. First, as commented earlier, one would need to define the slicing already in a particular physical gauge-fixing. This makes it quite difficult to compare the resulting wavefunction given in two different choices of foliations. Secondly, in standard gauge theories, one can show that the path-integral is independent of the gauge-fixing by the use of the Fadeev-Popov determinant, which has a geometrical interpretation as a Jacobian for the gauge-fixing surface [61], and thus embodies the correct transformation properties wrt arbitrary variable redefinitions. For theories such as GR, in which the constraint algebra is field-dependent, this is not the case, and one must resort to more complicated algorithms, such as the BFV one [22], which doubles the amount of fields. Even then, to apply the Fradkin-Vilkowiski theorem, one must have an invariant action, which is not the case here, due to . Hartle and Halliwell get around this by at first implementing boundary conditions (see eq. 3.16 therein) which don’t allow BRST transformations at the boundary (and which are also BRST invariant conditions) and then using the FV theorem for assuming independence of the gauge-fixing conditions. However, to get the Wheeler DeWitt equation, they must relax such boundary conditions, implementing new conditions which are neither BRST invariant nor allow for the use of the FV theorem (see equation 3.19). Indeed, the only gauge-fixing used in [23] is the one which would not require a functional connection, as in [71, 24], namely, the field-independent boundary conditions on the gauge-parameters (see their appendix B). Disallowing these, it seems it would not be possible to move from one physical slicing condition to another, as these transformations are necessarily field-dependent. My conclusion from these points is that perhaps the inclusion of boundaries in the formalism is more subtle than Hartle and Halliwell allow for, as indicated in many recent papers (see [71, 24] and references therein). This intuition is supported by [72], in which skeletonization issues are scrutinized. To maintain the invariance of the skeletonized Liouville measure, because of the action of refoliations, Barvinsky also restricts gauge transformations at the two end-points (see eqs 4.42 and 4.43 and comments in between). These issues are not, however, of fundamental importance for this work.
One of the issues here is that, unlike ordinary field theories such as Yang-Mills, the negative sign in front of the squared trace of the gravitational velocity (‘the conformal mode’), implies the covariant gravitational action is not positive-definite. Thus, restricting the sum to real Euclidean-signature metrics would not generically allow the the path integral to converge.676767While it is true the path integral for fermions is also not positive-definite, their anticommuting nature ensures that the path integral converges. Indeed, to obligate the path integral to converge in this way, it would be necessary to include complex metrics in the class of metrics to be integrated over [73]. However, as a recent renewed debate surrounding this issue makes clear [60], there is no unique contour to to integrate along in superspace. Moreover, the result obtained for the path integral would generically depend on the types of countours chosen. Such different alternatives are intimately connected with the boundary conditions chosen for the paths to be integrated over, and these choices should implicitly restrict the 4-manifolds included in the sum in (75).
However, the boundary conditions in the path integral should have a covariant meaning, and that makes them extremely averse to being embodied in superspace; where the individual paths of the path integral are explicitly realized.686868 In practice, these problems are not seen directly, since working with the infinite dimensions of the full superspace is unfeasible, and the problems are not self-evident in minisuperspace. One objective consequence of this problem is that, although such contour integrations and boundary conditions for the Wheeler-DeWitt equation must be somehow related, it is nearly impossible to realize this connection in a precise manner.
Physically defining boundary states
This approach of physically defining the boundary states is also related to what is known as the approach of complete observables in canonical general relativity [6, 74], for which a large literature exists (see [7] for a review) and the use of ‘internal time’ (see e.g. [74]). In the case where only one reparametrization constraint exists, complete observables should be a one-parameter family of Dirac observables. Many technical problems arise, however, when an infinite amount of reparametrization constraints exist; as is the case of GR, and when one would like to extend this construction to the entire phase space.696969See [75, 70] for some candidate constructions.
Nonetheless, as shown in [70], even in this case, attempting to use an internal time for each different region in phase space, patching the transitions between them, actually works well only in the purely classical regime. From the point of view of state evolution, defining time for each finite range poses an enormous issue for unitary evolution of states.707070Indeed, this happens even if classical evolution with respect to a local internal time can be made unproblematic in the transition region between one patch and another. The major issue is then that non-unitary quantum evolution risks meaningless results — even away from the boundary of the patch where it should be valid — and it is not clear how to define quantum observables in such a situation.
Appendix B Connections on principal fiber bundles
The advantage of working directly with principal fiber bundles (PFB’s) is that structures are simpler. For example, it is easier to work out particular features of associated vector bundles, or features of gauge-fixed structures directly from the general formalism of PFB’s than vice-versa. The standard example of a PFB is the bundle of linear frames over a spacetime manifold , with structure group .
A principal fiber bundle , is a smooth fiber bundle, for which a Lie group has an action from the right ,717171This means that in the coordinate construction of the transition functions act from the left. which we denote by , for , . The action is assumed to be free, so that iff , the identity of the group. The quotient of by the equivalence relation given by the group, that is iff for some , is usually identified with the spacetime manifold, i.e. . We denote the projection map by . where denotes the right action of on (which should be distinguished from the action of on the group itself). The set , is called the orbit through , or the fiber of . The action of projects to .
The section then gives rise to a trivialization , given by the diffeomorphism , . Generically, there are no such and the bundle is said to be non-trivial. In the functional, i.e. field space, case, the base manifold is a modular space and the obstruction to triviality is known as the Gribov ambiguity.
PFB connections
Before going on to define a connection, we require the concept of a vertical vector in . Let be the group exponential map. Then by dragging the point with the group action we can define a vertical vector at , related to , as
[TABLE]
The vector field is called the fundamental vector field associated to . The vertical space is defined to be the span of the fundamental vectors at .
From (79) one has that fundamental vector fields are ad–equivariant, in the following sense:
[TABLE]
where denotes the pushforward tangential map associated to and is the adjoint action of the group on the algebra. Now, using the fact that the Lie derivative of a vector field along is defined as the infinitesimal pushforward by the inverse of evaluated at , by setting and in (80) and deriving, we obtain
[TABLE]
This shows that the vertical bundle, , is an integrable tangential distribution.
The definition of a connection amounts to the determination of an equivariant algebraic complement to in , i.e. an such that
[TABLE]
One can equally well define to be the kernel of a -valued one form on , with the following properties:
[TABLE]
where . In other words, equation (83b) intertwines the action of the group on (lhs of the equation) and its action on (rhs). Notice that these equations hold only for a global action of on . For any , its vertical projection is given by .
The horizontal derivative is the generalization of a covariant derivative (in the associated vector bundle) to the PFB context. The role of a covariant derivative is to exactly cancel out the non-equivariant terms obtained for derivatives (either in in field space or not). In the presence of a principal fiber bundle structure in field space, what plays the role of the Lie group before is now a diffeomorphism , . It acts on the metric via pullback,
[TABLE]
where I have taken pains to distinguish the action () from the particular group element (). We can transplant the previous structures to the infinite-dimensional case:
[TABLE]
where is the inclusion operator in the field-theory context, is the vertical projection in and is a Lie-algebra-valued one-form, i.e. , which obeys the analogue of (83), that is
[TABLE]
As before, the horizontal projection , is defined via the kernel of . With the substitution (85) in the action, one automatically obtains general gauge-invariance for time-dependent gauge transformations. See [24] for a complete account on the relevance of this connection in the field space context, and how a choice of connection can embody a choice of abstract observer. Both for the functional case and in the finite-dimensional one, we have the infinitesimal form of the equations, generalized for possibly field-dependent gauge transformations:
[TABLE]
where the commutator is the Lie algebra one. From these equations it is easy to see that a connection fulfills the purpose of canceliing out terms that don’t transform homogeneously under the group.
Appendix C The timeless transition amplitude in quantum mechanics
Quantum mechanical timeless transition amplitude in configuration space
We start with a finite-dimensional system, whose configuration space, , is coordinitized by , for . Let us start by making it clear that no coordinate, or function of coordinates, singles itself out as a reference parameter of curves in . The systems we are considering are not necessarily ‘deparametrizable’ – they do not necessarily possess a suitable notion of time variable.
Now let be the cotangent bundle to configuration space, with coordinates and their momenta . For a reparametrization invariant system the classical dynamics is fully determined once one fixes the Hamiltonian constraint surface in , given by . A curve is a classical history connecting and if there exists an unparametrized curve in such that the following action is extremized:
[TABLE]
for curves lying on the constraint surface , and such that the projection of to is , connecting and . By parametrizing the curve with an innocent parameter , we can get to the familiar form:
[TABLE]
where the is a Lagrange multiplier (and we are considering only one constraint).
One can now define the fundamental transition amplitudes between configuration eigenstates.
[TABLE]
where the projector can be defined by:
[TABLE]
with the standard particle Hilbert space inner product, .
In [35], following previous studies of relational quantum mechanics (see [6] for a review), Chiou studied the relational timeless transition amplitude between configurations, in the path integral formalism.727272Chiou calls it a ’curve’ integral, as opposed to a path integral, because the integral only requires unparametrized paths in phase space. I will assume that there exists a metric in configuration space, for which I can then parametrize paths by arc-length. The standard proof of equivalence between non-relativistic Schroedinger quantum mechanics and the path integral formulation relies on refining time slicings – partitioning paths into arbitrarily small segments. Without absolute time, a parametrized curve need not be injective on its image (it may go back and forth). This issue is related to the one of integrating over only positive lapses, or over both positive and negative. One yields a propagator, and not a projection onto the constraint surface.
Chiou chooses the latter, and uses a Riemann-Stieltjes integral to make sense of the limiting procedure to infinite sub-divisions of the parametrization. Furthermore, one must then sum over all parametrizations, at which point one obtains , and the entire transition amplitude (91).
Crucially, Chiou uses a sub-division of the parametrization (available only when there is a single time direction). The mesh of the parameter sequence is then defined by the largest interval encompassed by the parameter sequence. The parameter sequence is fine enough if its mesh is smaller than a given small number .737373 Still, here the integration still needs to be regularized: with a cut-off , satisfying some hierarchical condition with respect to the mesh.
The transition amplitude can then be written as:
[TABLE]
where the path integral sums over those unparametrized curves in phase space, whose projection to configuration space begins at and ends at .747474 In the presence of gauge symmetries, if it is the case that these symmetries form a closed Lie algebra, one can in principle use the group averaging procedure, provided one uses a translation invariant measure of integration.
I will not require this condition, but if configuration space can be decomposed wrt a ‘time’ variable, , such that the total Hamiltonian can be written as , with not explicitly dependent on , then the timeless path integral amplitude kernel is equal to the standard non-relativistic one, up to an overall constant:757575In [76], the conditions under which Briggs and Rost derive the time dependent Schroedinger equation from the time-independent one, corresponds to the system being deparametrizable. I.e. one can isolate degrees of freedom (the environment, in Briggs and Rosen) which are heavy enough to not suffer back reaction. The proposal here is not dependent on this condition.
[TABLE]
Note however, an important difference: in the lhs of (94), time is just part of the configuration, it has no role as a “driver of change” [77]. In the remaining parcels of the equation, it is interpreted as an absolute time.
Here I take (93) for granted as an initial starting point. The difference is that I assume the system is written in the Lagrangian framework, and thus can be completely encapsulated by curves in configuration space. This would be equivalent to the starting point of Chiou if the momenta could be integrated out in the respective Hamiltonian. It would also yield the configuration space measure from the Liouville one, as in (14).
Appendix D Timeless semi-classical composition law.
Here I prove the theorem 1 in more generality. Assuming the action is additive, for a given path
[TABLE]
We need to show that
[TABLE]
which is the required composition law of determinants to obtain (27) (see Kleinert’s textbook [40] for a proof). It is easiest to see this relation by transforming to the on-shell momenta and using the interpretation of focusing of the Van-Vleck determinant. Then,
[TABLE]
becomes the relative amount of focusing that occurs at the intermediary point, which we are integrating out.
Using (95), from the stationarity condition at an intermediary field configuration, we have (dropping the subscripts for clarity):
[TABLE]
this requires the momenta to be continuous at , setting to be along the extremal path . Thus, given a classical path , also depends on and . Thus, deriving (97) by :
[TABLE]
[TABLE]
Writing equation (96) in terms of momenta we have:
[TABLE]
Using the product rule for determinants we have that:
[TABLE]
Finally, substituting this and (98) into (99):
[TABLE]
where we note that an extra cancellation occurs since because they are equal as second derivatives of the action.
Now, the continuity equation (97), turns the double sum of (29) into a single sum over the continuous extremal paths. Each overall extremal path decomposes into a unique composition of extremal paths, with final (resp. initial) endpoints on . Putting it all together we obtain:
[TABLE]
up to orders .
Appendix E Probability currents for weakly coupled systems.
Following section 4.2, here I will briefly go over the derivation of the decomposition of currents and probabilities, under an inner product which is orthogonal in terms of gravity and its source fields.
[TABLE]
I will assume the conservation law for , (44), and a specific form for the matter (or source) Hamiltonian given in (41),
[TABLE]
where is the potential for the sources, the quadratic momentum term in the Hamiltonian turns into
[TABLE]
following (42) and, again, I have taken a simple factor ordering.767676This is the ordering ignoring Christoffel symbols, i.e. ignoring covariance issues, both in configuration space and the spatial manifold. But everything goes through with , which makes everything covariant. The same is true for the gravitational modes. See footnote 38.
For the (block) diagonal metric, we decompose the current in components as:
[TABLE]
for given in (102). For a particle, we have
[TABLE]
and turns into the standard spatial Laplacian. Let’s look at this case for simplicity. In this case, the two currents of (106), reduce, with the approximation (102), to lowest order:
[TABLE]
and, using (44), the full continuity equation reduces to:
[TABLE]
where we have used (47) as the arc-length parametrization (surfaces of equal on-shell action).
Now if we use the definition of the Klein-Gordon inner product, the normal to equal time surfaces is given by (47), i.e.
[TABLE]
and from the block diagonal form of the supermetric for gravity and sources, we get:
[TABLE]
where is the screen, relating the Klein-Gordon inner product and the Schroedinger one.
Appendix F Jacobian
F.1 The Jacobian for the TT decomposition
The spin decomposition wrt the background :
[TABLE]
where
[TABLE]
we pick up a Jacobian
[TABLE]
defining
[TABLE]
we get
[TABLE]
F.2 Arc-length parametrization
If we gauge-fix the time-reparametrizations, we choose to be an arc-length one:
[TABLE]
The infinitesimal version, i.e. for ,
[TABLE]
which gives the variation:
[TABLE]
This gauge choice is responsible for the absence of the global square root. In other words, with this gauge choice one can express the action as an energy functional (in the Riemannian geometry sense) as opposed to a length functional:
[TABLE]
The main feature however of the energy functional is that extremals wrt it automatically implement arc-length parametrization as one of the extremum conditions. In other words, since the length and energy functional coincide for arc-length parametrization, and this is implied by the extrema of the energy functional, in the semi-classical approximation one can directly use the energy functional as opposed to the length. In a way, this gauge fixing ‘trades’ global time reparametrization invariance for locality of the action functional.
F.3 Eigenfunction basis
In terms of eigenfunctions of the Laplacian, using the notation of [51], we write
[TABLE]
For the 3-sphere, one finds that , for . The solution to these equations, in standard spherical coordinates is given by a combination of terms:
[TABLE]
where are the Fourier coefficients, and for each , is a TT tensor harmonic, explicitly computed in [51] (eqs 17-18c). Since is compact without boundary, and is self-adjoint with respect to the standard supermetric
[TABLE]
by the spectral theorem for bounded operators, eigenfunctions with distinct eigenvalues are orthogonal.
Now, for simplification, instead of (117), for a normalized basis (under the standard integral and sphere metric),
[TABLE]
we simply write,
[TABLE]
If the basis is not normalized, a Jacobian appears for the transformation.
Appendix G The full Van-Vleck determinant
I start by rewriting, for the reader’s convenience, (53):
[TABLE]
and (55)
[TABLE]
where is the matrix given by , such that (the traceless condition is taken for simplicity), and
[TABLE]
where . The strategy will be to compute the Van-Vleck matrix with the following chain rule:
[TABLE]
I start by noticing that the usual trick to deal with the square root in (53) is to use instead the energy functional, i.e. the version without the square root. It is easy to show that the equations of motion then give two types of equations: one says that the paths should be arc-length parametrized, and the other that the curve will be geodesic. Moreover, I also note that the conservation arguments section 4.2 go through, slightly modified – the global Hamiltonian is conserved in time, thus giving a non-local conserved energy for each curve.777777Note however that in this case the Hamiltonian is conserved along each trajectory, but it is not a primary constraint, as it is in the case of the geodesic action.
To simplify the notation, I define the matrix . Its derivative is:
[TABLE]
whose dimensions in arc-length units can be checked to be of sec. From the geodesic equation (55), the inverse metric along it is
[TABLE]
and the metric velocity is given by
[TABLE]
thus:
[TABLE]
and, since we are assuming the trace of vanishes:
[TABLE]
where the traces are taken with respect to .
The change in volume form (even for an initial traceless velocity) is
[TABLE]
and
[TABLE]
since in these simple cases all quantities are ultra-local. Thus from (124) and (125) we calculate
[TABLE]
For the other component of (120), we have
[TABLE]
where
[TABLE]
and
[TABLE]
Putting these two together we get:
[TABLE]
Since there were no derivatives, the contraction between (129) and (126) required by (120) is a trivial multiplication. Finally,
[TABLE]
For the last step in evaluating (120), one applies the chain rule for the derivative in (130), using that , and
[TABLE]
and implementing
[TABLE]
One finds that even for this simple case, the result is an immensely complicated function of the the initial velocity . Taking the determinant would once again require us to perform some regularization, as was implicitly done by taking the pointwise trace in the approximation in (61).
Appendix H Locality
There is a type of reciprocity between physical space and configuration space which can be described as follows. Whereas fixing the entire field configuration (non-locally on ) defines a point in the configuration space , fixing only a partial field configuration on a subset of determines an entire submanifold of .787878To be more careful, I should only ascribe to them the title of ‘subsets’, not submanifolds. However, under reasonable assumptions, as shown in the accompanying paper [18] they indeed form submanifolds. Such submanifolds are formed by all of the configurations which have that same fixed field, let’s say defined on , i.e. those fields which coincide with on but are arbitrary elsewhere:
[TABLE]
Extremal curves in the configuration space of the complete Universe will be in general non-local on , although of course local in . In the accompanying paper [18], we provide criteria for the sort of locality we are accustomed to, in the physical space .
Namely, suppose that the action admits a Jacobi-length representation, and let be a region in , , defining an embedded submanifold (with appropriate boundary conditions, in a specific way we spell out in [18]). We show that for semi-classical clustering – i.e. decoupling of from the rest of – to occur on a given region of configuration space, , we must have that is a totally geodesic submanifold of . We show that with this condition the semi-classical path integral kernel (in the arc-length gauge) decouples (here written for regions ), i.e. the kernel over the union of the regions turns into a product over each individual region:
[TABLE]
giving us our version of clustering decomposition.
I should also note that there is an inherent non-locality with the Jacobi form of reparametrization invariant actions (they possess a global square root). However, there is a preferred gauge in which they can become local: the arc-length gauge. In this gauge, the action becomes an energy functional (in Riemannian geometry terminology), and it is possible to show that the Fadeev Popov determinant is field independent. In other words, this particular gauge-fixing can trade time reparametrization invariance for locality.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] H. Gomes, S. Gryb, and T. Koslowski, “Einstein gravity as a 3D conformally invariant theory,” Class. Quant. Grav. , vol. 28, p. 045005, 2011.
- 2[2] F. Mercati, “A Shape Dynamics Tutorial,” 2014.
- 3[3] B. De Witt, The Global Approach to Quantum Field Theory . No. v. 1 in International series of monographs on physics, Clarendon Press, 2003.
- 4[4] B. Dittrich and J. Tambornino, “A Perturbative approach to Dirac observables and their space-time algebra,” Class. Quant. Grav. , vol. 24, pp. 757–784, 2007.
- 5[5] C. Rovelli, “Partial observables,” Phys. Rev. , vol. D 65, p. 124013, 2002.
- 6[6] C. Rovelli, Quantum Gravity . Cambridge University Press, 2007.
- 7[7] J. Tambornino, “Relational Observables in Gravity: a Review,” SIGMA , vol. 8, p. 017, 2012.
- 8[8] C. J. Isham, “Canonical quantum gravity and the problem of time,” in 19th International Colloquium on Group Theoretical Methods in Physics (GROUP 19) Salamanca, Spain, June 29-July 5, 1992 , 1992.
