On the Quantum-to-Classical Transition of Primordial Perturbations
Wouter Ryssens

TL;DR
This paper explores how quantum fluctuations in the early universe transition to classical inhomogeneities observed today, analyzing various formalisms and focusing on the pilot-wave approach to understand the quantum-to-classical transition.
Contribution
It provides a detailed analysis of the quantum-to-classical transition of primordial perturbations, emphasizing the pilot-wave theory as a novel perspective.
Findings
Pilot-wave trajectories illustrate the classical limit of quantum fluctuations.
Squeezing and decoherence formalisms have limitations in explaining the transition.
The pilot-wave approach offers a consistent framework for the quantum-to-classical transition.
Abstract
Detailed measurements of the cosmic microwave background indicate the large-scale homogeneity of the universe. On very small scales, we observe however inhomogeneities such as galaxies, stars, planets and ourselves. In the context of hot Big-Bang cosmology, these inhomogeneities are often explained as the remains of quantum fluctuations at very early times, enlarged to observable scales through the process of inflation. In this dissertation, I examine two important questions surrounding this scenario: a) How do inherently quantal fluctuations transition to the observed inhomogeneities, which behave classically? ; and b) If the initial state of the universe was symmetric, how can the currently observed state? This dissertation is organized in three parts. Part one first introduces the slow-roll inflation model and then discusses the behavior of small (scalar) perturbations to this model.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCosmology and Gravitation Theories · Black Holes and Theoretical Physics · Relativity and Gravitational Theory
On the Quantum-to-Classical Transition of Primordial Perturbations
Wouter Ryssens
© Copyright by KU Leuven
Without written permission of the promotors and the authors it is forbidden to reproduce or adapt in any form or by any means any part of this publication. Requests for obtaining the right to reproduce or utilize parts of this publication should be addressed to KU Leuven, Faculteit Wetenschappen, Geel Huis, Kasteelpark Arenberg 11, 3001 Leuven (Heverlee), Telephone +32 16 32 14 01.
A written permission of the promotor is also required to use the methods, products, schematics and programs described in this work for industrial or commercial use, and for submitting this publication in scientific contests.
Contents
-
8 Classicality for Cosmological Perturbations: Pilot-Wave Theory
-
10 Cosmological Perturbations: a Test for Quantum Mechanics?
Introduction
At the time of its discovery, the cosmic microwave background generated extensive interest, culminating in half a Nobel prize for the discoverers Penzias and Wilson. This ‘relic radiation’ was found to consist of photons that have been travelling for billions of years, effectively starting at the time of decoupling and not stopping until they were captured in our detectors.
In fig. 1 we see a simulated image of the temperature distribution of the sky, as it could have been measured by Penzias and Wilson. Apart from contamination by our own Milky Way (the central white ‘band’), we see that the temperature distribution is extremely homogeneous. This can be regarded as the most important experimental confirmation made by the discovery of the background radiation: it confirms the cosmological principle. This is a very important principle in cosmology, stating that the universe is homogeneous and isotropic. Technically, this means that there are no preferred directions in space and that relevant observables as temperature and energy density are the same everywhere at a given instant of time. Of course, this principle applies only if we focus on sufficiently big scales, meaning scales that are far larger than typical supercluster size, typically above Mpc [57]. The evidence provided by the cosmic microwave background is compelling, illustrating that the universe was extremely homogeneous in the early part of it’s history. It should be noted that the cosmological principle is supported by some other observations, such as the distribution of radio galaxies.
When we accept the cosmological principle, we have to acknowledge the fact that the universe is not homogeneous and isotropic on small scales. The Milky Way is not spherically symmetric and the mass distribution inside our solar system is not homogeneous. In particular, the cosmic microwave background is not perfectly homogeneous. Deviations from the mean temperature are very small, but could eventually be detected. The three-year results of the WMAP mission are displayed on figure 2, displaying the deviations from the mean temperature in function of the direction on the sky. The fluctuations are of order with respect to the mean temperature and can be characterised in terms of their power spectrum.
These tiny fluctuations in the background radiation mean that there were tiny fluctuations in the energy density of the early universe. The anisotropies in the CMB provide us with a snapshot of this period. The general idea is then that these initial perturbations will amplify in time due to the attractive nature of gravity, in turn leading to the small-scale structure of the universe as we observe it today. The evolution of these perturbations after the recombination and decoupling epoch and the formation of structure in the universe is a very active field of research, with famous examples as Jeans theory and the Millennium Simulation. While it is not entirely understood how the structure has formed exactly, it is usually thought that these inhomogeneities are the direct cause of structure formation.
In this thesis, we will not be concerned with the evolution of these inhomogeneities after decoupling, but with their origin. In the 1980’s the theory of inflation was formulated, in response to a set of problems cosmologists faced. The theory of inflation puts forward that the universe went through a period of accelerated expansion, (partially) solving what was know as the flatness problem, the homogeneity problem and the magnetic monopoles problem. It turned out that a quantised version of this theory was able to explain the origin and the power spectrum of the observed inhomogeneities. This property of inflation is in these days regarded as a far more compelling reason to attach credibility to the theory of inflation than the original problems.
The idea of inflation is rather simple. We take the very early universe as a starting point, before the suspected time at which inflation is suspected to have started. The whole universe is assumed to be isotropic at this point in time, meaning that the energy distribution was homogeneous. But on very small scales quantum mechanical effects become important and, because of the Heisenberg uncertainty principle, this homogeneity is uncertain. This phenomenon is commonly called ‘quantum fluctuations’, but it is dangerous to engage in trying to sketch a picture of this, therefore we will not try to give the reader a more ‘visual’ idea of these fluctuations. At that moment the universe enters the epoch of inflation and all length scales expand enormously. The quantum effects get transferred to much larger scales and get ’frozen’ into fluctuations in the energy density. Eventually, at the moment of decoupling, scattered photons will get gravitationally redshifted by the energy density. The difference in mass distribution gives rise to a difference in redshift which gives rise to the difference in temperature we observe.
The picture sketched above already has one fundamental problem. The idea is that a quantum phenomenon gives rise to a classical phenomenon, quantum fluctuations giving rise to classical energy density perturbations. While quantum theory has been around since the early 20th century, we still do not fully understand how quantum theory reduces to the classical world we observe every day. This problem is especially severe in the cosmological context since the events we wish to consider are generally far removed in both space and time from us as observers. Several mechanisms have been proposed to solve the classicality problem in the context of primordial perturbations. So far, no proposition is completely compelling and the scientific community is divided.
We will conclude this introduction of the problem by citing some authorities in the field on the subject of these perturbations. Weinberg writes in [58]
‘* […] the field configurations must become locked into one of an ensemble of classical configurations, with ensemble averages given by the quantum expectation values calculated […]. It is not apparent just how this happens …*’
In [35] it is stated that
‘There remains the usual interpretation problem, arising whenever we talk about measurement in quantum theory. We would like to know why eigenstates of the field operator have been selected by Nature, since after all they provide only one from an infinity of possible ways of expanding the state vector. And we would like to be assured that one of the eigenstates was indeed selected in the early Universe, since otherwise it would seem that the pattern of say the cosmic microwave anisotropy may not have existed before it was observed.’
As a final example we cite Mukhanov, one of the founders of the theory of quantum inflationary perturbations, in [39]
‘If these galaxies originated from initial quantum fluctuations, a natural question arises: how does a galaxy, e.g. Andromeda, find itself at a particular place if the initial vacuum state was translational-invariant with no preferred position in space? Quantum mechanical unitary evolution does not destroy translational invariance and hence the answer to this question must lie in the transition from quantum fluctuations to classical inhomogeneities. Decoherence is a necessary condition for the emergence of classical inhomogeneities and can easily be justified for amplified cosmological perturbations. However, decoherence is not sufficient to explain the breaking of translational invariance. It can be shown that as a result of unitary evolution we obtain a state which is a superposition of many macroscopically different states, each corresponding to a particular realization of galaxy distribution.[…] Such a state is a close cosmic analog of the Schrödinger cat.’
Organisation of this Thesis
In part I we will construct the theoretical basis to be able to formulate the problem as specifically as possible. We will start in chapter 1 by investigating the nature and origin of the cosmic microwave background. The simplest model of inflation, known as slow-roll inflation, will be introduced. In chapter 2 we will then look at perturbations of the metric and the inflaton field to the homogeneous inflationary theory. At the end of this chapter, we will promote the theory to a quantum theory. The framework is then in place to discuss the nature of the inhomogeneities in chapter 3. First the observed inhomogeneities will be characterised, followed by a brief exposition of how the perturbations from chapter 2 are supposed to give rise to the inhomogeneities in the sky. The questions we will abord in this thesis can then be explicitly stated in section 12.
Part II of this thesis will focus on the usual solutions presented in the literature. The discussion will be opened in chapter 4 where we will treat the general question of the classical limit, using two exemplary physical systems. In chapter 5 we will critically review the proposed solution known as ‘squeezing’. Following that, we will explore the possibilities of decoherence in chapter 6. All solutions considered in this part of the thesis are proposed solutions that remain within the confines of quantum theory. We mention that there are several other schemes to explain the observed classicality within and outside standard quantum mechanics, the examples we have studied are far from exhaustive.
The final part of this thesis, part III, will be concerned with pilot-wave theory. We will introduce the framework in chapter 7 and immediately apply it to the examples used in chapter 4, studying what the classical limit entails in de Broglie-Bohm theory. This will be applied to the case of the primordial perturbations in chapter 8, where we will find that pilot-wave theory presents a rather natural solution to the problem. We will continue by presenting some simulations in this framework in chapter 9, thereby providing more insight in the actual dynamics of the perturbations in the context of pilot-wave mechanics. We will argue from these simulations that one can expect the mechanism that is provided reaches classical behavior fast enough to explain observations. In chapter 10, we will look at some other implications of pilot-wave theory in cosmology: it is possible that the difference between pilot-wave theory and standard quantum mechanics becomes observable in that context.
Part I Inhomogeneities in the Sky: Exploring the Problem
Chapter 1 The Cosmic Microwave Background and Inflation
When Penzias and Wilson reported an ‘all-pervading’ noise in their radio experiments at Bell labs in New Jersey, they paved the road for most of the observational cosmology of today. This background radiaton that pervades the earth from all directions is measured to have a temperature of , giving it a mean wavelength in the order of micrometers. Therefore, it is commonly called the Cosmic Microwave Background or the CMB in short. It is among the best sources of information we have on the early universe, and while it is the most accurately measured black-body spectrum in human history, there still are experiments under way attempting to improve our knowledge and understanding of this phenomenon. Most striking is the homogeneity of this background: the temperature is homogeneous to one part in for every direction in the sky!
In this chapter we will briefly look into the origin of the CMB, as it is explained by the hot Big Bang model.111The CMB can in fact be considered as a most striking experimental confirmation of the hot Big Bang model. However, the conclusions we can draw on the state of the early universe present us with some extra questions. We will then introduce the (simplest) model of inflation, slow-roll inflation, a model that provides an answer for several of this questions.A brief discussion of possible extensions of this simple model will be included. To finish this chapter, we will introduce the observed inhomogeneities in the CMB, the deviations of order that will be the primary focus of this thesis.
For the duration of this thesis, we will use natural units, meaning that . The only exception is chapter 4 where the explicit presence of the factors of is enlightening.
1 Origin of the Cosmic Microwave Background
We will begin our account by assuming the cosmological principle, stating that the universe is (in first approximation) isotropic and homogeneous on large scales. This assumption is supported very well by our observations of the homogeneity of the CMB. In addition, it is supported by the measurement of the celestial distribution of radio galaxies and the isotropic nature of Hubble’s law.
Under the assumption of the cosmological principle, the universe must be a space of constant curvature on large scales. There are thus only three very distinct possibilities. Either the universe has positive, negative or zero curvature, usually denoted by respectively. The metric dictated by general relativity is then the Friedmann-Lemaître-Robertson-Walker line element, or FLRW metric. It can be found in most textbooks on general relativity, as for instance [16].
[TABLE]
Here is called the scale factor. Note that it can only depend on time, since any spatial dependence would violate the cosmological principle. The only aspects of the metric that are not fixed by the cosmological principle are and .
The Einsteins equations are, in a universe without cosmological constant,
[TABLE]
where is the contracted curvature tensor, is the Ricci scalar and is the FLRW metric from eq.(1). To solve for and , we also need a suitable energy momentum tensor . One usually assumes that the universe is filled with a perfect fluid and the energy-momentum tensor is given by
[TABLE]
where and denote the density and the pressure of the fluid respectively, while denotes the velocity field four-vector of the liquid. This set of equations is usually supplemented by an equation of state for the fluid .
This set-up is well known and leads to two independent relations, called the Friedmann equations[33]
[TABLE]
where dots indicate derivatives with respect to time. An important quantity is clearly . It is called Hubble’s parameter and is usually denoted by .
We now make some simplifying assumptions. First, the universe we observe today seems to be flat, so we take . In addition, we take the fluid to consist of a mixture of matter and radiation. Perfect matter is a fluid without pressure, while radiation has the following simple equation of state
[TABLE]
We can solve eq. (5) for each component separately.
[TABLE]
For an expanding universe, the volume increases as , so the matter result comes as no surprise. Radiation, in addition to diluting, redshifts in an expanding universe, which results in an extra loss of energy and in the extra factor . Solving equation (4) for a mixture of matter and radiation can be done, but is very messy. To make things easier we imagine one of both components dominating. The solutions for are respectively given by
[TABLE]
where the constant is chosen in such a way that the present value of is .
We can now, very schematically, paint a picture of the different eras the universe evolved through. We start our evolution in a very small universe that we imagine to be dominated by radiation. Since the wavelengths of photons behave as , their energy will decrease as . Following this reasoning, the initial stages of the universe must have been very hot. In fact the temperature was so high that the available protons and electrons didn’t combine into hydrogen. Any hydrogen system that formed would almost instantly be ionised by the omnipresent photons, who carried a lot more energy than the eV required. As a consequence, the mean free path length for the photons was very small. It is at this time that the photons were able to establish thermal equilibrium with their surroundings, forming a black-body spectrum.
In this regime, the universe expanded as and thus the temperature behaved as . Since radiation density decreases a lot faster with time than matter energy density, eqs. (7), our approximation of radiation dominating the expansion gets worse and worse. At a certain time, the situation will be inverted: the matter energy density will be more important. We thus transfer to a regime where matter dominates, meaning .
While the universe expands, the photons redshift to lower and lower frequencies. Eventually the energy of the average photon drops to the point where it can no longer ionize hydrogen. Consequently, the protons and electrons combine into (mostly) hydrogen. This is called the period of recombination. The photons have less free electrons to scatter off and their mean free path increases.
Soon after, as the temperature decreases further, the photons lose the capability to excite hydrogen to any of its energy eigenstates and all the electrons drop to the ground state of hydrogen. The probability for the photons to interact with the hydrogen atoms becomes vanishingly small and the mean free path for a thermal photon becomes very large. This is commonly called the period of decoupling. The approximate temperature of the universe during this period was [33]. The photons we see now in the night sky are originating from a sphere of radius , where is the time elapsed between decoupling and our observations. This sphere is called the surface of last scattering.
We already established that the photons were in thermal equilibrium during the early stages of the universe. The initial energy distribution over the frequency domain of the photons was at that time
[TABLE]
However, we know that . The left hand side of the equation is an energy density: it has to scale as for a given . This is exactly what the numerator of the right hand side scales like. The denominator, we can take as constant in time, provided we take . The evolution of the universe preserves the form of the initial black-body spectrum, but transfers it to increasingly lower equilibrium temperatures.
This explains very roughly where the photons we observe in the night sky come from and why they essentially behave as being emitted by a black body at a temperature of . Models other than the hot Big Bang model have trouble explaining the black body spectrum. Because of this, the presence of the CMB can be thought of as a ‘smoking gun’ for the hot Big Bang model.
2 Problems of Hot Big Bang Cosmology
Hot big bang cosmology is, of course, not complete. In this section we will zoom in on three observations that are usually cited as the reason to implement inflation.
Flatness Problem
Turning back to the more general Friedmann equation (4), one can derive the following solution for the curvature
[TABLE]
Taking the value of the Hubble parameter now to be , the quantity is called the critical density. One immediately sees that the ratio of the present-day density and the critical density determines the type of universe we live in: positive, negative or zero curvature. Rewriting this equation in terms of this ratio
[TABLE]
Supposing that there is enough fluid density present to dominate over any curvature and/or cosmological constant terms, we will take as one of the solutions in eqs. (8). We then have as time evolution
[TABLE]
For both cases, we see that evolves away from , provided that . This means that a slight deviation from the critical density will be greatly amplified in time.
Current observations of type IA supernovae (‘standard candles’) and the CMB place the current ratio at , with an experimental uncertainty of the order of . So the value of can be at most . But this is a quantity that has strictly increased with time. To account for the present value, the inital value of would have to be very small. If one uses the time evolution of the universe, as described above, and evolves back to Planck time the condition on becomes [39]
[TABLE]
This is a very stringent condition on the initial condition of our universe indeed!
Horizon Problem
Looking at the CMB, we see that the temperature distribution across the sky is very homogeneous. Some rough estimates then lead us to what is called the ‘horizon problem’. The present scale of homogeneity (that we can observe) is at least the size of the observable universe, on the order of cm. Evolving our universe back into time, the homogeneity originated at time in a patch of size at least cm. Now at that time, light only had the time to travel a distance . Thus the scale of physical processes, the causal scale, was at most . Denoting the homogeneity scale by and the causal scale by and evaluating at Planck time we get[39]
[TABLE]
Thus, at time we have on our hands a homogeneous region, that is immensely bigger than the causal scale. Every possible equilibration process that could have occurred before could only have established homogeneity on scales up to . Thus, under standard cosmology, the observation of the homogeneity in the CMB implies that the initial state of the universe was already homogeneous on large scales!
Magnetic Monopole Problem
Several Grand Unifying Theories, i.e. superstring theory, predict a plethora of new particles at the Grand Unification Scale, the scale on which the four fundamental forces are thought to be unified. One of the most infamous of these are magnetic monopoles. We only note that these monopoles are predicted to be stable and they are extremely heavy, with masses on the order of GeV.
As we have seen, the temperature of the universe decreased as . We can conclude that the very early universe was an extremely hot place, and several GUT’s predict that a copious amount of monopoles would have been formed. Being very heavy, these monopoles would very fast behave non-relativistically and thus would rapidly behave as pressureless matter. Now the energy density of radiation falls as , while the density of the monopoles would fall as . The energy density of the monopoles would very fast have dominated the energy density of the radiation.
However, present day limits on the existence of monopoles are extremely stringent. The GUT-scenario, however, predicts stable magnetic monopoles, constituting a density far greater than the density of radiation in our present universe. This is called the monopole problem.222Historically, there was only the ‘monopole problem’. Following the developments in high-energy physics and GU theories, there are now several hypothetical particles that cause similar problems.
3 Inflation
Answering the problems
In the 80’s Guth [19] proposed a way to solve the problems touched upon in the previous section. The paradigm known as inflation promises to solve the flatness problem, the horizon problem and the magnetic monopole problem by hypothesising a period of extreme accelerated expansion in the early universe. We will treat the answers that inflation offers to these problems.
The rough picture goes as follows. An initially small patch in our universe undergoes, during a brief period, severe expansion. We assume that this patch has some initial nonzero curvature and some initial nonzero magnetic monopole density. The monopole density falls off as , so if the change in was big enough, this could bring the monopole density to below present experimental limits.
If our patch has some initial nonzero curvature, the ratio of the density and the critical density in that patch is different from . For decelerated expansion, we have seen that in this case evolves away from . However, if we assume an accelerating inflationary phase we get for the scale factor
[TABLE]
Thus, using eq. (11), we find that will evolve towards , instead of away from it. Inflation thus has the benefit of smoothing out the curvature.
Inflation also solves the horizon problem in a conceptually easy way. Patches of sky that seem to be causally disconnected today were in fact much closer together before the period of inflation. Thermal equilibrium would thus have been established before inflation, in a causally connected universe. Then the patches were causally removed from each other, by enormously enlarging the universe, accounting for the seemingly causally disconnected patches.
For inflation to account for observations, we need at least a factor of where is somewhere in the range between . The number is called the number of e-folds the universe has expanded. As typical value, one can safely use [35]. In orders of magnitude: .
Problems?
We close this section with some side remarks on the problems we presented in the last section. One must notice that none of these is a problem in the strict sense of speaking. The magnetic monopole problem is only a problem as soon as we introduce magnetic monopoles, which so far have absolutely not been observed. The flatness problem and horizon problem can both be solved, within a non-inflationary universe, by fine-tuning the initial condition of our universe. If we stay within hot big bang cosmology, these problems merely represent restrictions on the possible initial state of the universe. Such fine tuning is widely regarded as dissatisfactory, but in principle, it is not inconceivable that our universe started out with the special initial conditions required to form our present-day universe. One could question the motivations for the theory of inflation, as Penrose does in [42]. His arguments are the following.
The monopole problem, while certainly a problem for people working on superstring theory or other Grand Unified Theories, does not present itself in the context of standard cosmology, for the simple reason that the Standard Model does not incorporate magnetic monopoles or other exotic particles.
Given a certain amount of inflation, say e-folds, there certainly are manifolds that are not smooth on a scale of . Penrose takes as example the Mandelbrot fractal, a structure that does not smooth out when enlarged. One could argue that this is kind of a pathological example, but it is clear that there are a lot of possible initial spacetime structures that are not smoothed out by e-folds of inflation. In a way, we can say that the inflation paradigm limits the amount of required finetuning for the flatness problem, but does not eliminate it entirely.
The phrasing of the horizon problem also merits some extra attention. The observed homogeneity of the CMB can also be explained in terms of the second law of thermodynamics. Let us think of two causally disconnected regions as two isolating boxes containing some (very large) amount of gas molecules. The number of possibilities for the temperature in both boxes to be equal by far exceeds the number of possibilities for the boxes to have different temperatures. Thus, in the same way both gas containers do not need thermal equilibration, two causally disconnected regions in the universe do not need an equilibration to have the same temperature.
These arguments are not trying to disprove the inflationary paradigm in any way. They are meant to be sideremarks to the discussion in section 2.
Implementing Slow-Roll Inflation
The rest of this section will be devoted to the question how the hypothesis of inflation can be realised within the framework of general relativity. Our treatment will be brief and loosely based on [7], [35] and [9].
Looking back at the Friedmann equations, eq. (4) and eq. (5) for a flat universe
[TABLE]
It is clear that, when , a possible solution for is an increasing exponential.
We will construct a fluid satisfying in the following way. The action for a minimally coupled massless scalar field , that we will call the inflaton, in a potential is
[TABLE]
From the action, we can extract the energy momentum tensor
[TABLE]
We read off the pressure and the energy density
[TABLE]
So, for a homogeneous field that slowly varies in time, the potential will dominate both the pressure and the energy density, realising .
The Friedmann equations become for this fluid
[TABLE]
We will be looking for homogeneous solutions for , meaning that the space derivatives on are well approximated by zero333Even if this is a bad approximation at the onset of inflation, the factor will certainly quickly suppress the influence of the spatial inhomogeneities.. Secondly, we will assume that the potential dominates any kinetic energy that is present. Mathematically these two approximations are written
[TABLE]
Under the slow-roll and homogeneity assumptions, the Friedmann equations become approximately
[TABLE]
So we immediately see that will be approximately constant. Now, if we allow to be big, we cannot maintain the slow-roll approximation for long. We will thus look for solutions to these equations in an overdamped regime, meaning that we neglect . The condition for this to be a good approximation is
[TABLE]
This assumption, together with eqs. (23), are the assumptions of slow-roll inflation. These assumptions sketch the picture of a scalar field in a very flat potential, slowly rolling down the hill to some minimum of that potential. We can recast these assumptions to be assumptions on the potential only[35]
[TABLE]
A scalar field, moving in a potential that satisfies these constraints, will give rise to a constant term in the right hand side of eq. (24). The final result will be a scale factor that increases exponentially . Thus we have found a model that incorporates accelerating expansion of the universe, solving the problems we described in section 2.
To conclude our construction of the simplest inflationary scenario, we remark on the inflaton field. It is a hypothetical field that is thought to source inflation. It moves in the potential , but aside from eqs. (26), there are no restrictions on this potential. There is a wide variety of possibilities for the potential and there is no reason to suspect the inflaton field is one of the fields present in the Standard Model. Candidates for the inflaton field may be sought in supersymmetric theories or string theory.
Ending inflation
If inflation wishes to explain the present universe, there has to be a way for it to end. Let’s look at the second slow-roll condition, eq. (25). This assumption cannot be maintained forever, since the scalar field rolling down the potential will eventually pick-up some speed. Alternatively, one can look at this in terms of eqs. (26). We need in our present universe. would lead to a large cosmological constant in the Einstein equations, something that is not supported by observations today. It is thus clear that the assumption in the second equation of eqs. (26) cannot hold forever.
The breakdown of the second of the equations in eqs. (26) is usually cited as the moment slow-roll inflation ends. Eq. (24) will then describe an oscillator that is no longer in an overdamped regime. The inflaton field will start oscillating, while the term will damp the amplitude of the oscillation.
Usually, one also supposes that the inflaton is coupled to the particles of the Standard Model, allowing the inflaton field to decay into different combinations of elementary particles. Thus, one modifies eq. (24) by an extra term on the right hand side proportional to . Here is the total decay rate of the inflaton field into different kinds of particles. This extra term will result in extra damping of the oscillatory motion. During the inflationary phase, one presumes it to be dominated by the term . When inflation ends, will decrease and the decay term will take over, thereby increasing the damping of the amplitude of the oscillation.
This allows for a natural way to repopulate the universe. Since inflation expanded the universe enormously, it will also have cooled enormously. In addition, any initial density of particles present in the Standard Model will have been diluted. The photons and other elementary particles formed by a decaying inflaton field are then a very natural way to restore the temperature in the universe and repopulate it with the particles we see today. This process is called the process of reheating.444This explanation is a very basic one. Normally one should also include the effect of parametric resonances in the oscillator equation eq. 24, but this is outside the scope of this thesis. For more details, see for example [9].
It is however important to note that the reheating process is not understood very well. Since the nature of the inflaton field is unknown, we also know nothing of its decay rates and decay products. We will for the rest of this thesis neglect any couplings of the inflaton field to other particles, since we will mostly be considering the inflationary period itself.
4 Inhomogeneities
As said before, the CMB is homogeneous to one part in . That is a remarkable fact and good evidence in support of the cosmological principles. However, it also implies that there are inhomogeneities in the CMB of order . In chapter 3, we will go more into detail on characterising the observed anisotropies.
Intuitively it is very clear that these inhomogeneities can be explained by inhomogeneities in the matter distribution at the surface of last scattering. How these matter density perturbations exactly lead to the observed anisotropies can be explained by the nonintegrated Sachs-Wolfe effect. Very roughly, the scattered photons are gravitationally redshifted at the surface of last scattering. The amount of redshift depends on the distribution of matter. For small density perturbations, general relativity relates the shift in frequency of a photon travelling from to to the difference of the component of the perturbed metric between those places. It is in good approximation given by
[TABLE]
The actual situation is obviously more complicated, since this effect does not only play a role at the surface of last scattering. When this effect is also taken into account for the whole journey of the photons from the surface of last scattering to earth, the effect is called the integrated Sachs-Wolfe effect. We will not delve deeper into that, since it is outside the scope of this thesis.
What we will note however, is that hot big bang cosmology can not account for the presence of these density perturbations. Given initial perturbations in the density of the early universe, hot big bang cosmology can evolve them in time and preserve them for us to observe. It cannot tell us how these perturbations came to be.
In contrast, inflation can account for the generation of these inhomogeneities. Roughly speaking one considers the early universe before inflation and looks at vacuum of the inflaton field that is present. If we treat the inflaton vacuum quantum mechanically there are the Heisenberg uncertainties to be considered. In the standard situation, ‘quantum fluctuations’ of the vacuum only occur on very small length scales. However, inflation amplifies all length scales enormously, and transfers these vacuum fluctuations to observable scales. According to inflationary theory, the anisotropies we observe today are direct results from the quantum fluctuations that were present at the onset of inflation. Even more striking, inflationary theory can predict the observed spectrum of the anisotropies in the sky!
The explanation offered here is of course a sketch, but there is something amiss here. How exactly can the quantum mechanical fluctuations give rise to the density perturbations that behave classically? In chapter 2 we will first develop the formalism needed to describe the density perturbations, while we will go more in depth on the origin of the inhomogeneities in chapter 3.
5 Generalisations
The scenario sketched above is only one of many possibilities and is commonly known as chaotic inflation. It constitutes the most simple model for inflation, and we will use it as basis for the rest of this thesis. For completeness sake, we list here some possible generalisations. One can of course start by considering more than one inflaton field. Every inflaton field induces its own inflation era. There is no upper limit to the amount of inflaton fields one can have.
It is also possible to construct potentials of two scalar fields and such that the contribution to the energy density is mostly due to a very massive , for bigger than some critical value . One then usually takes to be a slowly rolling field, and when drops beneath the potential of acquires a minimum at . Then begins to move and inflation ends as the potential vanishes. In this way, one scalar field takes the role of the inflaton while another one takes the role of potential.
The exponential behavior for the scale factor presented above is not the only possible behavior. For example, Lucchin [34] proves that power-law inflation, with for , can also prove a satisfactory explanation for the flatness and horizon problem. To account for the correct spectrum of anisotropies, Lucchin shows that or .
Chapter 2 The Theory of Cosmological Perturbations
The aim of this chapter is to develop the theory of cosmological perturbations. We will look at the machinery that will allow us to talk about the impact of the primordial quantum fluctuations that are supposed to be the source of the inhomogeneities in the CMB.
To do that, we will first look at the classical picture. Supposing some initial inhomogeneities, what do the Einstein equations tell us about their time evolution? We will analyse deviations of the FLRW-metric and inhomogeneities of the inflaton, in order to find the relevant physical variables. This will turn out to be a single scalar field. Second, we will quantise the action of the fluctuations by imposing the canonical commutation relations. Thirdly, we take a look at what happens when we write the theory using creation and annihilation operators. We will mostly follow the treatment and notation from [39] and [10], but we will also use elements from [8], [57] and [6].
6 The Classical Theory
Metric Perturbations
As in the preceding chapter, we will consider a flat FLRW-universe, using the following metric
[TABLE]
In what follows, we will find it easier to work in a different coordinate system. We define conformal time as , so that the FLRW metric takes the form
[TABLE]
When ranges between and , ranges from to [math]. We are now interested in what happens to small deviations of this FLRW-metric. We identify
[TABLE]
where denotes the background FLRW-metric. Since we obviously want to transform as a tensor quantity, we will decompose it into parts, according to their transformation properties. Thus we acquire scalar, vector and tensor contributions to .
The time-time component behaves as a scalar and so we write
[TABLE]
where is some scalar function. The quantities , where ranges over spatial indices, are vectors. Any vector can be decomposed as the gradient of a scalar and a three-vector with zero divergence.
[TABLE]
The quantities should now behave as a tensor, and thus can be written as [39]
[TABLE]
where and are scalar functions, has zero divergence and the three-tensor satisfies the analogous constraints
[TABLE]
Briefly counting the independent degrees of freedom, we see that the scalar functions and bring three degrees of freedom. and both contribute two degrees of freedom, since they each satisfy a constraint. then, as a symmetric three-tensor satisfying four constraints, contributes two degrees of freedom. So we have ten degrees of freedom, exactly the number of independent components of the symmetric tensor .
Collecting all the terms, we can now write the metric of our universe if only scalar, vector or tensor perturbations are present. In that order
[TABLE]
It is now important to look at the behaviour of this metric perturbations under general coordinate transformations, since some of the ten degrees of freedom represent our freedom in the choice of coordinates. One can show that the simplest gauge invariant quantities we can construct are
[TABLE]
where primes denote derivatives with respect to conformal time. Note that is already gauge invariant. Thus, we have seven remaining degrees of freedom and three degrees of freedom can be attributed to our choice of coordinate system.
Einstein Equations
The aim is not only to perturb the metric around a flat FLRW universe, but also to consider perturbations of the energy momentum tensor. The Einstein equations are
[TABLE]
We split the Einsteintensor , where is linear in the metric perturbations. Dividing the energy-momentum tensor in a similar way we find the linearised Einstein equations
[TABLE]
As before, we need to identify the gauge invariant degrees of freedom first. One can show that the simplest gauge invariant quantities are
[TABLE]
and similar for . Thus we find
[TABLE]
As before, one could now decompose into the different contributions: scalar, vector and tensor perturbations. From the linear nature of eq. (46), we can already expect that the metric perturbations only couple to their corresponding energy-momentum perturbations; i.e. scalar metric perturbations couple to scalar energy-momentum perturbations and analogously for the vector and tensor perturbations.
Physically, scalar perturbations correspond to density perturbations, while vector perturbations are related to the rotational motion of our cosmic liquids and tensor modes are associated with gravitational waves. However, vector perturbations do not grow in time and tensor perturbations contribute far less to the energy density and pressure[10]555Vector and tensor perturbations are less important for cosmic structure formation, but they are important for observations of the CMB[6]. The Planck mission[1] will in fact look for B-mode polarisation in the CMB, a signature of vector and tensor perturbations.. A typical ratio of tensor contributions to scalar contributions is in the range of to [39]. In what follows, we will restrict ourselves to scalar perturbations as they are the ones that contribute most to the density variations in our universe, e.g. we will be concerned with the linearised Einstein equations using the metric of eq. (35).
Introducing the Inflaton Field
Now we suppose that the universe is filled with a minimally coupled real scalar field , the inflaton field that is subject to a potential , with action given by eq. (18). The inflaton then has an energy-momentum tensor
[TABLE]
As before, see chapter 1, one can regard this scalar field as a perfect fluid.666In fact one can do everything that follows for different fluids, as is done in [39] and [2]. One retains then an extra parameter in , the speed of sound, that depends on the equation of state of the liquid. For the canonical scalar field that we use, is equal to the speed of light.
Let’s now perturb this inflaton field around it’s mean value: . The equations of motion to linear order in the perturbations become
[TABLE]
We notice that the background variable still obeys the equations of motion from the previous chapter, to first order. From the analogues of eq. (43) for we find that the relevant gauge invariant quantity in these equations is
[TABLE]
The gauge invariant equation of motion for the scalar perturbations becomes
[TABLE]
It is important to note that this last equation was derived using eq. (48) and (49). In [39] one proceeds to show that the gauge invariant quantities and need to be equal. The metric from eq. (35) then reduces to
[TABLE]
Notice that this is very reminiscent of the Newtonian limit of General Relativity: with we can identify with the Newtonian potential. The variable usually goes by the name of Bardeen potential. It is the only physical degree of freedom left for the metric perturbations.
Equations of Motion for a Slow-Roll Theory
We are now in a position to actually solve the linearised Einstein equations, eq. (46), by substituting equation (47) and calculating the gauge invariant perturbation of the Einstein Tensor with metric given by eq. (52). This involves a lot of tedious algebra that we will omit here. The result is [10]
[TABLE]
where is called the Mukhanov-Sasaki variable. While we omitted the algebra involved, it is important to note that the background equations for and , eq. (51) and eq. (29), were of critical importance in obtaining this result.
It may seem strange that we retain only one physical degree of freedom at the end of this story. However, the metric fluctuations contain only one degree of freedom, as do the inflaton perturbations. These degrees of freedom satisfy a constraint: the Einstein equations. It is in this way that only one degree of freedom remains: the Mukhanov-Sasaki variable. We can already note that is not the physical variable: the gauge invariant perturbation is . includes an extra scaling and we conclude that the physical variable is .
In the case of slow-roll inflation we can further simplify the equations of motion. Using the approximations in eqs. (23) and treating as a constant we can show that
[TABLE]
For a perfect slowly rolling field, the second and third term are zero. In more general slow-roll inflation, we can safely ignore the second and third term. Thus we find that the equation of motion in the regime of slow-roll inflation is given by
[TABLE]
Given a background , a solution to some slow-roll model, one could now compute the classical evolution of perturbations. For this however, we need some initial conditions. Solving this equation is what we will do in the next paragraph.
Solving the Equations of Motion
Lets first look at eq. (53) in more detail. It is a generalisation of the Klein-Gordon equation for the scalar field on expanding spacetime. As in standard field theory, we can decompose the Mukhanov-Sasaki variable in Fourier components
[TABLE]
And it is now easy to check that the components satisfy the condition
[TABLE]
How to make sense of this equation? There are two limits in which the solution is clear. As explained in the previous chapter, we imagine the observable part of the universe as originating from a small volume of space of a typical scale on the order of . The physically relevant modes are then those with wavelengths well inside the Hubble radius: or equivalently . Thus, at the onset of inflation, we imagine that gravitational effects are not yet important for these modes, as their wavelength is significantly smaller than the Hubble scale. In this regime, we expect to dominate over . Eq. (59) reduces (as it should) to the Klein-Gordon equation for a free scalar field on Minkowski spacetime. The solutions to eq. (59) are plane waves. Then as inflation progresses, the physical wavelengths grow in time, as discussed in the previous chapter. At a certain moment, the physical wavelength will equal the Hubble radius. On this scale, the expansion of space becomes important for the time evolution of the mode . Thus the term becomes more and more important. In the limit where the solutions behave essentially as . Thus, for , , where is some complex constant. The amplitude of the Mukhanov-Sasaki variable grows rapidly in this regime.
The transition between both regimes is commonly called the Hubble exit of the mode. The condition for this to occur is that . For exponential and power-law inflation this means . The regimes are thus characterised by : the mode is a free field on Minkowski spacetime when and grows in time proportional to the scale factor when .
One can also draw the conclusion that the system described by a single Fourier mode is in fact a time dependent harmonic oscillator. The frequency of the system is then
[TABLE]
Indeed, when is positive, the solutions are plane waves, Re and Im oscillate. But when the mode exits the Hubble radius changes sign. The system then transforms into an upside down harmonic oscillator and will go ‘rolling down the hill’. During the rest of this thesis we will use the upside-down harmonic oscillator as a toy model and a useful analogy.
Owing to the above analysis, natural initial conditions for the Cauchy problem posed by eq. (59) are given by a free field on Minkowski space. Thus, denoting the time when inflation begins with
[TABLE]
For exponential and power-law inflation, where the scale factor is proportional to , one can solve eq. (59) supplemented by these initial conditions exactly. The solution is given in terms of Hankel functions of the first kind of degree [36]
[TABLE]
For de Sitter-expansion (), this reduces to [57]
[TABLE]
This solution is clearly a plane wave for big, while it becomes proportional to when becomes small. Some additional properties of these solutions to the classical equations of motion for general inflationary models are given in appendix D.
7 An Overview
To promote the outlined theory to a quantum theory, we need an action to quantise. Let’s use the opportunity to briefly redo what we did in the previous chapter and section, in a conceptually more transparent way.777This way is however a lot more computationally intensive.
We started from the Einstein-Hilbert action and the action for a minimally coupled real massless scalar field , as in chapter 1
[TABLE]
and by varying with respect to the metric we found the equations of motion, being the Einstein field equations. By applying the cosmological principle we found the FLRW metric and the Friedmann equations. As a result we get homogeneous background solutions and , where the subscript stands for ‘unperturbed’.
Now, we look at perturbations around the solutions to these equations of motion. We write
[TABLE]
where is the FLRW metric, incorporating some scale factor , where the subscript now stands for ‘perturbed’. Using this division between background and perturbation, we identified the gauge invariant parts of the perturbations in equations (38), (39) and (40). As before, we restrict ourselves to scalar perturbations. The following step is writing the Einstein-Hilbert action up to second order in the gauge invariant scalar perturbations, denoted by bars.
[TABLE]
Here is the first order perturbation, the second order. For small perturbations, we can assume that the equations of motion for and are still solved by the same solutions as before. In other words, we assume that and . Since these are solutions to the equations of motion, we know that vanishes888This immediately explains why we wrote the action up to second order.. Thus, by choosing these particular solutions, becomes fixed and we are left with . In terms of the Mukhanov-Sasaki variable equals [39]
[TABLE]
where is the Minkowski metric. Varying this action with respect to yields eq. (53), or eq.(59) in terms of Fourier modes.
8 The Quantum Theory
Semiclassical versus Quantum Theory
It is the action from eq. (67) that we will now quantise and promote our theory from a classical to a semi-classical one. The quantised version of this action will not constitute a full quantum theory of cosmological perturbations since the full action is in fact given by eq. (66). A complete theory would thus quantise the background variables as well as the Mukhanov-Sasaki variable . It is clear that quantising the full action is a difficult matter, not only computationally, but also conceptually. The result of such quantisation would be wavefunctions for and interpreting those is not a trivial matter. By only quantising the action for the perturbations, we avoid such problems for the moment. Later on, in section 26, we will return to the problem of quantising the full action.
Quantisation
To quantise the action in eq. (67), we will first transfer from the Lagrangian to the Hamiltonian formalism. The momentum adjoint to is
[TABLE]
And thus the Hamiltonian reads
[TABLE]
Inspecting this Hamiltonian, we see that quantising the Mukhanov-Sasaki variable is analogous to quantising a scalar field on flat Minkowski space with a time-dependent mass .
To quantise the theory, we want to promote and to operators and and impose the equal-time commutation relations on them.
[TABLE]
Proceeding as in the standard case, we decompose into its Fourier components
[TABLE]
Because our field is real, we demand that . The Hamiltonian in terms of Fourier modes, with , then becomes
[TABLE]
and we notice that the Fourier modes are all decoupled, as in standard field theory. This decoupling happens only because we are dealing with a perturbative theory: expanding the action from eq. (66) to higher order would yield interaction terms in the field theory of , as for example done in [38].
Annihilation and Creation Operators
It will pay off to look at the theory in its second-quantised form, in other words, in terms of annihilation and creation operators, as for example done in [31].
We will start by decomposing the position and momentum operators:
[TABLE]
Such a transformation is called a Bogoliubov transformation. For the case of scalar field on flat Minkowski space, the usual decomposition is given by and . In our case, the functions will depend on time since we have a time dependent Hamiltonian due to the growth of .
Let’s further specialise using the action from eq. (67). Imposing the equal-time commutation relations, eqs. (70), implies the following commutation relations for and
[TABLE]
If we want to recover the usual interpretation of and as annihilation and creation operators, we should impose a condition on the time dependent functions and
[TABLE]
Using this condition, we can easily invert eq. (74)
[TABLE]
In addition, the Heisenberg equations of motion read
[TABLE]
This implies that . It is in this context that we can regard eq.(77) as a Wronskian condition on the system of differential equations defined by eq. (79) and eq. (80). It imposes the need for linear independent solutions to these equations. Moreover, since we want our field to behave as a free field on Minkowski space for very large values of , we will choose as initial conditions for and
[TABLE]
Filling in the Bogoliubov transformation, eq. (74), we find that has to obey the following equation
[TABLE]
We notice that this is exactly the classical equation of motion, eq. (53)!
As a consistency check, we can check that the creation operator is indeed time independent. Using the Wronskian condition and noting that is a solution to the classical equations of motion we get
[TABLE]
We conclude that both and are time independent. This means that the operator will be an invariant of motion. Note that this means that -particle states are robust under time evolution.999This fact will be used in appendix A to derive the wavefunctions of similar systems.
For each one can now continue to define a vacuum in the Fock space of and by demanding
[TABLE]
In general, one defines a state by applying repeatedly:
[TABLE]
To finish, let us briefly look at the possible interpretation of the states . When is big, our field evolves as if in Minkowski space, and the Bogoliubov transformation, eq. (74), reflects that. The annihilation and creation operators are the canonical ones, and we can safely interpret the states as a state with particles (or quanta of energy) with momentum .
However, when is small, when the mode has exited the Hubble radius, and are far from the canonical creation and annihilation operators of a field living on flat Minkowski space. As noted before, this is due to the time dependence of the frequency of our harmonic oscillator, especially because the potential changes sign when . In appendix B we consider a different picture, starting from a different action that is equivalent to the on used here on the classical level. Its interpretation when is small is more straightforward.
Chapter 3 Inhomogeneities in the CMB: An Inflationary Explanation?
In chapter 1, we discussed the cosmic microwave background, as it is measured by us. It appears to be extremely homogeneous, with deviations from uniformity of the order . To explain this homogeneity the inflationary model was developed. However, we now need a mechanism to explain the inhomogeneities and explain their apparent random distribution across the sky.
In this chapter we explain the standard account of how quantum fluctuations of the Mukhanov-Sasaki variable, introduced in the previous chapter, give rise to density perturbations in the early universe. At first sight, we will find that the theory explains the observed pattern in the sky. Careful inspection however, will reveal that there is some missing element. There is no mechanism that explains how the quantum fluctuations give rise to classical density perturbations. At the end of this chapter we will try to formulate this problem as precise as possible, in order to have a solid footing for the discussion on various proposed explanations.
9 Characterising the CMB Inhomogeneities
To find a workable theory, we will first need to understand what we see in the sky. The inhomogeneities in the CMB seem random, but what do we mean by random? We can decompose the observed relative temperature variations in spherical harmonics
[TABLE]
where the angles and characterize a certain direction in the sky and are the spherical harmonics of degree and order . A more precise formulation of random is now evident: the coefficients are independent random variables, each one distributed according to some probability distribution . In addition, we impose that the universe is rotationally invariant, so the fluctuations must be isotropic. For this reason, we restrict random to mean that the distributions are independent of direction, and thus not depending on .
The CMB can be thought of as a particular realization of the . One could imagine ourselves living in a different universe, where the coefficients would be different. If we have a large number of such universes, all the would be distributed according to their distribution . These reflections already touch the problem upon that we only have one universe. A decomposition of the entire sky, would only give us one set of . For a single realization of random parameters, it is impossible to deduce anything about their distribution .
In this kind of matter, it is often convenient not to talk about the values of the . More manageable quantities are the correlators , which we write down as
[TABLE]
where we call the spectrum. Rotation invariance makes it independent of and the Kronecker deltas make sure that the different are independent variables. One can continue to define higher order correlators, and in fact, one can completely specify the function in terms of all its correlators.
Experimentally, one can find out more about by dividing the sky into patches. Suppose we want information for some fixed . We then divide the sky into patches, in such a way that the size of a typical patch is inversely proportional to , meaning that we take the patch size proportional to the wavelength of the perturbations we consider. For each such patch, we then decompose the available temperature fluctuations and average over . This gives us values of .
The common approach is then to calculate the two-point-correlators, eq. (87), for each patch. By virtue of the ergodic theorem [35], a spatial average is a good estimator of an average over an ensemble in this case. What it means, is that we can for all practical (read as ‘averaging’) purposes understand each patch as a ‘distinct universe’. Thus we have values for the two point correlator, and we can start statistical analysis. However, one typically works with the dimensionless quantity which is called the power spectrum of the inhomogeneities. The result of the WMAP experiment is usually displayed as in fig. 3.
In this picture, the multipole moments are plotted on the horizontal axis, while the power spectrum is plotted on the vertical scale. There are many very distinct features on this graph that have been adequately studied during the last decades. The appearance of acoustic peaks for instance and their location and height has been the subject of extensive study. We will not go into the details of this spectrum because the majority of them is explained by acoustic theory in the post-inflationary universe. In short, we are more concerned with the origin of the perturbations than with their evolution through the non-inflationary epochs. A more in depth explanation of the spectrum can be found in [57], [35]. An animated, but more intuitive approach can be found at [22].
In the previous paragraph we have (very briefly) studied the experimental observations of the temperature variation across the sky. Thus, any theory about cosmological perturbations should be able to explain a spectrum like the one we see, e.g. in fig. 3. As it turns out, the acoustic theory due to Harrison and Z’eldovich (see chapter 1) is able to reproduce excellently the observed spectrum, granted that the density perturbations at the end of the inflationary epoch have a Harrison-Z’eldovich spectrum. This power spectrum is easy to describe: is independent of . For this reason it is commonly called a scale invariant power spectrum.
10 Spectrum of
To see how one gets from a quantum mechanical Hamiltonian, eq. (73), to a scale invariant power spectrum, we start by considering the initial state of the universe. Generically one assumes the Mukhanov-Sasaki variable to start evolving from the Bunch-Davies vacuum, defined as the product state of the vacuum for all modes
[TABLE]
The usual argument is that the Bunch-Davies vacuum is the most symmetric state possible, and thus is the most ‘natural’ initial condition for our universe to be in. A second argument, given in [50], is that we want to explain the appearance of inhomogeneities in our universe, not assume them from the beginning by choosing an excited state for the Mukhanov-Sasaki variable.
To see what happens, one needs to know the expansion of Fourier components of the Mukhanov-Sasaki variable, into spherical components . One can show that
[TABLE]
In [35], one continues to show that the correlators for the then are given by
[TABLE]
where, as before, the are the Fourier components of and
[TABLE]
The factor is called the spectrum of the Mukhanov-Sasaki variable and can be found by computing the correlators for the Fourier components. Using the commutation relations for the annihilation and creation operators present in , direct computation gives us the important equation
[TABLE]
However, we should remember from the previous chapter that isn’t the physical variable. It is the rescaling of the fluctuations with a factor , thus the physical spectrum is given by
[TABLE]
For exponential and power-law inflation, the ratio is proportional to for late times [31]. Taking as example exponential inflation, , is given by eq. (63) and the power spectrum corresponding to this spectrum becomes
[TABLE]
And thus, for any mode that didn’t leave the Hubble radius too late (), this power spectrum is (approximately) scale invariant. Similar calculations can be done for other and in fact, a (nearly) scale invariant power spectrum is a very generic prediction for a lot of inflationary models.
It is important to note, that it is a valid approximation to neglect the term in for all the modes that are discernable across the sky. By this we mean, that all the wavelengths observed by WMAP crossed the Hubble radius early enough[57]. Let’s make a (very crude) estimate. We will take the end of inflation at after the big bang, while decoupling happened approximately at after the big bang[33] and we are at present at about . Taking the Hubble radius at present to be (very) roughly and the angular resolution of the WMAP experiment to be of order , we get that the smallest intrinsic wavelength we can discern in the CMB is of order
[TABLE]
In a matter dominated universe so we have
[TABLE]
Continuing past decoupling, we have and we get for the physical wavelength at the end of inflation
[TABLE]
For the Hubble radius during matter and radiation dominance
[TABLE]
So at the end of inflation, the ratio was of order ! We can thus safely assume that the modes we see in the sky were well outside the Hubble radius.
It is also illustrative to note that eq. (94) doesn’t change much when we consider initial states that are not the Bunch-Davies vacuum. Excited states will contribute extra constant terms to the analogue of eq. (92). Thus, the contribution of these terms to eq. (94) decreases in time as . Given that the energy contained in these excitations is not too far from the ground-state energy, we can safely neglect such terms. This can be regarded as another reason to adopt the Bunch-Davies vacuum as initial state.
11 Spectrum of the Density Fluctuations
While we succesfully recovered a power spectrum that is (approximately) scale invariant, it is now time to answer the question how fluctuations of the Mukhanov-Sasaki variable result in density perturbations. These density perturbations will then account for the observed inhomogeneities in the CMB, via the Sachs-Wolfe effect (see chapter 1).
The standard account starts by looking for a solution to the Schrödinger equation, that corresponds to the ground state of the system: the Bunch-Davies vacuum. That wavefunction can be found in [37]. In appendix A we construct the wavefunctions for generic states of , and the initial ground state is given by101010Notice that this is the wavefunction for the ground state of the action (67). For the alternative action (356) one can also construct similar wavefunctions, as for instance found in [30] and [44].
[TABLE]
where is a normalisation constant, that is irrelevant for now. The first thing to notice is that this wavefunction is a Gaussian for each mode . The width of this Gaussian can be read off and using the Wronskian condition 77
[TABLE]
This means that the physical variable has Fourier coefficients with Gaussian wavefunctions of width given by
[TABLE]
Noting that in the slow-roll approximation dominates the kinetic energy, the energy density of the inflaton field is given by
[TABLE]
Thus, the classical first order energy density perturbations are related to the classical(!) Mukhanov-Sasaki variable by
[TABLE]
where refers to the homogeneous background inflaton field. We are more interested in the contrast of density perturbations in momentum space
[TABLE]
We see that classical density perturbations are seeded by the classical . One now takes the classical as ‘mimicking’ [40] the quantum mechanical . More precisely, one takes the to be classical ‘random’ fields, distributed according to a Gaussian distribution of width given by eq. (100). That means that we fill in eq. (101) as an approximation to . We get
[TABLE]
Taking as example de Sitter inflation, this becomes in the late time limit
[TABLE]
We regain a spectrum for varying as . Equivalently we get a (nearly) scale invariant power spectrum . As said before, power law inflation for different also leads to a scale invariant power spectrum. Now we have shown that the density perturbations have a scale invariant spectrum, we can let the Wolf-Sachs effect and the acoustic theory of Harrison and Z’eldovich do their job: the result is the power spectrum we see in the cosmic microwave background!
12 The Problem
At first sight, the theory sketched above seems solid. Given that it succeeds in accounting for the observed inhomogeneities in the CMB, this is a veritable triumph for physics! However, this thesis wouldn’t be written if there wasn’t something to complain about. The replacement of by in eq. (105) is poorly understood. The essence is that we are replacing quantum mechanical quantities () by classical quantities (). Several people in the literature recognize this problem. For instance111111In addition, we remind the reader of the various quotes we presented in the introduction of this thesis., Sudarsky writes in [50],
‘…, we must carefully inquire into the justifications for the identification of these quantum uncertainties and the perturbation spectrum for the seeds of structure that characterize our Universe.’
Or even more suggestively, in [43]:
‘* The observed similarity between the spectral properties of the (classical) density fluctuations in the early universe and the quantum mechanical fluctuations of the inflaton field after inflation indeed suggest much more than a coincidence. However, we want to draw attention to the fact that a detailed understanding of the process that leads from the quantum mechanical to the classical fluctuations is lacking. What is needed to justify the connection between the two spectra is a mechanism that transforms quantum mechanical uncertainties into classical density fluctuations.*’
There are in fact a lot of theories that couple quantum mechanical and classical quantities, let’s say and . In such cases, computations are made in the quantum mechanical regime, using . In the end, the quantity is then computed by identifying it with the expected value of . In this way, one does not get a fundamental theory. In many cases however, this does produce an effective theory that is quite good at explaining the experimental results. For example, in solid state physics, the equation of motion for electrons in a crystal are obtained by equating their classical momentum by the expected value of the quantum mechanical operator . The result is then interpreted as being Newtons second law for a so-called effective mass of the electron.
However, there is an important difference for cosmological perturbations. In going from eq. (104) to eq. (105), we didn’t replace by the expected value of , but by its standard deviation! Notice that the expected value of in the Bunch-Davies vacuum is in fact zero, so that replacing by its expected value would predict no inhomogeneities at all. The claim that there is an analogue between cosmological perturbations and other semi-classical theories, like the effective mass description in solid state physics, is thus unfounded.
We are touching an important aspect of the problem, when we note that the expected value of the operator is zero. This is rooted in the fact that our vacuum state is completely symmetric. On the other hand, it is clear that the result of eq. (105) is an asymmetric state for the density perturbations. Somewhere along the line, some mechanism broke the symmetry present in the vacuum state with as result the particular realisation of density perturbations we can infer from the CMB. It is clear that standard unitary evolution following the Schrödinger equation cannot result in any loss of symmetry. There is only one mechanism present that can account for such a symmetry breaking, namely a collapse of the wavefunction. In standard quantum mechanics this type of symmetry breaking is typically induced by a measurement.
If we call upon a measurement as symmetry breaking mechanism we have to ask the obvious question: who is the observer? It can’t be the WMAP probe, or for that matter, any human or device created by humanity, for the simple reason that we ourselves are the product of the concrete realisation of perturbations that we observe. These perturbations are presumed to source the formation of superclusters, clusters, galaxies, stars and ultimately ourselves. Thus the CMB had to be inhomogeneous long before we observed the inhomogeneities in 1992 with the COBE probe. We are clearly not the relevant observers. Moreover, it is not clear what constituted such a measurement, let alone what the measured observable was.
In the next part of this thesis we will look at some standard ideas that claim to solve these problems. Before we move on, we’ll list the questions that need to be asked.
- I
How does one go from an initial homogeneous and isotropic state to a state with inhomogeneities? 2. II
Why can we consider the density perturbations to behave essentially classical at the end of inflation?
Part II The Quantum-to-Classical Transition: Usual Solutions
Chapter 4 The Classical Limit of Quantum Mechanics
Recall question number two at the end of last chapter:
‘Why can we expect the density perturbations to behave classically at the end of inflation?’
This chapter will be devoted to the more general case of this question:
‘Which quantum mechanical systems can we expect to behave classically?’
This is a very old question, dating back to a letter Lorentz wrote to Schrödinger [48]. Lorentz wrote to his colleague on the subject of the quantum mechanical theory of the hydrogen atom:
‘If we decide to dissolve the electron completely, so to speak, and to replace this by a system of waves this has both an advantage and a disadvantage. […] But then how would I understand the phenomena of photo-electricity and the emission of electrons from heated metals? […] But if we take a wave packet as model for an electron, then by doing so we block the way of restoring matters. For it is indeed asking a lot that a wave packet should condense itself again once it has lost its shape.’
In fact, this is still the core of the problem of the classical limit of quantum mechanics: how can we reconcile the spreading of wave packets, inherent in the Schrödinger equation, with the image of the world we see around us, full of seemingly well located particles with well defined momenta?
Since it is not very clear what classically in our question means, we will be focusing on different characterisations of this concept within the framework of standard quantum mechanics. We will treat briefly the role of Heisenbergs uncertainty principle, the WKB approximation and the Wigner function for the toy models of a free particle and the inverted oscillator. For the latter, we will include a treatment of squeezing, a concept not applicable to the free particle.
We will look at the importance of the various constants in our model and how they relate to the classical limit. For this reason, we will be keeping all factors of intact throughout this chapter. The treatment of decoherence will postponed untill chapter 6.
13 Criteria for the Free Particle
In this section, we will look at the most simple system in quantum mechanics: the free particle in one dimension. Starting from the classical Hamiltonian, the correspondence principle gives us the quantum Hamiltonian
[TABLE]
where is the mass of our particle and is the momentum conjugate to , satisfying the canonical commutation relation
[TABLE]
The Schrödinger equation reads
[TABLE]
Energy eigenfunctions are then given by de Broglie’s plane waves, and any solution to Schrödingers equation can be thought of as a packet of plane waves. We will consider a Gaussian wavepacket as initial state. It is given by
[TABLE]
Uncertainty
The most striking aspect of quantum mechanics is perhaps the concept of uncertainty. Heisenberg’s uncertainty relations indicate that in standard quantum mechanics there are no such things as particles with welldefined trajectories. Or in other words, if we adhere to the Copenhagen Interpretation of quantum mechanics we cannot think of particles as having a definite position and momentum. In contrast, classical mechanics treats pointlike particles (or pointlike centers of mass) with welldefined momenta.
Thus, to regain classical mechanics, one would like the uncertainties in position and momentum to be at least ‘small’. We of course need something to compare these uncertainties to, and a logical choice is the scale of the experiment we have in question.
Working in the Heisenberg picture and using Heisenbergs equation of motion for the operators and , one can easily find and
[TABLE]
If we imagine our system to be initially in the state given by eq. (110), the initial uncertainties are so that the Heisenberg uncertainty principle is satisfied as an equality
[TABLE]
The terms linear in time in eq. (111) are zero for such a Gaussian packet, and the uncertainty in rises as . We see that any initial uncertainty gets amplified as time goes on. This phenomenon is the result of the dispersive aspect of the Schrödinger equation.
Let us be more precise, we can say that our wavepacket is not any more of minimum uncertainty once
[TABLE]
or thus when is comparable to . If we want to retain minimum uncertainty as long as possible, we had better have that be big for a given . This in fact corresponds to our intuition: pretty ‘heavy’ objects remain rather localised in position and momentum space.
One last remark is in place: however heavy our particle is, there will always be a time so that the packet is no longer of minimum uncertainty. Thus, every free particle will eventually be badly localised. However, any realistic physical system that may be described by a free particle on short time scales is far from ‘free’ if we look at longer and longer timescales. As we will see in chapter 6, interactions tend to damp the dispersive character of wavefunctions.
WKB approximation
Another criterion commonly used to characterise the classical limit is the Wentzel-Kramers-Brillouin (WKB) approximation. One can derive it by rewriting the Schrödinger equation. We will first look at the most general case, meaning a particle moving in a potential . This treatment is based on [21].
Writing as , for real and positive and real, we find that the Schrödinger equation for a free particle decouples into
[TABLE]
Since , we recognize in the second equation the continuity equation for , expressing the conservation of probability. Here, we will not delve into it further. The first equation can be recognized as the Hamilton-Jacobi equation for a particle moving in a potential and an extra potential given by
[TABLE]
This quantity is often called the quantum potential for this reason. If, for some reason, this quantum potential would be small in comparison with the typical energy, then we would regain something that looks like the classical Hamilton-Jacobi equation.
One typically continues by assuming a form , and assuming that can be expanded in powers of .
[TABLE]
We then look for solutions to the stationary Schrödinger equation, for some energy eigenvalue . Since we are looking for the classical limit, we only consider the case , since the opposite has no classical analog.
[TABLE]
Filling in eq. (118), and collecting powers of , we get for the first and second order [21]
[TABLE]
The third order term is negligible in comparison with the first and second order if
[TABLE]
Interpreting the quantity in the denominator as the classical momentum to the third power, we find that this is in fact a condition on the de Broglie wavelength
[TABLE]
where is given by
[TABLE]
Now we note that, under the condition of a suitably smooth , the resulting wavefunctions become
[TABLE]
Now we proceed to calculate the quantum potential for these wavefunctions
[TABLE]
The WKB condition, eq. (123), and the conditions on the higher derivatives then gives us that the quantum potential is negligible in comparison with the kinetic energy.
[TABLE]
Thus, for this type of wavefunction, the quantum Hamilton-Jacobi equation, eq. (115) reduces to classical form, and the criterion for WKB classicality is reached.
One has to note however, that this notion of classicality is only restricted to states of the form stated in eq. (125). It does not in general apply to linear combinations of the WKB wavefunctions. The example given in [21] is very clear. Say we take , with some normalisation coefficient. We can write the amplitude of the total wavefunction then as follows
[TABLE]
Since we assumed to be large, this is clearly a rapidly oscillating function. The quantum potential can be calculated to be and hence is very large by assumption. Thus, WKB classicality is only valid for the WKB energy eigenstates and certainly not general superpositions.
For the free particle, only the first and second order in eq. (118) contribute. This is no surprise, since the de Broglie wavelength is obviously conserved. The WKB wavefunctions are the exact eigenfunctions . These are thus classical in the WKB sense.
WKB Classicality?
One could ask what this ‘WKB classicality’ actually means. We certainly have that eq. (115) formally resembles the classical Hamilton-Jacobi equation. What we certainly don’t have in this situation is the concept of definite positions and momenta. Note that a measurement of or will in general be undetermined, meaning that the wavefunction is certainly not peaked. In fact, the WKB condition dictates that does not vary too much, thus making an approximate uniform distribution. We conclude that the classical limit is not truly reached.
Wigner Function
To a given wavefunction in one dimension one associates a Wigner function using the relation
[TABLE]
Given a wavefunction on momentum space , an equivalent definition of the Wigner function is
[TABLE]
If is normalised then is also.
The most important property of this function is the following. For any operator the following equality holds
[TABLE]
It is this property that elicits a simple interpretation of the Wigner function: one can regard it as a probability on classical phase space. However, this function is not necessarily positive everywhere; for this reason it is often called the Wigner quasi-probability distribution. For the free particle wavefunction in eq. (110) one can find the explicit form of the Wigner function by solving some Gaussian integrals. It is given by
[TABLE]
We see that this function is never negative, so that we can safely interpret this as a probability distribution. In addition, we see that the distribution is peaked along the classical trajectories . In this way, one could say that the classical limit is reached. Note that the spreading of this classical distribution can never be zero for both and ! This is due to the Heisenberg uncertainty principle.
Wigner Classicality?
The same argument again applies: the concept of the Wigner function being peaked on classical trajectories does not imply anything for the results of measurements of and . The wavefunction is still unaltered and the uncertainties in and still obey the same time evolution. Even more crudely: there is no reason a priori to interpret the arguments of the Wigner function as classical position and momentum within the framework of quantum theory.
14 The Inverted Oscillator
We will now look at what the above notions have to say about the classical limit for an inverted harmonic oscillator. The Hamiltonian we wish to consider is, for real and constant
[TABLE]
The classical equation of motion is easily solved:
[TABLE]
for and to be determined by the initial conditions.
The quantum system is a lot harder to solve. The normalisable121212There are also wavefunctions for this system that are not normalisable, see eq. (354). The conclusions in this section will to large extent also be applicable to them, but for clarity we will only present the treatment for the normalisable solutions. solutions are indexed by the nonnegative integers and are given by
[TABLE]
where is a solution to the classical equations of motion that satisfies a Wronskian constraint131313This Wronskian constraint makes sure that Im() is never zero and that the solutions are normalisable.. For the derivation of these states, see appendix A.
For shorter notation, we will denote and and similar for other quantities. We will take the initial conditions for to be141414These initial conditions are consistent with the Wronskian condition 346. In addition, these conditions correspond to taking the initial relations between creation and annihilation operators in the standard way.
[TABLE]
These initial conditions determine
[TABLE]
Uncertainty
Using the time independent annihilation and creation operators defined in eq. (347), we can find the following expressions for and
[TABLE]
This allows us to compute the time evolution of the uncertainties
[TABLE]
It is very clear from these expressions that, for a generic state, the uncertainties in both variables will rise exponentially! There is no stopping the spreading of uncertainties, not even for really big. Only the limit of very small will slow down the spreading of the wavefunction, but we are obviously treating a free particle in that limit. The conclusion is that, for a generic state, there is no hope of maintaining small uncertainties. One can expect the classical limit to be less evident in this case.
Squeezing
One can look at this problem using different variables. We define, as in [2],
[TABLE]
Note that , and we can regard these variables as canonical ones. For these variables, the time evolution is given by
[TABLE]
And we can easily compute the time evolution of the uncertainties in and
[TABLE]
The situation is thus drastically different. While the uncertainty in one variable rises exponentially, the uncertainty for the other variable vanishes. If we start from an initial minimum uncertainty state, Heisenbergs uncertainty relations are obeyed as an equality for all time.
This phenomenon is called squeezing of the state. It get its name from the following classical picture. One can imagine an ellipse in phase space, corresponding to some spread in and . The dimensions of the ellipse in both the and direction will typically grow. However, time evolution will preserve the area of the ellipse, through Liouville’s theorem. Thus, if we take and as classical quantities in the direction of the minor and major axis of the ellipse, we get that the product of the spread in and is equal to the area of the ellipse, and thus constant in time. This process is pictured in figure 4.
Quantum mechanically, the squeezing process results in one variable with negligible uncertainty. Thus, for large time , a measurement of the observable will result in the measured value with a probability very close to . Within standard quantum theory, it is thus reasonable to think of as having achieved the classical limit. On has to note that, for very large , the conjugate variable will have a very large uncertainty. In a way, becomes very ‘quantum’: a measurement of is utterly unpredictable.
On a final note, we repeat that the uncertainties in and also rise exponentially. While we can speak about classical behavior for , we certainly cannot for the variables and . Following this reasoning, it is important to clearly specify what the relevant quantity is.
WKB approximation
We will look for WKB solutions for the inverted oscillator. More exactly we are looking for approximate energy eigenstates of energy . We look at the de Broglie wavelength of this problem
[TABLE]
We know that the WKB solutions will satisfy a classical Hamilton Jacobi equation if the variation of is very smooth. For example, the first derivative of is
[TABLE]
This derivative is certainly small for all if . Thus for energies higher than the ground state energy, we can find approximate solutions to the Schrödinger equation.
The WKB approximation leads to the following wavefunctions
[TABLE]
Note that these functions are not normalisable. Thus, to represent normalisable physical states, we will need to make linear superpositions. However, as noted above, linear superpositions of WKB states are no longer WKB states. Classicality in the WKB sense is thus only achievable for the non-normalisable WKB energy eigenstates. We repeat that WKB wavefunctions do not truly achieve the classical limit.
Wigner Function
The Wigner function for a -particle state can be calculated from eq. (129). It is given by
[TABLE]
For this reduces to
[TABLE]
Thus, for , our semi-probability distribution is positive everywhere. We are thus free to interpret as a real probability function. In addition, the Wigner function is fairly located around the classical trajectories . In fact, as and thus , the probability distribution becomes infinitely well located.
[TABLE]
However, for the Wigner function is in general not positive definite anymore due to the oscillatory nature of the Hermite polynomials. For example, for , we get
[TABLE]
which is not positive for every . Because of this reason, we cannot interpret the Wigner function as a probability density. We can also look at the Wigner function in terms of the classical squeezed coordinates and . For it becomes
[TABLE]
Thus, for , we will retain a function for , but the probability distribution will be uniform in . Thus, the Wigner kind of classicality is only achieved for and certainly not for . This reflects the behavior of the quantum uncertainties, eq. 145.
In conclusion, the Wigner function type of classicality will only be truly reached for the ground state, while for states with higher particle number (and in general, superpositions of different particle number states) the Wigner function is not positive definite, even though one could argue that regions where are ‘small’. We repeat that the Wigner classicality is not true classicality, since the properties of the Wigner function do not translate to any statement of measurements of and .
15 The Role of Plancks Constant, Wavelength and Mass
Let us conclude by clarifying the role of various relevant constants. We will first look at the reduced Plancks constant, . It is common habit for basic quantum mechanics textbooks to handwavingly invoke the classical limit as being the limit of . A typical example could include the commutator of two conjugate variables. By letting , one argues that this commutator will vanish and that the operators will behave ‘classically’, meaning commutatively. Another consequence of this limit would be that lower limit for the Heisenberg uncertainty principle would vanish.
Clearly, these arguments do not make sense. First of all, is not a dimensionless quantity. While in SI units it can be thought of as small, this is certainly not the case other units. Secondly, it is not clear what the notion of ‘wavefunction’ becomes in such a limit since the Schrödinger equation itself does not make sense any more. Thirdly, what exactly is the meaning of varying one of natures constants?
A more correct formulation of this argument would involve comparing to the action relevant for the problem under consideration[4]. For a particle moving in a potential , the relevant length scale would be the length of variation of the potential. For such a particle the relevant dimensionless constant would be
[TABLE]
where is the typical time scale on which we consider our system.
For the free particle, is very large, and thus we can regard as being ‘small’ for time scales that are not very large. This is reflected in the time behavior of the uncertainties calculated in section 13 and the fact that the Wigner function is positive definite and peaks on classical trajectories.
For the inverted oscillator, the scale of variation for our potential is not very clear. [3] argues that any type of harmonic oscillator is slowly varying, meaning that we can think of as being very big.
These considerations are of course very much related to the WKB approximation. In the expansion of the wavefunction as , we immediately recognize that has the dimensions of an action. Assuming that the expansion in eq. (118) is accurate to first order amounts to demanding for the second order correction (dimensions of an inverse action) that is large compared to .
The de Broglie wavelength also enters this discussion in a natural way. It can be regarded as the length scale of the quantum mechanical problem, while can be thought of as the classical length scale of our problem. It is then natural to compare and and think of the classical limit as being achieved when vanishes in comparison with . This is, again, very much connected to the WKB approximation. In fact, eq. (123), can be rewritten as
[TABLE]
If we assume that is related to the length scale of the potential, then the WKB condition reduces to comparing with .
The role of mass is suprisingly intuitive in this discussion. The classical actions of both the free particle and the inverted oscillator are proportional to . In the context of this discussion, larger will increase the ratio . Our intuition that really massive behave classical can thus easily be reconciled with this picture.
Chapter 5 Squeezing of Cosmological Perturbations
In this chapter we will extend the theory of squeezing to the Mukhanov-Sasaki variable. The concept was already briefly introduced in chapter 4 for the inverted harmonic oscillator. We will give an exposition of the theory of squeezing based principally on [45], followed by a summary of the principal arguments presented in the literature concerning classicality. This chapter will finish by arguing why the theory of squeezing is not sufficient to explain classicality for the cosmological perturbations.
16 Theory of Squeezing
Let us fix a certain Fourier mode of the Muhanov-Sasaki variable of wavevector . Analogous to our calculation for the inverted oscillator in section 14, we can find the operators and in terms of their initial conditions
[TABLE]
Using these equalities, we can find the uncertainties in the variables and
[TABLE]
where we have omitted terms involving , thus specialising to symmetric or antisymmetric wavefunctions. These uncertainties will spread for generic states of the Mukhanov-Sasaki variable, since and blow up. In addition, there is no limit in which these uncertainties stay bounded. Also analogous to the inverted oscillator, we can do better.
We define functions and by
[TABLE]
Or the other way around
[TABLE]
The Wronskian condition, eq. (77) and the initial conditions on then imply
[TABLE]
This means that there exists some functions and so that we can write
[TABLE]
One usually calls the squeezing parameter, the phase angle and the squeezing angle. We can now define two new variables and by
[TABLE]
Notice that they are canonically conjugate, since . Now one can check that
[TABLE]
As for the inverted harmonic oscillator we recover two conjugate variables that satisfy the uncertainty principle as an equality when the wavefunction is initially a minimum uncertainty wavepacket. As increases, the wavefunction grows more narrow in while it grows broader in . We retrieve the results of the harmonic oscillator of when and .
Classically, this corresponds to a volume in phase-space that gets ‘squeezed’ into an ellipse. The surface of the ellipse stays constant in time, but the ratio of the length of the major to length of the semi-major axis grows in time as . The difference with the inverted oscillator is that the transformation in eq. (164) is time-dependent. The phase-space ellipse thus rotates in time as the squeezing angle varies.
17 Classicality from Squeezing
Usually, one starts the argument by remarking that the phase of becomes constant as , as proven in appendix D. In terms of the functions and this means that,
[TABLE]
A typical value for at the end of inflation for the modes we observe in the sky would be [37], [13]. This would correspond to a very squeezed state, meaning that the uncertainty in would be a factor larger than the spread in . A measurement of would result in with a probability extremely close to while a measurement of would be utterly unpredictable.
One proceeds by noting that we can make real in the late times limit by a time-independent complex rotation [30]. The argument is continued by noting that the Heisenberg picture equations, eqs. (156), reduce to
[TABLE]
since vanishes in the late times limit. Since the right hand sides of these equations do not mention , the right hand sides commute. One concludes then that
[TABLE]
This is suggestively called the ‘pragmatical view’ in [12]. Several authors conclude that the classical limit is reached for late times [2, 30, 31, 45, 27, 26, 24, 25, 13] among others.
As extra arguments, one typically first shows that for late times the Wigner distribution of the Mukhanov-Sasaki variable becomes peaked along classical trajectories by direct computation, see e.g. [12] for the vacuum and [30] for generalisation to some other states. Second, one argues that the modes become indistinguishable from a ‘classical stochastic field’. With this, it is meant that one has at late times
[TABLE]
where the subscript ’cl’ stands for classical and [30]
‘[…] is a time-independent Gaussian stochastic function of k with zero average and unit dispersion […]’
and thus (from the same paper)
‘for each realization of the stochastic field , respectively is the classical amplitude, respectively momentum, for the corresponding initial conditions […]’
Here we have changed the names of the variables in the citations to ensure compatibility with the rest of this thesis.
18 Classicality?
Let us now carefully review the arguments in the previous section. A first remark can be made surrounding eqs. (167). What does it mean when we throw away the terms proportional to ? How do we compare the terms in eqs. (156)? This is far from clear, since we are comparing operators.
This problem might seem academical, but it is crucial: eq. (169) is wrong. One can check by direct computation starting from eqs. (156) that
[TABLE]
as it should be. Even if one could somehow ‘compare’ different operator terms and ‘throw away’ some terms as done in eq. (167), it is clear that any such approximation must become invalid when computing the commutator. In [2], the author tries to explain the approximate commutativity in a more careful way in terms of expectation values instead in terms of operators. Nevertheless,
[TABLE]
for any possible state.
On the subject of the Wigner function, we will repeat what was concluded in chapter 4: namely that properties of the Wigner function do not imply anything about localisation of wave packets, and thus nothing on the classical limit of quantum mechanics.
Concerning the ‘classical stochastic field’ argument; this is very reminiscent of a measurement. The stochastic variable is exactly what would arise if we would perform a measurement of . As was already previously discussed in section 12, the idea of measurement in this cosmological context is untenable. The implicitly assumed measurement is even more apparent in the following quote from [45] (emphasis mine)
‘As a result, after some realization of the stochastic amplitude of the field mode has occured, further evolution of the mode is deterministic and is not affected by quantum noise.’
We assume that the author means by ‘realization’ that a measurement has been performed and that the wavefunction has collapsed to some (approximate) eigenstate of . If not, it is very unclear what we should understand by the concept of ‘realization’.
As a final note, we remark that the ‘squeezing’, represented by eqs. (165), is never really used in the classicality arguments. As Sudarsky puts it in his criticism [50] of a particular paper (namely [12]) on the squeezing formalism:
‘But having a squeezed state has no effect on the quantum field and momentum operators which do, of course, continue to satisfy the Heisenberg uncertainties. The manuscript seems to ascribe a fundamental value to the squeezing of the state […]’
The uncertainties in eqs. (165) do not in any way present any conclusion on the uncertainties of the physical variables and . In fact, it can easily be seen from eqs. (158) that both and strictly grow in time with for generic states.
We conclude this chapter by stating that the squeezing formalism, as presented here, is not sufficient to explain the classicality of the modes in the CMB. In fact care should be taken with several accounts of this formalism as presented in the literature as certainly not all claims are founded, with the explicit example of claims similar to the one in eq. (169).
Chapter 6 Decoherence for Cosmological Perturbations
In this chapter we will explore the concept of decoherence, first for the inverted oscillator and later on for the Mukhanov-Sasaki variable. It is an often invoked explanation for the classical appearance of the inhomogeneities in the CMB [5, 11, 27, 29, 38]. While several authors seem to agree on the general idea, we will see that there is still substantial discussion on the specifics of the decoherence process.
19 Classicality from Decoherence
To start off, we will present biefly the principal aspects of the theory of decoherence. It will largely be based on [59] and the introduction of [38].
System and Environment
Within the framework of decoherence, the problematic aspects of the quantum-to-classical transition are thought of as artefacts from the idealised notion of an ‘isolated system’. For a realistic experiment, one has to take into account (small) interactions of the system with its environment. The term environment typically refers to something the observer is not interested in or has no control over. We will then need some way of reducing the degrees of freedom of the system and environment to just the degrees of the system.
Let us be more precise, and specify what we are talking about. Suppose we have some Hamiltonian for our system , a Hamiltonian for our environment and an interaction Hamiltonian . The total Hamiltonian, acting on the product of the system Hilbert space and the environment Hilbert space becomes
[TABLE]
Let us further now suppose that the system and environment start out in a state that is factorisable
[TABLE]
Without the interaction Hamiltonian the time evolution of this state would guarantee it to stay factorisable for all times. However, the interaction Hamiltonian will in general act to set up entanglement. The time evolution of our state is
[TABLE]
This state is in general not factorisable any more: it is in an entangled state with the environment. We now suppose that the states are an orthogonal basis in the Hilbert space of our system and compute at time the expectation value of some hermitian operator according to the Born rule.
[TABLE]
Let us now assume that the states are approximately orthonormal. We can then safely neglect the second term, and we write
[TABLE]
It is in this equation that we will look for the classical limit. Suppose we look at the system without coupling to the environment and then compute the expectation value of .
[TABLE]
The difference is apparent: without the environmental interaction the expectation value contains terms that couple different basis states and . They represent interference between different parts of the wavefunction. Such interference terms are absent in eq. (179). In fact, eq. (179) is a pure summation of probabilities. This is reminiscent of classical mechanics, where interference is never present and probabilities behave additive.
Density matrix
Let us now shift our focus to the density matrix associated with our system state . On this level, it is easy to implement the fact that an observer is not interested in the environment. The best way to ignore the degrees of freedom associated with the environment is to trace them out and the result is called the reduced density matrix of the system .
[TABLE]
An important thing to note is that we have changed a pure state into a state that is generally impure by tracing out the environment. The Von Neumann entropy for will be greater than the entropy of our total density matrix . Physically this corresponds to the loss of information we suffered by ignoring the environment. This reflects the fact that is impossible (or at least very hard) to undo the entanglement of our system with the environment.
We will suppose now, as before, that the states are approximately orthogonal. The reduced density matrix becomes
[TABLE]
One sees that the reduced density matrix is approximately diagonal in the basis of the . The basis for which this (approximate) diagonalisation occurs is often called the pointer basis. The condition of (approximate) orthonormality on the environment states is usually called the einselection condition. The reduced density matrix now reminds us of a classical ensemble. The classical analogue would be an ensemble of many different systems, of which a fraction would be in the state .
This similarity with a classical ensemble is only formal, since describes a single system, while a classical ensemble describes several systems. In addition, a measurement will still return values according to the Born rule. We will return to this problems in section 23.
20 Decoherence of the Inverted Oscillator
We will first apply the concept of decoherence to our toy model of the inverted harmonic oscillator, as presented in chapter 4, to get some feeling for the concepts that were discussed above.
We suppose our system is constituted of an inverted harmonic oscillator with Hamiltonian for real and constant ,
[TABLE]
As environment we will take a normal harmonic oscillator, governed by the following Hamiltonian
[TABLE]
with real. Let us take a simple form of interaction between the system and the environment, with (small) interaction parameter . The total Hamiltonian we wish to study is then
[TABLE]
One can then check that the following wavefunction satisfies the Schrödinger equation [28]
[TABLE]
The notation will shortly be explained, while and are solutions to the following equations
[TABLE]
We take solutions to these equations, obeying the following Wronskian condition, for both and ,
[TABLE]
It is here that lies the difference with [28]. Their solutions for do not satisfy this condition, making real for . Thus, their starting wavefunction is not normalisable for the noninteracting case.
We now consider the density matrix of this system
[TABLE]
We are now interested in the reduced density matrix of the system. Suppose we don’t know anything about the harmonic oscillator, the environment, and we want to eliminate our knowledge about it. Therefore, we will trace over the environment degrees of freedom.
[TABLE]
Using the Wronskian condition, we can rewrite this as
[TABLE]
In these coordinates, the diagonal elements of the density matrix are given by . If is approximately diagonal, then would be very large. In fact, the ratio of over can be taken as a measure of the coherence of the density matrix. As this quantity becomes smaller, quantum coherence is lost[28]. Straightforward calculation gives us
[TABLE]
Intuitively, what we expect for ‘small’ is this: will be approximately constant, while will grow roughly exponentially. If doesn’t vary much in time, we see that quantum coherence is lost.
In fact, this is precisely what will happen. We choose concrete realisations of and to be
[TABLE]
The modified frequencies are no longer time independent. To keep the problem analytically tractable, we only look for solutions as a power series in and .
[TABLE]
To find to second order in we only need to first order in
[TABLE]
Our measure of quantum coherence becomes, to lowest non-vanishing order in ,
[TABLE]
where we only retained the lowest non vanishing order in for the last equation.
As long as , we see that the density matrix very rapidly becomes approximately diagonal in the position basis. Quantum coherence is thus lost extremely rapidly on a time scale of , while the relevant pointer basis is the position basis.
We repeat that our result is less beautiful than the result obtained in [28]. This is due to the fact that we impose a Wronskian constraint on our solutions for and . Our wavefunctions are normalisable for all , while the wavefunctions in [28] are non-normalisable for .
21 System and Environment for Cosmological Perturbations
The theory of the scalar perturbations of the Einstein equations, as explained in chapter 2, does not incorporate any decoherence in itself. The Fourier modes of the Mukhanov-Sasaki variable constitute a closed system. This is a feature of the second order expansion we made of the full action of our theory in eq. (66). Expanding the action to third order in the perturbations will yield small corrections in the form of interaction terms: Different modes will no longer evolve independently.
There is now an important distinction to be made. Our experiments that measure the sky have a finite angular resolution. For example, WMAP achieves a resolution often quoted as being of order . This thus limits our ability to discern modes with very short physical wavelengths, or equivalently, modes with very high . The modes we observe have all exited the Hubble radius during inflation quite early, see section 10. We do not observe very high energy modes that only exited the Hubble radius very late (or not at all) during inflation. Since there are no observational results for these short-wavelength modes we are free to ignore their degrees of freedom. A natural choice of system and environment would thus be the long-wavelength modes and short-wavelength modes, respectively.
For the rest of this section, we will restrict our attention to perfect de Sitter inflation. The relevant length scale is of course the Hubble radius . A natural cut-off at any given time between the long and short wavelength modes would be the Hubble radius. The idea is thus that modes with decohere the observed modes of . One thus introduces a cut-off wavelength, which separates system and environment.
Of course, there are other possibilities for the choice of decohering environment. For instance, one could imagine the inflaton field as being weakly coupled to the photons present at the onset of inflation. However, the interaction between different Fourier modes of the Mukhanov-Sasaki variable will be enough to achieve decoherence. It is thus not necessary to introduce extra interactions for the sake of decoherence.
It is necessary to think some more about our choice of cut-off. After all, an observer choosing a different cut-off would have a different definition of system and environment, thus seeing less or more decoherence than the observer that chose as the cut-off wavelength. This is natural, since someone choosing a cut-off wavelength of [math] would not see any decoherence at all, because he is keeping track of all available degrees of freedom. The choice of as the cut-off is however the minimal one, as is argued in[38]. The essential idea is that the Hubble radius represents an event horizon in perfect de Sitter spacetime.
22 Points of Dispute
The ideas outlined above are generally agreed upon by the community. There are however two important aspects where the community is in disagreement. Firstly, there are two competing views of the period where the decoherence should occur. Secondly, there is still discussion of which basis is the relevant pointer basis. We will briefly discuss for both problems the dominating views.
When Does Decoherence Occur?
In [38, 26, 5], among others, it is argued that decoherence occurs during the inflationary period. Decoherence for a mode of wavevector is claimed to start quickly after Hubble exit, when . At that moment, changes sign, while the short wavelength modes essentially behave as a harmonic oscillator. For example, in [12] the evolution of the density matrix is treated by assuming a master equation of Lindblad form. The resulting decoherence is exponential on timescales , where is a time scale characteristic for the considered interaction.
The contrasting vision is the one in [11]. It is argued that decoherence is produced rapidly when the modes under investigation re-enter the Hubble radius, during the heating epoch. The short wavelength modes are then already sub-Hubble and are assumed to essentially be a thermic bath. It is calculated that, for interactions of the Mukhanov-Sasaki variable of order 3, the decoherence in this epoch is also rapid, while the decoherence time is again determined by the ratio of to some time scale set of the interaction.
In [11], it is argued why the other view, decoherence during inflation, should be treated carefully. Very short wavelength modes behave essentially as they would in Minkowski space. For other experiments, unrelated to inflation, there would in general be higher-energy modes present that are excluded from observation by experimental limits. It is then concluded that such high energy modes would always decohere the long wavelength modes that are under study. If we assume this decoherence process, why do we observe any quantum coherence at all in diverse experiments?
This argumentation is not really founded however, since the analogy with Minkowski space time is stretched beyond its limit. For experiments in Minkowski spacetime, there is no such thing as modes that exit the Hubble radius. The long wavelength modes in our imaginary experiment also evolve in Minkowski spacetime, while during inflation we have a coupling of sub Hubble (Minkowski) modes and super Hubble (very much non-Minkowski) modes.
Pointer Basis
Another point of discussion is the relevant pointer basis. In [26], [25], [5], [11] and many others, the relevant pointer is argued to be the field amplitude basis151515To be entirely correct, the relevant pointer basis is a basis of very narrow Gaussians around some central field amplitudes. This makes the pointer basis normalisable.. This corresponds to the position basis for the inverted harmonic oscillator. The primary argument seems to be that the observable ‘approximately commutes’ with the Hamiltonian and is thus rather robust under time evolution. This type of argument is very reminiscent of the type of arguments considered in section 17. The system Hamiltonian was given by
[TABLE]
As discussed in chapter 5, there is no limit in which and commute (even approximately), and thus does not commute with the Hamiltonian. While this does not exclude the field amplitude basis as pointer basis, it illustrates that perhaps further motivation is necessary.
Several other pointer bases have been suggested. For example, in [14] it is suggested that the pointer basis is the basis of the coherent states, the eigenstates of the annihilation operator. The argument relies on the harmonic oscillator case. For a standard harmonic oscillator prepared in a coherent state, the expected value of both position and momentum obey the classical equations of motion. In addition, the coherent states are the states of the harmonic oscillator that entangle the least with an environment, therefore providing relatively robust under time evolution. One can doubt this argument, as [26] does, since the behavior of the inverted oscillator is fundamentally different from the standard oscillator.
23 Classicality?
We conclude this chapter by closely looking at what type of classicality decoherence actually achieves. What we argued in the beginning of this chapter can be loosely described as a ‘reduction of quantum interference’ for the selected states of the pointer basis. This represents a step forward in achieving the classical limit, but it is not enough.
The description of the system is still using a wavefunction (or equivalently, a density matrix). Measurements will still give variable outcomes, according to the Born rule. Once measured, decoherence explains why the modes behave classically. This is best explained in terms of the paradox of Schrödinger’s cat. The system here is the two level system comprised of cat dead and cat alive. Decoherence with the remaining degrees of freedom of the cat will create a reduced density matrix that is diagonal.
[TABLE]
The cat is no longer in a superposition of ‘dead’ and ‘alive’ and the density matrix formally resembles a classical ensemble. This is only formal resemblance, since we still need a measurement to make the cat look either dead or alive to us.
The situation for cosmological perturbations is analogous. Decoherence, in for example the field configuration basis, will still need a measurement to select a field configuration for us to observe.161616In addition, this (approximate) field configuration basis is very spread out in momentum space. For the classical limit, we need both a well determined position and a well determined momentum. We conclude that one needs a measurement to explain the observed classicality in the night sky. But, as explained in section 12, it is very unclear what constitutes a measurement in the cosmological constant.
One can consider theories which combine decoherence and an effective collapse theory. For such theories, there is no need for a measurement, but this comes at the price of introducing some kind of collapse mechanism. Examples include, but are certainly not limited to [42] and [43].
Decoherence alone also does not explain why the initial symmetric state became inhomogeneous. All the modes are assumed to be in a symmetric state, and tracing over some of them will not result in a reduced density matrix that is no longer symmetric. For example, note that the interaction considered in section 20 is not symmetric in . Despite this fact the reduced density matrix, eq. (195), will still be symmetric under the change of .
Kiefer, in [26], argues that this situation is analogous to symmetry breaking. The set of minima for a certain potential is symmetric, but an individual minimum is no longer symmetric. It is then argued that for cosmological perturbations the same thing happens: the state is symmetric, but then an individual realisation is chosen that breaks the symmetry. Note first that this is exactly the procedure of a measurement. Second, in [50] it is argued that any classical realisation of spontaneous symmetry breaking, like a ferromagnet being cooled below critical temperature, involves states that are not exactly symmetric. A ferromagnet prepared in exactly zero magnetisation will conserve zero magnetisation, whatever the temperature. This process is however very much subject to perturbations, the slightest change in magnetisation will result in the magnet developing a non zero magnetisation. For the Mukhanov-Sasaki modes, there is no such initial perturbation if all modes start in the Bunch-Davies vacuum. Thirdly, for symmetry breaking in quantum field theory, the theory is equivalent for every choice of symmetry breaking vacuum. For the Mukhanov-Sasaki variable, different symmetry breaking configurations are clearly not equivalent.
As conclusion, we cite from [47] why decoherence is not sufficient to explain the classical limit:
‘…decoherence cannot solve the problem of definite outcomes in quantum measurement: we are still left with a multitude of albeit individually well-localized quasi-classical components of the wave function, and we need to supplement or otherwise to interpret this situation in order to explain why and how single outcomes are perceived.’
Part III The Pilot-Wave Case
Chapter 7 Pilot-wave Theory
We will use this chapter to introduce Bohmian mechanics, or pilot-wave theory. It is an alternative interpretation of quantum mechanics, that postulates well-defined particles with well-defined trajectories. While it is a deterministic theory, the predictions it makes are equivalent to those made by standard quantum mechanics under the condition of quantum equilibrium.
First, we will explain the non-relativistic theory. It then is possible to generalize pilot-wave theory to quantum field theory: we will briefly present this theory for spin 0 fields. The possibilities that pilot-wave theory opens up in the field of quantum cosmology will also briefly be discussed. Then we will present the classical limit within Bohmian mechanics, thereby giving a strict formulation and touching upon a different characterisation and an important conjecture that is present in the literature. In conclusion, we will turn to the concrete examples of chapter 4, the free particle and the inverted oscillator and develop the pilot-wave theory for them.
24 General Theory
In classical mechanics a system of one particle is fully specified by the couple , where is some generalised coordinate and is the momentum canonically conjugate to . In Bohmian mechanics, the role of this couple is fulfilled by the couple , where is the wavefunction of the system.
The wavefunction is the same as the one used in standard quantum mechanics. It follows Schrödinger time evolution
[TABLE]
We can now, as we did with the WKB approximation, split the wavefunction as follows
[TABLE]
where and are both real and is positive. As before, the Schrödinger equation now decouples into a real and imaginary part
[TABLE]
Let us now postulate an actual particle with actual velocity given by the guidance equation
[TABLE]
Eq. (213) can then be seen as a continuity equation for an ensemble of postulated particles with density and velocity field given by the guidance equation eq. (214). Using the postulated particles interpretation, eq. (212) formally looks like the classical Hamilton-Jacobi equation171717This likeness is only formal. Note that the classical Hamilton-Jacobi function is a function of various constants of motion, while the here is a function of and .. We think of it as describing a particle moving under the influence of a potential and an extra quantum potential term
[TABLE]
The obvious interpretation is that the pilot-wave particle moves under the influence of two forces, a classical one and a quantum one
[TABLE]
Quantum Equilibrium
Let us now demonstrate the equivalence between pilot-wave theory and standard quantum theory, under some condition. Let us go back to the continuity equation (213). This is in fact a second continuity equation in quantum mechanics, since the Schrödinger equation already implies the following equation
[TABLE]
which is nothing but the conservation of probability within the framework of the Copenhagen interpretation. We see that both and obey the same laws of motion. We conclude that, if at some time , that and will be equal for all time. This is called the equivariance of and . In general it is assumed that , and this assumption goes by the name of quantum equilibrium. Under the assumption of quantum equilibrium, the predictions of Bohmian mechanics match those of standard quantum mechanics. To show this explicitly, consider a large amount of systems prepared in the same state . The expected value of some operator position dependent operator in de Broglie-Bohm theory will then be
[TABLE]
where the right hand side is the expected value in standard quantum mechanics. This effectively demonstrates the equivalence of Bohmian mechanics in the equilibrium regime with standard quantum mechanics, since quantum mechanics only makes statements about large numbers of systems and most measurements only make predictions in terms of particle positions, like e.g. pointers of measurement apparati. There are however some situations, involving measuring time dependent quantities, where pilot-wave theory makes unambigous predictions while there is discussion on the Copenhagen interpretation of those measurements, as is for example mentioned in [49] and [3].
Measurement and Collapse of the Wavefunction
Collapse of the wavefunction has a natural explanation within the framework of pilot-wave theory. What one typically measures in an experiment is the position of some particle, possibly the position of some pointer of some measurement apparatus. The result of such a measurement is then the exact position of the pilot-wave particle. The collapse of the wavefunction has an effective interpretation. Consider some wavefunction that consists of two wavepackets with virtually no overlap in position space
[TABLE]
Since a pilot-wave particle has a definitive position in space, we can think of it as being located inside the support of . In that region, it is clear that . In computing the the trajectory of the particle, we can safely ignore the presence of in eq. (214). In this way, our description of the particle now makes use of the collapsed wavefunction . Note however, that this collapse only takes the form of a useful approximation in pilot-wave theory and that it is in no way fundamental or necessary, in contrast to the case in standard quantum mechanics.
Non-Locality
It is obvious that pilot-wave theory is a hidden-variables theory. As such it is necessarily non-local, as implied by Bells inequalities. To see this explicitly, consider the two-body generalisation of the previous paragraphs. We have a wavefunction . The guidance equation for particle will depend on the instantaneous position of particle . Since this instantaneous connection is always present, no matter the distance between both particles, the theory is manifestly non-local. This is a property that will carry over to the relativistic field theory. It can be shown, see [56], that this non-locality can not be used to send faster-than-light signals under the assumption of quantum equilibrium. This places the non-locality of pilot-wave theory on the same level as the non-locality implied by entanglement in standard quantum mechanics.
25 Quantum Field Theory
While the pilot-wave theory presented in section 24 can be generalised to particles of any spin, it is found that the interpretation in terms of particles cannot be held in a relativistic version of pilot-wave theory. In analogy with the extension of quantum theory to include special relativity by transferring from particles to fields, we will construct a pilot-wave theory for a field of spin [math]. It is possible to generalise this formalism to all bosonic fields. For spin one cán maintain the particle interpretation: the pilot-wave theory for fermionic fields is fundamentally different. It will however not be treated here. For notes on the subject see e.g. [49] or [20]. Our (very brief) treatment will be based on this last reference.
We will consider some free scalar field . The relevant classical Lagrangian density is
[TABLE]
Moving to the Hamiltonian formalism we get
[TABLE]
We will then quantise this Hamiltonian by imposing equal time commutation relations. In the Schrödinger picture we replace with the functional derivative . The Hamiltonian then becomes an operator that acts on a wavefunction , which is now a functional of the field configuration . We then get the functional Schrödinger equation
[TABLE]
It is now possible to make a pilot-wave theory for the field configuration , meaning that we can formulate a guidance equation and corresponding definite trajectories for the variable . For our purposes it will be simpler to formulate the theory in terms of the Fourier components of the field. We define the Fourier transformation as follows
[TABLE]
The functional Schrödinger equation becomes
[TABLE]
Assuming that is a product state, this equation decouples into a Schrödinger equation for all
[TABLE]
where stands for the factor of the total wavefunction associated with the variable . As in the non-relativistic case, we can write . We can then postulate that the Fourier modes follow actual trajectories, governed by the guidance equations
[TABLE]
We also have an analogue for eq. (212),
[TABLE]
and we recognize the classical Hamilton-Jacobi equation with an extra term that we identify as the quantum potential. We will use this framework (in terms of the real and imaginary parts of the Fourier modes) to construct the pilot-wave theory for the Mukhanov-Sasaki variable in chapter 9.
We conclude this brief section by remarking that this theory is not Lorentz invariant and that there exists a preferred inertial frame. For explicit argumentation see [20]. This non-covariance is however not relevant, since under the condition quantum equilibrium this preferred frame can not be detected[55].
26 Pilot-wave Cosmology
Pilot-wave theory opens up a whole new realm of possibilities in the context of cosmology. While we will apply it in the context of the primordial perturbations in chapter 8, we will briefly mention some attempts at devising a complete quantum theory of the early universe.
In chapter 2 we only quantised the perturbations, in the form of the Mukhanov-Sasaki variable, while we took the classical solutions for the background variables and . In this light, the theory we constructed in section 8 can be considered to be semi-classical at best. In fact, we repeatedly used the classical equations of motion for the background variables in deriving the classical action for the Mukhanov-Sasaki variable, as was indicated explicitly in section 6. However, it can be shown [17] that this action can be obtained without ever invoking the background equations181818At least for an inflaton field that is not subject to any slowroll potential ., through an intricate scheme of canonical transformations before quantisation and unitary transformations after quantisation. In this way, we can be fairly certain that several of the features of the theory discussed in this thesis will carry over to a fully quantised theory.
Bouncing Cosmologies
However, as noted in [46], a full quantisation can entail some very striking features and solve some problems that are not addressed by the theory of inflation. If one quantises the background variables, the result is wavefunctions for and . These can then be easily interpreted by pilot-wave theory. Starting from some initial condition and one gets definite trajectories, meaning a prescription and .
In [46] these trajectories are explicitly calculated for models that exhibit a ‘bounce’. This bounce means that the universe is supposed to be eternal and that the scale factor in turn grows and decays with time. Bounces occur when the scale factor stops shrinking and begins to grow. It is noted that such a scenario eliminates every need for an initial singularity (a Big Bang). In addition, for some values of relevant parameters it can be shown that no physically relevant wavelength ever reaches Planck scales, thereby increasing our trust in the predictions of the model at the moment of bounces. In fact, such bouncing models do in general not need inflation to produce the observed homogeneity and the power spectrum of the perturbations. For the homogeneity case, it is enough to observe that one can evolve the critical density in eq. (11) towards one by a period of accelerated expansion or, equivalently, a period of decelerated contraction. For the inhomogeneities, one can show that the same equations of motion for the inhomogeneities hold during the bounce as the ones during inflation[46]. Bouncing cosmologies thus don’t need to introduce a hypothetical inflaton field to account for observations.
Inflationary Models
However, the theory also allows for models without initial singularity but with an inflaton field. Consider for example the cosmologies studied in [18]. These models include a universe that has been expanding forever, starting from a universe with a finite non-zero size in the infinite past. It is even shown, that there are possible trajectories for that start with an accelerated expansion and can be smoothly joined to a period of radiation dominated expansion. These trajectories can thus be smoothly linked to the classical theory of our universe, starting at the epoch of radiation domination.
27 The Classical Limit
The emergence of classicality from quantum theories is easier to discuss in the pilot-wave context. The reason for this is clear. While standard quantum mechanics has to construct ways to regain particles with well-defined positions and momenta, de Broglie-Bohm theory postulates these features. We have now a theory which unambiguously speaks about definite particles (or field configurations) for which the time evolution can be exactly computed.
Of course, this does not mean that we always trivially have a classical limit. The theory sketched above clearly allows for trajectories that are far from classical solutions to the equations of motion. It is the great advantage of pilot-wave theory that we can exactly see where any non-classical behaviour originates, namely in the quantum potential term in eq. (212). It is easy to see that if this quantum potential is negligible compared to the kinetic energy or the classical potential of the system, that quantum effects can be ignored.
Since this quantum potential is dependent on the specific state the system is in, it is very non-trivial to classify the states for which the quantum potential can be neglected either in comparison with the kinetic energy or in comparison with the classical potential. To tackle this issue, we can return to the treatment of the WKB approximation in section 13, where the quantum potential was first introduced. The pilot-wave guidance equations for the WKB energy eigenfunctions in eq. (125) are
[TABLE]
Deriving this with respect to time we immediately get the classical equations of motion
[TABLE]
The WKB condition in eq. (123) is thus clearly a step in the right direction, since the WKB states give rise to the classical limit. But, as we remarked before, this condition is not sufficient to guarantee the classical limit. Reconsider for instance the superposition of different WKB solutions already given before, . The quantum force that corresponds to this state is given by
[TABLE]
which is exactly equal and opposite the classical force. We recover static particles, which is certainly not classical behavior in general, unless we also consider special initial conditions which give rise to classical static trajectories. We repeat that the problem of the WKB condition is that it is state independent, while the quantum potential is state dependent.
A Conjecture in the Literature
One can reach more by arguments similar to the ones given in section 15. In [4] and [3] it is argued that to obtain classical trajectories one needs to have that the de Broglie wavelength needs to be smaller than a scale associated with the classical potential. Specifically
[TABLE]
where the right hand side is taken to be indicative of the typical variation scale of the classical potential. For quadratic potentials, it is argued that this inequality always holds. This inequality is very reminiscent of the WKB condition, eq. (123), but a different scale of variation for the classical potential is taken to compare the de Broglie wavelength to. We imagine this condition by itself to be insufficient, since this will suffer from the same problem as the WKB condition: it will not generalise to superpositions of states. One can see this very easily in case of a quadratic potential: for an energy eigenstate of normal harmonic oscillator the condition is trivially satisfied, but the guidance equations yield static particles. Again, this is not classical movement, excepting the classical case when .
In [3] and [4] it is argued that this condition nees to be supplemented by a kind of scaling. We mean by this that an experimenter should fix a relevant length scale and a timescale , where is the typical variation scale of the classical potential and is taken to be the time a particle needs to ‘feel’ variation in the potential. These are defined by
[TABLE]
We should then look at the rescaled variables
[TABLE]
It is not unreasonable to look at these new variables as the physical ones, since in most relevant experiments we do not have access to the (typically small) length and time scales on which the quantum particles evolve. The relevant quantity that indicates the importance of a quantum treatment of the problem is given by
[TABLE]
Consider now the force acting on the macroscopic variables, due to the quantum force
[TABLE]
The conjecture in [3] can then be stated as
[TABLE]
The quantity is a random variable, since the initial conditions on are random variables with distribution given by the norm squared of the wavefunction in the case of quantum equilibrium.
What does this conjecture promise? It asserts that when the quantum scale is negligible in comparison with the classical scale , ‘almost all’ Bohmian trajectories will follow approximately classical trajectories. Here ‘almost all’ means that the set of initial conditions , which give rise to non-classical macroscopic trajectories, has a small probability measure, meaning that
[TABLE]
Let us return to our example of the harmonic oscillator. We saw that the motion was far fom classical for energy eigenstates, the Bohmian particles did not move. This is of course only classical motion when . The macroscopic trajectories are then given by
[TABLE]
The de Broglie wavelength is bounded from below for the harmonic oscillator, so we take the limit to mean that . We thus imagine to be very big, and we only see non-classical behaviour for the modes for which
[TABLE]
Since is very big, and the wavefunction is normalised, we expect that
[TABLE]
The scaling in eq. (233) is thus very essential. In the first paragraph we argued that classical limit arises when the trajectories become classical, meaning that the quantum potential is negligible compared to the classical potential (or kinetic energy). This conjecture however guarantees a less strict definition of classical limit, meaning that only the macroscopic trajectories achieve classical trajectories. Classical trajectories for the variable imply classical trajectories for , but the converse is certainly not true, as made clear by our example of the harmonic oscillator above.
In the rest of the thesis, we will not use this different characterisation of the classical limit. We will stick to our first definition for two reasons. The first one is very straightforward: the conjecture is not proven yet, and even ‘not precisely stated yet[4]’. Questions include the exact nature of the limit in eq. (238). We will not go deeper into that instead referring the reader to [3], appendix B, for a summary of possible ways to interpret this limit. The second reason is that we do not want the arguments for the classicality of primordial perturbations to depend on any sort of observer. Since these perturbations couple to classical density perturbations (eq. (105)) long before any observation by a human experimenter took place, we would like to understand this classicality independent from any human observer choosing a relevant length scale .
28 Examples
The Free Particle
Let us apply the formalism of pilot-wave theory to our first example of chapter 4, the quantum free particle. The energy eigenfunctions are the plane waves and the relevant normalisable states are the Gaussian wavepackets from eq. (110). The wavefunctions in position space can be obtained by inverting the Fourier transform. The algebra is quite involved, but one can integrate the guidance equations for these type of wavefunctions exactly[3].
[TABLE]
This reduces to the classical equations of motion on timescales for which holds that
[TABLE]
This is exactly the same criterion we found in section 13.
Inverted Harmonic Oscillator
We will now turn to the inverted oscillator. The relevant -particle wavefunctions are given by eq. (353)191919We will again restrict to the normalisable solutions of the Schrödinger equation.. For an -particle state we can explicitly integrate the Bohmian trajectories.
[TABLE]
For our choice of , see eq. (138), we see that at for late times . The trajectories thus become classical in the late time limit. The same will happen when we discuss the primordial perturbations.
Another way to see is to compute the quantum force. It is given by
[TABLE]
The classical force is given by and the ratio of both forces is then
[TABLE]
For our choice of this ratio will become exponentially smaller with time, so we can safely neglect the quantum force at large times.
Pilot-wave theory thus very clearly argues that the classical limit is reached for late times.
Chapter 8 Classicality for Cosmological Perturbations: Pilot-Wave Theory
In this chapter we will use the techniques developed in the previous chapter to formulate the pilot-wave theory for cosmological perturbations. We will first solve the Bohmian equations of motion for cosmological perturbations for the simplest cases: the -particle states. We will see that pilot-wave theory offers a very natural way of achieving the classical limit.
An attractive feature of the standard story, presented in chapter 3, was that the predictions for the power spectrum were roughly the same for initial states that didn’t differ too much from the Bunch-Davies vacuum. We will continue to show that for a large class of physically relevant initial states the Bohmian theory is also robust, meaning that the classical limit is always reached. This will be done by proving a theorem and then generalising it to more initial states. This will result in theorem 8.1 and 8.2. A final generalisation can be found in appendix E, but does not offer much more insight. In chapter 9 we will investigate exactly how this classicality occurs for de Sitter and power-law inflation by presenting the results of some simulations.
29 Bohmian Trajectories for -particle States
We will extend the calculations done in [44]. There, the Bohmian trajectories were calculated for the de Bunch-Davies vacuum, while we will start by considering -particle states.
Guidance Equations
We repeat the Hamiltonian for a Fourier mode of the Mukhanov-Sasaki variable that we will use
[TABLE]
Since is a real field, . Thus we take the integral over half the momentum space and we dropped the factor present in the Hamiltonian in chapter 2. We will develop the pilot-wave theory in terms of the real variables and . We note
[TABLE]
The variables and are analogously defined. The Hamiltonian for these variables takes the form
[TABLE]
Here the integral is again over half momentum space. We will suppose, as is often done, that the system is initially in a product state. By this we mean
[TABLE]
where each is a state in the Hilbert space of mode . Since the Hamiltonian does not couple the different variables, time evolution will preserve this product form at later times. We will now turn to the functional Schrödinger picture. The wavefunction of the Mukhanov-Sasaki variable will then factorise into wavefunctions for each mode separately. The wavefunction of an -particle state for a Fourier mode (see appendix A) is given by
[TABLE]
where is a solution to the classical equation of motion, satisfying
[TABLE]
All these equations were introduced in chapter 2 and are repeated for the convenience of the reader. We now split the wavefunction of a single variable in its modulus and its phase, . The real and imaginary part of the functional Schrödinger equation for the Hamiltonian of a single variable then decouple in the following equations
[TABLE]
We will now interpret the two equations in the same way we did in chapter 7. The second equation can be seen as a continuity equation for a probability current . We can now postulate an actual field configuration , whose modes obey guidance equations
[TABLE]
These guidance equations then make interpretation of eq. (253) possible. It is formally very reminiscent of the Hamilton-Jacobi equation of the classical theory, with an added quantum potential term.
Trajectories
We proceed now to solve the guidance equations for some initial condition . We can compute the phase from eq. (250).
[TABLE]
This equation of motion is solved by separation of variables. The solution is given by
[TABLE]
We notice first that this result is independent of the occupation number . This is not surprising, since this is also the case for the pilot-wave theories of the harmonic and inverted oscillator.
Secondly, we identify two regimes for these trajectories. When is large, will essentially behave as a plane wave, see for example eq. (63) for perfect de Sitter inflation. The -states are then harmonic oscillator states and the particles will stand still. This is far from classical movement, since the classical equations of motion are the equations of the harmonic oscillator. When is small, will behave as . More precisely
[TABLE]
where the details of are dependent on the exact model of inflation under consideration. Note that is always a solution of the classical equations of motion. Because of eq. (258), we see that will approximately become a solution to the classical equations of motion too. Thus, for very small, our modes will approximately obey the classical equations of motion.
Let us try to understand what happens a little better. The quantum potential for the state is
[TABLE]
where we have used the recursion relation for the Hermite polynomials eq. (352). The force associated with this potential, in the Hamilton-Jacobi sense, is
[TABLE]
We conclude that the ratio of the classical to the quantum force is
[TABLE]
Note that when is big, will be approximately be equal to its initial value, . Thus, at the onset of inflation, when the modes behave as if on Minkowski space, the ratio of both forces is approximately . The quantum force exactly cancels the classical force, and the movement is very far from classical. On the other hand, after the Hubble exit when is small, the quantum force falls as while the classical force rises as . The quantum force thus becomes completely negligible in the late times limit, leaving the classical force to generate classical trajectories for the Bohmian particles!
Classical Limit
Let us formulate what this means for the classical limit. We now have a theory in our hands that unambiguously speaks of definite field configurations and trajectories. In addition, this theory clearly demonstrates that these trajectories become ‘classical’ for late times. ‘Classical’ is a very well defined concept, meaning that the trajectories predicted by pilot-wave theory will become indistinguishable from solutions to the classical equations of motion of our theory. This theory thus answers our second question from section 12.
In addition, this theory provides an easy way to break the initial symmetry of the problem. While the wavefunction is symmetric, the initial condition clearly is not, and as a consequence the trajectory is no longer symmetric. Note that this is exactly analogous to classical symmetry breaking: while the laws of motion can exhibit a symmetry, individual solutions in general do not possess this symmetry.
30 General Classicality of Trajectories
The computation of the power spectrum of the cosmological perturbations was rather robust, as noted in section 11. With this we mean that corrections to the power spectrum caused by deviations from the Bunch-Davies vacuum all carry a factor . These corrections all vanish in the late times-limit. In addition, the mechanism of squeezing, as presented in chapter 5, does not distinguish much between different initial states. This section will be devoted to the proof of the analogon for pilot-wave theory.
We showed in the previous section that -particle states give rise to classical trajectories in the late times limit. Now the question arises what kind of states also exhibit this classical behavior. We will begin by formulating an exact mathematical theorem and proceed to the proof. The consequences of this theorem will de discussed in section 31.
Theorem 8.1**.**
Let the system be in an initial state
[TABLE]
with
[TABLE]
with and . Let be a solution to the guidance equation for a the real or imaginary part of a certain mode of wavevector , on the interval . Then
[TABLE]
*exists and is finite. *
Proof: To prove the theorem, we fix our attention to a mode . The wavefunction for this mode is of the form
[TABLE]
It will be easier to change variables
[TABLE]
Take arbitrary. The theorem is proven if we can find (with a suggestive notation) and so that for all holds that . We will first construct , in such a way that the values form a Cauchy sequence, while we note the limit of that sequence to be . Then we will show that a certain element of can fulfill the role of .
The velocity field of becomes
[TABLE]
where the notation is possible since only depends on the combination . Now has a finite limit for , see the proof in appendix D. We will use lemma 8.1 below to continuously extend this velocity field to all of and take so that for all and holds
[TABLE]
Due to the lemma in appendix D, is strictly increasing and unbounded on . For this reason, we can take for each , so that . Let . We will now prove that is a Cauchy sequence. We retain from these considerations
[TABLE]
Integrating gives us, for ,
[TABLE]
And thus is a Cauchy sequence. For this reason exists and is finite, denoted formally by .
We will now prove that . Take , . Then, for all we can take . Then the following inequalities hold
[TABLE]
Lemma 8.1**.**
Let be a polynomial in of degree , with time dependent complex coefficients . Define the following quantities:
[TABLE]
*Then can be continuously extended to . If exists for all , then exists. *
Proof: We will first prove that can be continuously extended to A. Pick . Then we can write
[TABLE]
where and . We now compute
[TABLE]
And this limit is finite since . We conclude that can be continuously extended to . Let us denote this extension by . Suppose now that every coefficient has a finite limit for . We will first prove that
[TABLE]
exists and is finite for all . To this end, consider . Since all the coefficients have finite limits for , the limit of also exists. Let us formally denote this by . We then extend by (formally) noting:
[TABLE]
The analogon of eq. (275) applies, and can be continuously extended to all of . We will abuse notation and let signify this extension.
We will now prove that is bounded. Then clearly is bounded too. We fix a time . Note that is a rational function, whose denumerator is a polynomial of degree , while the denominator is a polynomial of degree . Thus . Supposing that is not zero everywhere, fix a so that . We can now find , so that for every holds
[TABLE]
Now the following relation follows
[TABLE]
Since is compact and is continuous, this supremum is finite.
It is now possible to find such a supremum for every time , denoted by . Since ‘taking the supremum’ is a continuous function and is continuous, is continuous as a composition of two continuous functions. Because we could extend to include , we can also define in the logical way. We denote the extended version of by .
The proof is completed by observing the following (in)equalities,
[TABLE]
which is finite because is continuous on the compact interval .
31 Interpretation
This section will be devoted to understanding the theorem in the previous section. In physical terms, what was proven is
If the initial state of the system is a product state and each mode is in a superposition of a finite number of -particle states, then all trajectories generated by the guidance equation become classical in the late time limit.
Classicality
Let us start with the words ‘become classical in the late time limit’. The meaning of this is contained in the following expression
[TABLE]
We conclude that at the end of inflation, where is some constant. Since we already know that itself becomes an approximate solution of the classical equation of motion, we see that becomes an approximate solution to the classical equations of motion. Thus, we can think of as behaving classically for late times.
We can understand more of what happens when we look at the quantum potential. In terms of the polynomial , defined in eq. (263), we get
[TABLE]
where we have dropped the dependencies of to shorten notation. The first term is easily interpreted. It was also present for the pure -particle state and we know it gives rise to a quantum force that is negligible compared to the classical force for late times. The second term is a time dependent constant that does not give rise to any force. The term in square brackets however, can be a very complex function of and time. It constitutes a very important part of the total potential for large and it results in motion that is in general not classical. Rewriting in terms of the rescaled variable gives
[TABLE]
Here is now understood to be a function of and . Under the conditions of lemma 8.1, the phase of as a polynomial in doesn’t vary too fast. The terms inside square brackets then forms a polynomial in which doesn’t vary too much in time. As the factor becomes large, the polynomial in square brackets can’t compensate. Any contribution to the quantum force will be accompanied by a factor , making sure that this part of the quantum force will be negligible compared to the classical force, which grows with a factor .
As a side remark, the above argument is only sufficient to sketch the picture. In itself it is not sufficient to explain the late-time classicality of the trajectories. The exact demonstration is done through the proof of theorem 8.1. For example, the quantum potential blows up for zeros of . In contrast to the velocity field, this quantum potential will not necessarily be bounded. However, this is no problem. The zeros of correspond to zeros of the wavefunction and thus particles have zero probability of being at this nodes, under the assumption of quantum equilibrium.
Let us also try a dimensional analysis of what happens, analogous to the reasoning in section 15. We have noted in section 8 that quantising the Mukhanov-Sasaki variable is equivalent to quantising a scalar field of time dependent mass . As the effective mass becomes bigger in absolute value our trajectories become more classical. This corresponds to our intuition of ‘heavy’ particles behaving classically. We can also look at the de Broglie wavelength for an energy eigen state of eigenvalue
[TABLE]
in regions where the argument of the square root is positive. This wavelength is large when is large compared to but rapidly decreases as becomes the dominant term. We see that quantum effects are important when but the quantum scale rapidly decays when .
Finiteness Criterion
What is the meaning of the condition ‘superposition of a finite number of -particle states’ in theorem 8.1? We will call this condition the ‘finiteness criterion’.
A physical way to realise condition this if we instate some upper boundary on the allowed occupation number for each mode. We could imagine a cut-off energy (with respect to the ground state) that, gives rise to such a maximum occupation number . No admixtures of states above this treshold occupation number would be allowed. A logical choice for the energy scale would be the Planck energy . Since we don’t expect quantum theory to hold on that kind of energy scale, it makes sense to restrict ourselves to the lower energy states. In this way, the superpositions of a finite number of -particle states are the only physically relevant states.
Where did we use this criterion? It was of crucial importance in the proof of lemma 8.1. The finiteness condition is needed in for two things. First, it ensures that there are no places where the velocity field blows up in eq. (275). In effect, it states that the nodes of the velocity field (discussed in chapter 7) ‘behave well’. Second, it assures that the velocity field behaves well at infinity, by ensuring that the numerator of , eq. (273), is a polynomial of lower degree than the denumerator.
Product states
The theorem presented above is valid for product states. While the proof presented is only valid for these kind of states, the condition is not necessary for classicality of the trajectories. In appendix E we present a further generalisation of this proof for non-product states. It does not offer more insight and has the burden of needing cumbersome notation, so we have chosen not to present it here. Also it is generally assumed in the literature that the Mukhanov-Sasaki variable started out in a product state.
32 Getting Rid of the Finiteness Criterion
There are states that do not satisfy the finiteness criterion, being a superposition of an infinite number of -particle states. These wavefunctions have the following form
[TABLE]
Since the Hermite polynomials form a basis for the Hilbert space of functions with a suitable inner product (see eq. (367)), we can construct almost any function at by choosing the coefficients appropriately. Still, we want these infinite states to be normalisable. It follows that these infinite states can be approximated to arbitrary accuracy by a superposition of a finite number of -particle states. The theorem 8.1 would hold for every approximation. This means that the trajectories for each approximation would be classical. There is however no guarantee that the trajectories of would become classical eventually, even though the trajectories of every approximation become classical.
On the other hand, there are special states that do not satisfy the finiteness condition, but which do exhibit classical trajectories in the late time limit. An excellent and physically important example are the coherent states,inspired by the calculations in [23]. They are defined as the eigenstates of the annihilation operator. For an eigenvalue we get
[TABLE]
Their wavefunctions are given by
[TABLE]
where we have used eq. (368). The guidance equation is exactly solvable for these states. The trajectories are
[TABLE]
For late times, because the integral is negligible compared to the first term. We thus recover the same classicality as the states that do satisfy the finiteness criterion.
The existence of states that do not satisfy the finiteness criterion but exhibit classical trajectories invites us to look for a generalisation of theorem 8.1. Careful inspection of the proof presented reveals that it is easy to prove the following generalisation.
Theorem 8.2**.**
Let the system be in an initial state
[TABLE]
Define
[TABLE]
and
[TABLE]
Let be a solution to the guidance equation for a certain mode of wavevector , on the interval . Suppose
[TABLE]
is bounded. Then
[TABLE]
*exists and is finite. *
The proof is completely analogous to the proof of theorem 8.1. It is only different in the fact that it does not use lemma 8.1. We will not state the proof here.
This theorem is now strong enough to prove the classicality of the trajectories of the coherent states. The quantity is for those states
[TABLE]
with as in eq. (292). This is obviously bounded and the coherent states satisfy the conditions of theorem 8.2.
It is very difficult to further identify states that satisfy the conditions of theorem 8.2, due to the rather abstract mathematical conditions it poses. It should also be noted that we did not succeed in finding an explicit state that does not satisfy the conditions of theorem 8.2. We hope that theorem 8.2 can be extended to include all possible product states of the Mukhanov-Sasaki variable. A viable possibility would maybe run along the following lines:
- I
An arbitrary initial wavefunction can be written as a (in)finite superposition of -particle states, because the Hermite functions form a basis for the square integrable functions. So 2. II
For the first terms in this sum, the quantity is bounded, say by . 3. III
Normalisability of the wavefunction can be used to prove that is finite. 4. IV
The quantity is then bounded and satisfies the conditions of theorem 8.2.
This sketch of a proof would work if one could find a way of implementing step III. The idea of that step would be that the wavefunction would need to be normalisable, and that the ‘tail’ of the sum over states would not contribute much to due to being small in the ‘tail’. However, we remind the reader that for all states of realistic interest, a finite superposition of states is sufficient. We do not claim that our theory holds once we reach energies on the Planck scale and so the accessible number of initial occupation numbers is limited.
33 Power Spectra
We will now calculate the power spectrum for a slow-roll inflation theory in this pilot-wave formalism. Taking as initial state the Bunch-Davies vacuum and assuming quantum equilibrium, we get for the power spectrum of the rescaled Mukhanov-Sasaki variable
[TABLE]
Thus, for de Sitter or power-law inflation, we know that for late times, and we regain the power spectrum for the rescaled Mukhanov-Sasaki variable as presented in section 11, under the condition of quantum equilibrium. We will discuss this assumption in detail in chapter 10.
Let us now look at the power spectrum of the density perturbations. We repeat the classical equation, eq. (103), that couples the Fourier components of the density perturbations to the classical modes of the Mukhanov-Sasaki variable terms of the slow-roll potential ,
[TABLE]
We have seen that for late times the quantum mechanical variable will evolve in very good approximation according to the classical equation of motion. Thus we trust the above equation to couple the classical density perturbations to the (approximately) classical at late times. Substituting our solution for the equations of motion we find
[TABLE]
At first sight, we get an equation that is very different from the one in section 11: an extra factor of and the initial condition are introduced. We will explicitly show that these two contributions cancel. Under the assumption of quantum equilibrium we have the following equality
[TABLE]
The initial wavefunction is a Gaussian of variance . We get as final result
[TABLE]
which is the analogue of eq. 93. As before, we now note that for de Sitter and power law inflation the ratio is approximately constant for late times and proportional to . We regain thus a scale invariant power spectrum, in agreement with observation.
Chapter 9 Quantifying Classicality
In the previous chapter, we have proven that pilot-wave trajectories for a large class of initial states will always resemble the classical trajectories in the large time limit. In this chapter we will deepen our understanding of this phenomenon by presenting some simulations for de Sitter and power-law inflation. We will primarily determine how much these trajectories will deviate from the classical trajectories for a given amount of inflation and find that for realistic amounts of inflation this process is fast enough to create classicality for the relevant modes.
As a starting remark, note that theorem 8.1 will be sufficient for the rest of this chapter. Computers can only handle a superposition of a finite number of -particle states numerically. For any simulation executed, classicality will eventually set in due to this calculational constraint. In addition, we will only study the case for product states, since the case for entangled modes is computationally harder and expected to be analogous.
34 Two Sources of Deviation from Classicality
We will fix a mode . We showed that the trajectories for finite superpositions will all resemble the classical ones in the late times limit. This was implemented by proving that for every trajectory
[TABLE]
for some constant , when . There are two important sources here that make deviate from the classical trajectories. First, the trajectory of will deviate from . Second, will never solve the classical equation of motion exactly, only approximately. We will start by addressing the first issue. In the proof of theorem 8.1 we stated that the velocity field of the rescaled variable was
[TABLE]
Integration gives us
[TABLE]
The trajectory will not deviate much from time at time when
[TABLE]
This condition is always fulfilled in the case of pure -particle states, since in that case. For other initial states, this happens when becomes very large. This is the case when the mode has exited the Hubble radius, i.e. when and the classical potential switches sign. We will take the lefthand side of this equation as a measure of the deviation from classicality from this source.
When does become an approximate solution to the classical equations of motion? Note that satisfies the following equation
[TABLE]
Since The term with will lose its importance when
[TABLE]
This happens soon after the Hubble exit of the mode as both and increase rapidly. We will take the left hand side of this equation as a measure of the deviations from classicality due to this source .
The conclusion of this section is thus that we expect the trajectories to be indistinguishable soon after the Hubble exit of the mode. To be able to say more, we have to specify a model of inflation, thereby giving us a formula for .
35 De Sitter and Power-Law Inflation
We will further focus on the special cases of de Sitter and power-law inflation. Both are slow-roll models and we can thus safely approximate by (see eq. (56)). The scale factor is then given by
[TABLE]
where corresponds to de Sitter inflation and corresponds to power-law inflation. So, in terms of the cosmic time the scale factor is
[TABLE]
Lucchin shows [34] that this type of inflation is compatible with observations when or . Thus we get as set of possible values or . These are the values we will use in simulations. These types of inflation have the additional bonus that we can solve for the function exactly. We repeat eq. (62)
[TABLE]
Let us compare the two sources of deviations from classicality in the previous section for de Sitter and power-law inflation. Taking we compute the ratio of the deviations some time after Hubble exit, when
[TABLE]
For the dominant source for late times will always be . In what follows, we will neglect as source of non-classicality.
36 Technicalities of the Simulations
Before we move on to the simulations themselves, we will briefly comment on some technical details.
The variable has a velocity field that becomes extremely large for . The differential equations for this variable present an extremely stiff problem for and have proven to be very hard to solve reliably. Instead we simulated the rescaled variable
[TABLE]
The variable , by virtue of lemma 8.1, has a bounded velocity field and represents a computational problem that is far better conditioned. Since the functions are exactly know for de Sitter and power-law inflation this change of variables does not entail any kind of extra uncertainty. In addition, graphs for this rescaled variable are far more clear, since trajectories of will become unbounded in the limit.
The guidance equations for the trajectories were solved using a fourth-order Runge Kutta-method of adaptive step size, implemented by the module ode45 in Octave. Accuracy was checked by calculating the time-inverted trajectories, starting from their end position, and comparing the time-reversed end position with the initial condition. Errors were estimated using this method to be of order to for superpositions of around different quantum numbers up to . Higher accuracies, higher quantum numbers and superpositions of more different quantum numbers are achievable for increased computer time, but this was deemed unnecessary.
Concerning the subject of simulational parameters, it is important to realise that the choice of is not irrelevant. Different choices of will imply different initial conditions for . In general, was chosen in such a way that the mode under consideration starts out in very nearly Minkowski space . Practically we chose the parameters so that at least was guaranteed. In terms of the equations of motion, this guarantees that the term is at least two orders of magnitude smaller than the term. Computationally, this places some restrictions on the parameters used. One doesn’t want to choose to be too big, since that would involve taking many time steps. Furthermore taking big results in problems in the Minkowski regime. In that regime, the system will essentially behave as a harmonic oscillator. The typical frequency of the system is determined by energy differences, proportional to . High means high frequency which needs a lot of time steps. We thus need to strike a balance between high and early times .
To finalise this section, let us introduce the notation for the parameters of the simulation we will adopt. ‘Quantum numbers ’ with ‘weights ’ means that the initial state of the mode was prepared in
[TABLE]
where the numbers are in general complex and were often taken randomly. Initial non-product states were not simulated. Their behaviour is thought not to be very different from the product states, by virtue of the theorem in appendix E. To keep in contact with the physically relevant situation, was taken to be bigger than . Relativistic units were chosen.
37 Typical Trajectories
We will begin by analysing some trajectories of the variable . The case for pure -particle states is not interesting, the velocity field of will be zero. We will take as first example a superposition of weights of equal amplitude but random phases. The trajectory of such a mode is shown in fig. 5. The general process is clear. starts off by moving in a seemingly oscillatory way, which corresponds to the field evolving on Minkowski vacuum. At that time is approximately and the variable is directly proportional to .
As increases, the effective frequency of our oscillator drops and the oscillations become slower and slower. Their period rises and their amplitude gets damped as increases. In the limit of the velocity field of becomes negligible and settles to a constant value. This means that in that limit, as was proven in theorem 8.1. This can be seen more clearly on fig. 6, which describes the same simulation but with a logarithmic horizontal and restricted to later times. The picture is more clear here. One can see that the period of the oscillations increases and that the freezing of the mode clearly takes place at , meaning at Hubble exit .
It turns out that this behaviour is quite generic for different superpositions and different values of . For example, in fig. 7 a trajectory is plotted for power-law inflation. The behaviour of does not differ quantitatively from the previous figure. This type of trajectories was reliably confirmed for and superpositions of up to different with between [math] and . Qualitatively the same behaviour was observed for up to with superpositions of up to different . These results are not presented since the error estimates proved to be too big for systematic quantitative analysis.
38 Freezing Time
We will now come to a more systematic study of the time when becomes classical. can be easily computed since we can solve for exactly. The only unknown quantity in section 34 is the . We will thus inquire about the moment when eq. (302) holds.
The first problem to for doing this systematically is that eq. (302) is a worst-case scenario, since is an upper boundary on the initial velocity field of the variable . Given a certain initial state, one can in principle compute . But it is very much possible that a trajectory will have derivatives that are a lot smaller than . Second, while one could compute for one given initial state, it is very hard to do this more generally. Let’s say we have an initial state
[TABLE]
will then be a very complex function of the and , since the wavefunctions of states involve the Hermite polynomials and time dependent phases. While we have tried to get an analytical estimate of the ‘average velocity’ of along some trajectory, the two problems mentioned are the reasons we failed. It is here that simulations bring solace.
The method we used to extract information from the solutions is simple. We take the value of at the end time of our simulation, and then select the time so that for all we get that . We effectively measured the time at which deviations from the limit value were smaller than . The problem with this approach is that we have to choose a ‘hard’ value of . Taking to be too small will result in to be very susceptible to numerical errors, either because of numerical instabilities or because of our finite simulation time. Taking too large will place in the regime where the trajectories still oscillate.
A good value of would be between and . Looking back at fig. 5 we see that the typical scale on which is oscillates is of order to . This scale of oscillations was generally confirmed, for example by fig. 7 where we see qualitatively the same behaviour. In addition, a value for larger than will make rather robust to numerical effects, since those are expected to be at most of order .
Dependence on
Let us first look at the -dependence of . We can see the dependence clearly in fig. 8 for a simple superposition. A log-log version of this plot is shown in fig. 9, together with a first order fit. On that figure it is even more clear that is a constant. A first order polynomial fit in the least squares sense gives
[TABLE]
We can safely assume that .
39 Is Classicality reached?
While the dependence on other parameters was not very clear, we demonstrated in the previous section that was approximately a constant for a given . Let us use this information to make an estimate of how much realistic modes can deviate from the classical trajectories.
Let us go back to eq. (302). For , . Of course, eq. (302) was a worst case scenario. We can however expect some variant of eq. (302) to hold, meaning that we will replace the maximum of the velocity field by some suitable constant that could be thought of as representing the average velocity of . We thus look for a dependence
[TABLE]
long after the Hubble exit.
Let us take again as in the previous section. Consider fig. 10 for de Sitter inflation. Plotted is the behaviour of as a function of our choice of between and . The values of were obtained by averaging over the simulations for different presented in fig. 8. The polynomial fit is given by
[TABLE]
This is consistent with eq. (314)
[TABLE]
where the constant coefficient is of order .
We observe essentially the same behaviour for power law inflation. In figure 11 we again see a log-log plot of classicality time as a function of the required accuracy . The black line is the first order polynomial fit given by
[TABLE]
which is again fully consistent with the relation
[TABLE]
where is now of order .
For de Sitter inflation, the results of the previous section allow us to estimate the classicality of the modes at the end of inflation by extrapolating our results. The non-classicality is dominated by . For simplicity, we take e-folds of inflation. This means that . Let us take a mode that left the Hubble radius early, so . Then we get
[TABLE]
Thus the process we described is certainly fast enough to achieve classicality de Sitter inflation, if we take reasonable estimates for . For the example in the previous section, was of order .
The same result applies for power-law inflation. Taking again e-folds of inflation, we have
[TABLE]
Again, taking reasonable assumptions for we get extremely small deviation from classicality. As an illustration, we take the example of fig. 11, meaning power-law inflation and of order . We get an upper bound on .
40 Conclusion
We can conclude from this chapter that the classical limit for the primordial modes is expected to be reached. We repeat that there are two sources of deviation, as presented in section 34. For pure -particle states, the only contribution arises from , which we can calculate exactly. For finite superpositions, we can neglect compared to for late times. Since this contribution is very hard to calculate in general, we resorted to numerical simulations. The deviation is seen to follow the estimate , where is some state dependent parameter. This relation suggests that for reasonable the classical limit is reached sufficiently fast.
Chapter 10 Cosmological Perturbations: a Test for Quantum Mechanics?
In chapter 8 and chapter 9 we have shown that pilot-wave theory presents a solution to the problems stated in section 12. Firstly, it explains why the cosmological perturbations look classical to us. Secondly, it presents a very natural way of breaking the initial symmetry present in the de Bunch-Davies vacuum. This picture is not very dependent on the choice of initial state. Thirdly pilot-wave theory leads to the same scale invariant power spectrum as the standard account does under the assumption of quantum equilibrium.
This chapter will be devoted to exploring the possibilities of non-equilibrium in the early universe. We will first discuss the concept of non-equilibrium and relaxation to equilibrium for pilot-wave theory in general. Then we will look at what opportunities inflation presents us with to detect possible non-equilibrium in the early universe, as they are displayed in the literature.
41 Non-Equilibrium and Relaxation
In the Copenhagen interpretation of quantum mechanics the Born rule is simply postulated. In pilot wave theory, the Born rule arises automatically when the density of the postulated particle is equal to the modulus squared of the wavefunction . The continuity equation, eq. (213), guarantees that the equality will be preserved by time evolution.
It does not have to be this way. While it is completely possible to regard quantum equilibrium as an additional postulate in pilot-wave theory, the formalism itself gives no reason why needs to be equal to . Taking makes sure that pilot-wave theory cannot be differentiated from standard quantum theory, in this regime the predictions of both theories are the same. While one could prefer the picture pilot-wave theory paints, there is no reason in that case to prefer one theory over the other. Only when does pilot-wave theory differ from standard quantum theory. The search for systems that are in non-equilibrium is thus an important test of quantum mechanics.
So far, no quantum non-equilibrium has been observed. If it exists, how do we explain its apparent absence? There are indications[56] that systems prepared with initial non-equilibrium distributions tend to relax to equilibrium. Note that the continuity equation forbids any non-equilibrium distribution to evolve to the exact equilibrium distribution, but non-equilibrium distributions can come arbitrarily close. Both distributions are then essentially the same on a coarse grained level. Using numerical simulations it is often found that relaxation is very efficient[51, 15]. Relaxation typically happens on the same time scale as the considered wavefunction varies, so
[TABLE]
where is the spreading in energy of the considered wavefunction. Notice that relaxation occurs on very large time scales when is small. In the case of a pure -particle state of an harmonic oscillator, the particles do not move. Every distribution is a stationary one and no relaxation occurs. This is also the case for modes of the Mukhanov-Sasaki variable, long before Hubble exit, see section 29. For complex superpositions of modes, the relaxation time scale will generally be small.
42 Non-Equilibrium in the Early Universe
Since relaxation is expected to be very efficient in realistic situations this would explain why we only observe quantum equilibrium. If there was initially some quantum equilibrium in the early universe, it would very rapidly have relaxed.202020It has even been suggested that non-equilibrium may have been produced in the early universe due to quantum gravitational effects, see [52].
Without falling into pure speculation we can think about what would have happened to early non-equilibrium. In standard Hot Big Bang theory, it would have relaxed on a very short time scale and would be well outside present experimental limits. In inflationary models of our universe, there is another possibility. We have proven in chapter 8 that trajectories in pilot-wave theory typically get ‘frozen’. We mean by this that the physical variable does not move in the late times limit. It is clear that any distribution present at that time will also be frozen and thus not evolve any more in time. When such a frozen mode then re-enters the Hubble radius, it will again behave as a harmonic oscillator and relaxation can commence.
Freezing Inequality
The scenario explained in the previous paragraph would mean that modes with wavelengths that are larger than the Hubble scale now can still carry evidence of initial quantum non-equilibrium. They would have exited the Hubble radius very early during inflation, effectively ‘freezing in’ any non-equilibrium that would have been present. This non-equilibrium could, in principle, be observed by us as soon as the modes re-enter the Hubble radius. In [53], Valentini derives the freezing inequality. Quantum non-equilibrium will not relax on an time interval for a mode of some scalar field of wavevector when
[TABLE]
Here is the Hamiltonian associated with the real or imaginary part of the mode , given by
[TABLE]
where . These equations were derived for a free massless scalar field without potential that lives on an expanding, but not necessarily inflationary, background. This is of course a harmonic oscillator for every mode with time dependent mass. One can do something in close analogy with appendix A and we can find a number operator with positive integer eigenvalues so that[53]
[TABLE]
The freezing inequality becomes
[TABLE]
This freezing inequality was derived for general expanding spacetimes. The author argues that this inequality predicts freezing of non-equilibrium for super Hubble modes for pre-inflationary eras, say between Planck time and the start of inflation. Valentini suggest then, in another article [54], that non-equilibrium will be transferred intact troughout some inflationary period. One can see this as follows. Starting from the de Bunch-Davies vacuum, the modes of the Mukhanov-Sasaki variable will evolve in time as (see section 29)
[TABLE]
Since this time evolution is so simple, any initial distribution will essentially spread out in time but retain its form. More exactly
[TABLE]
This of course corresponds to the physical variables being frozen in the late times limit.
There are however some problems with this picture. Firstly, pre-inflationary periods are very poorly understood. We are not even entirely sure whether inflation happened and which of the many possible models is closest to reality. We thus should be rather distrustful about the validity of the action from which the Hamiltonian in eq. (323) was derived. Second, the picture painted in [54] is slightly too simplistic. For an exact Bunch-Davies vacuum the trajectories are indeed that simple, but this is not longer true for states that deviate a little bit from the Bunch-Davies vacuum. We have explicitly shown in chapter 9 that the trajectories for superpositions of states are far from simple. Before Hubble exit the system is a harmonic oscillator and we would expect relaxation on the time scale in eq. (321).
One could for instance try to remedy this by applying the freezing inequality to inflationary periods. A best-case scenario will give us some insight. The inequality is best satisfied for . We take a de Sitter inflationary period of e-folds of inflation for simplicity. The modes that satisfy the freezing inequality then satisfy
[TABLE]
This is clearly a very extreme result. This would mean that the wavelengths of modes that retain quantum non-equilibrium would be times bigger than the Hubble radius at the start of inflation. This would correspond to wavelengths of at the end of inflation. It is out of the question that we would observe any such modes.
The problem lies in some of the inequalities used in the derivation of the freezing inequality. While certainly valid, they seem to be too crude to give useful results.
Sharpening the Freezing Inequality
For actual inflation, we will sketch how better estimates could be made. We focus on de Sitter inflation of e-folds for . The criterion used to look for the freezing of quantum non-equilibrium in [53] reads for our rescaled variable
[TABLE]
The left hand side is the ‘distance travelled’ between and . To be precise: and the expected value should be taken with respect to an equilibrium distribution. The right hand side is the uncertainty in of the considered wavefunction. The idea is now the following: if the particles do not move over scales comparable to the spread of the wavefunction, there can be no relaxation on the level of the entire wavefunction212121The inequality is of course more rigorously argumented in [53]..
Analogously to the treatment in [53], we will estimate an upper bound for the left hand side for wavefunctions that satisfy the conditions of theorem 8.2. We thus suppose an initial product state with a bounded velocity field for . Notice that the following inequalities hold
[TABLE]
where the constant is defined as in chapter 8, being an upper bound on the quantity . Let us first focus on modes that start inside the Hubble radius at the onset of inflation. The last integral is then comprised of two different parts: before and after Hubble exit of the mode. In the first regime is essentially constant, and in the second it roughly behaves as . We estimate the integral as
[TABLE]
We have seen in chapter 9 that typical trajectories do not move appreciably after Hubble exit. We will thus neglect the second term in this equation.
We will now turn to in the right hand side of eq. (329). For the vacuum, is given by . We will thus estimate, as a lower bound, .
Collecting everything, we estimate that non-equilibrium will survive during a period of de Sitter inflation is
[TABLE]
For a given initial state, and thus given , this is a condition on the wavelength of the mode under consideration. We see that non-equilibrium will be maintained if the mode exits the Hubble radius soon enough. This is could have been anticipated: before Hubble exit the modes behave as harmonic oscillators and relaxation can occur.
Let us now focus on modes that start far outside the Hubble radius at the onset of inflation. The velocity integral is then
[TABLE]
The freezing inequality becomes
[TABLE]
Since the mode is initially outside of the Hubble radius, we know that . We thus see that this inequality is very easily satisfied for modes that start outside the Hubble radius. For such modes, we do not expect appreciable relaxation to equilibrium.
This is only a sketch of an argument, mainly for two reasons. First, the splitting of the integral in eq. (331) is very rough. Second, we have ignored any role that plays in the discussion. Relaxation is obviously heavily dependent on the initial states of the Mukhanov-Sasaki variable. For example, if it started in the Bunch-Davies vacuum, would be zero and no relaxation at all could occur. On the other side, if some initial super-Hubble mode would be in a very complex superposition of -particle states, then we could imagine being very large and thus permitting relaxation for that mode. A complete theory of relaxation during inflation would thus include some estimates for .
43 Detection
Let us now come to the detection of possible non-equilibrium. Let us assume for simplicity that the Mukhanov-Sasaki variable started out in the de Bunch-Davies vacuum. Every Fourier mode would then have a Gaussian wavefunction. As first proposed test [54] lies in the power spectrum itself. The calculation done in section 33 was obviously dependent on the assumption of quantum equilibrium. We conclude that non-equilibrium would lead to a different power spectrum.
There is however an ongoing search for non-Gaussianity, on which pilot-wave theory also has an interesting take. Let us to that end compute the correlator of three modes
[TABLE]
If we assume quantum equilibrium, then is a Gaussian. It is then easy to see that this integral will always be zero, for symmetry reasons. This means that the three-point-correlator should be zero under the assumption of quantum equilibrium. We conclude that in the case of non-equilibrium being present in some of the modes that are discernible across the sky that the three-point-correlator will in general not be zero. Since the results of the WMAP experiment were not sensitive enough to give any definite information on the three-point-correlators, it is one of the aims of the Planck mission to provide us with more data [1].
As mentioned before, this idea fits in the broader scheme of the search for non-Gaussianity. All the correlators of arbitrary order can be trivially specified in terms of the correlator of second order if the distribution is Gaussian. The odd orders are zero (as shown above for degree three) while the even powers are just the second order to some appropriate power. Deviations from this Gaussian distribution can be easily inferred from the behaviour of the different correlators and are generally termed non-Gaussianity. It is one of the primary goals of the Planck mission to test for non-Gaussianity[1]. If the Planck mission would find non-trivial behaviour of the correlators, we have shown that quantum non-equilibrium offers a possible explanation. It should be noted that non-Gaussianity is a very generic prediction. For example, taking into account non-linear effects in inflation (meaning taking a higher order expansion in section 7) already yields a (small) amount of non-Gaussianity. Multi-field inflationary models can also easily account for any non-Gaussianity, as well as several non-inflationary models[1].
44 Non-Inflationary Tests
To conclude this chapter, we will briefly mention some other tests for non-equilibrium in the early universe, taken from [52]. Based on the freezing inequality presented in section 42, one can expect suppression of relaxation in other expanding regimes. More precisely, particles that decoupled very early from the rest of the energy density in the universe could be expected to retain some degree of non-equilibrium. Such particles have been given the suggestive name of ‘relic particles’. Gravitinos, for which decoupling at Planck time is expected, are suggested as a possible candidate. One however has to keep in mind that inflation, if it occurred, has probably diluted their presence to unobservable densities. The search for relic particles can thus only expected to be fruitful in non-inflationary models.
As a final possibility, Hawking radiation from black holes might be in non-equilibrium. Roughly speaking, because of the no-hair theorem it seems that black holes can effectively destroy information. While completely thermal Hawking radiation does not solve this, it has been suggested that out-of-equilibrium Hawking radiation can carry more information than normal radiation, thus possibly providing an answer to the black hole information paradox [52].
Conclusion and Outlook
The aim of this thesis was to try and understand the problem of classicality in the context of primordial perturbations. The proposed solutions are ample, but we have seen that the scientific community is in general divided on the subject. The important questions are
- I
How does one go from an initial homogeneous and isotropic state to a state with inhomogeneities? 2. II
Why can we consider the density perturbations to behave essentially classical at the end of inflation?
Already in section 12 we argued that the answer quantum mechanics provides, a measurement, is not enough to explain the classicality of the primordial perturbations. In chapter 4 we explicitly reviewed some traditional answers to the question and found many to be insufficient. Two proposed solutions were examined, the theory of squeezing and the theory of decoherence. The first one was found to make invalid approximations, in stating that two conjugate quantum mechanical operators commute. The second is physically extremely feasible and explains the absence of quantum interference. However one has to supplement decoherence with a measurement or some effective collapse mechanism to achieve the classical limit.
We examined the answers that pilot-wave theory offers. The classical limit in this framework is more easily attained since the theory treats definite particles (or field modes) with definite trajectories. The answer to the second question above is also easy: the symmetry is broken by the initial conditions. The answer to the first question is more involved: we found in chapter 8 that the trajectories for a large class of initial states, including the vacuum, become classical in the late times limit. We explicitly showed that pilot-wave theory reproduces the predictions of standard quantum mechanics in this situation, under the assumption of quantum equilibrium. In chapter 9 we provided the reader with explicitly calculated trajectories to help intuition. We argued, using numerical simulations, that this mechanism is capable of reaching classical trajectories in realistic periods of inflation.
Further investigations could possibly be made to extend our picture to arbitrary initial states, and mathematically prove that they all give rise to classical trajectories in the late times limit. The possibility of quantum non-equilibrium in the early universe also needs more work. We gave some sketches in chapter 10, but the details are yet to be understood. The data the Planck mission will provide should bring more insight into the discussion, possibly discerning between different inflationary models and putting upper limits on the amount of quantum non-equilibrium in the early universe.
Nederlandstalige samenvatting
Als we naar de hemel kijken, merken we dat we vanuit alle richtingen een merkwaardig homogeen signaal opvangen, vooral in het micrometer regime. Dit is straling vanuit de verste delen van het observeerbare universum. Bovendien is deze kosmische achtergrondstraling een van onze beste manieren om informatie over het vroege universum te krijgen. De eerste observatie is al meteen de belangrijkste: de achtergrondstraling is uiterst homogeen. De temperatuur van de straling die we opvangen is zo goed als onafhankelijk van de richting waarin we kijken. Dit is een belangrijke experimentele bevestiging van het kosmologische principe: het universum is homogeen op zeer grote schaal.
Maar het is duidelijk dat het universum op kleine schaal verre van homogeen is. Dit is ook te zien in de achtergrondstraling: deze is niet perfect homogeen maar bevat kleine inhomogeniteiten. Als we uitgaan van het kosmologisch principe moeten we natuurlijk een manier vinden om die kleine storingen te verklaren: waar komen ze vandaan en waarom zijn ze zo verdeeld over de hemel? Daarover gaat deel I.
In hoofdstuk 1 wordt de relativistische theorie van inflatie uit de doeken gedaan. Het idee is dat het vroege universum een zeer korte periode van versnelde expansie doormaakte. Wij presenteren het simpelste model, namelijk slow-roll inflatie. Hierin wordt de expansie aangedreven door een reëel scalair veld, het inflatonveld, dat leeft op een achtergrond van een homogeen universum met FLRW-metriek. Voor een universum waarin de energiedichtheid wordt gedomineerd door zo’n veld, zal de schaalfactor exponentieel snel stijgen, met de Hubble parameter
[TABLE]
In hoofdstuk 2 veranderen we het homogene universum uit het vorige hoofdstuk in een inhomogeen universum. Dat wil zeggen dat we de kleine storingen in zowel de metriek als het inflatonveld beschouwen. Het zal voldoende zijn om de variabelen te splitsen in homogene achtergrond en scalaire storingen. Vector- en tensorstoringen zijn van ondergeschikt belang. Deze splitsing is natuurlijk niet invariant onder veralgemeende coördinaattransformaties en we gaan dus op zoek naar ijkinvariante combinaties. Uiteindelijk komt het erop neer dat de scalaire perturbaties maar één relevante vrijheidsgraad hebben, de Mukhanov-Sasaki variable met Hamiltoniaan in momentumruimte
[TABLE]
waar het momentum geconjugeerd aan is en apostrofes wijzen op afgeleides naar een andere tijdscoördinaat, de conforme tijd . Deze Hamiltoniaan beschrijft Fourier modes die zich eerst gedragen als harmonische oscillatoren. Vanaf gedragen ze zich echter heel anders, als omgekeerde oscillatoren. Op het einde van dit hoofdstuk kwantiseren we deze Hamiltoniaan. We vinden annihilatie en creatie operatoren en de bijbehorende -deeltjestoestanden.
In hoofdstuk 3 beginnen we met de inhomogeniteiten aan de hemel beter te beschrijven: we karakteriseren ze in termen van hun vermogensspectrum. Vervolgens leggen we uit hoe de kwantumtheorie uit het vorige hoofdstuk aanleiding geeft tot een schaalinvariant vermogensspectrum dat de observaties verklaart. Hier blijken een aantal onduidelijkheden te bestaan. In sectie 12 stellen we de volgende vragen:
- [1]
Hoe geeft een kwantummechanische aanleiding tot klassieke fluctuaties van de energie? 2. [2]
De initiële golffunctie voor is symmetrisch. De verdeling van de inhomogeneiteiten aan de hemel is dat niet. Hoe is dit mogelijk?
Een uitleg gebaseerd op een meting is niet mogelijk in deze kosmologische context.
In deel II onderzoeken we de mogelijke antwoorden die de literatuur aanbiedt. Hoofdstuk 4 houdt zich bezig met het onderzoeken van allerlei criteria voor de klassieke limiet. We bespreken de rol van onzekerheden, de WKB benadering, de Wigner functie en de rol van de natuurconstanten met als voorbeelden het vrije deeltje en de omgekeerde oscillator. In hoofdstuk 5 bekijken we het formalisme van de zogenaamde squeezed states. Dit formalisme is gebaseerd op een benadering, namelijk
[TABLE]
Deze benadering is duidelijk verkeerd, omdat deze commutator altijd is.
In hoofdstuk 6 bekijken we een andere prominente verklaring. Decoherentie veronderstelt extra interacties tussen verschillende modes van de Mukhanov-Sasaki variabele of interacties met andere velden. Deze resulteren in een (bij goede benadering) diagonale dichtheidsmatrix en verklaart dus het afwezig zijn van interferentie tussen verschillende toestanden. We merken echter op dat verscheidene details in de literatuur nog niet uitgeklaard zijn, zoals bijvoorbeeld het moment waarop decoherentie zijn werk deed. Helaas heeft decoherentie geen antwoord op onze tweede vraag: er is een extra meting (of een effectief collapsemechanisme) nodig om de overgang van een symmetrische naar een assymetrische toestand te verklaren.
In deel III bekijken we welke antwoorden de Broglie-Bohm mechanica kan aanreiken. In hoofdstuk 7 leggen we de theorie uit. Het verschil met standaard kwantummechanica ligt er voornamelijk in dat deze theorie het heeft over goed gedefinieerde deeltjes (of Fourier modes in veldentheorie) met goed gedefinieerde trajecten, die worden bepaald door eerste orde differentiaalvergelijkingen. Meer concreet schrijven we een golffunctie als
[TABLE]
met en reëel. De vergelijking wordt dan
[TABLE]
Dit komt overeen met de beweging van een deeltje onder de invloed van de klassieke potentiaal en een extra potentiaal, de kwantumpotentiaal
[TABLE]
Onder een bepaalde voorwaarde, die van kwantumevenwicht, zijn de voorspellingen van deze theorie niet onderscheidbaar van de voorspellingen van de standaard kwantummechanica. Vervolgens bekijken we hoe precies de klassieke limiet moet gezien worden in dit kader. Het is duidelijk dat de klassieke limimet bereikt wordt wanneer de trajecten oplossingen zijn van de klassieke bewegingsvergelijkingen. Dit gebeurt als de kwantumpotentiaal niet te hard varieert. We bekijken dit ook weer kort aan de hand van de voorbeelden van het vrije deeltje en de omgekeerde oscillator.
In hoofdstuk 8 lossen we de differentiaalvergelijkingen op voor de Mukhanov-Sasaki variabele. Voor de -deeltjestoestanden vinden we de klassieke limiet zeer natuurlijk terug voor late tijden: de modes van volgen in die limiet de klassieke bewegingsvergelijkingen. Vervolgens zoeken dan verder naar de oplossingen voor superposities van een eindig aantal -deeltjestoestanden. Deze zijn niet meer analytisch op te lossen, maar we bewijzen rigoureus dat deze klassiek worden voor late tijden.
In hoofdstuk 9 werken we aan onze intuïtie: we lossen de bewegingsvergergelijkingen numeriek op en bevestigen dat ze zich klassiek gedragen voor late tijden. Daarbij zoeken we uit op welk moment we eigenlijk kunnen spreken van de klassieke limiet. We vinden door numerieke simulaties en theoretische schattingen dat de afwijking van een klassiek traject op het einde van een de Sitter inflationaire periode aan de volgende ongelijkheid voldoet
[TABLE]
waar een constante is van orde .
Tenslotte beëindigen we deze thesis in hoofdstuk 10 met ons af te vragen wat er gebeurt als er niet voldaan is aan de voorwaarde van kwantumevenwicht. In de literatuur wordt geargumenteerd dat we dan eventueel kwantumonevenwicht zouden kunnen detecteren in de achtergrondstraling. We geven aan waar deze ideeën wat te snel door de bocht gaan, maar we schetsen hoe een volledige theorie er zou moeten uitzien en hoe dit onevenwicht precies zou kunnen gedetecteerd worden.
Dankwoord
Veel van ons hebben zich er al sinds de jaren ’20 mee verzoend dat de wereld niet deterministisch is. Ik weiger echter te aanvaarden dat de volgende mensen een of ander golfpakket vormden tot ik ze omeeting op ze uitvoerdde.
Aan mama en papa, die toch wel voor zowat alles gezorgd hebben. Ook al kregen gasten de opmerking ‘Vraag maar niet naar zijn thesis’ en ondanks dat papa de indruk had dat ‘ze weten toch eigenlijk niet wat aanvangen met jullie, die laatste twee jaar’, hebben ze zich altijd achter mij geschaard.
Aan Ward, die me zowel voor mijn thesis als mijn bachelorproject begeleid heeft. Hij is een van de weinige personen op deze wereld die het belang van eenvoud en duidelijkheid volledig naar waarde schatten. De vraag ‘maar wat betekent dat eigenlijk’ is een vaak genegeerde vraag in veel artikels en boeken, en ik bewonder hem enorm dat hij toch van die vraag zijn werk heeft gemaakt.
Aan Wina en aan mijn jaargenoten in het bijzonder, aan wie ik veel indirect te danken heb. Met sommigen van hen heb ik (te) veel gedronken, met heel veel heb ik gelachen en van allemaal heb ik ongelofelijk veel bijgeleerd. Ik ga altijd met veel plezier terugkijken op allerlei discussies: ‘many-worlds interpretaties’, ‘staat er een overaftelbaar aantal sterren aan de hemel’ en ‘proffen die collapsen naar overal, behalve waar ze moeten zijn’ zijn er slechts enkele. Speciale dank aan de kerels van het bureau: Hans, Ruben en Alberto voor nog veel meer discussies over zo mogelijk nog gekkere onderwerpen.
Aan Hanne, want ze verdient toch een speciale vermelding, omdat ze al die jaren voor eten heeft gezorgd.222222En omdat ze het zelf niet zo subtiel heeft gevraagd. Ook wel een beetje als verontschuldiging: we hebben vijf jaar lang plezier gehad aan haar pijn.
Aan Kaat, die maar ongeveer de helft van het maken van deze thesis heeft meegemaakt, maar wel het grootste deel van het gevloek heeft opgevangen. Zij zorgde voor vele ‘topdagen’.
Aan Rowie, mijn aantal leuke herinneringen aan jou overtreft het aantal pinten dat we samen uitdronken veruit. Ik denk dat het zo duidelijk genoeg is.
Aan het USO, ik hoop dat ik een fractie heb teruggegeven van dat wat jullie mij gegeven hebben. Jullie hebben mij leren luisteren: naar Mahler, naar stemming232323Wat is dat moeilijk! en toonafstanden, maar vooral naar anderen.
En omdat ik toch ergens een krachtige one-liner moet hebben staan:
‘…but criticizing […] is not the job. The job is to find the theory that’s right.’
L. Smolin, The Trouble With Physics
\appendixpage\openany
Appendix A Wavefunctions of Time Dependent Harmonic Oscillators
In this appendix we will construct the wavefunctions for time dependent harmonic oscillators. We prefer using the method of Lewis-Riesenfeldt invariants as it was originally written down in [32]. The flaw of this method is that it will only allow us to construct normalisable wavefunctions. For inverted harmonic oscillator systems there clearly are unbound states with non-normalisable wavefunctions: these cannot be constructed in this way and we will merely quote the more general result from [41].
The type of Hamiltonian that we will discuss is given by
[TABLE]
where the frequency is allowed to depend on time. We do not restrict to real frequencies, thus allowing to be negative. The classical equation of motion for this Hamiltonian is
[TABLE]
where apostrophes indicate derivatives with respect to time. In classical physics, one would look for a constant of motion and then do a canonical transformation to find the solutions. Quantum mechanically, we look for an invariant , meaning that . We propose, for an arbitrary solution to the classical equation of motion,
[TABLE]
Using Heisenbergs equations of motion for and one can readily check that is indeed an invariant of the Hamiltonian. This means that the eigenvalues of will be independent of time. We are now free to impose a Wronskian constraint on the functions and their conjugates, the equivalent of eq. (77).
[TABLE]
We now introduce operators and , defined as
[TABLE]
One can check that these operators satisfy the following relations
[TABLE]
Thus and respectively annihilate and create quanta of . The operator is for the time dependent system what the number operator is for the time independent system.
We will now limit ourselves to the normalisable eigenstates of the operator . When all the eigenstates are normalisable. When it is clear that the energy is not bounded from below and the Schrödinger equation allows unbound states with non-normalisable wavefunctions.
Diracs algebraic treatment242424Note that this treatment doesn’t work for non-normalisable states. of the harmonic oscillator can now be used for the normalised eigenstates of , and we find that these eigenstates of the operator are the non-negative integers . There thus exists a state which gets annihilated by ; this gives us the differential equation
[TABLE]
which can be solved by guessing a Gaussian form. The result is, up to some time dependent normalisation factor
[TABLE]
Anticipating the results for the standard harmonic oscillator, we expect that Hermite polynomials will play a role in constructing the eigenfunctions for higher eigenvalues of . Notice that the groundstate already incorporates the Hermite polynomial of degree 0. The recursion relation for Hermite polynomials is
[TABLE]
Using this relation and normalising, one can check that the wavefunctions of the states are given by
[TABLE]
where we have reinstated for future reference. For a standard oscillator, these states are also energy eigenstates: it is the factor that gives rise to the Schrödinger time evolution, . For completeness sake, the most general solution to the Schrödinger equation for Hamiltonian (343)is, indexed by a real [41]
[TABLE]
These are eigenfunctions of with eigenvalue . For each there are two linearly independent . They are the odd and even parabolic cylinder functions. For , there is a linear combination of the even and odd solution that is normalisable. This combination then reduces to eq. (353).
Appendix B An Alternative Action
Eq. (73) is where we started the quantum description of cosmological perturbations. There is however an alternative approach commonly used by several authors. In this approach, the action of the Mukhanov-Sasaki variable is derived by considering a minimally coupled real massless scalar field on a flat FLRW background. The action is, in terms of the conformal time
[TABLE]
And the momentum conjugate to is . We will write the dynamics in terms of a new variable . The conjugate momentum is then . Immediately moving into momentum space, we write the action in terms of the Fourier modes [36]
[TABLE]
where the -integral is restricted, since we are dealing with a real field and thus . We see that . The Hamiltonian for a single Fourier mode then reads
[TABLE]
This is the Hamiltonian252525Notice that this Hamiltonian couples modes with wavevector and . This interaction originates in the reality of the field and is similar to the fact that we restrict the integral in eq. 357. that is used to quantise the theory by some authors; [44], [27] and [2] among others. The Lagrangian density in eq. (356) differs from the one in eq. (67) by a total derivative. They are thus equivalent on the classical level and we assume that they also are equivalent in their quantised forms.
As before we will write the theory in terms of annihilation and creation operators. These relations still hold
[TABLE]
if we enforce the Wronskian condition
[TABLE]
The main difference is that the function is now
[TABLE]
In terms of and , the Hamiltonian becomes
[TABLE]
We get the same interpretation as before, when is big, this is essentially the Hamiltonian of a harmonic oscillator. But it is in the last two terms of this Hamiltonian that we get a more intuitive picture of what happens when is small. When is large compared to , particle number eigenstates 262626Note that the states are not the same anymore, since is dependent on the Hamiltonian used. are no longer energy eigenstates and the time evolution induces pair creation. 272727This is in fact very reminiscent of the Schwinger effect, as shown in [37]. In that case, the energy recquired for pair creation is supplied by an external electric field, while the source in this case is the gravitational field.
This could have been guessed, since for this action
[TABLE]
and the operator is no longer an invariant. This means that particle number is not conserved by time evolution in this case. This is the most important reason why we chose to work with Hamiltonian in eq. (73), since for that action the particle number is a good quantum number. This has the disadvantage that the interpretation of the regime after Hubble exit is severely harder.
Appendix C Properties of the Hermite polynomials
The Hermite polynomials , can be recursively defined by
[TABLE]
They satisfy the following differential equation
[TABLE]
The Hermite functions are determined by
[TABLE]
And they form a complete orthonormal basis of the Hilbert space of square integrable functions. The Hermite polynomials have a generating function, meaning that
[TABLE]
Appendix D Properties of
We will use this appendix to collect the properties of , as discussed in the text. Some properties that may not be found explicitly in the literature, but that prove useful in the text (particularly section 30) are also presented. The basic properties of are
[TABLE]
Theorem D.1**.**
If is a positive increasing function on that is unbounded, then is an unbounded, increasing function on .
Proof: We first prove that is increasing on , by proving that is positive on . Note that
[TABLE]
due to the initial conditions of . In addition the equations 369 imply that
[TABLE]
Filling in for the initial conditions, we notice . Thus is positive on , since is increasing. We thus conclude that is increasing on . Since , it follows that is positive on . In addition, it follows that , for all .
To prove that is unbounded, it is enough to observe that is unbounded on . Since is unbounded, and , it follows that is unbounded.
Theorem D.2**.**
*We write . Then exists and is finite if grows faster than . *
Proof: Notice that the following equalities hold
[TABLE]
Note that this is integrable when grows faster than close to [math]. This is the case for every model of inflation studied in this thesis.
Appendix E Generalisation of Theorem 8.2 to Non-Product States
We will generalise theorem 8.2 in chapter 8 to non-product states.
Theorem E.1**.**
Suppose the system is in an initial state
[TABLE]
Define
[TABLE]
and
[TABLE]
Finally, define
[TABLE]
Let be a solution to the guidance equation for a certain mode of wavevector , on the interval . Suppose
[TABLE]
is bounded for all and . Then
[TABLE]
*exists and is finite. *
Proof: For clearness and ease of notation, we will only prove the case for entanglement between the real parts of two modes, as the case for entanglements of more states is completely analogous. Thus we suppose
[TABLE]
where . Proving the theorem for the non-entangled degrees of freedom is analogous to the proof of theorem 8.2. For the real parts of the modes of wavevector and the guidance equation for becomes
[TABLE]
and analogous for . This becomes in terms of the and the
[TABLE]
and analogous for . Now the imaginary part of the quantity in square brackets is exactly and is bounded by assumption. The proof can continue in the same way as the proof of theorem 8.1 in chapter 8.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] European Space Agency “Planck: the Scientific Programme”, 2009
- 2[2] A. Albrecht, P. Ferreira, M. Joyce and T. Prokopec “Inflation and squeezed quantum states” In Phys.Rev. D 50 , 1994, pp. 4807–4820 DOI: 10.1103/Phys Rev D.50.4807 · doi ↗
- 3[3] V. Allori “Decoherence and the Classical Limit of Quantum Mechanics”, 2001
- 4[4] V. Allori, D. Dürr, S. Goldstein and N. Zanghí “Seven Steps to The Classical World” In ar Xiv: quant-ph/0112005 , 2001
- 5[5] A.O. Barvinsky, A.. Kamenshchik, C. Kiefer and I.V. Mishakov “Decoherence in quantum cosmology at the onset of inflation” In Nucl.Phys. B 551 , 1999, pp. 374–396 DOI: 10.1016/S 0550-3213(99)00208-4 · doi ↗
- 6[6] E. Bertschinger “Cosmological perturbation theory and structure formation”, 2001 ar Xiv: astro-ph/0101009 [astro-ph]
- 7[7] R.H. Brandenberger “Inflation and the theory of cosmological perturbations”, 1997 ar Xiv: astro-ph/9711106 [astro-ph]
- 8[8] R.H. Brandenberger “Lectures on the theory of cosmological perturbations” In Lect.Notes Phys. 646 , 2004, pp. 127–167 ar Xiv: hep-th/0306071 [hep-th]
