Quantum Theory of the Universe Based on Bayesian Probability
Matsuo Sato

TL;DR
This paper develops a quantum universe model using Bayesian probability, allowing for the calculation of universe probabilities from a single occurrence rather than repeated experiments.
Contribution
It introduces a Bayesian probability framework into quantum cosmology, enabling probability definitions for unique universe events.
Findings
Derived explicit probability of the universe in a scalar field model.
Demonstrated the application of Bayesian quantum theory to cosmological models.
Abstract
We formulate a quantum theory of the Universe based on Bayesian probability. In this theory, the probability of the Universe is not a frequency probability, which can be obtained by observing experimental results several times, but is a Bayesian probability, which can define a probability of an event that occurs just once. As an example, by applying the quantum theory of the Universe to an action of a scalar field theory in the four dimensions as a toy model for the theory of the Universe, we explicitly obtain the probability of the Universe and the action of matters in the Universe.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Statistical Research · Statistical Mechanics and Entropy · Bayesian Modeling and Causal Inference
Quantum Theory of the Universe Based on Bayesian Probability
Matsuo Sato111 e-mail address : [email protected]
*Department of Natural Science, Faculty of Education, Hirosaki University
Bunkyo-cho 1, Hirosaki, Aomori 036-8560, Japan*
We formulate a quantum theory of the Universe based on Bayesian probability. In this theory, the probability of the Universe is not a frequency probability, which can be obtained by observing experimental results several times, but is a Bayesian probability, which can define a probability of an event that occurs just once. As an example, by applying the quantum theory of the Universe to an action of a scalar field theory in the four dimensions as a toy model for the theory of the Universe, we explicitly obtain the probability of the Universe and the action of matters in the Universe.
1 Introduction
The quantum theory of matters is defined by a path integral with an action , which represents the total probability of a canonical distribution . That is, the theory is defined by a standard framework of probability theory, so called frequency probability, which is defined based on populations, because experiments on matters can be made several times in laboratory systems. On the other hand, the time evolution of the Universe cannot repeat or cannot be made experiments on. Thus, we cannot define a quantum theory of the Universe by using frequency probability. Therefore, we propose to define a quantum theory of the Universe based on Bayesian probability,111Bayesian probability is a base of the machine learning recently developed extensively. See [1, 2, 3] as examples of text books for Bayesian probability. which can give a probability of an event that occurs just once.222For example, Bayesian probability gives a probability of a state of genes in a family tree, from an information of diseases in it. A state of genes in a family tree occurs just once.
Bayes’ theorem states that , where . is a probability that an event occurs, and is a conditional probability that an event occurs under the condition that an event occurs. Bayes’ theorem can be proven easily in the framework of the frequency probability as follows. In this framework, all the probabilities are defined by populations and , where is a probability that events and occur. Actually, by using this, we can prove it as . On the other hand, in the framework of Bayesian probability, the definitions are different: so-called posterior distribution , which is an analogue of the conditional probability in the frequency probability, is defined by the right-hand side of , where and are called prior distribution and statistical model,333 is automatically normalized as by definition in the frequency probability. Thus, the statistical model must be normalized as well. respectively, defined without a population. That is, the posterior distribution is defined even if a population is not defined or we cannot observe the conditional probability that the event occurs under the condition that the event occurs.
A summary of the application of the Bayesian probability to the Universe is as follows. is a background that represents an Universe, whereas is a fluctuation that represents matters. For example, in a theory of gravity, represents a space-time itself and a vacuum expectation value of matters, whereas represents a graviton and matters. An action of a theory of the Universe gives a statistical model representing a probability that around occurs. The posterior probability represents a probability of the Universe when fluctuations occur. This is defined although it cannot be observed directly. In section 2, we apply the Bayesian probability to the Universe precisely and formulate a quantum theory of the Universe. In section 3, we explicitly obtain the probability of the Universe and the action of matters in the Universe in the case that we treat an action of a scalar field theory in the four dimensions as a toy model of a theory of the Universe.
2 Formalism
A prior distribution is given in general as , where is a background representing an Universe and is an arbitrary function of that is normalized as . A probability of the Universe is not the prior distribution itself but is obtained from the posterior distribution. Moreover, we will see that the probability of the Universe does not depend on the prior distribution later. After all, we can set the prior distribution 1 ().
By the quantum theory of matters, a probability of a correlation of N points () in an experiment in a background is given by
[TABLE]
where is given by expanding the action of a theory of the Universe with respect to around , and . This means that a probability that a path in a path integral occurs, is given by . Thus, we define a statistical model where a single fluctuation around a fixed background occurs as , where .
Next, we generalize the statistical model to a model that has multiple fluctuations. In another experiment, a probability of a correlation of points () in the same background , is given by . Then, a probability that we have correlation of points in an experiment and a correlation of points in another experiment is given by the product
[TABLE]
This means that a probability that a path in an experiment and another path in another experiment occurs, is given by . Thus, we define a statistical model where two independent fluctuations around a fixed background occur as . In the same way, we define a statistical model where independent fluctuations around a fixed background occur as
A posterior distribution that a Universe occurs under the condition that independent matters occur, is defined as
[TABLE]
where .
We are going to obtain a probability of the Universe by taking large expansion of the posterior distribution. Matters are subject to the same probability, called true probability , where matters occur in the Universe. That is, is the action of matters in the Universe. The normalization is given by . True probability will be determined at last.
We define an effective action as . We name that gives the minimum of , . Then, satisfies
[TABLE]
In order to take large expansion of the posterior distribution, we define a deformed action whose effective action has minimum value 0, as Actually, and the effective action satisfies
[TABLE]
because . In addition, the posterior distribution defined by the deformed action coincides with that defined by the original action: where
[TABLE]
Because the expectation value of in (2.7) is given by , we define its fluctuation as
[TABLE]
converges to a Gaussian process in by the central limit theorem with respect to the random valuables . The expectation value and the correlation functions are given by and
[TABLE]
We also have
[TABLE]
We abbreviate to in the following. The partition function can be written in terms of and by definition of : .
We divide the partition function to the main part
[TABLE]
and the remaining part
[TABLE]
[TABLE]
where satisfies
[TABLE]
because
[TABLE]
where we have used the inequality of additive geometric mean,
[TABLE]
We evaluate the main part. In the following, we treat only the case that is unique and is regular, namely its eigenvalues are all positive. In the general case, we are working on evaluating the main part in progress [4]. From (2.5), (2.6), (2.10), and the mean value theorem, there exist and in and such that
[TABLE]
where . Because the integration region of the main part is
[TABLE]
and (2.14), we have and . Thus, the integrand of the main part is given by
[TABLE]
which is Gaussian with a dispersion , where .
From (2.18) and (2.20) and the regularity of , the integration region is . From (2.15), we have , and then . Thus, the integration region covers the center plus and minus the deviation , where are eigenvalues of . Therefore,
[TABLE]
and then,
[TABLE]
Thus, large expansion of the posterior distribution, namely the probability of the Universe is given by
[TABLE]
where
[TABLE]
We obtain the expectation value of the statistical model of matters in a Universe with respect to the probability of the Universe (2.24),
[TABLE]
where we have diagonalized by changing the basis of .
In order to obtain a finite and non-trivial limit of the large expansion of (2), we take a double scaling limit with fixed. is realized by making parameters () approach to their critical values simultaneously with . is determined by
[TABLE]
By Taylor expanding around , the integrand of the partition function becomes
[TABLE]
where , especially, , and . From this formula, one can see that the partition function and posterior distribution depend on only through the product , perturbatively. Thus, the large corrections from the main part of the partition function to (2) come from expansions with respect to and only. By fixing in the double scaling limit, these corrections vanish, and the corrections from the remaining part of the partition function also vanish because the corrections’ dependence on is at most . We make approach to canonically as
[TABLE]
where and are positive parameters. Because , is fixed if and only if . Therefore, we obtain a finite and non-trivial limit,
[TABLE]
where represents that is substituted.
The random valuables are not singular in , because do not depend on . Thus, the central limit theorem states that converges to a Gaussian process in the double scaling limit by definition of (2.8). In the same way, its differential also converges to a Gaussian process in this limit. Its expectation value is given by whereas its correlation function is given by
[TABLE]
Its dispersion is given by and especially, at ,
[TABLE]
Thus, is subject to the normal distribution , that is, the probability is given by . The expectation value with respect to this probability is given by
[TABLE]
This is identified with the true probability444 is automatically normalized: . : 555In practical Bayesian statistics, the right-hand side is nearer left-hand side in a better statistical model. In a quantum theory of the Universe, the right-hand side should equal the left-hand side and the equation should be treated to determine the true probability, because the statistical model is obtained from the action of a theory of the Universe, that is, the model must be the best.. is determined by solving this equation because (2) is a functional of .
3 Example
In this section, by treating the action of an Euclidean scalar field theory in the four dimensions as a toy model of the action of the theory of the Universe, we explicitly obtain the probability of the Universe and the true probability. In the following, we assume that and are constant and the fluctuations are up to the fourth order.
The action is expanded as . The true probability is generally written as , where . The normalization determines . In the following, we evaluate path-integrals up to one-loop by using scheme666In the string geometry theory [5, 6], a perturbative string theory is obtained by expanding fluctuations up to the second order. In this case, we only consider the fluctuations up to the second order, and then the path-integral is exact.. We have . The effective action is given by
[TABLE]
(2.4) is equivalent to (i) and or
[TABLE]
(i) or (ii) determines . and are given by
[TABLE]
The equation , which determines , is given up to the fourth order by
[TABLE]
where represents that determined by (2.27) with (3.3) is substituted to in , , and its derivatives. These equations determine , , and .
We solve explicitly in case of . We only solve in case of (i) because there is no analytic solution in case of (ii). The solution to (i) is and . Then, we obtain , , and . The solution to (2.27) with (3.3) is . The solution to (3.5) is , , and . When is the critical value , , , , and . is positive because is negative. We have obtained a stable theory of matters with from the theory of the Universe with because approaches to as in (2.29). One can confirm that has the minimum at by looking up globally.
As a result, the probability of the Universe is (2.24) with . The true probability, namely the probability of matters in the Universe, is , where . That is, the action of matters in the Universe is determined to be , where .
4 Conclusion and Discussion
In this paper, we formulated a quantum theory of the Universe based on Bayesian probability. In this theory, the probability of the Universe is not a frequency probability, which can be obtained by observing experimental results several times, but is a Bayesian probability, which can define a probability of an event that occurs just once. As an example, by applying the quantum theory of the Universe to an action of a scalar field theory in the four dimensions as a toy model for the theory of the Universe, we explicitly obtained the probability of the Universe and the action of matters in the Universe.
As an application, we will study to what extent the Universe is reproduced by applying the quantum theory of the Universe to the action of the Einstein gravity coupled with matters in the standard model, although it is not UV complete. Furthermore, it is necessary to apply the quantum theory of the Universe to the actions of candidates for the UV complete theory, such as matrix models, string field theories, loop quantum gravities, and string geometry theories777Especially, string geometry theory around a string geometry background reproduces a perturbative string theory on a string background [5, 6]. Thus, it is important to apply the quantum theory of the Universe to the action of the string geometry theory and examine whether the action of the perturbative string theory in the Universe is uniquely determined and the Universe is reproduced including its origin. Moreover, it is expected that string geometry theory is finite as a theory of the Universe because we only need to perform a path-integral over a very limited region as in (2.20). to investigate the origin of the Universe.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C.M.Bishop, “Pattern Recognition and Machine Learning,” Springer (2010)
- 2[2] A.Gelman, J.B.Carlin, H.S.Stern, D.B.Dunson,A.Vehtari,D.B.Rubin, “Bayesian Data Analysis,” Chapman and Hall/CRC (2013)
- 3[3] S.Watanabe, “Algebraic Geometry and Statistical Learning Theory,” Cambridge University Press(2009)
- 4[4] M.Sato, work in progress.
- 5[5] M.Sato, “String Geometry and Non-perturbative Formulation of String Theory,” ar Xiv:1709.03506 [hep-th]
- 6[6] M.Sato, Y.Sugimoto, “Topological String Geometry,” ar Xiv:1903.05775 [hep-th]
