Born's Rule from Quantum Frequentism

Lionel Brits

arXiv:1903.12027·quant-ph·August 19, 2025

Born's Rule from Quantum Frequentism

Lionel Brits

PDF

Open Access

TL;DR

This paper derives a generalized form of Born's rule applicable to arbitrary factorizable states, demonstrating that unitary evolution can be consistent with observed quantum probabilities without requiring non-unitary collapse.

Contribution

It introduces a generalized Born's rule for factorizable states and proves its consistency with unitary evolution, extending previous results limited to special cases.

Findings

01

No histories violate Born's rule in the generalized framework

02

Purely unitary evolution can produce observed quantum probabilities

03

Provides a new single-shot fidelity benchmarking method

Abstract

Quantum theory has evolved from a set of provisional rules to an indispensable framework that underlies much of modern technology and infrastructure. Yet, after a century, Born's probability postulate remains at odds with the theory's unitary character. The problem stems from the linearity of the Schr\"odinger equation, as linear systems are insensitive to the magnitudes of their solutions' coefficients. If measurement is unitary, and thus linear, how can the frequency of an outcome depend on the magnitude of its amplitude? And if not, at what scale does unitarity break down? This question remains pressing, as the assumption of unitarity underlies both the design of large-scale fault-tolerant quantum devices, as well as our understanding of fundamental aspects of our universe, for example, the black hole information problem. Proponents of the many-worlds interpretation have argued…

Equations50

∣ Ψ_{N} ⟩

∣ Ψ_{N} ⟩

∣ ⟨ n ∣ Ψ_{N} ⟩ ∣

∣ ⟨ n ∣ Ψ_{N} ⟩ ∣

∣ Ψ_{N} ⟩ = x_{1} x_{2} \dots x_{N} \sum c_{x_{1} x_{2} \dots x_{N}} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩,

∣ Ψ_{N} ⟩ = x_{1} x_{2} \dots x_{N} \sum c_{x_{1} x_{2} \dots x_{N}} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩,

∥ ∣ Ψ_{N} ⟩_{M} ∥^{2} = ∣ \overset{x}{ˉ} - ∣ α ∣^{2} ∣ > ϵ \sum ∣ c_{x_{1} x_{2} \dots x_{N}} ∣^{2} .

∥ ∣ Ψ_{N} ⟩_{M} ∥^{2} = ∣ \overset{x}{ˉ} - ∣ α ∣^{2} ∣ > ϵ \sum ∣ c_{x_{1} x_{2} \dots x_{N}} ∣^{2} .

∥ ∣ Ψ_{N} ⟩_{M} ∥^{2} \leq 2 e^{- 2 ϵ^{2} N} .

∥ ∣ Ψ_{N} ⟩_{M} ∥^{2} \leq 2 e^{- 2 ϵ^{2} N} .

∣ ⟨ a ∣ ∣ b ⟩ ∣ \leq ⟨ a ∣ ∣ a ⟩ ⟨ b ∣ ∣ b ⟩, \forall ∣ a ⟩, ∣ b ⟩ \in H,

∣ ⟨ a ∣ ∣ b ⟩ ∣ \leq ⟨ a ∣ ∣ a ⟩ ⟨ b ∣ ∣ b ⟩, \forall ∣ a ⟩, ∣ b ⟩ \in H,

\ket{\Omega}=...\ket{\varphi_{-2}}\ket{\varphi_{-1}}\,\Big{(}\ket{\psi}\ket{\psi}...\ket{\psi}\Big{)}\,\ket{\varphi_{1}}\ket{\varphi_{2}}...,

\ket{\Omega}=...\ket{\varphi_{-2}}\ket{\varphi_{-1}}\,\Big{(}\ket{\psi}\ket{\psi}...\ket{\psi}\Big{)}\,\ket{\varphi_{1}}\ket{\varphi_{2}}...,

∣ Ω_{N} ⟩ = ∣ ϕ_{1} ⟩ ∣ ϕ_{2} ⟩ \dots ∣ ϕ_{N} ⟩ .

∣ Ω_{N} ⟩ = ∣ ϕ_{1} ⟩ ∣ ϕ_{2} ⟩ \dots ∣ ϕ_{N} ⟩ .

∣ Ω_{N} ⟩ = x_{1} x_{2} \dots x_{N} \sum c_{x_{1} x_{2} \dots x_{N}} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩ .

∣ Ω_{N} ⟩ = x_{1} x_{2} \dots x_{N} \sum c_{x_{1} x_{2} \dots x_{N}} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩ .

- \frac{1}{N} lo g_{2} p (x_{1}, x_{2}, \dots, x_{N}) - H (x) \leq ϵ,

- \frac{1}{N} lo g_{2} p (x_{1}, x_{2}, \dots, x_{N}) - H (x) \leq ϵ,

H (x) = \frac{1}{N} E [- lo g p (x_{1}, x_{2}, \dots, x_{N})],

H (x) = \frac{1}{N} E [- lo g p (x_{1}, x_{2}, \dots, x_{N})],

N \to \infty lim Pr [- \frac{1}{N} lo g_{2} p (x_{1}, x_{2}, \dots, x_{N}) - H (x) > ϵ] = 0.

N \to \infty lim Pr [- \frac{1}{N} lo g_{2} p (x_{1}, x_{2}, \dots, x_{N}) - H (x) > ϵ] = 0.

i \sum lo g_{2} ∣ c_{x_{i}} ∣^{2} - N k \sum ∣ c_{k} ∣^{2} lo g_{2} ∣ c_{k} ∣^{2},

i \sum lo g_{2} ∣ c_{x_{i}} ∣^{2} - N k \sum ∣ c_{k} ∣^{2} lo g_{2} ∣ c_{k} ∣^{2},

\hat{P}_{∣ Y - μ_{Y} ∣ > ϵ} ∣ Ψ ⟩^{2} \leq \frac{Var ( Y )}{ϵ ^{2}},

\hat{P}_{∣ Y - μ_{Y} ∣ > ϵ} ∣ Ψ ⟩^{2} \leq \frac{Var ( Y )}{ϵ ^{2}},

\hat{P}_{- \frac{1}{N} l o g ∣ c_{x_{1} \dots x_{N}} ∣^{2} - H (x) > ϵ} ∣ Ω_{N} ⟩^{2}

\hat{P}_{- \frac{1}{N} l o g ∣ c_{x_{1} \dots x_{N}} ∣^{2} - H (x) > ϵ} ∣ Ω_{N} ⟩^{2}

\hat{P}_{- \frac{1}{N} l o g ∣ c_{x_{1} \dots x_{N}} ∣^{2} - H (x) > ϵ} ∣ Ω_{N} ⟩^{2}

\hat{P}_{- \frac{1}{N} l o g ∣ c_{x_{1} \dots x_{N}} ∣^{2} - H (x) > ϵ} ∣ Ω_{N} ⟩^{2}

\hat{P}_{- \frac{1}{N} l o g ∣ c_{x_{1} \dots x_{N}} ∣^{2} - H (x) > ϵ} ∣ Ω_{N} ⟩^{2}

\hat{P}_{- \frac{1}{N} l o g ∣ c_{x_{1} \dots x_{N}} ∣^{2} - H (x) > ϵ} ∣ Ω_{N} ⟩^{2}

\hat{P}_{f > a} ∣ Ψ ⟩^{2} \leq \frac{1}{a} ⟨ Ψ ∣ \hat{f} ∣ Ψ ⟩ .

\hat{P}_{f > a} ∣ Ψ ⟩^{2} \leq \frac{1}{a} ⟨ Ψ ∣ \hat{f} ∣ Ψ ⟩ .

∣ Ψ ⟩ = x_{1} x_{2} \dots x_{N} \sum Ψ_{x_{1} x_{2} \dots x_{N}} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩,

∣ Ψ ⟩ = x_{1} x_{2} \dots x_{N} \sum Ψ_{x_{1} x_{2} \dots x_{N}} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩,

\hat{f} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩ = f (x_{1}, x_{2}, \dots, x_{N}) ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩ .

\hat{f} ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩ = f (x_{1}, x_{2}, \dots, x_{N}) ∣ x_{1} ⟩ ∣ x_{2} ⟩ \dots ∣ x_{N} ⟩ .

⟨ Ψ ∣ \hat{f} ∣ Ψ ⟩

⟨ Ψ ∣ \hat{f} ∣ Ψ ⟩

\geq ⟨ Ψ ∣ \hat{P}_{f > a} \hat{f} \hat{P}_{f > a} ∣ Ψ ⟩,

\geq a ⟨ Ψ ∣ \hat{P}_{f > a} \hat{P}_{f > a} ∣ Ψ ⟩ .

\hat{P}_{f > a} ∣ Ψ ⟩^{2} \leq \frac{1}{a} ⟨ Ψ ∣ \hat{f} ∣ Ψ ⟩ .

\hat{P}_{f > a} ∣ Ψ ⟩^{2} \leq \frac{1}{a} ⟨ Ψ ∣ \hat{f} ∣ Ψ ⟩ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQuantum Mechanics and Applications · Quantum Information and Cryptography · Statistical Mechanics and Entropy

Full text

Less is More: Born’s Rule from Quantum Frequentism

Lionel Brits

[email protected]

(February 27, 2024)

Abstract

Accounting for Born’s rule from first principles remains an open problem among the foundational issues in quantum mechanics. Proponents of the many-worlds interpretation have argued that Born’s rule is observed simply because those histories that violate the rule have vanishing norms, and so must be unphysical. This argument has only been made explicit for contrived situations involving measurements on infinitely many identically-prepared systems. We prove a more general result, namely that for systems containing infinitely many degrees of freedom in arbitrarily-prepared states the universal wavefunction contains no histories that violate Born’s rule.

I Introduction

Despite the predictive success of the Copenhagen formulation of quantum mechanics it cannot be considered a complete description of nature as it relies on a distinguished observer in order to make sense of the measurement process. This can be seen either as an inconsistency – that physical interactions are unitary or non-unitary depending on whether they are observed Wigner (1961) – or merely a nuisance – that only a single observer is actually necessary to bootstrap the measurement process, one that can be pushed to the very edges of a system and then be forgotten von Neumann and Beyer (1955). This issue has, in part, lead to the development of the many worlds interpretation (MWI) Everett (1956); Dewitt and Graham (1973), which aims to explain the role of the observer within the unitary framework of the theory itself. In this interpretation, the apparent collapse of the wavefunction is understood as a loss of coherence between environmental states corresponding to different measurement outcomes, so that local degrees of freedom seem to evolve irreversibly. However, the MWI suffers from its own minimalism – having thrown out everything that is discontinuous and non-unitary, it has yet to give a wholly satisfactory explanation for the origin of probability, and in particular, Born’s rule. This paper aims to shed light on the problem, while keeping as much as possible of the spirit of the MWI intact.

I.1 The Problem with (Too) Many Worlds

For definiteness, we will summarize the key ideas behind the MWI Everett (1956); Dewitt and Graham (1973). According to this interpretation, the state of any isolated system is at all times described by a state vector $\ket{\Psi}$ evolving unitarily according to the Schrödinger equation. In particular, the MWI deduces that if the universe is an isolated system, then it too must be described in this way. The key strength of the MWI is that it explains quite clearly how the stochastic nature of quantum mechanics comes about. Suppose that we divide our universe into a microscopic system $S$ being measured and an apparatus $A$ that is performing the measurement (the observer may be considered as part of the apparatus subsystem). Then a factorizable state $\ket{S_{0}}\otimes\ket{A_{0}}$ of the composite system will evolve into a superposition $\sum_{i}c_{i}\ket{S_{i}}\otimes\ket{A_{i}}$ . One may say that the universe in which the system and apparatus was in state $\ket{S_{0}}\otimes\ket{A_{0}}$ has branched into a (possibly dense) set of universes each in the definite state $\ket{S_{i}}\otimes\ket{A_{i}}$ . Within each universe, the observer sees a different measurement outcome (represented by $A_{i}$ ), despite having identical initial conditions, and since any particular observer has no way of knowing which branch they will find themselves in, their measurement outcome is completely unpredictable. We clarify that the universes described so far are merely arbitrary orthogonal decompositions of the universal wavefunction, and it is the task of the decoherence program to find among these decompositions emergent classical realities Zurek (1982); Joos and Zeh (1985), which we will not do here.

Despite the appeal of dealing with the observer as an integral part of the system, the MWI has been criticized for failing to account for the appearance of Born’s rule in the measurement process. Since linearity puts every universe on an equal footing, it would suggest that they are all equally likely, including so-called maverick worlds in which Born’s rule is grossly violated. This is of course in contrast to what we actually observe, i.e., that likelihoods are proportional to the absolute squares of the coefficients $c_{i}$ .

To see how this comes about, let us construct a minimal model of measurement in which Born’s rule can be verified in a completely transparent way. We imagine that a number of spin- $\tfrac{1}{2}$ particles are prepared in identical states equal to $\ket{\psi}=\alpha\ket{\downarrow}+\beta\ket{\uparrow}=\alpha\ket{1}+\beta\ket{0}$ after which the number of particles in one of the states, $\ket{\downarrow}$ say, is recorded by a particle counter. This may be represented by a quantum circuit that counts the number of 1s in a set of identically prepared qubits (i.e., its Hamming weight) and records the result in a binary register consisting of some other previously initialized qubits. For the sake of clarity we will construct it from a sequence of unoptimized controlled- $[\mathrm{ADD}\,1]$ gates. This gate increments the target register if the control qubit is set to $1$ , and leaves it unaltered otherwise. The full circuit is shown in Fig. 1. An implementation for $N=3$ is also reproduced in Fig. 2.

From this simple model we see that the act of registering the frequency of certain events can be done in a manifestly unitary way. The circuit therefore exemplifies a sort of robotic “Wigners’s friend” Everett (1956); Wigner (1961), one in which no proposed mechanisms for collapse, such as consciousness, can hide. Consequently, at the end of the registration process the circuit remains in a superposition of states in which every possible value of $n$ has been registered. Except in the case in which $|\alpha|=|\beta|$ , the majority of these states correspond to frequencies that violate Born’s rule, which is at odds with what we actually observe, i.e., that $n/N\approx|\alpha|^{2}$ . At this point the role of the experimenter seems indispensable for obtaining the correct result, but this can only be true if he or she is to be endowed with some non-unitary quality Everett (1956). While this possibility has yet to be ruled out experimentally, recent demonstrations of quantum superposition on both mesoscopic and macroscopic O’Connell et al. (2010); Fein et al. (2019) scales make this argument hard to accept. The alternative conclusion, originally put forth by Everett Everett (1956) and others, is that maverick worlds are nonphysical, since they are assumed to have vanishing norm. Perhaps the most promising result is that of Graham and DeWitt Dewitt and Graham (1973), and independently, Hartle Hartle (1968), who showed that when measurements are performed on an ensemble of $N$ identically prepared systems, histories that deviate from Born’s rule have zero norm in the limit that the $N$ is taken to infinity. In the next section we will review this result before moving on to its generalization.

I.2 Quantum Frequentism

Prior to measurement, our circuit may be considered to be in the state $\left|\Psi_{N}\right\rangle=\left(\alpha\left|1\right\rangle+\beta\left|0\right\rangle\right)^{N}\left|0\right\rangle$ , with the rightmost $\ket{0}$ representing the state of the counter. After $N$ measurements the system will be in the state

[TABLE]

where $\{\left|s_{n,h}\right\rangle\}$ is the set of subsequent $N$ -particle states corresponding to the ${N\choose n}$ initial states which contain $n$ downward spins. Making use of the fact that the states $\left|s_{n,h}\right\rangle$ are orthogonal, one finds,

[TABLE]

We recognize the term $|\alpha|^{2n}|\beta|^{2(N-n)}{N\choose n}$ as the binomial distribution for the number of successes in a sequence of $N$ independent Bernoulli trials, each with success probability $p=|\alpha|^{2}$ . This function has relative width $\frac{\sigma}{N}=\sqrt{\frac{p\,(1-p)}{N}}$ and mean $\mu=|\alpha|^{2}N$ , so that after a large number of measurements, the state of the system will be narrowly peaked around the expected frequency $|\alpha|^{2}$ , decaying rapidly to zero elsewhere. Since the counter has so far only served a bookkeeping purpose, organizing the initial multi-particle state $\ket{\Psi_{N}}=\left(\alpha\left|1\right\rangle+\beta\left|0\right\rangle\right)^{N}\ket{0}$ into a superposition of states with definite numbers of downward spins, we shall suppress the counter and rewrite the final state state in terms of the computational basis, i.e.

[TABLE]

where $x_{i}\in\{0,1\}$ and $c_{x_{1}x_{2}\dots x_{N}}=\alpha^{n}\beta^{N-n}$ with $n=\sum x_{i}$ . Each binary string $(x_{1},x_{2},\dots,x_{N})$ and its associated counter value $n$ then defines a world in which a particular set of outcomes will be measured. Note that as we let $N\to\infty$ , the vector space $\mathcal{H}=\mathbb{C}^{2^{N}}$ spanned by the basis vectors $\ket{x_{1}}\ket{x_{2}}\dots$ becomes non-separable, so that care must be taken to maintain a well-behaved inner product structure von Neumann (1939). Let us define a maverick world to be one in which the empirical frequency $\bar{x}=\frac{1}{N}\sum x_{i}$ differs from the expected frequency $\mathrm{E}[\bar{x}]=|\alpha|^{2}$ by some finite positive error $\epsilon$ , i.e., $\left|\bar{x}-|\alpha|^{2}\right|>\epsilon$ (here and elsewhere expectation values will always mean those computed according to Born’s rule). We can then decompose $\ket{\Psi_{N}}$ into the projection $\ket{\Psi_{N}}_{\mathrm{M(averick)}}$ containing all maverick worlds, as well as the projection $\ket{\Psi_{N}}_{\mathrm{B(orn)}}$ containing all regular, or Born worlds, so that $\ket{\Psi_{N}}=\ket{\Psi_{N}}_{\mathrm{B}}+\ket{\Psi_{N}}_{\mathrm{M}}$ . To find $\left\lVert\ket{\Psi_{N}}_{\mathrm{M}}\right\rVert$ , we would need to evaluate

[TABLE]

We can however place an upper bound on this quantity. Because $n$ is the sum of independent random variables taking the values $\{0,1\}$ we use Hoeffding’s inequality Hoeffding (1963) to find

[TABLE]

It follows that $\lim_{N\to\infty}\left\lVert\ket{\Psi_{N}}_{\mathrm{M}}\right\rVert^{2}=0$ for any finite $\epsilon$ , and therefore that $\ket{\Psi_{\infty}}$ differs from $\ket{\Psi_{\infty}}_{\mathrm{B}}$ by a quantity of zero norm. Using the Cauchy–Schwarz inequality,

[TABLE]

we see that $\ket{\Psi_{\infty}}_{M}$ is orthogonal to (and decoupled from) all other states. Such vectors are not proper elements of the Hilbert space, and must be removed in order to maintain a positive definite inner product. We therefore identify those elements of $\mathcal{H}$ that differ by elements of $\mathcal{H}_{0}$ , the subspace of zero norm states. Our Hilbert space is then the quotient space denoted by $\mathcal{H}/\mathcal{H}_{0}$ , so that we may consider $\ket{\Psi_{\infty}}$ and $\ket{\Psi_{\infty}}_{B}$ to represent the same physical state, i.e., $\ket{\Psi_{\infty}}=\ket{\Psi_{\infty}}_{B}$ . We conclude that in the $N\to\infty$ limit the universal wavefunction contains only those worlds in which Born’s rule is observed.

Since the counter plays no role in this result, it strictly unnecessary that the spin of every particle be measured, only that infinitely many identically prepared particles be available to measure. However, a serious criticism of the frequentist program is that we do not perform measurements on systems containing infinitely many identically prepared particles Caves and Schack (2005). In the finite case, all frequencies of events have non-zero amplitudes, and consequently non-zero probabilities of occurring, so that it seems hardly even possible to define maverick worlds in this case. Everett and others have ultimately argued that the only way to produce the correct Born probabilities in this case is to assign a probability measure on the universal Hilbert space, so that maverick worlds are never observed simply because one is very unlikely to find oneself in such a world. However, by getting rid of one postulate at the cost of gaining another, this apparent resolution stands at odds with the position of the wavefunction as a complete description of the system. If the theory is to be self consistent, then the structure of the wavefunction itself must account for the appearance of Born’s rule.

II Maverick Worlds in the Thermodynamic Limit

A way out of this problem is to realize that any experiment performed on a finite multi-particle state such as $\ket{\psi}\ket{\psi}...\ket{\psi}$ must take place inside a Hilbert space containing also all the particles in the environment. Aguirre and Tegmark Aguirre and Tegmark (2011) have argued that in an infinite, statistically uniform cosmological model the laboratory state $\ket{\psi}$ must be replicated infinitely many times throughout the universe, thereby realizing the “fictitious” infinite ensemble needed to derive Born’s rule. (More specifically, the authors of Aguirre and Tegmark (2011) impose the stronger condition that both system plus experimenter be replicated infinitely many times, although this does not seem necessary.) However, this argument rests on some knowledge of the distribution of states that make up the universal wavefunction, and does not account for states that come arbitrarily close to, but never equal $\ket{\psi}$ . As we will show, the replica condition is actually unnecessary, so that we need only consider states of the form

[TABLE]

where $\{\ket{\varphi_{i}}\}$ now represent arbitrary environmental component states. Since there is no real distinction between the system and the environment, it is convenient to treat all degrees of freedom on an equal footing by absorbing the multi-particle state $\ket{\psi}\ket{\psi}...\ket{\psi}$ into the environmental degrees of freedom, letting

[TABLE]

Let us now find a suitable definition of a maverick world in this case. In terms of the computational basis, this state may again be written as

[TABLE]

As we have argued, there can be no condition placed on the outcome of any finite subset of measurement outcomes $\{x_{i}\}$ (provided that $p_{i}$ is neither [math] or $1$ ). Instead, we will take an information theoretic approach. First, given a joint probability distribution $p(x_{1},x_{2},\dots,x_{N})$ obtained from $\ket{\Omega_{N}}$ , we define a typical sequence to be a sequence $(x_{i})$ such that

[TABLE]

for some value $\epsilon>0$ , where

[TABLE]

is the Shannon entropy rate of the distribution $p(x_{1},x_{2},\dots,x_{N})$ Shannon (1948). This definition is motivated by the asymptotic equipartition property (AEP) Shannon (1948); Cover and Thomas (2006), which states that

[TABLE]

That is, as $N$ becomes large, the empirical entropy rate of a sequence chosen at random from the distribution $p(x_{1},x_{2},\dots,x_{N})$ tends towards $H(x)$ , which follows directly from the weak law of large numbers applied to the quantity $-\log_{2}p(x_{1},x_{2},\dots,x_{N})$ . We can therefore partition the set of all possible sequences $(x_{i})$ into two sets: a typical set, in which every element has probability $p(x_{1},x_{2},\dots,x_{N})\approx 2^{-NH(x)}$ , and a non-typical set, containing all other sequences. The AEP tells us that, in the $N\to\infty$ limit, the probability of randomly selecting an element that belongs to the non-typical set is zero.

Having established the expected behaviour of the classical sequences $\{(x_{i})\}$ , we identify typical sequences with Born worlds and the remaining (non-typical) sequences with maverick worlds. To see that this identification agrees with our intuition, note that if we have $N$ repetitions of the state $\sum_{k}c_{k}\ket{k}$ then the inequality in equation 10 will contain the term

[TABLE]

which attains a minimum when $\log_{2}\left|c_{k}\right|^{2}$ occurs roughly $N\left|c_{k}\right|^{2}$ times in the first sum, or equivalently, when there are roughly $N\left|c_{k}\right|^{2}$ particles in the state $\ket{k}$ . It remains to be shown that the state $\ket{\Omega_{N}}=\ket{\Omega_{N}}_{B}+\ket{\Omega_{N}}_{M}$ contains no non-typical sequences in the $N\to\infty$ limit, i.e., that $\ket{\Omega_{\infty}}_{M}=0$ . But since $\left\lVert\ket{\Omega_{N}}_{\mathrm{M}}\right\rVert^{2}$ is precisely the quantity $\mathrm{Pr}\left[\left|-\tfrac{1}{N}\log_{2}p(x_{1},x_{2},\dots,x_{N})-H(x)\right|>\epsilon\right]$ , by the AEP, we may conclude that $\lim_{N\to\infty}\left\lVert\ket{\Omega_{N}}_{\mathrm{M}}\right\rVert^{2}=0$ and that $\ket{\Omega_{\infty}}=\ket{\Omega_{\infty}}_{B}$ in general. The fact that we observe Born’s rule therefore stems from a rather surprising place: From a practical point of view, choosing a sequence $(x_{1},x_{2},\dots)$ at random from the distribution $p(x_{1},x_{2},\dots)$ is indistinguishable from choosing among the set of typical sequences with a uniform probability distribution. Since $\ket{\Omega_{\infty}}$ contains only such sequences, our observations must appear to be governed by the distribution $p(x_{1},x_{2},\dots)$ as obtained from Born’s rule.

Although it seems that we needed to make a detour into the classical world to derive this result, we stress that the AEP and the weak law of large numbers from which it is derived are purely algebraic inequalities which may be applied to the quantities $\left|c_{x_{1}x_{2}\dots x_{N}}\right|^{2}$ without any mention of probabilities. Since our argument hinges on this point, it is worth doing so explicitly. We start by recalling an important inequality (see appendix):

Theorem II.1 (Chebyshev’s Inequality)

Given an arbitrary state $\ket{\Psi}$ and a Hermitian operator $\hat{Y}$ , let $\hat{P}_{|Y-\mu_{Y}|>\epsilon}$ be the projection operator that preserves states for which $\left|Y-\bra{\Psi}\hat{Y}\ket{\Psi}\right|>\epsilon$ for some $\epsilon>0$ . Then

[TABLE]

where $\mathrm{Var}(Y)=\bra{\Psi}(\hat{Y}-\bra{\Psi}\hat{Y}\ket{\Psi})^{2}\ket{\Psi}$ .

Phrased in this form, Chebyshev’s inequality is independent of any probabilistic interpretation, and may be used to derive the AEP by applying it to $\ket{\Omega_{N}}$ , letting $Y(x_{1},x_{2},\dots,x_{N})=-\frac{1}{N}\log\left|c_{x_{1}\dots x_{N}}\right|^{2}$ . Then

[TABLE]

Since $\ket{\Omega_{N}}=\ket{\phi_{1}}\ket{\phi_{2}}\dots\ket{\phi_{N}}$ the quantity $\left|c_{x_{1}\dots x_{N}}\right|^{2}$ can be factorized into the form $|c_{1}(x_{1})|^{2}\dots|c_{N}(x_{N})|^{2}$ so that

[TABLE]

Then, provided that $\mathrm{Var}(\log|c_{i}(x_{i})|^{2})<M$ for all $i$ , we may write

[TABLE]

It follows then that $\lim_{N\to\infty}\left\lVert\ket{\Omega_{N}}_{\mathrm{M}}\right\rVert^{2}=0$ and therefore that $\ket{\Omega_{\infty}}=\ket{\Omega_{\infty}}_{B}$ as claimed.

III Concluding Remarks

The importance of environmental degrees of freedom in obtaining Born’s rule was already recognized by Zurek Zurek (1982) in the context of environmentally induced decoherence. However, in our result the environment plays a non-dynamical role, serving to define, and then get rid of maverick worlds, a sort of Mach’s principle for quantum states. Therefore, while system and environmental degrees of freedom are a priori independent, they must be considered together as parts of a single quantum state. Our result shows that if one allows for systems with infinitely many degrees of freedom to exist, then Born’s rule arises from the theory quite automatically. However, we do not impose any ad hoc measure on the Hilbert space in order to achieve this result. Instead, we note that in order for the inner product to be positive definite, all vectors of zero norm must be identified with the zero vector. Thus the physical Hilbert space is not $\mathcal{H}$ but $\mathcal{H}/\mathcal{H}_{0}$ , in which $\ket{\Omega_{\infty}}-\ket{\Omega_{\infty}}_{B}$ is identically zero.

It is also worth noting that the argument presented in this paper works strictly in the limit $N\to\infty$ , and fails for any finite value of $N$ , where maverick states are in the majority. Thus the statistical behaviour of the finite system does not approach that of the infinite system continuously. To fully account for Born’s rule, we must assume that our universe contains infinitely many degrees of freedom (rather than some arbitrarily large number, as is normally done). While this does not seem to be an overly objectionable assumption to us, we can turn this reasoning around and view Born’s rule instead as an experimental validation of this possibility. That is, the absence of maverick states supports the idea that the universe is an infinite system.

IV Acknowledgments

The author wishes to thank Conor Stokes for helpful discussion.

V Appendix

Lemma V.1 (Markov’s Inequality)

Given an arbitrary state $\ket{\Psi}$ and a non-negative Hermitian operator $\hat{f}$ , let $\hat{P}_{f>a}$ and $\hat{P}_{f\leq a}$ be projection operators that separate those states for which $f>a$ from those for which $f\leq a$ for some $a>0$ . Then

[TABLE]

Proof

Without loss of generality, we may consider a basis that diagonalizes $\hat{f}$ , i.e., let

[TABLE]

such that

[TABLE]

Then

[TABLE]

Since $a>0$ ,

[TABLE]

Proof of theorem II.1 (Chebyshev’s Inequality)

The result follows directly by taking $\hat{f}=(\hat{Y}-\mu_{Y})^{2}$ where $\mu_{Y}=\bra{\Psi}\hat{Y}\ket{\Psi}$ and $a=\epsilon^{2}$ .

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Wigner (1961) E. P. Wigner, in The Scientist Speculates , edited by I. J. Good (Heineman, 1961).
2von Neumann and Beyer (1955) J. von Neumann and R. Beyer, Mathematical Foundations of Quantum Mechanics , Goldstine Printed Materials (Princeton University Press, 1955).
3Everett (1956) H. Everett, Wave Mechanics Without Probability , Ph.D. thesis, Princeton University (1956).
4Dewitt and Graham (1973) B. S. Dewitt and N. Graham, eds., The Many-Worlds Interpretation of Quantum Mechanics (Princeton University Press, 1973).
5Zurek (1982) W. H. Zurek, Phys. Rev. D 26 , 1862 (1982) . · doi ↗
6Joos and Zeh (1985) E. Joos and H. D. Zeh, Zeitschrift für Physik B Condensed Matter 59 , 223 (1985) . · doi ↗
7O’Connell et al. (2010) A. D. O’Connell, M. Hofheinz, M. Ansmann, R. C. Bialczak, M. Lenander, E. Lucero, M. Neeley, D. Sank, H. Wang, M. Weides, J. Wenner, J. M. Martinis, and A. N. Cleland, Nature (London) 464 , 697 (2010) . · doi ↗
8Fein et al. (2019) Y. Fein, P. Geyer, P. Zwick, F. Kiałka, S. Pedalino, M. Mayor, S. Gerlich, and M. Arndt, Nature Physics , 1 (2019) . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Less is More: Born’s Rule from Quantum Frequentism

Abstract

I Introduction

I.1 The Problem with (Too) Many Worlds

I.2 Quantum Frequentism

II Maverick Worlds in the Thermodynamic Limit

Theorem II.1** (Chebyshev’s Inequality)**

III Concluding Remarks

IV Acknowledgments

V Appendix

Lemma V.1** (Markov’s Inequality)**

Theorem II.1 (Chebyshev’s Inequality)

Lemma V.1 (Markov’s Inequality)