An Algebraic Approach to Koopman Classical Mechanics

Peter Morgan

arXiv:1901.00526·quant-ph·February 18, 2020

An Algebraic Approach to Koopman Classical Mechanics

Peter Morgan

PDF

TL;DR

This paper reformulates classical mechanics using algebraic operators and constructs a Hilbert space framework, bridging classical and quantum mechanics and offering new insights into the measurement problem.

Contribution

It introduces a unary operator algebraic formulation of classical mechanics that parallels quantum mechanics, enabling a unified measurement theory and addressing the measurement problem.

Findings

01

Classical mechanics can be expressed with noncommutative operators.

02

A Hilbert space representation of classical mechanics is constructed.

03

The approach provides a formal way to reconcile collapse and no-collapse interpretations.

Abstract

Classical mechanics is presented here in a unary operator form, constructed using the binary multiplication and Poisson bracket operations that are given in a phase space formalism, then a Gibbs equilibrium state over this unary operator algebra is introduced, which allows the construction of a Hilbert space as a representation space of a Heisenberg algebra, giving a noncommutative operator algebraic variant of the Koopman-von Neumann approach. In this form, the measurement theory for unary classical mechanics can be the same as and inform that for quantum mechanics, expanding classical mechanics to include noncommutative operators so that it is close to quantum mechanics, instead of attempting to squeeze quantum mechanics into a classical mechanics mold. The measurement problem as it appears in unary classical mechanics suggests a classical signal analysis approach that can also be…

Equations138

\cdot : u, v

\cdot : u, v

{,} : u, v

\overset{q}{^} : u (q, p) \mapsto q \cdot u (q, p), \overset{p}{^} : u (q, p) \mapsto p \cdot u (q, p),

\overset{q}{^} : u (q, p) \mapsto q \cdot u (q, p), \overset{p}{^} : u (q, p) \mapsto p \cdot u (q, p),

\hat{Q} : u (q, p) \mapsto {p, u} (q, p) = \frac{\partial}{\partial q} u (q, p),

\hat{P} : u (q, p) \mapsto {u, q} (q, p) = \frac{\partial}{\partial p} u (q, p),

ρ (q^{2 m} p^{2 n}) = \int q^{2 m} p^{2 n} \frac{1}{2 π k _{B} T} e^{- (q^{2} + p^{2}) /2 k_{B} T} d q d p = (k_{B} T)^{m + n} \frac{( 2 m )!}{2 ^{m} m !} \frac{( 2 n )!}{2 ^{n} n !},

ρ (q^{2 m} p^{2 n}) = \int q^{2 m} p^{2 n} \frac{1}{2 π k _{B} T} e^{- (q^{2} + p^{2}) /2 k_{B} T} d q d p = (k_{B} T)^{m + n} \frac{( 2 m )!}{2 ^{m} m !} \frac{( 2 n )!}{2 ^{n} n !},

ρ (e^{j λ q + j μ p}) = e^{- k_{B} T (λ^{2} + μ^{2}) /2},

ρ (e^{j λ q + j μ p}) = e^{- k_{B} T (λ^{2} + μ^{2}) /2},

P(\mathring{q},\mathring{p})=\rho\bigl{(}\delta(q-\mathring{q})\delta(p-\mathring{p})\bigr{)}=\frac{1}{2\pi{\mathsf{k}_{\!B}\!\mathsf{T}}}{\mathrm{e}}^{-(\mathring{q}^{2}+\mathring{p}^{2})/2{\mathsf{k}_{\!B}\!\mathsf{T}}}.

P(\mathring{q},\mathring{p})=\rho\bigl{(}\delta(q-\mathring{q})\delta(p-\mathring{p})\bigr{)}=\frac{1}{2\pi{\mathsf{k}_{\!B}\!\mathsf{T}}}{\mathrm{e}}^{-(\mathring{q}^{2}+\mathring{p}^{2})/2{\mathsf{k}_{\!B}\!\mathsf{T}}}.

\overset{q}{^} = (a + a^{†}) k_{B} T,

\overset{q}{^} = (a + a^{†}) k_{B} T,

\hat{Q} = (a - a^{†}) /2 k_{B} T,

\hat{F}_{f} = f_{1} \overset{q}{^} + f_{2} \overset{p}{^} + 2 k_{B} T (f_{3} j \hat{Q} + f_{4} j \hat{P}), f ≐ (f_{1}, f_{2}, f_{3}, f_{4}), \hat{F}_{f}^{†} = \hat{F}_{f^{*}},

\hat{F}_{f} = f_{1} \overset{q}{^} + f_{2} \overset{p}{^} + 2 k_{B} T (f_{3} j \hat{Q} + f_{4} j \hat{P}), f ≐ (f_{1}, f_{2}, f_{3}, f_{4}), \hat{F}_{f}^{†} = \hat{F}_{f^{*}},

ρ (e^{j λ \hat{F}_{f}})

ρ (e^{j λ \hat{F}_{f}})

(f, g) ≐ ρ (\hat{F}_{f}^{†} \hat{F}_{g}) = k_{B} T [(f_{1}^{*} + j f_{3}^{*}) (g_{1} - j g_{3}) + (f_{2}^{*} + j f_{4}^{*}) (g_{2} - j g_{4})],

(f, g) ≐ ρ (\hat{F}_{f}^{†} \hat{F}_{g}) = k_{B} T [(f_{1}^{*} + j f_{3}^{*}) (g_{1} - j g_{3}) + (f_{2}^{*} + j f_{4}^{*}) (g_{2} - j g_{4})],

[\hat{F}_{f}, \hat{F}_{g}] = (f^{*}, g) - (g^{*}, f) = 2 j k_{B} T [f_{3} g_{1} - f_{1} g_{3} + f_{4} g_{2} - f_{2} g_{4}],

[\hat{F}_{f}, \hat{F}_{g}] = (f^{*}, g) - (g^{*}, f) = 2 j k_{B} T [f_{3} g_{1} - f_{1} g_{3} + f_{4} g_{2} - f_{2} g_{4}],

ρ (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}})

ρ (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}})

=

ρ (\hat{A} e^{j λ \hat{F}_{f}} e^{j μ \hat{F}_{g}} \hat{B}) = e^{- λ μ [(f^{*}, g) - (g^{*}, f)]} ρ (\hat{A} e^{j λ \hat{F}_{g}} e^{j μ \hat{F}_{f}} \hat{B}),

ρ (\hat{A} e^{j λ \hat{F}_{f}} e^{j μ \hat{F}_{g}} \hat{B}) = e^{- λ μ [(f^{*}, g) - (g^{*}, f)]} ρ (\hat{A} e^{j λ \hat{F}_{g}} e^{j μ \hat{F}_{f}} \hat{B}),

ρ (\hat{A} e^{j λ \hat{F}_{f}} e^{j μ \hat{F}_{g}} \hat{B}) = e^{- λ μ [(f^{*}, g) - (g^{*}, f)] /2} ρ (\hat{A} e^{j \hat{F}_{λ f + μ g}} \hat{B}),

ρ (\hat{A} e^{j λ \hat{F}_{f}} e^{j μ \hat{F}_{g}} \hat{B}) = e^{- λ μ [(f^{*}, g) - (g^{*}, f)] /2} ρ (\hat{A} e^{j \hat{F}_{λ f + μ g}} \hat{B}),

(f, g) = \int (\tilde{f} (k), \tilde{g} (k)) 2 π δ (k^{2} - 1) \frac{d k}{2 π} .

(f, g) = \int (\tilde{f} (k), \tilde{g} (k)) 2 π δ (k^{2} - 1) \frac{d k}{2 π} .

(f, g) = \int (f_{0}, g_{0}) e^{- τ^{2} k^{2}} 2 π δ (k^{2} - 1) \frac{d k}{2 π} = (f_{0}, g_{0}) e^{- τ^{2}} .

(f, g) = \int (f_{0}, g_{0}) e^{- τ^{2} k^{2}} 2 π δ (k^{2} - 1) \frac{d k}{2 π} = (f_{0}, g_{0}) e^{- τ^{2}} .

(f, g) = \int (\tilde{f} (k), \tilde{g} (k)) \tilde{G} (k) \frac{d k}{2 π},

(f, g) = \int (\tilde{f} (k), \tilde{g} (k)) \tilde{G} (k) \frac{d k}{2 π},

(f, g) = ℏ \int \tilde{f}^{*} (k) \tilde{g} (k) 2 π δ (k \cdot k - m^{2}) \frac{d ^{4} k}{( 2 π ) ^{4}},

(f, g) = ℏ \int \tilde{f}^{*} (k) \tilde{g} (k) 2 π δ (k \cdot k - m^{2}) \frac{d ^{4} k}{( 2 π ) ^{4}},

ρ (λ \hat{A} + μ \hat{B}) = λ ρ (\hat{A}) + μ ρ (\hat{B}), ρ (\hat{A}^{†} \hat{A}) \geq 0, ρ (\hat{1}) = 1,

ρ (λ \hat{A} + μ \hat{B}) = λ ρ (\hat{A}) + μ ρ (\hat{B}), ρ (\hat{A}^{†} \hat{A}) \geq 0, ρ (\hat{1}) = 1,

ρ_{\hat{A}} (\hat{B}) = \frac{ρ ( A ^ ^{†} B ^ A ^ )}{ρ ( A ^ ^{†} A ^ )} = \frac{⟨ A ^ ∣ B ^ ∣ A ^ ⟩}{⟨ A ^ ∣ A ^ ⟩} = \frac{⟨ 1 ^ ∣ A ^ ^{†} B ^ A ^ ∣ 1 ^ ⟩}{⟨ 1 ^ ∣ A ^ ^{†} A ^ ∣ 1 ^ ⟩},

ρ_{\hat{A}} (\hat{B}) = \frac{ρ ( A ^ ^{†} B ^ A ^ )}{ρ ( A ^ ^{†} A ^ )} = \frac{⟨ A ^ ∣ B ^ ∣ A ^ ⟩}{⟨ A ^ ∣ A ^ ⟩} = \frac{⟨ 1 ^ ∣ A ^ ^{†} B ^ A ^ ∣ 1 ^ ⟩}{⟨ 1 ^ ∣ A ^ ^{†} A ^ ∣ 1 ^ ⟩},

\rho_{H}\bigl{(}\delta(q-\mathring{q})\delta(p-\mathring{p})\bigr{)}=\frac{1}{\mathcal{N}}{\mathrm{e}}^{-H(\mathring{q},\mathring{p})/{\mathsf{k}_{\!B}\!\mathsf{T}}}=P(\mathring{q},\mathring{p}),

\rho_{H}\bigl{(}\delta(q-\mathring{q})\delta(p-\mathring{p})\bigr{)}=\frac{1}{\mathcal{N}}{\mathrm{e}}^{-H(\mathring{q},\mathring{p})/{\mathsf{k}_{\!B}\!\mathsf{T}}}=P(\mathring{q},\mathring{p}),

ρ_{H} (e^{j λ q + j μ p}) = \tilde{P} (λ, μ) .

ρ_{H} (e^{j λ q + j μ p}) = \tilde{P} (λ, μ) .

ρ_{H} (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}}) = ρ (\hat{U}_{H}^{†} e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}} \hat{U}_{H}) .

ρ_{H} (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}}) = ρ (\hat{U}_{H}^{†} e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}} \hat{U}_{H}) .

\displaystyle\rho_{H}\bigl{(}\delta(\hat{q}-\mathring{q})\delta(\hat{p}-\mathring{p})\bigr{)}

\displaystyle\rho_{H}\bigl{(}\delta(\hat{q}-\mathring{q})\delta(\hat{p}-\mathring{p})\bigr{)}

\tilde{P}_{H} (λ, μ) = \tilde{Ξ}_{H} (λ, μ) e^{- k_{B} T^{'} (λ^{2} + μ^{2}) /2} = \tilde{Ξ}_{H} (λ, μ) ρ_{[k_{B} T^{'}]} (e^{j λ \overset{q}{^} + j μ \overset{p}{^}}), \tilde{Ξ}_{H} (0, 0) = 1,

\tilde{P}_{H} (λ, μ) = \tilde{Ξ}_{H} (λ, μ) e^{- k_{B} T^{'} (λ^{2} + μ^{2}) /2} = \tilde{Ξ}_{H} (λ, μ) ρ_{[k_{B} T^{'}]} (e^{j λ \overset{q}{^} + j μ \overset{p}{^}}), \tilde{Ξ}_{H} (0, 0) = 1,

\tilde{Ξ}_{H} (λ, μ) = \tilde{P}_{H} (λ, μ) e^{+ k_{B} T^{'} (λ^{2} + μ^{2}) /2}

\tilde{Ξ}_{H} (λ, μ) = \tilde{P}_{H} (λ, μ) e^{+ k_{B} T^{'} (λ^{2} + μ^{2}) /2}

ρ_{H} (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}}) = \int Ξ_{H} (α, β) ρ_{[k_{B} T^{'}]} (e^{α \hat{Q} + β \hat{P}} e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}} e^{- α \hat{Q} - β \hat{P}}) d α d β .

ρ_{H} (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}}) = \int Ξ_{H} (α, β) ρ_{[k_{B} T^{'}]} (e^{α \hat{Q} + β \hat{P}} e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}} e^{- α \hat{Q} - β \hat{P}}) d α d β .

ρ_{H} (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}}) = \int Ξ_{H} (α, β) ρ_{[k_{B} T^{'}]} (e^{α \hat{Q} + β \hat{P} + j F (α, β, \overset{q}{^}, \overset{p}{^})} e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}} e^{- α \hat{Q} - β \hat{P} - j F (α, β, \overset{q}{^}, \overset{p}{^})}) d α d β

ρ_{H} (e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}}) = \int Ξ_{H} (α, β) ρ_{[k_{B} T^{'}]} (e^{α \hat{Q} + β \hat{P} + j F (α, β, \overset{q}{^}, \overset{p}{^})} e^{j λ_{1} \hat{F}_{f_{1}}} \dots e^{j λ_{n} \hat{F}_{f_{n}}} e^{- α \hat{Q} - β \hat{P} - j F (α, β, \overset{q}{^}, \overset{p}{^})}) d α d β

\displaystyle\hat{Y}_{u}:\mathcal{A}\rightarrow\mathcal{A};v\mapsto\hat{Y}_{u}(v)=u\cdot v,\

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

An Algebraic Approach to Koopman Classical Mechanics

Peter Morgan

Physics Department, Yale University, New Haven, CT 06520, USA.

[email protected]

Abstract

Classical mechanics is presented here in a unary operator form, constructed using the binary multiplication and Poisson bracket operations that are given in a phase space formalism, then a Gibbs equilibrium state over this unary operator algebra is introduced, which allows the construction of a Hilbert space as a representation space of a Heisenberg algebra, giving a noncommutative operator algebraic variant of the Koopman–von Neumann approach. In this form, the measurement theory for unary classical mechanics can be the same as and inform that for quantum mechanics, expanding classical mechanics to include noncommutative operators so that it is close to quantum mechanics, instead of attempting to squeeze quantum mechanics into a classical mechanics mold. The measurement problem as it appears in unary classical mechanics suggests a classical signal analysis approach that can also be successfully applied to the measurement problem of quantum mechanics. The development offers elementary mathematics that allows a formal reconciliation of “collapse” and “no–collapse” interpretations of quantum mechanics.

keywords:

Classical Mechanics, Koopman-von Neumann formalism, Quantum Mechanics

††journal: Annals of Physics

$\blacktriangleright$

The algebra of classical mechanics observables in unary form is noncommutative.

$\blacktriangleright$

Measurement for this noncommutative algebra is the same as for quantum mechanics.

$\blacktriangleright$

A signal analysis approach unifies the classical and quantum pictures.

$\blacktriangleright$

The Liouvillian of CM is not bounded below, in contrast to the Hamiltonian of QM.

$\blacktriangleright$

“Collapse” of the state is shown equivalent to a constraint on joint measurements.

1 Introduction

An algebraic approach to Koopman’s Hilbert space formalism for classical mechanics gives us mathematical tools that are powerful enough that we can obey Bohr’s stricture that we must describe an experimental apparatus classically very closely indeed. The noncommutativity of the transformation algebra that is introduced by the Poisson bracket is so natural for a classical physicist who fully exploits the Hilbert space tools available that not only can we classically describe the experimental apparatus, we can also describe the analysis of the experimental raw data as effectively as we can in quantum mechanics.

Many discussions now appear in the physics literature that probe the relationship between classical and quantum physics, within which Koopman’s 1931 introduction of a Hilbert space formalism for classical mechanics increasingly appears. The approach here contrasts with traditional Koopman–type approaches by being more algebra–centric, inspired by the algebraic approach to quantum mechanics[1] and to quantum field theory[2], so that Koopman’s Hilbert space formalism for classical mechanics becomes more a classical signal analysis formalism (which introduces noncommutativity very naturally just in its free use of fourier and other integral transforms over time, which is not permitted for classical mechanics). Experimental raw data is taken here to be a finite, lossily compressed record of an arbitrarily detailed collection of possible measurements of noisy signal voltage measurements that could have been recorded: even though we only actually perform a finite number of measurements, the set of measurements we could have performed in the past and the set of measurements we might perform in the future are both arbitrarily large. At the level of actual signals, signal level responses to the surrounding environment do not occur as instantaneous “collapses”: they instead transition over very short but finite times that as much depend on the material and electronic circuitry of the apparatus as on the surroundings. The positioning of the finite number of measurements we actually perform into a continuously indexed set of possible measurements will give us a theory that discusses a field of measurements, represented by an algebra of operators, together with a relatively unstructured “state”, which gives us information about what the results would be: it is helpful to think of the mathematics of quantum field theory as a continuously indexed field of measurements, not as a field that is measured.

The construction given here takes inspiration from many places: from quantum non–demolition measurement[3], from the Koopman–von Neumann[4, 5, 6, 7] and similar approaches[8, 9, 10, 11] to classical mechanics, and from generalized probability theory[12], and there is also a control systems literature that considers mixed classical and quantum models[13, 14, 15]. Another inspiration —which is, however, rather flawed because it is not statistical— is the long use of the Wigner function and other time–frequency distributions in signal analysis[16]. This practical use of the Heisenberg algebra[17], generated in the time–frequency case by $[\partial/\partial t,t]=1$ , with a complex structure provided by fourier analysis, and the consequent approach to the Heisenberg uncertainty principle, has become well–known in popular science, and it may be a useful resource for some audiences, because there are several videos that present a similar idea on much–followed YouTube channels[18, 19, 20, 21]. More closely, Wigner function approaches to quantum mechanics are well–known and much developed[22][23, Ch. 15], putting classical and quantum mechanics both into phase space formalisms, whereas Koopman–type approaches put classical and quantum mechanics both into Hilbert space formalisms: the two approaches have different merits, but Koopman–type approaches have thus far been much less developed.

There is also much inspiration from “Geometric Quantization”[24], but there is an essential contrast with that approach: quantization valiantly attempts to construct a map from the commutative algebra generated by classical phase space position and momentum observables $q$ , $p$ , to the noncommutative Heisenberg algebra generated by quantum observables $\hat{q}$ , $\hat{p}$ , but, to say it bluntly, fails. Classical mechanics in a phase space formalism takes the observables of the theory to be functions on phase space, which does not include unary algebraic operators that can naturally be constructed using the Poisson bracket. In phase space classical mechanics the action of the Poisson bracket as a binary operation is closed on the commutative algebra of functions on phase space —given two functions $u$ and $v$ on phase space, $\{u,v\}$ is also a function on phase space— but it is classically natural in a Koopman–type approach to use the unary operators that can be constructed using the Poisson bracket not only as generators of transformations. A less constrained unary classical mechanics formalism leads to a larger, noncommutative algebra of observables, which in elementary cases amounts to allowing classical physics to use functions $u(q,p,\partial/\partial q,\partial/\partial p)$ as observables, not just functions $u(q,p)$ . A noncommutative algebra generated by $q$ , $\partial/\partial q$ and by $p$ , $\partial/\partial p$ is a classical mechanics that we will call CM+, which, as an instance of the Heisenberg algebra, can be mapped to two copies of quantum $\hat{q}$ , $\hat{p}$ . The paper shares with Wetterich[10] a concern to motivate and to justify the use of noncommutativity as a natural classical tool, but focuses on the use of the Poisson bracket and applying the methods of algebraic quantum mechanics to classical mechanics. Given a Gibbs equilibrium state over the classical algebra we can construct a complex Hilbert space (with a complex structure provided below by the use of characteristic functions), and use that Hilbert space to model physics.

Cohn, in 1980, presented a comparable “Operator formulation of classical mechanics”[25], from which the present approach differs in notation, by using only Gibbs equilibrium states, and by taking it to be the measurement theory that can be considered common between unary classical and quantum mechanics (despite the difference, discussed in §8, between thermal and quantum fluctuations.)

We will begin with the simple harmonic oscillator in §2, using the binary multiplication and Poisson bracket operations to construct unary operators, and introducing a Gibbs equilibrium state, then §2.1 gives a signal analysis approach to the simple harmonic oscillator by introducing an explicit time parameter. §4 introduces more general Hamiltonians, but with the Poisson bracket structure still the same as for the simple harmonic oscillator, then §5 discusses a more general phase space, for which both the Hamiltonian and the Poisson bracket are nontrivial. §3 gives a short discussion of the (Gelfand–Naimark–Segal) GNS–construction of a Hilbert space in a relatively elementary way, focusing on states over algebras of operators, which usefully frees us from thinking of the Hilbert space as necessarily pre–eminent, though the familiar Hilbert space formalism will remain the first choice for practical use.

§6 shows how to construct states as modulations of the Gibbs equilibrium state and how to construct a noncommutative algebra of operators in a way that is classically natural, particularly from a signal analysis perspective, which can be used to model transformations that are performed either by experimental apparatus or by signal analysis algorithms whenever such sophistication is needed. §7 develops and illustrates the consequences of Koopman–type mathematics for the measurement problem in §7.1, suggesting a Joint Measurement Principle as a way to remove the necessity for a “collapse” dynamics; for Bell–type inequalities in §7.2, applying the Joint Measurement Principle and suggesting in §7.2.1 an investigation of the variation over time of the violation of Bell–type inequalities before steady state statistics have been established; and §7.3 applies the thinking of the preceding sections in a contrasting, figurative way to perhaps the most elementary and whimsical of examples, Schrödinger’s cat.

§7.1 discusses the measurement problem in a way that is natural relative to —but largely independent of— the development of the mathematics of a Koopman–type approach to classical mechanics in §§2–6, to some extent following the idea of Belavkin’s approach[26]. After a measurement represented by an operator $\hat{A}$ , the “collapse” of the state can be modeled as two steps: the first step, within the linear Hilbert space formalism, can be modeled by a projective superoperator action (known as a Lüders transformer) on the density operator that represents a state, $\hat{\rho}\mapsto\hat{\rho}_{A}$ ; the second step can be modeled as a stochastic jump, as in [26]. Excluding the second step greatly simplifies the relevant mathematics, and, in addition, stochastic jumps are both not part of classical statistical physics at the level of the Gibbs equilibrium probability density and cannot be modeled as a linear operation. The superoperator action of the Lüders transformer on the state is equivalent to replacing the collapse of the state by a superoperator action of the same Lüders transformer on a subsequent measurement $\hat{X}$ , a projection to the commutator subalgebra of $\hat{A}$ , $\hat{X}\mapsto\hat{X}_{A}$ , because of the identity, Eq. (40), $\mathsf{Tr}\bigl{[}\hat{A}\hat{X}\hat{\rho}_{\!A}\bigr{]}=\mathsf{Tr}\bigl{[}\hat{A}\hat{X}_{\!A}\hat{\rho}\bigr{]}$ , which is exactly enough to make the subsequent measurement $\hat{X}_{A}$ jointly measurable with $\hat{A}$ . When joint measurements are in fact performed, they must be modeled by mutually commutative operators to ensure that in all states a joint probability density is generated by the quantum or classical Hilbert space formalism: “collapse” very effectively ensures, implicitly, that this is the case. As a partial resolution of the measurement problem, excluding the final stochastic jump that has usually not been a great concern for classical physics, we can think of “collapse” of the state as ensuring a necessary property for joint measurements instead of as a change of the state. The answer to the question ‘does this approach include “collapse”?’, which is a fundamental bellwether for interpretations, is ‘this is how we can answer both yes and no,’ with the mathematics concerned making it possible to reconcile interpretations that differ over this hitherto rather metaphysical choice.

Koopman–type constructions can perhaps best be appreciated as offering an alternative relationship between classical and quantum mechanics that is a significantly more unifying approach than the quantization we are used to, not as a replacement for the very effective mathematics of quantum physics. With some subtleties, we can add a Koopman picture to the physical Schrödinger and Heisenberg pictures of Hilbert space mathematics. By understanding Koopman Classical Mechanics and its relationship with Quantum Mechanics, we can hope, a little, to gain an edge in our understanding of Quantum Mechanics.

2 The Simple Harmonic Oscillator

We will first work with a simple harmonic oscillator, for which an abstract “position” might perhaps be the displacement of a pendulum, but might also be a voltage or any abstract degree of freedom that is subject to a linear restorative “force”: we will take Hamiltonian mechanics to be an abstract formalism for generating a conservative evolution equation, not a “particle” theory. The position may be a point in a many–dimensional vector space, but we will work here as if it is one–dimensional, without indices (which can easily be added, however.) A subsection, §2.1, introduces a signal analysis approach that brings the simple harmonic oscillator much closer to being a field theory.

For the simple harmonic oscillator there are no constraints, so that the elementary observables of the system are straightforward functions of position and momentum, $u(q,p)$ , for which we have, as well as the linear vector space structure $\lambda u(q,p)+\mu v(q,p)=[\lambda u+\mu v](q,p)$ , the binary multiplication operation and the trivial binary Poisson bracket operation,

[TABLE]

with both operations being bilinear and with the latter being also a biderivation. This structure is not a straightforward algebra, for which there is only the linear vector space structure and one other binary operation. We use these binary operations to construct four linear unary operators that act on functions such as $u(q,p)$ ,

[TABLE]

which can be used to construct a general unary operator as a function of $\hat{q}$ , $\hat{p}$ , $\hat{Q}$ , $\hat{P}$ , for which a product can be defined by composition, $[\hat{q}\hat{p}](w)=\hat{q}(\hat{p}(w))$ , et cetera. For perhaps the most important example, the Hamiltonian function both becomes a multiplicative unary operator, $\hat{H}={\scriptstyle\frac{\scriptstyle 1}{\raisebox{-0.86108pt}{$ \scriptstyle 2 $}}}(\hat{q}^{2}+\hat{p}^{2})$ , and becomes the Liouvillian unary operator, $\hat{L}:u(q,p)\mapsto\{H,u\}(q,p)$ , $\hat{L}=\hat{p}\hat{Q}-\hat{q}\hat{P}$ . Note carefully that this is not quantum theory, because $\hat{q}$ and $\hat{p}$ commute and because the Liouvillian unary operator, which generates evolution over time, is not a positive operator. The algebraic structure is nonetheless closely comparable to that of quantum theory, because $\hat{Q}$ and $\hat{P}$ are both derivations, so that $[\hat{Q},\hat{q}]=1$ and $[\hat{P},\hat{p}]=1$ , which, except for the absence of a complex structure that we will introduce below, gives two copies of the Heisenberg algebra. Note also that we cannot present the Liouvillian operator, nor any other generators of transformations, if we do not introduce $\hat{Q}$ and $\hat{P}$ , which are essential elements of the unary algebraic structure because of the Poisson bracket. We cannot omit $\hat{Q}$ and $\hat{P}$ in a fully construed presentation of classical mechanics in a unary operator form: the functions $u(q,p)$ do not exhaust the questions that can be asked of a classical mechanical system, because, for example, it is reasonable for a classical physicist to ask whether a state is an eigenstate of the Liouvillian operator.

For a measurement theory, we look for a linear, positive, normalized state over an algebra that is generated by $\hat{q}$ , $\hat{p}$ , $\hat{Q}$ , and $\hat{P}$ , for which we also provide the adjoint operation $\hat{q}^{\dagger}=\hat{q}$ , $\hat{p}^{\dagger}=\hat{p}$ , $\hat{Q}^{\dagger}=-\hat{Q}$ , $\hat{P}^{\dagger}=-\hat{P}$ , and, for any two unary operators, $(\hat{A}\hat{B})^{\dagger}=\hat{B}^{\dagger}\hat{A}^{\dagger}$ . We interpret a state as giving the average value associated with any self–adjoint unary operator in the given state, following an algebraic quantum mechanics framework[1], which is enough to make some kind of contact with the statistics of a collection of experimental raw data. Other consequences can be derived, such as the association of the spectrum of an operator with the sample space of a probability density, of projection operators with a logic and with probabilities, and of average values of powers of a self–adjoint operator with higher statistical moments, et cetera. If we wish to emphasize the experimental interpretation of a state —that the number it generates for a given operator is in some practical way connected to an average value of ensembles of experimental raw data— we can call it a statistical state.

To construct such a state, we first note that the Gibbs equilibrium state over the phase space of the simple harmonic oscillator at finite temperature $\mathsf{T}$ and Boltzmann constant $\mathsf{k}_{\!B}$ results in average values

[TABLE]

or $\rho(q^{m}p^{n})=0$ if either $m$ or $n$ is odd. This can be presented in a characteristic function form as a Gaussian

[TABLE]

which can be thought of in more elementary terms as a generating function for moments. We can also use an inverse fourier transform to return to a probability density, which we could write informally as

[TABLE]

The imaginary ${\mathsf{j}}$ has been introduced in Eq. (5) as an engineering convenience to allow a characteristic function to be constructed, but we will also use it as a central generator of a $*$ –algebra $\mathcal{C}$ that is generated by $\hat{q}$ , $\hat{p}$ , $\hat{Q}$ , $\hat{P}$ , and ${\mathsf{j}}$ , with adjoint ${\mathsf{j}}^{\dagger}=-{\mathsf{j}}$ . This introduction —which an engineer can make carelessly for its usefulness in presenting the sine and cosine components of the fourier transform systematically even if it might give a mathematician or a philosopher pause— also allows us to use ${\mathsf{j}}\hat{Q}$ and ${\mathsf{j}}\hat{P}$ as self-adjoint unary operators, for which measurement is relative to the fourier transform basis of improper eigenfunctions of ${\mathsf{j}}\hat{Q}$ and ${\mathsf{j}}\hat{P}$ . Other motivations for introducing a complex structure are possible, but an elementary argument can be made for considering the use of characteristic functions to be closely related to the use of complex Hilbert space methods[27].

A nonunique extension of the Gibbs equilibrium state to the algebra $\mathcal{C}$ can be constructed by using a raising and lowering operator algebra, $[a,a^{\dagger}]=[b,b^{\dagger}]=1$ ,

[TABLE]

which ensures that $[\hat{Q},\hat{q}]=[\hat{P},\hat{p}]=1$ . If we introduce an appropriately scaled object

[TABLE]

we can construct a state that satisfies, for any unary operator $\hat{A}$ , $\rho(a^{\dagger}\hat{A})=\rho(b^{\dagger}\hat{A})=\rho(\hat{A}a)=\rho(\hat{A}b)=0$ and $\rho(\hat{A}^{\dagger})=\rho(\hat{A})^{*}$ , that is linear, $\rho(\lambda\hat{A}+\mu\hat{B})=\lambda\rho(\hat{A})+\mu\rho(\hat{B})$ , and that is normalized for a unit element ${\hat{\mathbf{1}}}$ , $\rho({\hat{\mathbf{1}}})=1$ . We obtain, using a Baker–Campbell–Hausdorff identity, the generating function

[TABLE]

which is a Gaussian characteristic function if the components of ${\bf f}=(f_{1},f_{2},f_{3},f_{4})$ are real–valued, and which is, as required, the same as in Eq. (5) if we set $\lambda=1$ and then $f_{1}=\lambda$ , $f_{2}=\mu$ , and $f_{3}=f_{4}=0$ . We define a pre–inner product

[TABLE]

in terms of which the commutator is

[TABLE]

so that for an arbitrary number of factors, again using the same Baker–Campbell–Hausdorff identity, we obtain for arbitrarily many unary operators $\hat{F}_{{\bf f}_{i}}$ the generating function

[TABLE]

The first expression preserves the conceptual separation of the thermal noise term and the incompatibility term, as labeled above, whereas the second expression, which applies a straightforward algebraic simplification, is often easier to use. From this generating function, we can use differentiation at $\lambda_{i}=0$ or inverse fourier transforms to construct the average value associated with any function of the $\hat{F}_{{\bf f}_{i}}$ . From Eq. (2), we can derive, for any operators $\hat{A}$ and $\hat{B}$ ,

[TABLE]

so that Eq. (2) fixes the algebraic structure, for the purposes of the state, as effectively that of the Weyl–Heisenberg group. Again using the same Baker–Campbell–Hausdorff identity, we can reduce any product of exponential factors to a single exponential of a sum,

[TABLE]

so we can in general work with terms only of the form ${\mathrm{e}}^{\,{\mathsf{j}}\lambda\hat{F}_{\bf f}}$ . A shows that Eq. (2) is a state in the abstract sense given in §3.

2.1 A signal analysis approach to the simple harmonic oscillator

Taking a signal analysis approach, we can consider the simple harmonic oscillator over time by taking ${\bf f}$ to be a function of time, ${\bf f}(t)=(f_{1}(t),f_{2}(t),f_{3}(t),f_{4}(t))$ , with the algebraic structure still given by Eq. (2), but with the time–translation invariant pre-inner product given as

[TABLE]

The time–like evolution is restricted to fourier modes for which $k=\pm 1$ , and under the fourier–mode integral we have used the pre–inner product defined above as part of the phase space construction, Eq. (11). This gives a 1+0–dimensional random field theory, which is better thought of as a field of measurements, indexed by functions such as ${\bf f}(t)$ , instead of as a field that is measured. The function ${\bf f}(t)$ , which tells us where in time our measurement is most focused, is known as a “window function” in signal analysis. As a simplest example, suppose that ${\bf f}(t)$ and ${\bf g}(t)$ have a Gaussian focus at time $t_{0}$ with parameter $\tau$ , ${\bf f}(t)={\bf f}_{0}{\mathrm{e}}^{-(t-t_{0})^{2}/2\tau^{2}}/\sqrt{2\pi\tau^{2}}$ , ${\bf g}(t)={\bf g}_{0}{\mathrm{e}}^{-(t-t_{0})^{2}/2\tau^{2}}/\sqrt{2\pi\tau^{2}}$ , so that

[TABLE]

Unsurprisingly, the thermal noise is less apparent if we average over long intervals, with large $\tau$ , but we obtain the same result as for the phase space formalism if we average over short intervals, with small $\tau$ . If the oscillator interacts time–invariantly with an external time–like invariant noise, then we would expect all frequencies to be driven at different amplitudes, so that the time–translation invariant pre-inner product would be

[TABLE]

where $\tilde{G}(k)$ is the “noise kernel”: unless a model for a real–world system includes absolutely every degree of freedom, we would have to expect that $\tilde{G}(k)$ will not be perfectly focused on a single fourier mode. An analysis that freely uses mutually noncommutative integral transformations as needed —fourier analysis!— is a commonplace in classical signal analysis, whereas noncommutative transformations have effectively been banned from classical mechanics for a hundred years by fiat, leaving classical mechanics as a straw man.

Anticipating §8, we can extend this structure to a Poincaré invariant state over a classical Klein–Gordon field just by changing the pre–inner product to

[TABLE]

which can be shown to be equivalent to the quantized complex Klein–Gordon field[7, Appendix B].

3 The GNS–construction

We have so far obtained only a single state over the algebra of unary operators associated with the simple harmonic oscillator. The Gelfand–Naimark–Segal construction allows us to construct a Hilbert space if we are given a single state over a $*$ –algebra, which is somewhat different from a commonplace presentation of quantum theory, in which a Hilbert space is prior to the states we can construct using vectors in the Hilbert space. With this approach, the definition of a state replaces the introduction of the Born rule. We here abridge the elementary account given by Haag[2, §III.2.2] (see also [28, §14.1.3]).

A state over a $*$ –algebra $\mathcal{A}$ is a linear, positive, and normalized map, $\rho:\mathcal{A}\rightarrow\mathbb{C}$ , satisfying

[TABLE]

and also commutes with the adjoint operation, $\rho(\hat{A}^{\dagger})=\rho(\hat{A})^{*}$ . Such a linear form defines a Hermitian scalar product for a vector space of operators $|\hat{A}\rangle$ , $\langle\hat{A}|\hat{B}\rangle{\doteq}\rho(\hat{A}^{\dagger}\hat{B}){=}\bigl{[}\rho(\hat{B}^{\dagger}\hat{A})\bigr{]}^{*}\!$ , which is positive semi–definite, $\langle\hat{A}|\hat{A}\rangle\geq 0$ , and which can be refined to a Hermitian inner product over equivalence classes, because for unary operators $\hat{I}$ and $\hat{J}$ for which $\langle\hat{I}|\hat{I}\rangle=\langle\hat{J}|\hat{J}\rangle=0$ , the Schwarz inequality, $|\langle\hat{A}|\hat{B}\rangle|^{2}\leq\langle\hat{A}|\hat{A}\rangle\langle\hat{B}|\hat{B}\rangle$ , ensures that $\langle\hat{A}+\hat{I}|\hat{B}+\hat{J}\rangle=\langle\hat{A}|\hat{B}\rangle$ .

We can take the vector corresponding to the unit element, $|{\hat{\mathbf{1}}}\rangle$ , to be the Gibbs equilibrium vector of the Hilbert space, $\rho(\hat{A})=\langle{\hat{\mathbf{1}}}|\hat{A}|{\hat{\mathbf{1}}}\rangle$ , then we can construct new vectors in the Hilbert space as $\hat{A}|{\hat{\mathbf{1}}}\rangle=|\hat{A}\rangle$ , using an arbitrary function of $\hat{q}$ , $\hat{p}$ , $\hat{Q}$ , and $\hat{P}$ , with completion in the Hilbert space norm that is given by the Hermitian inner product. The Gibbs equilibrium vector is thus an object that we modulate by multiplication, which makes it appropriate to consider adopting $|{\hat{\mathbf{1}}}\rangle$ as an alternative notation for the Gibbs equilibrium vector (in contrast to the usual $|0\rangle$ for the ground or vacuum state, as a zero eigenstate of all lowering operators, which greatly underplays the potency of an equilibrium or vacuum state). New states, which should be contrasted with new vectors in the Hilbert space, can be constructed as

[TABLE]

or as convex sums or integrals of this construction. For this construction, we do not necessarily need to know the structure of the $*$ –algebra $\mathcal{A}$ to construct the Hilbert space, if we are given or we can in whatever way show that $\rho$ is a state, as A shows for Eq. (2): the state fixes a representation of a class of $*$ –algebras. A state is necessary for there to be a concrete connection between the algebraic structure and statistics of a finite set of experimental raw data, so the structure of the particular representation given by a state is arguably all that is accessible.

A full account would be much more elaborate, however the bare bones of the GNS–construction are relatively elementary, with the construction of new states and new vectors being largely familiar. The need to introduce equivalence classes and to invoke completion in the norm introduce some difficulty, but they do not have to detain us at an elementary level.

4 A modified Hamiltonian

A modified Hamiltonian function determines a different dynamics than for the simple harmonic oscillator, but in this section we continue using a Poisson bracket that is globally defined as $\{u,v\}(q,p)=\frac{\partial u}{\partial p}\frac{\partial v}{\partial q}-\frac{\partial u}{\partial q}\frac{\partial v}{\partial p}$ , as for Eq. (2), so that we still work with representations of the Heisenberg algebra and of the Weyl–Heisenberg group that are generated by $q$ , $\partial/\partial q$ , and $p$ , $\partial/\partial p$ . For a Hamiltonian $H(q,p)$ , we replace Eq. (6) by the Gibbs equilibrium probability density

[TABLE]

which has as generating function the fourier transform

[TABLE]

If we extend this state to be also over the differential operators $\partial/\partial q$ and $\partial/\partial p$ , then it generates a representation of the Weyl–Heisenberg group, which therefore must be unitarily equivalent to the representation generated for the simple harmonic oscillator[28, §11.5.7], if the representation is irreducible. Consequently, for some unitary operator $\hat{U}_{\!H}$ , using the state defined for the simple harmonic oscillator by Eq. (2) (possibly for a different value of ${\mathsf{k}_{\!B}\!\mathsf{T}}$ , depending on the scale of the Hamiltonian function),

[TABLE]

If the representation is reducible, which in general will be the case if the state represents a system that is in mechanical or thermodynamic contact with other systems, $\rho_{H}$ will be equal to a convex sum or integral of such expressions for different values of ${\mathsf{k}_{\!B}\!\mathsf{T}}$ and for different unitaries $\hat{U}_{\!H}$ . For our purposes here, therefore, we will consider in Section 6 only properties of the state and Hilbert space defined by Eq. (2).

We can approach this mathematics more concretely: we can model any finite set of data by a probability density $P_{H}(\mathring{q},\mathring{p})$ that can be written as a convex sum of displaced Gaussian probability densities, at some temperature ${\mathsf{k}_{\!B}\!\mathsf{T}}^{\prime}$ ,

[TABLE]

so that in characteristic function terms the convolution becomes a product,

[TABLE]

because we can always approximate that finite set of data in such a way that for a small enough choice of ${\mathsf{k}_{\!B}\!\mathsf{T}}^{\prime}$ ,

[TABLE]

has an inverse fourier transform. This is a strong constraint on the fourier transform of $P_{H}(\mathring{q},\mathring{p})$ , but any finite set of data is compatible with it. Because $\hat{Q}$ and $\hat{P}$ generate translations, we have for any function, ${\mathrm{e}}^{\alpha\hat{Q}}F(\hat{q}){\mathrm{e}}^{-\alpha\hat{Q}}=F(\hat{q}+\alpha)$ and ${\mathrm{e}}^{\beta\hat{P}}F(\hat{p}){\mathrm{e}}^{-\beta\hat{P}}=F(\hat{p}+\beta)$ , so we can construct a state $\rho_{H}$ as a convex sum of displaced simple harmonic oscillators,

[TABLE]

which is a convex sum of elementary examples of Eq. (25). As for state tomography in quantum mechanics, however, just knowing the probability density $P_{H}(\mathring{q},\mathring{p})$ does not fix the state $\rho_{H}$ uniquely: we can, for example, generate exactly the same probability density $P_{H}(\mathring{q},\mathring{p})$ using

[TABLE]

(which is a convex sum of slightly less elementary examples of Eq. (25)), where $F(\alpha,\beta,\hat{q},\hat{p})^{\dagger}=F(\alpha,\beta,\hat{q},\hat{p})$ , any self–adjoint function, commutes with $\hat{q}$ and $\hat{p}$ , but transforms the expected measurement results for ${\mathsf{j}}\hat{Q}$ and ${\mathsf{j}}\hat{P}$ . It seems unlikely that a decomposition as a convex sum of displaced simple harmonic oscillators will be the optimal construction for use in physics, but it is perhaps the simplest example.

5 A modified Hamiltonian and a modified Poisson bracket structure

If there are constraints, the elementary observables form a commutative, associative algebra $\mathcal{A}$ of functions on a phase space manifold $\mathcal{P}$ , $u:\mathcal{P}\rightarrow\mathbb{R}$ , with a binary multiplication operation, together with a nontrivial binary Poisson bracket operation, $\cdot:u,v\mapsto u\cdot v$ , $\{,\}:u,v\mapsto\{u,v\}$ , which cannot be written globally as in Eq. (2). We can use the multiplication and Poisson bracket operations to construct two sets of unary operators,

[TABLE]

which satisfy the commutation relations

[TABLE]

$\hat{Z}_{u}$ is called the Hamiltonian vector field of $u$ [29, §3.2], however this is a slightly unfortunate name insofar as only the Poisson bracket is used in its construction. The whole algebra $\mathcal{C}$ is a semi–direct product of the $\hat{Z}_{u}$ unary operators acting adjointly on the commutative subalgebra $\mathcal{C}_{Y}$ that is generated by the $\hat{Y}_{u}$ unary operators, with the latter being naturally isomorphic to the multiplicative, non–Poisson bracket part of $\mathcal{A}$ , $u\mapsto\hat{Y}_{u}$ , $\hat{Y}_{u}(\hat{Y}_{v}(w))=\hat{Y}_{u\cdot v}(w)$ , and $\lambda\hat{Y}_{u}(w)+\mu\hat{Y}_{v}(w)=\hat{Y}_{\lambda u+\mu v}(w)$ , with $\hat{Y}_{1}$ as the unit element, $\hat{Y}_{1}(w)=w$ .

$\mathcal{C}$ is not in general isomorphic to a Heisenberg algebra unless the classical dynamics can be written globally as in Eq. (2), so we cannot proceed as we did in §4, leading to Eq. (25). Instead, we begin by using a nontrivial Gibbs equilibrium state over $\mathcal{A}$ to construct a state over the commutative subalgebra $\mathcal{C}_{Y}$ , normalized so that $\rho(\hat{Y}_{1})=1$ ,

[TABLE]

with an adjoint defined as $\hat{Y}_{u}^{\dagger}=\hat{Y}_{u}$ , $(\hat{A}\hat{B})^{\dagger}=\hat{B}^{\dagger}\hat{A}^{\dagger}$ , and with an imaginary ${\mathsf{j}}^{\dagger}=-{\mathsf{j}}$ introduced as for the simple harmonic oscillator as an efficient way to discuss the fourier sine and cosine transforms of probability distributions. This is enough to naturally construct a state over the whole algebra $\mathcal{C}$ if we introduce the Gibbs equilibrium projection operator $\hat{V}$ , which we can define abstractly by $\rho(\hat{A}_{1}\hat{V}\cdots\hat{V}\hat{A}_{n})\,{=}\,\prod_{i}\rho(\hat{A}_{i})$ (in a more elementary approach, we can define $\hat{V}=|{\hat{\mathbf{1}}}\rangle\langle{\hat{\mathbf{1}}}|$ .) A general element of the resulting algebra is $\hat{A}+\sum_{i}\hat{X}_{i}\hat{V}\hat{A}_{i}$ , where $\hat{A},\hat{A}_{i}\in\mathcal{C}_{Y}$ and each $\hat{X}_{i}$ may include $\hat{V}$ factors. For this general element

[TABLE]

so that by induction a state over $\mathcal{C}_{Y}$ can be extended naturally to be a state over an algebra $\mathcal{C}_{+}$ of all transformations that can be constructed as a limit (in a suitable topology) of forms such as $\sum_{i,j}\alpha_{i,j}|\hat{A}_{i}\rangle\langle\hat{A}_{j}|$ , with $|\hat{A}_{i}\rangle$ a countable basis of the Hilbert space $\mathcal{H}$ that is generated by the GNS–construction’s action of $\mathcal{C}_{Y}$ on $|{\hat{\mathbf{1}}}\rangle$ . $\mathcal{C}_{+}$ can include, for example, the algebra of bounded operators acting on $\mathcal{H}$ , $\mathcal{B}(\mathcal{H})$ . The superoperators that can be constructed using $\mathcal{C}_{+}$ includes a representation of $\mathcal{C}$ as a subalgebra because $\mathcal{C}$ contains only $\mathcal{C}_{Y}$ and a subalgebra of adjoint actions on $\mathcal{C}_{Y}$ . As for unconstrained classical mechanics, we can allow all of $\mathcal{C}_{+}$ as observables if it is empirically necessary or useful to do so, and construct both pure and mixed states over the algebra of observables. It can be argued from a Dutch book perspective that it is necessary to construct all such states to describe some circumstances adequately[30].

Note that we have here weaponized the Gibbs equilibrium projection operator in a very global way, because $\hat{V}$ does not commute with any operator in $\mathcal{C}_{Y}$ , which is anathema to, for example, a local quantum physics perspective[2] but is a commonplace in practical physics: whenever we use the Born rule to generate a transition probability such as $|\langle\hat{B}|\hat{A}\rangle|^{2}$ , we implicitly use a measurement operator $|\hat{B}\rangle\langle\hat{B}|=\hat{B}\hat{V}\hat{B}^{\dagger}$ in a state with density operator $|\hat{A}\rangle\langle\hat{A}|=\hat{A}\hat{V}\hat{A}^{\dagger}$ .

Constrained classical mechanics can in any case be modeled in the first instance as a limit of unconstrained classical mechanics, as in §4, for which the Hamiltonian function of systems that are not on the constraint surface (which is taken to be embedded in an unconstrained phase space) becomes arbitrarily large, so that in the limit the Gibbs equilibrium distribution assigns zero probability except on the constraint surface. At the level of experimental raw data, we cannot confirm that a real experimental apparatus always lies absolutely precisely on a singular constrained surface. We also note that quantum field theory has historically only constrained the algebra of observables to be the commutant of the symmetries of the dynamics, which mathematicians typically also restrict to $\mathcal{B}(\mathcal{H})$ , which for classical mechanics we can construct as the double commutant of the Liouvillian, $\mathcal{B}(\mathcal{H})\supset\mathcal{C}=\{\hat{L}\}^{\prime\prime}$ . For our purposes here, therefore, we will consider below, as for unconstrained classical mechanics, only properties of the state and the Hilbert space defined by Eq. (2).

6 Measurements and states

We first focus on $\hat{q}$ and ${\mathsf{j}}\hat{Q}$ measurements for the simple harmonic oscillator, with $\hat{p}$ and ${\mathsf{j}}\hat{P}$ measurements being closely comparable. As above, for the Gibbs equilibrium state, Eq. (6),

[TABLE]

gives a Gaussian probability density that the position observable will be near $\mathring{q}$ . In classical mechanics, we can use the Poisson bracket to generate unitary transformations such as $\hat{U}={\mathrm{e}}^{\kappa\hat{Q}}$ , which acts to translate the probability density, giving a different state, a modulated form of the Gibbs equilibrium state, for which

[TABLE]

or we can say that this is a translated measurement of the Gibbs equilibrium state. We can further introduce convex mixtures of such states for different values of $\kappa$ , such as $\rho_{B}\bigl{(}\delta(\hat{q}-\mathring{q})\bigr{)}=\sum_{i}\beta_{i}\langle{\hat{\mathbf{1}}}|\hat{U}^{\dagger}_{i}\delta(\hat{q}-\mathring{q})\hat{U}_{i}|{\hat{\mathbf{1}}}\rangle$ , with $\sum_{i}\beta_{i}=1$ , or, again, we can say that this is a more general measurement of the Gibbs equilibrium state. We can modulate the Gibbs equilibrium state using arbitrary polynomials of $\hat{q}$ , taking the fourier transform with respect to $\mathring{q}$ , differentiating a three factor version of the generating function Eq. (2), and taking the inverse fourier transform back to the $\mathring{q}$ ,

[TABLE]

so from this classical mechanics perspective we have constructed modulations of the Gibbs equilibrium state probability density, and indeed we can by superposition and mixture construct arbitrary positive multinomial modulations of the Gibbs equilibrium state probability density, using

[TABLE]

provided the normalization constant $\mathcal{N}$ exists. Recalling §4, such states can also be thought of as the ground state probability density associated with some different Hamiltonian function.

For a random field theory in operator form, such modulation of probability densities extends to there being different modulations of the Poincaré invariant vacuum state in different regions of space–time, a higher order analog of modulating a single–frequency carrier signal[7]. Eq. (35), for example, can be written using the pre–inner product, rather more abstractly, as

[TABLE]

so that more or less displacement occurs depending on the imaginary part of the pre–inner product $({\bf f},{\bf g})$ . We can understand the vacuum state as a noisy, higher order carrier, for which the noise can be considered a valuable resource for quantum computation purposes. Note that this way of working with probabilities gives us a higher order mathematical structure than a classical field.

We can also measure components of a transformed state such as $\rho_{B}(\hat{A})$ . For the Gibbs equilibrium state component, we can use the Gibbs equilibrium state projection operator $\hat{V}=|{\hat{\mathbf{1}}}\rangle\langle{\hat{\mathbf{1}}}|$ ,

[TABLE]

or, using $|v(\hat{q},\hat{p})\rangle\langle v(\hat{q},\hat{p})|$ , we can measure any other component,

[TABLE]

This kind of construction is routine in quantum mechanics, but we can think of the Gibbs equilibrium state projection operator as also a classically natural measure of how much a given state is like or unlike the Gibbs equilibrium state, which we can use to construct comparisons with any modulated form of the Gibbs equilibrium state. We can construct self–adjoint operators such as $|v_{1}\rangle\langle v_{2}|+|v_{2}\rangle\langle v_{1}|$ , for two functions $v_{1}(\hat{q},\hat{p})$ and $v_{2}(\hat{q},\hat{p})$ , as

[TABLE]

More generally, we can use arbitrary numbers of functions $v_{i}(\hat{q},\hat{p})$ , or even more generally we can use limits in an appropriate topology of sequences of such constructions, so that using the Gibbs equilibrium state projection operator allows us to construct very general self–adjoint operators.

We can also use the well–known construction of a lowering operator $a_{q}$ as an unbounded operator for which $[a_{q},a^{\dagger}_{q}]=1$ and $a_{q}|{\hat{\mathbf{1}}}\rangle=0$ ,

[TABLE]

where $H_{m}(\hat{q})$ are orthonormal polynomials in $\hat{q}$ for which $\langle H_{m}(\hat{q})|H_{n}(\hat{q})\rangle=\delta_{m,n}$ , constructed using $|H_{0}(\hat{q})\rangle=|{\hat{\mathbf{1}}}\rangle$ , $\hat{q}|H_{m}(\hat{q})\rangle$ , and the Gram–Schmidt algorithm. Given this kind of construction, we can think of $\hat{q}$ and $\hat{Q}$ as complementary assessments of the physical state, as systematically weighted sums and differences of how much a given state is like each of many possible states.

We can construct transformations of measurements and of states in many different ways, where different mathematical tools will correspond to different physically available objects: apparatus such as diffraction gratings, half–wave plates, et cetera. Transformations, which may or may not mutually commute, may be implemented in hardware, where the experimental apparatus is engineered so that a single measurement result is a function of many measurement results that could have been but are not performed, or in software, where a single measurement result is computed as a function of the records of many measurement results that have been performed, as will be the case in the Bell inequality–violating example in §7.2. The cycle of consecutive calibrations of mathematical models of new measurement apparatus relative to well–understood state preparations and then of mathematical models of new state preparations relative to well–understood measurement apparatus has a centuries–long history[31].

6.1 Minimal and extended classical mechanics

We can present the measurements that can be used in classical mechanics, for our purposes here, in three ways:

Functions on phase space $u(q,p)$ , with multiplication (naïve CM): this is a commutative algebra, with addition and multiplication at a point.

2.

Functions on phase space $u(q,p)$ , with multiplication and the Poisson bracket: having three operations, this is not a straightforward algebra at all. However, we can convert it into a straightforward associative, non–commutative algebra of unary operators, generated by $\hat{q}$ , $\hat{p}$ , $\hat{Q}$ , and $\hat{P}$ , with $[\hat{Q},\hat{q}]=1$ and $[\hat{P},\hat{p}]=1$ .

Now there are two choices:

(a)

The Poisson bracket generated unary operators act only as transformations that leave the algebra of functions on phase space invariant. We allow the use of $\exp(-\kappa\hat{Q}){\cdot}\hat{q}{\cdot}\exp(\kappa\hat{Q}){=}\hat{q}\,{-}\,\kappa$ , but we do not allow the use of, for example, $\exp(-\kappa\hat{Q}^{3}){\cdot}\hat{q}{\cdot}\exp(\kappa\hat{Q}^{3}){=}\hat{q}\,{-}\,3\kappa\hat{Q}^{2}$ or any other construction that gives an operator that is not a function of only $\hat{q}$ and $\hat{p}$ . The Poisson bracket binary operation phase space formalism for classical mechanics only allows this case, because $\{u,v\}(q,p)$ is indeed just another function on phase space. Call this CM0.

(b)

The Poisson bracket generated unary operators $\hat{Q}$ and $\hat{P}$ have the same standing as the $\hat{q}$ and $\hat{p}$ operators. The general unary operator is a function $u(\hat{q},\hat{p},\hat{Q},\hat{P})$ , which is natural for a Koopman–type Hilbert space formalism for classical mechanics. With this construction, Bell inequalities can be violated, for example, because the algebra of operators is noncommutative[32, 33, 34]. Call this CM+.

Adopting CM+ steps outside of the CM0 that is natural for a phase space formalism for classical mechanics into what is natural for a Hilbert space formalism for classical mechanics. A classical physicist can reasonably use CM+ as a convenient, classical tool, and cannot reasonably be stopped from using it, but does not have to use it.

6.2 Modeling finite experimental data using matrix algebras for measurements and states

We can always present any finite amount of information about state preparations and measurements using a commutative algebra of matrices, because if we use diagonal matrices of high enough dimension $N$ we can always solve the $mn$ equations $A_{ij}=\mathsf{Tr}[\hat{M}_{i}\hat{\rho}_{j}]$ given, say, by a set of $mn$ average values of experimental raw data $\{A_{ij},i=1..m,j=1..n\}$ for the components of $m$ diagonal measurement matrices $\hat{M}_{i}$ and the components of $n$ diagonal density matrices $\hat{\rho}_{j}$ . It is often much more convenient, however, indeed significantly advantageous, as we know well from quantum mechanics, to solve for $\hat{M}_{i}$ and $\hat{\rho}_{j}$ as self-adjoint matrices of dimension ${\ll}N$ (this process can be made to work even if we do not have averages for all $mn$ cases). We can look for a dimension ${\ll}N$ for which the information looks “nicest”, in some sense, and we have a lot of engineering information about what numbers of dimensions work well as a first approximation for a given experiment: this process is known as quantum tomography, where “the aim is to estimate an unknown state from outcomes of measurements performed on an ensemble of identical prepared systems”[35]. Classical mechanics can equally well adopt this kind of construction and call it a system of contextual models.

In quantum mechanics in practice, we prepare many states and measure them in many ways, however in quantum mechanics as a global metaphysics we more think of there being a single state measured in many ways, in which case in solving $A_{i}=\mathsf{Tr}[\hat{M}_{i}\hat{\rho}]$ there is at least one basis in which $\hat{\rho}$ is a diagonal matrix $\rho_{j}\delta_{jk}$ , so that in that basis $\mathsf{Tr}[\hat{M}_{i}\hat{\rho}]=\sum_{j}M_{i,jj}\rho_{j}$ . If there is only one state all measurements can be taken to commute because off–diagonal entries in such a basis can be taken to be zero, so the one–state metaphysics of quantum mechanics can be taken to be the same as the one–state metaphysics of the CM0 of classical physics: if we ever consider subensembles, however, the possibility of it being advantageous to use CM+ re–emerges.

At a higher level, we might also find it convenient to present information using Positive Operator Valued Measures (POVMs), using a smaller Hilbert space, even though we know by Neumark’s theorem that we can always present the same information using Projection Valued Measures (PVMs) by introducing an ancilla system to construct a larger Hilbert space[36, §II.2.4], but again we do not as classical physicists have to use this construction, it is just there for us to use if it is convenient to do so.

7 Classical thinking about states and measurement

We will develop here the idea that thinking in terms of CM+ can illuminate quantum mechanics, even if we never use CM+. Many more aspects could be considered, but we will focus here on a partial solution of the measurement problem in §7.1, the violation of Bell–type inequalities in §7.2, and Schrödinger’s cat in §7.3.

7.1 The measurement problem

We will take the measurement problem of quantum theory to concern the dynamics of “collapse” or “reduction” of the state when a measurement happens, which can be thought an unwanted contrast to the unitary dynamics that applies at all other times. For our purposes in this subsection, we will exclude aspects of the measurement problem that concern the interpretation of probability, such as the relationship between probability and statistics and other uses of experimental raw data such as Bayesian updating, because this is also an issue for classical physics. We will also filter the extensive literature by insisting that whatever we say about measurement must seem natural to a classical physicist who decides to adopt CM+. To that end, the principle we will apply is

Principle (JM): if we perform joint measurements that result in joint relative frequencies, then we must use mutually commutative self–adjoint operators to model those measurements.

Principle (JM) is quite natural for a classical physicist who has previously always used CM0, and, as argued in §6.2, we can always model finite experimental data using only commutative operators, but it is subtly not as natural in quantum mechanics, partly because although at space–like separation measurement operators are required to commute, at time–like separation measurement operators may or may not commute and we have instead become accustomed to invoking “collapse” of the state. Principle (JM) is unnecessary, however, except as emphasis, if we insist that all measurements must be modeled by self-adjoint operators, insofar as if $\hat{A}^{\dagger}{=}\,\hat{A}$ and $\hat{X}^{\dagger}{=}\,\hat{X}$ are both self-adjoint then $\hat{A}\hat{X}$ can only be self-adjoint if $[\hat{A},\hat{X}]\,{=}\,0$ . More explicitly, if $\hat{A}^{\dagger}{=}\,\hat{A}$ , $\hat{X}^{\dagger}{=}\,\hat{X}$ , and a density operator $\hat{\rho}^{\dagger}{=}\,\hat{\rho}$ are all mutually noncommutative, $[\hat{A},\hat{X}]\not=0$ , $[\hat{A},\hat{\rho}]\not=0$ , $[\hat{X},\hat{\rho}]\not=0$ , then

[TABLE]

has an imaginary part. Unless we ensure that either $\hat{A}$ or $\hat{X}$ always commutes with the density operator, we cannot model actual joint experimental results, which do not have an imaginary part, using operators that do not commute.

The concept of quantum non–demolition measurements, defined as mutually compatible measurements at time–like separation[3], is well–known to quantum physics, but it is not usually insisted on when modeling joint measurements: a notable exception, however, is the Nondemolition Principle that is introduced by Belavkin[26], which is very close in concept to Principle (JM). We will show below how Principle (JM) can be reconciled with the usual formalism of “collapse”.

In the “collapse” literature in quantum mechanics, the elementary linear algebraic construction asserts that after a measurement $\hat{A}$ that admits a discrete spectral projection $\hat{A}=\sum_{i}\alpha_{i}\hat{P}_{i}$ , where $\hat{P}_{i}\hat{P}_{j}=\delta_{ij}\hat{P}_{i}$ , $\sum_{i}\hat{P}_{i}=1$ , for which $[\hat{A},\hat{P}_{i}]=0$ , a density operator $\hat{\rho}$ evolves instantaneously to a density operator $\hat{\rho}_{\!A}=\sum_{i}\hat{P}_{i}\hat{\rho}\hat{P}_{i}$ , which is known as a Lüders transformer in the von Neumann–Lüders measurement model[36, §§II.3.2-3].

This differs a little in detail from an elementary textbook description such as
“If the particle is in a state $|\psi\rangle$ , measurement of the variable (corresponding to) $\Omega$ will yield one of the eigenvalues $\omega$ with probability $P(\omega)\propto|\langle\omega|\psi\rangle|^{2}$ . The state of the system will change from $|\psi\rangle$ to $|\omega\rangle$ as a result of the measurement.”[37, §4.1],

which we can model as a measurement $\hat{P}_{i}\hat{A}$ that not only measures $\hat{A}$ but also projects to (discards all but) a single eigenspace of $\hat{A}$ (although this does not model a stochastic transformation, as discussed below). [[ See also B for a discussion that is less abstract than follows below. ]]

After a measurement $\hat{A}$ with this prescription, the expected value for a subsequent incommensurate measurement $\hat{X}$ , $[\hat{A},\hat{X}]\not=0$ , will be given by

[TABLE]

because of the cyclic property of the trace and because $[\hat{A},\hat{P}_{i}]=0$ , so that the Lüders transformer applied to the state $\hat{\rho}\mapsto\hat{\rho}_{\!A}$ is equivalent to that Lüders transformer applied to the measurement $\hat{X}\mapsto\hat{X}_{\!A}$ . This forces mutual commutativity and allows joint measurement in a minimal way, $[\hat{A},\hat{X}_{\!A}]=0$ , $\mathsf{Tr}[\hat{A}\hat{X}_{\!A}\hat{\rho}]=\mathsf{Tr}[\hat{X}_{\!A}\hat{A}\hat{\rho}]$ , with no change of the state. In the quantum field theory context, where microcausality is satisfied, this identity makes it very clear that “collapse” only affects components of measurements that are at time–like separation from the measurement that causes the collapse: “collapse” is not instantaneous when considered in terms of measurements. After joint measurement of $\hat{A}$ and $\hat{X}_{\!A}$ , we can apply the same construction using the discrete spectral projection of $\hat{X}_{\!A}$ , $\mathsf{Tr}\bigl{[}\hat{A}\hat{X}_{\!A}\hat{Y}\hat{\rho}_{\!X_{\!A}}\bigr{]}=\mathsf{Tr}\bigl{[}\hat{A}\hat{X}_{\!A}\hat{Y}_{\!X_{\!A}}\hat{\rho}\bigr{]}$ , et cetera. The discussion here focuses on joint measurement, in which context it is natural to consider the identity $\mathsf{Tr}\bigl{[}\hat{A}\hat{X}\hat{\rho}_{\!A}\bigr{]}=\mathsf{Tr}\bigl{[}\hat{A}\hat{X}_{\!A}\hat{\rho}\bigr{]}$ , however we can also consider the identity $\mathsf{Tr}\bigl{[}\hat{X}\hat{\rho}_{\!A}\bigr{]}=\mathsf{Tr}\bigl{[}\hat{X}_{\!A}\hat{\rho}\bigr{]}$ , which shows more concisely that a measurement $\hat{X}$ after the collapse of the state after a measurement $\hat{A}$ , $\hat{\rho}\mapsto\hat{\rho}_{\!A}$ , is equivalent to a measurement $\hat{X}_{\!A}$ without any collapse.

The Lüders transformer applied to subsequent measurements instead of as a collapse of the state simply enforces Principle (JM). For measurements that cannot be modeled by a discrete spectral projection, or when a state does not admit presentation as a density operator, we will have to ensure that whatever measurement operators we use for sequential joint measurement satisfy Principle (JM) without being able to use the Lüders transformer as a tool. It may in any case be best to be careful when using the Lüders transformer, insofar as it may not be the best way to use experience to choose an operator as a first approximate model for a given measurement: the Lüders transformer is certainly not the only way to enforce Principle (JM). Note, however, that although idealized measurements are often modeled as having continuous sample spaces, for real experiments there is always discretization by an analog-to-digital conversion, so that real measurements can always be modeled by a discrete spectral projection. Any observable $\hat{A}$ can be discretized as a binary value, using the Heaviside function, as, for a simplest example, $\hat{A}_{x}=\theta(x-\hat{A})\,{\cdot}\,0+\theta(\hat{A}-x)\,{\cdot}\,1$ .

The Lüders transformer is a projection, $(\hat{X}_{\!A\!})_{\!A}=\hat{X}_{\!A}$ , that maps operators to the commutant of $\hat{A}$ , $[\hat{A},\hat{X}_{\!A}]=0$ . If $|1\rangle$ and $|2\rangle$ are eigenvectors of $\hat{A}$ , $\hat{A}|j\rangle=\alpha_{\!j}|j\rangle$ , $\hat{P}_{i}|j\rangle=\delta_{ij}|j\rangle$ , we have

[TABLE]

The Lüders transformer, as a linear operator, does not model a stochastic transformation of a density operator that might be loosely written as

[TABLE]

which steps outside the linear algebraic representation of expected values and probability densities and, indirectly, of statistics that lossily compress the full details of the experimental raw data. Although we can model the discarding of states for the purposes of joint measurements by using a measurement operator such as $\hat{P}_{i}\hat{A}$ , the Lüders transformer for this operator is $\hat{X}_{\!P_{\!i\!}A}=\hat{P}_{i}\hat{X}\hat{P}_{i}+(1{-}\hat{P}_{i})\hat{X}(1{-}\hat{P}_{i})$ , which is not the content of Eq. (41). Such a stochastic transformation is also not modeled in the classical statistical mechanics of §2, §4, and §5, so we will not further address it here, as was declared at the beginning of this subsection, despite its obvious interest. See Belavkin[26] for one way to introduce stochastic transformations.

It might be thought that the perfect repeatability of experimental results requires collapse of the state, however the mathematics already ensures perfect correlations of repeated measurements. If we perform a measurement modeled by $\hat{A}$ , followed by the same measurement at a later time, modeled by $\hat{B}=\hat{U}^{\dagger}(t)\hat{A}\hat{U}(t)$ , the joint probability density is

[TABLE]

so that the results are perfectly correlated without any explicit collapse mechanism being required. This can be thought no more than an elementary, very idealized, and somewhat unrigorous version of Mott’s result[38], that a track in a Wilson cloud chamber can be modeled by correlations already contained in a state: there is no necessity to collapse a state to model such correlations. Conversely, if perfect correlation is not observed, so that the results do not correspond to the mathematics above, then we have not performed the same measurement twice and we should not model the two measurements using the same operator.

The linear algebra above —the three equations (39), (40), and (42)— and Principle (JM) can be understood to be as natural as a way to eliminate “collapse” of the state as a separate dynamics in quantum mechanics as it is in classical statistical mechanics. If we actually record a sequence of instrument readings over time and construct joint statistics using those records, those joint statistics will be consistent with a joint probability density, but they will not be consistent with a quasiprobability distribution that is for some joint values negative or complex. Since for some states noncommutative operators will generate joint distributions for real measurements that are negative or complex (which might be thought embarrassing), the operators we use to model actually recorded joint measurements had better be mutually commutative.

On this appeal to CM+, there is, for example, no need for decoherence, just as there is no need for decoherence in classical probability theory when using statistics to model the throwing of a die. A die, unlike a sphere, is typically engineered to almost always give one of six results, with care taken to use a material for which internal degrees of freedom can be ignored for most purposes: an easily malleable die could be quite different.

The idea that measurement of $\hat{A}$ makes it in general not possible to jointly measure $\hat{A}$ and $\hat{X}$ if $[\hat{A},\hat{X}]\not=0$ , but always possible to jointly measure $\hat{A}$ and $\hat{X}_{\!A}$ (which we might call “ $\hat{X}$ after $\hat{A}$ ”), is a form of contextuality, insofar as what can be measured after $\hat{A}$ has been measured is determined by the Lüders transformer. We could instead, however, jointly measure $\hat{A}_{\!X}$ (which we might call “ $\hat{A}$ modified so it does not affect $\hat{X}$ ”) with $\hat{X}$ , because $\hat{X}_{\!A_{\!X}}=\hat{X}$ . For an elaborate discussion of joint measurement and the travails of noncontextuality, see [39], however Eq. (40) puts the enforcement of Principle (JM) as equivalent to collapse of the state in a very compact form.

Although we can equally apply Lüders transformers to subsequent measurements or to the state, which we choose to do when modeling an experiment can be decided by what seems useful at the time: it is, in particular, often useful to think of a measurement as a preparation of a state. If microcausality is satisfied by measurements, however, it is as well to remember that applying the Lüders transformer to a state is then to that extent not a nonlocal operation.

A comparable approach, using “multitime correlation functions”, is suggested by Öttinger[40, §1.2.4.1, §1.2.9.3], in a more elaborate formalism of quantum master equations that seeks also to model stochastic transformations of a density operator, as also does Belavkin’s approach[26], which at the linear algebraic level explicitly uses quantum non-demolition measurements.

For a lucid account of the history of the measurement problem and for other recent literature, see Landsman[29, Ch. 11]. In such terms, as a counterpoint to the account above, Principle (JM) effectively formalizes the Copenhagen interpretation’s requirement that an experimental apparatus in the raw and its results must be described classically, and Eq. (40) gives a concrete mathematical form to Bohr’s doctrine of Complementarity, an idea that measurements affect the possibility of other measurements as an alternative to an idea that measurements change the state, which makes decoherence as (an example of) a mechanism to change the state unnecessary (see [29, pp. 4–5].) Note that we have to distinguish between variants of the Copenhagen interpretation[41, 42], because Bohr rejected collapse of the wave function, whereas Heisenberg, in particular, emphasized its necessity: Eq. (40) offers a mathematically reasoned resolution of this difference. Landsman introduces what he calls “Bohrification, i.e., the mathematical interpretation of Bohr’s classical concepts by commutative C $*$ –algebras”[29, p. viii], which is rather close to Principle (JM) in spirit, but, without Eq. (40) to attribute “collapse” to the enforcement of “Bohrification”, he

“describes measurement as a physical process, including the collapse that settles the outcome (as opposed to reinterpretations of the uncollapsed state, as in modal or Everettian interpretations). However, in our approach collapse takes place within unitary quantum theory.”[29, p. 14]

In contrast, we made no attempt here to discuss the final stochastic process, insofar as this problem is shared with classical statistical physics.

7.2 The violation of Bell–type inequalities

All raw data in modern experiments, whether described classically or quantum mechanically, comes into a computer along shielded signal lines attached to exotic materials that are coupled to their local surroundings and that are, furthermore, driven by support circuitry in carefully engineered ways. What may be an elaborate hardware and software process cannot be perfectly described in a single sentence, but in outline the analog signal level is sampled and converted into binary form and the data is saved in computer storage. There can be significant variations in this process: signal levels on many signal lines might each be stored as a 10–bit value every picosecond, say, but more typically, applying a very substantial level of compression, one or many signal levels may be analyzed for “trigger” conditions and information about a trigger event is stored only if a trigger condition is satisfied.

At the level of the signal picosecond by picosecond, there is in a sense no collapse, but if material properties and electronic circuitry are engineered to give a signal that satisfies a trigger condition only occasionally, then we can think of that moment as a collapse, but we can also think of it as very like the classical moment when we decide that a die has finally settled after it has landed and rolled a few times, before we pick up the die to throw it again.

Crucially in what follows, an empiricist approach should not too quickly assume that the satisfaction of signal level trigger conditions is caused by a “particle” or any other isolated system: from a classical signal analysis perspective, the signal levels are more appropriately associated with whatever locally surrounds the exotic materials and circuitry that directly drive the signal levels, with the whole experimental apparatus being driven in turn by other exotic materials and circuitry that are relatively remote, tens or thousands of meters away. Certainly an operator algebra may include idealized operators that have a discrete spectrum (or, classically, a discrete sample space), as well as operators that more realistically model the finite widths of real spectra, but we should not —or, again, not too quickly— assume that a discrete spectrum implies that there are point particles that cause that discreteness.

For a review of the violation of Bell–type inequalities, see [43]. For a specific experiment that violates a Bell–type inequality, we consider the measurements that were performed by Weihs[44, 45], for which see Fig. 1, which we take to be a compressed description of six measured voltages, $q_{1}(t),q_{2}(t),q_{3}(t),q_{4}(t),q_{5}(t),q_{6}(t)$ , on signal lines attached to four Avalanche PhotoDiodes (APDs), two for Alice, $a_{1}{\,=\,}q_{1}$ , $a_{2}{\,=\,}q_{2}$ , and two for Bob, $b_{1}{\,=\,}q_{4}$ , $b_{2}{\,=\,}q_{5}$ , and to two ElectroOptic Modulators (EOMs), $A{\,=\,}q_{3}$ , $B{\,=\,}q_{6}$ . See [45, page 60] for a beautifully clear schematic of the experiment.

All six of these voltages are the output of electronic systems that both have complex dynamics of their own and are externally driven in complex ways, enough that we cannot measure the associated momenta. The trigger conditions, however, implicitly use crude assessments of the time derivatives of the signals over device appropriate time scales. The APD signals are mostly near zero voltage, but, at random intervals, on average once every $\sim$ 100 $\mu$ s during the Weihs experiment, each APD signal becomes and stays near a larger voltage, which we will call “1”, for $\sim$ 1 $\mu$ s (the “dead time”), so that each APD signal level averaged over long periods is $\sim$ 0.01. Considered over long periods each APD is in a time–translation invariant thermodynamic equilibrium, whereas over short periods each APD is in a thermodynamically metastable state that is engineered to be easily disturbed. The APDs, and the state of the whole experimental apparatus, are driven by a single light source, through a relatively exotic crystal, in a way that results in elaborate correlations between the signals $a_{1}$ , $a_{2}$ , $b_{1}$ , and $b_{2}$ . The two EOMs are driven by external voltages that are as statistically independent as possible, so that the signal level at each EOM might or might not change between “0” and “1” every $\sim$ 0.1 $\mu$ s.

The raw signal level data, if it were recorded as six voltages averaged over a picosecond timescale and digitized to 10–bit accuracy, would be 8 Terabytes per second, so it was lossily compressed in hardware by recording the time at which each APD signal changes from 0 to 1, as a 64–bit number, with accuracy $\sim$ 0.5 ns, together with the local EOM signal ( $A$ for $a_{1}$ and $a_{2}$ , $B$ for $b_{1}$ and $b_{2}$ ), provided that the local EOM signal is not at the same time changing between the values 0 and 1. The compressed data rate was therefore $\sim$ 100 kilobytes per second per APD.

This is all that was done during each experimental run, which was followed much later by signal analysis in which signal transition times were compared and pairs of transition times that were close to coincident (within a few nanoseconds) were collated into 16 categories, for events on the $a_{1}$ or $a_{2}$ signal lines and for the EOM signal line $A$ either 0 or 1, and similarly for Bob’s signal lines. For example, for data from the experimental run longdist35, which can be obtained on reasonable request from Gregor Weihs, we can construct a table such as Table 1

(with general properties as shown, but with the precise numbers depending on details of the algorithm and its parameters). The 16 observables in Table 1 are in four groups, corresponding to events post–selected according to the EOM settings, for which we compute relative frequencies,

[TABLE]

The last expression exhibits the violation of a Bell–type inequality, which shows that the observables concerned cannot be elements of a commutative algebra[32, 33, 34]. Ordinarily, this would be a death knell for classical mechanics, but it is not for CM+.

We can apply Principle (JM) to this description of the Weihs experiment: measurements of the six $q_{i}(t)$ over time are joint measurements, as is the lossily compressed data that is actually stored, so they should be modeled by mutually commuting operators (even though the experiment as performed only records one instance at each time and place.) The signal analysis algorithm, however, constructs statistics for events on the $a_{1}$ or $a_{2}$ signal lines and for the EOM signal line $A$ being either 0 or 1: we cannot make joint measurements of the occurrence of events on the $a_{1}$ signal line when the EOM signal line A is both 0 and 1, so such statistics may well have to be modeled by noncommuting operators —which, again, can be done within CM+.

A typical quantum theoretical model for this experiment introduces two 2–dimensional Hilbert spaces, $\mathcal{H}_{\mathrm{Alice}}$ , spanned by $|H_{\mathrm{Alice}}\rangle$ and $|V_{\mathrm{Alice}}\rangle$ , and $\mathcal{H}_{\mathrm{Bob}}$ , spanned by $|H_{\mathrm{Bob}}\rangle$ and $|V_{\mathrm{Bob}}\rangle$ , and the 4–dimensional tensor product $\mathcal{H}_{\mathrm{Alice}}\otimes\mathcal{H}_{\mathrm{Bob}}$ . Relative frequencies of $a_{1}$ and $a_{2}$ events can be represented by one pair of orthogonal projection operators $\hat{A}_{1}$ and $\hat{A}_{2}$ acting on $\mathcal{H}_{\mathrm{Alice}}$ for $A\,{=}\,0$ and by a different pair of orthogonal projection operators $\hat{A}_{1}^{\prime}$ and $\hat{A}_{2}^{\prime}$ acting on $\mathcal{H}_{\mathrm{Alice}}$ for $A\,{=}\,1$ , and similarly for relative frequencies of $b_{1}$ and $b_{2}$ events for $B\,{=}\,0$ and $B\,{=}\,1$ . $\hat{A}_{1}$ , $\hat{A}_{2}$ , $\hat{A}_{1}^{\prime}$ , and $\hat{A}_{2}^{\prime}$ generate a noncommutative algebra of operators $\mathcal{A}$ ; $\hat{B}_{1}$ , $\hat{B}_{2}$ , $\hat{B}_{1}^{\prime}$ , and $\hat{B}_{2}^{\prime}$ generate a noncommutative algebra of operators $\mathcal{B}$ , all of which commute with all of $\mathcal{A}$ ; and together they generate an algebra of operators $\mathcal{A}\vee\mathcal{B}$ . Following Landau’s derivation[32] (but see also [43, §I.A]), define $\hat{\mathsf{a}}\,{=}\,\hat{A}_{1}{-}\hat{A}_{2}$ , $\hat{\mathsf{b}}\,{=}\,\hat{B}_{1}{-}\hat{B}_{2}$ , $\hat{\mathsf{a}}^{\prime}\,{=}\,\hat{A}_{1}^{\prime}{-}\hat{A}_{2}^{\prime}$ , and $\hat{\mathsf{b}}^{\prime}\,{=}\,\hat{B}_{1}^{\prime}{-}\hat{B}_{2}^{\prime}$ , for which $\hat{\mathsf{a}}^{2}\,{=}\,\hat{\mathsf{b}}^{2}\,{=}\,\hat{\mathsf{a}}^{\prime}{}^{2}\,{=}\,\hat{\mathsf{b}}^{\prime}{}^{2}\,{=}\,1$ , and define

[TABLE]

for which we find eight terms in $\hat{\mathsf{C}}^{2}$ cancel, leaving $\hat{\mathsf{C}}^{2}\,{=}\,4+[\hat{\mathsf{a}},\hat{\mathsf{a}}^{\prime}][\hat{\mathsf{b}},\hat{\mathsf{b}}^{\prime}]$ . For CM0, we would obtain, in a state $\rho$ , because both commutators must be zero, $|\rho(\hat{\mathsf{C}})|^{2}\leq\rho(\hat{\mathsf{C}}^{2})=4$ , whereas for CM+ and QM we obtain, because for the spectral norm we have $\|\hat{\mathsf{a}}\|\,{=}\,\|\hat{\mathsf{b}}\|\,{=}\,\|\hat{\mathsf{a}}^{\prime}\|\,{=}\,\|\hat{\mathsf{b}}^{\prime}\|\,{=}\,1$ and hence $\|[\hat{\mathsf{a}},\hat{\mathsf{a}}^{\prime}][\hat{\mathsf{b}},\hat{\mathsf{b}}^{\prime}]\|\leq 4$ , $\|\hat{\mathsf{C}}^{2}\|\leq 8$ , $|\rho(\hat{\mathsf{C}})|^{2}\leq\rho(\hat{\mathsf{C}}^{2})\leq 8$ . An extremal $4{\times}4$ matrix model of the above algebraic structure and a state for which $|\rho(\hat{C})|\,{=}\,2\sqrt{2}\,{>}\,2$ is given in C. In Eq. (43) above, $E_{00}\,{\sim}\,\rho(\hat{\mathsf{a}}\hat{\mathsf{b}})$ , $E_{10}\,{\sim}\,\rho(\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}})$ , $E_{01}\,{\sim}\,\rho(\hat{\mathsf{a}}\hat{\mathsf{b}}^{\prime})$ , and $E_{11}\,{\sim}\,\rho(\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}}^{\prime})$ . Other Bell–type inequalities that depend on whether the algebra of operators that are allowed in models is commutative or noncommutative can be derived.

The above derivation is independent of locality, however a state over $\mathcal{A}\vee\mathcal{B}$ that generates relative frequencies close to those in Table 1 is nonlocal in the sense that it is not a product of a state over $\mathcal{A}$ and another state over $\mathcal{B}$ . As superficially perplexing as this is usually thought to be, a nonlocality that is determined by the boundary conditions of the experimental apparatus is expected for a classical time–translation invariant equilibrium state, with equilibrium being established more or less quickly depending on the dynamics at smaller scale. Such a 4–dimensional Hilbert space model derives from an infinite–dimensional Hilbert space that models the electromagnetic field state of the whole of the experimental apparatus, but we can as much take that Hilbert space and measurement operators that act on it to be generated by a random field[7] as we usually take it to be generated by a quantum field; Bell inequalities for random fields are also discussed in [46]. Note that taking APD events to be a result of coupling of the APDs to the electromagnetic field is an alternative to taking APD events to be measurements of particle properties.

7.2.1 An experimental proposal

For the experiment above, the statistics of events on the signal lines are time–translation invariant, in that we turn on the power to a laser that drives the state, wait some reasonable length of time, then collect data, but if we waited for a slightly longer or shorter length of time we would obtain a very similar violation of a Bell–type inequality. A signal analysis approach to such data suggests that it would be both theoretically and technologically useful to characterize how the violation evolves over time immediately after the power to the laser is turned on, because it is surely not guaranteed a priori that the numbers of coincident events and the violation of the Bell–type inequality will increase precisely as fast as the number of events on each signal line separately. Indeed, insofar as the violation of Bell–type inequalities in such experiments is a time–translation invariant equilibrium condition, we would more expect the approach to an equilibrium nonlocal state that results in the violation of Bell–type inequalities might be slower than the approach to a local equilibrium of events on each signal line: it would be interesting to know by precisely how much, or whether it is in fact just as fast. It may well be that for some applications we cannot leave the power to the laser permanently on, in which case how the numbers of coincident events and the violation of Bell–type inequalities change over time after power is supplied may also be technologically important. It may also be that the numbers of coincident events and the violation of Bell–type inequalities will increase at different rates when the separation between Alice and Bob is increased or when different optical components are used, or subtle universalities may emerge even for apparently very different experiments. It would seem significant, for example, if it takes time equivalent to some large multiple of the length of the whole apparatus before violation of Bell–type inequalities is established.

To investigate such variation over time, we can turn on the power to the laser, wait a millisecond (or more or less, if we discover this is shorter or longer than necessary), turn off the power and wait until the event rate decreases to the dark rate, and repeat this process until we have enough statistics. From the resulting data, we can compute counts as above for Table 1 for events in every 0.1 $\mu$ s time slice, say, after the power is turned on, and plot the increase of single events on each APD signal line, the numbers of coincident events, and the violation of the Bell–type inequality over time after the power is turned on. If we wish to understand the dynamics of such experiments, instead of essentially static statistics, we should investigate the variation of coincidences and of the violation of Bell–type inequalities over time. An alternative computation, which can be performed using the same data, would consider how the violation of Bell–type inequalties is different for an ensemble of the very first coincident pairs after each time the power is turned on, for an ensemble of the second coincident pairs, and so on, independent of the timings of the coincident pairs.

Consideration of electromagnetic field observables as well as of avalanche event timings in APDs also suggests that correlations between the detailed APD signal voltages averaged over, say, nanosecond or picosecond periods (conditioned both on the EOM signals $A$ and $B$ and on which of the four APD signals $a_{1}$ , $a_{2}$ , $b_{1}$ , and $b_{2}$ are currently in their avalanche state) should be examined for whether correlations of avalanche timings and the associated violation of Bell–type inequalities can also be detected in other, less compressed experimental raw data[43, §VII.C.2]. The APDs are coupled to the electromagnetic field that is driven by the electrically active components of the experiment and modified and contained by exotic materials, wave guides, and other electrically inactive components of the experiment, so that there should be some correlations between the detailed APD signal voltages and the electromagnetic field. It will be interesting to know, however, whether APD events are effectively “rogue wave”–type events, for which precursor conditions that allow prediction are typically very difficult to identify.

7.3 Schrödinger’s cat: what’s the state?

Suppose a quantum physicist prepares a box and tells a classical physicist that in the box there is a cat that is in a superposition of being alive and being dead. It’s a little whimsical to ask, as whimsical perhaps as thinking about cats in the context of quantum mechanics is always bound to be, but how can the classical physicist be sure whether the quantum physicist is telling the truth? Most often there is a song and a dance about decay of a nucleus that is initially assumed to be in a pure state and about unitary transformation of the nucleus and cat[47, §I A], but idealized claims about a state preparation have to be backed up with experimental verification that all possible effects of other degrees of freedom have been completely enough eliminated[48]. Is the state a superposition or a mixture? We focus here only on the extent to which the answer depends on whether we allow a noncommutative algebra of operators as models for measurements.

Both classically and quantum mechanically, suppose that when we open the box and we measure whether the cat is alive, using a projection operator that we can present as $\hat{A}=\left(\!\!\begin{array}[]{cc}1&0\\ 0&0\end{array}\!\!\right)$ , acting on a Hilbert space that is, as in §7.2, spanned by two vectors, $|\mathrm{Alive}\rangle$ and $|\mathrm{Dead}\rangle$ , but that derives from a much higher–dimensional Hilbert space, in either CM0, CM+, or QM. Then through some experimental sleight of hand —an ensemble of cats in boxes— we obtain a probability $\alpha$ that the cat is alive. We can represent that result using a density matrix $\left(\!\!\!\begin{array}[]{cc}\alpha\!&\!\beta\\ \beta^{*}\!&\!1{-}\alpha\end{array}\!\!\!\right)$ , which is consistent with either a mixed state such as $\hat{M}_{\alpha}\,{=}\,\left(\!\!\!\begin{array}[]{cc}\alpha\!&\!0\\ 0\!&\!1{-}\alpha\end{array}\!\!\!\right)$ or a pure state such as $\hat{S}_{\alpha}\,{=}\,\left(\!\begin{array}[]{cc}\alpha\!\!&\!\!\sqrt{\alpha(1{-}\alpha)}\!\\ \!\!\sqrt{\alpha(1{-}\alpha)}\!\!&\!\!1{-}\alpha\end{array}\!\right)$ , for all of which we obtain precisely the same probability $\alpha$ that the cat is alive. To tell whether the quantum physicist is telling the truth, the classical physicist must use other observables, such as what could be called the Lewis Carroll operators, $\hat{C}_{1}=\left(\!\!\begin{array}[]{cc}0&1\\ 1&0\end{array}\!\!\right)$ and $\hat{C}_{2}=\left(\!\!\begin{array}[]{cc}0&{\mathsf{j}}\\ -{\mathsf{j}}&0\end{array}\!\!\right)$ , each of which, in slightly different ways, takes a live cat and kills it and takes a dead cat and resuscitates it: a little strange and very difficult to implement, but comprehensible to a Victorian mathematician and in CM*+*. With these operators as well as $\hat{A}$ , the classical physicist can determine $\beta$ , which likely eliminates both $\beta=0$ and $\beta=\sqrt{\alpha(1{-}\alpha)}$ as possibilities, which cannot be done if the classical physicist only uses operators that are compatible with $\hat{A}$ .

According to the usual account, a classical physicist’s measurements are always mutually commutative, which can even be held up as the fundamental difference between classical and quantum. In that case, the classical physicist cannot tell whether the alleged preparation is what the quantum physicist says it is or not. If the classical physicist accepts that all their measurements are and must be mutually commutative, they can reasonably say, “Huh, it’s just a mixture, which I understand well enough, you’re just muddying perfectly clear waters by saying that it’s a superposition”. In fact, however, the Lewis Carroll operators are classically well–enough–defined. If the classical physicist allows themselves to use the Lewis Carroll and similar operators, then they can tell whether the state is a pure state, and they can confirm all the quantum physicist’s claims, but with that expansion of what a classical physicist can do, to CM+ instead of CM0, a quantum physicist is hardly different from a “unary” classical physicist.

The question “Is the cat dead?” is embedded in a field of questions: we can ask “Is the cat’s heart working?”, “Is the cat’s liver working?”, “has a particular hair on the cat’s left rear leg fallen out?”, and so on and on, but CM+ allows us also to ask differently integrative questions, such as “Is the cat between alive and dead, with its heart working but with its liver not working?” As we ask more and more detailed questions, and discover answers, we construct larger and larger Hilbert spaces that contain the Hilbert spaces we construct if we ask fewer questions (for quantum field theory, and for the equivalent mathematics in CM+, every detailed question has the possibility of more detailed questions, …) Perhaps most importantly, experimental apparatus we can actually construct allows us to ask some of the possible integrative questions “in hardware”, not just by analysis of more detailed questions: we can construct diffraction gratings to answer questions about the weighted average of the answers to many detailed questions without knowing the answers to any of the individual questions.

Insofar as resuscitation of a long–dead cat is in practice impossible for either a classical or a quantum physicist, of course no–one can prepare an eigenstate of the Lewis Carroll operators. If we can physically implement such reversals for a given real system, however, which in practice for some we can[47], it can equally be modeled by a classical or a quantum physicist, and such operations can be used as a computational resource. As a last word —though in CM+ there is arguably no last word— we have been wont to discuss “quantum systems” as distinct from “classical systems”, according to whether we can implement multiple clearly incompatible measurements, but it seems better to discuss “nontrivial systems of measurements”, which is more neutral both as to the distinction between CM+ and QM, and also as to whether there is such a thing as a distinguishable “system”.

8 Discussion

We have here constructed a presentation of unary classical mechanics for which only the measurement theory is the same as a reasonable, if rather minimal, measurement theory for quantum mechanics, a relationship between states–and–operators and experimental raw data, which provides a conceptual bridge between unary classical mechanics and quantum mechanics. We have deliberately made no attempt, however, to construct a more definite mathematical link between them, of a formal quantization procedure. The informal connecting link is that for both there are statistical states over $*$ –algebras, or, more simply, that for both there are “Hilbert spaces” as a way to describe different experimental contexts and different signal analysis algorithms, but with different states over different $*$ –algebras. The change of perspective suggested here is almost no change at all: perhaps it might be as well to rename quantum computing, say, as “Hilbert computing”, to demystify it, but the Hilbert space heart of the work is no different, except for, perhaps, a clearer understanding of measurement by analogy with classical measurement and signal analysis.

Although we can interpret unary classical mechanics in whatever way we interpret quantum mechanics, there is a significant difference in that for unary classical mechanics Planck’s constant plays no part. In a more elaborate framework of random fields and quantum fields, however, isomorphisms can be constructed for some cases, including the physically important case of the electromagnetic field[7, 49], for which a clear symmetry group distinction can be seen between Poincaré invariant quantum fluctuations, with an action scale determined by Planck’s constant, and thermal fluctuations that are invariant under only the little group of the Poincaré group that is defined by a Hamiltonian operator, with an energy scale determined by temperature and the Boltzmann constant[50]. Such a distinction is of course not available in the absence of the 1+ $n$ –signature metric of Minkowski space. The algebraic connection between the constructions given here for unary classical mechanics and quantum mechanics is nonetheless very close, in that Eq. (2) for the Gibbs equilibrium state of a classical simple harmonic oscillator is equally satisfied for the ground state of a quantized simple harmonic oscillator or the vacuum state of a free quantum field if the pre–inner product $({\bf f},{\bf g})$ is suitably replaced. For the quantized electromagnetic field, in a manifestly Poincaré invariant construction[7], we have, as for Eq. (2),

[TABLE]

with the metric tensor $\mathsf{g}^{\mu\nu}$ being constant of signature (1,-1,-1,-1) and $k^{\alpha}{{\tilde{f}}_{\alpha\mu}}^{*\hskip 1.80835pt}(k)$ and $k^{\beta}\tilde{g}_{\beta\nu}(k)$ are both space–like 4–vectors orthogonal to the light–like 4–vector $k$ . The algebraic structure is identical, the Weyl–Heisenberg group, but with different geometric structures.

There is a second difference, noted earlier, that although the Hamiltonian function of classical mechanics is positive the Liouvillian operator that generates evolution over time is not a positive operator, in contrast to the Hamiltonian operator that generates evolution over time in quantum theory, which is positive. This closely parallels the observation that the systematic use of quantum non-demolition measurement operators within the quantum mechanics formalism results in a generator of evolution over time that is non–positive[3, Eq. (12)]. This difference changes analytic properties of the dynamics significantly, however it does not change the abstract relationships between measurements and operator algebras and between statistics of measurement results and states.

The traditional connection between classical probabilities generated by theoretical physics and statistics of experimental raw data does not require a “collapse” of a state: the state reports probabilities, which are somehow related to statistics of ensembles. We here suggest a pragmatic case-by-case agnosticism about that relationship, without any stipulation that it must be Bayesian, frequentist, parameter estimation, or otherwise, although for those who have a strongly held adherence to a particular metaphysical interpretation of quantum mechanics, the Hilbert space mathematics for unary classical mechanics given here can be interpreted in that same way. As was shown in §7.1, “collapse” of the state is also not required insofar as we can insist on the classically natural Principle (JM), that joint measurements must be modeled by mutually commuting operators. Thirdly, as noted in §7.2, the electronic signals within an apparatus are not essentially discontinuous when they are considered at sufficient resolution: it is only when signals satisfy elaborate trigger conditions that we identify events. It should also be noted that Gibbsian states are subject to question from a classical perspective[51]. Whatever interpretation one adopts, however, one cannot quite as easily say, for example, that at small scales the world is quantum, at large scales the world is classical, insofar as the classical is quantum too. If we understand the measurement problem for classical mechanics, then we as much understand it for quantum mechanics.

Acknowledgements

I am grateful to Gregor Weihs for access to the datasets used in §7.2, to David Alan Edwards for a long sequence of incisive responses, to Jeremy Steeger for correspondence focused on the relationship between probability and statistics and experimental raw data, and to three anonymous reviewers. Amongst many online comments and correspondence, I am especially grateful for comments that directly resulted in material changes, from Leslie Ballentine, Federico Comparsi, Richard Gill, Jean–Pierre Magnot, Ulla Mattfolk, and Arnold Neumaier.

Appendix A Eq. (2) is a state

We can show directly for any operator $\hat{A}$ that $\rho(\hat{A}^{\dagger}\hat{A})\geq 0$ (separately from the explicit construction of Eq. (2) as the usual ground state for an algebra of raising and lowering operators), the other properties required being straightforward. $\hat{A}$ can be written as a sum of exponential terms, $\hat{A}=\sum_{i}\alpha_{i}{\mathrm{e}}^{\,{\mathsf{j}}\hat{F}_{{\bf f}_{i}}}$ , so that, applying Eq. (2),

[TABLE]

which is positive semi-definite because ${\mathrm{e}}^{({\bf f}_{i},{\bf f}_{j})}$ is a Hadamard exponential of $({\bf f}_{i},{\bf f}_{j})$ , which is a positive semi-definite matrix because it is a Gram matrix.

Appendix B Joint measurement instruments

We give here a joint measurement instrument account that parallels the more abstract discussion in §7.1. Following the account and notation given by Ballentine[52, §3.3], we consider measurements $\hat{A}$ and $\hat{B}$ that have discrete degenerate eigenvalues $a_{i}$ and $b_{j}$ ,

[TABLE]

(we will omit the degenerate eigenvector indices $\lambda$ and $\mu$ except where necessary.) To implement these measurements, we introduce measurement instruments $A$ and $B$ that are initially in vector states $|A_{0}\rangle$ and $|B_{0}\rangle$ and unitary evolutions

[TABLE]

By linearity, for a general vector $|\psi\rangle$ ,

[TABLE]

and similarly for $\hat{U}_{\!{}_{B}}$ . We apply first $\hat{U}_{\!{}_{A}}$ and then $\hat{U}_{\!{}_{B}}$ ,

[TABLE]

from which, using the Born rule, we extract probabilities

[TABLE]

The probability of a measurement result $B=b_{j}$ given that a measurement $A$ has been made, but averaging over its measurement results, is

[TABLE]

which differs from the probability of a measurement result $B=b_{j}$ given that a measurement $A$ was never made,

[TABLE]

by the omission of “interference” terms, unless $\hat{A}$ and $\hat{B}$ commute. We can rewrite Eq. (45), using a projection operator associated with each eigenvalue $a_{i}$ , $\hat{P}_{i}=\sum_{\lambda}|a_{i\lambda}\rangle\langle a_{i\lambda}|$ , as

[TABLE]

which corresponds, comparably to Eq. (40), to either

a Lüders transformed measurement $\sum_{i}\hat{P}_{i}|b_{j}\rangle\langle b_{j}|\hat{P}_{i}$ in the state $|\psi\rangle\langle\psi|$ , or

2.

a measurement $|b_{j}\rangle\langle b_{j}|$ in the Lüders transformed state $\sum_{i}\hat{P}_{i}|\psi\rangle\langle\psi|\hat{P}_{i}$ ,

so, following the algebra, either we can say that a measurement of $B$ after a measurement of $A$ is not in general the same as a measurement of $B$ alone, or we can say that the measurement of $A$ changed the state. We can say either that both descriptions are equally acceptable, or we can insist that one or the other description is preferred for specific contexts. The third way, suggested by Principle (JM), is to require all operators that are used to model joint measurements to commute, so that the Lüders transformer has no effect on subsequent measurements.

Appendix C A matrix model for §7.2

We give an extremal $4{\times}4$ matrix model that has the algebraic structure given for the operators in §7.2:

[TABLE]

using which we obtain for $\hat{\mathsf{C}}=\hat{\mathsf{a}}\hat{\mathsf{b}}+\hat{\mathsf{a}}\hat{\mathsf{b}}^{\prime}+\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}}^{\prime}-\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}}$ ,

[TABLE]

For this extremal model for $\hat{\mathsf{C}}$ , we have $\|\hat{\mathsf{C}}\|^{2}\,{=}\,\|\hat{\mathsf{C}}^{2}\|\,{=}\,8$ , $\hat{\mathsf{C}}^{3}\,{=}\,8\hat{\mathsf{C}}$ , $\mathsf{Tr}[\hat{\mathsf{C}}]\,{=}\,0$ , and $\mathsf{Tr}[\hat{\mathsf{C}}^{2}]\,{=}\,16$ .

We can use a density matrix $\hat{\rho}\,{=}\,\psi\psi^{\dagger}\,{=}\,\frac{1}{16}(\hat{\mathsf{C}}^{2}-2\!\sqrt{2}\hat{\mathsf{C}})$ , where $\psi$ is a unit length eigenvector of $\hat{\mathsf{C}}$ ,

[TABLE]

to construct a state for which $\rho(\hat{\mathsf{C}})\,{=}\,\mathsf{Tr}[\hat{\mathsf{C}}\hat{\rho}]\,{=}\,-2\!\sqrt{2}$ ,

$\rho(\hat{\mathsf{a}})\,{=}\,\rho(\hat{\mathsf{a}}^{\prime})\,{=}\,\rho(\hat{\mathsf{b}})\,{=}\,\rho(\hat{\mathsf{b}}^{\prime})\,{=}\,0$ , and

$\rho(\hat{\mathsf{a}}\hat{\mathsf{b}})\,{=}\,\rho(\hat{\mathsf{a}}\hat{\mathsf{b}}^{\prime})\,{=}\,\rho(\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}}^{\prime})\,{=}\,-{\scriptstyle\frac{\scriptstyle 1}{\raisebox{-0.86108pt}{$ \scriptstyle 2 $}}}\!\sqrt{2}$ , $\rho(\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}})\,{=}\,{\scriptstyle\frac{\scriptstyle 1}{\raisebox{-0.86108pt}{$ \scriptstyle 2 $}}}\!\sqrt{2}$ .

[[ For a density matrix $\hat{\rho}\,{=}\,\frac{1}{16}(\hat{\mathsf{C}}^{2}+2\!\sqrt{2}\hat{\mathsf{C}})$ , we obtain $\mathsf{Tr}[\hat{\mathsf{C}}\hat{\rho}]{=}2\!\sqrt{2}$ , $\rho(\hat{\mathsf{a}}\hat{\mathsf{b}}){=}\rho(\hat{\mathsf{a}}\hat{\mathsf{b}}^{\prime}){=}\rho(\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}}^{\prime}){=}{\scriptstyle\frac{\scriptstyle 1}{\raisebox{-0.77498pt}{$ \scriptstyle 2 $}}}\!\sqrt{2}$ , $\rho(\hat{\mathsf{a}}^{\prime}\hat{\mathsf{b}}){=}\,{-}{\scriptstyle\frac{\scriptstyle 1}{\raisebox{-0.77498pt}{$ \scriptstyle 2 $}}}\!\sqrt{2}$ . ]]

Bibliography52

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. P. Landsman, “Algebraic Quantum Mechanics”, in D. Greenberger, K. Hentschel, and F. Weinert (Eds.), Compendium of quantum physics , Springer, Berlin, 2009, pp. 6-10. https://doi.org/10.1007/978-3-540-70626-7_3
2[2] R. Haag, Local Quantum Physics: Fields, Particles, Algebras , 2nd Edn., Springer, Berlin, 1996.
3[3] M. Tsang, C. Caves, “Evading Quantum Mechanics: Engineering a Classical Subsystem within a Quantum Environment”, Phys. Rev. X 2 (2012) 031016. https://doi.org/10.1103/Phys Rev X.2.031016
4[4] B. O. Koopman, “Hamiltonian Systems and Transformations in Hilbert Space”, Proc. Natl. Acad. Sci. 17 (1931) 315. https://doi.org/10.1073/pnas.17.5.315
5[5] D. Mauro, “A new quantization map”, Phys. Lett. A 315 (2003) 28. https://doi.org/10.1016/S 0375-9601(03)00996-4
6[6] P. Ghose, “The Unfinished Search for Wave–Particle and Classical–Quantum Harmony”, J. Adv. Phys. 4 (2015) 236. https://doi.org/10.1166/jap.2015.1197
7[7] P. Morgan, “Classical states, quantum field measurement”, Physica Scripta 94 (2019) 075003. https://doi.org/10.1088/1402-4896/ab 0c 53
8[8] F. Zalamea, “The Twofold Role of Observables in Classical and Quantum Kinematics”, Found. Phys. 48 (2018) 1061. https://doi.org/10.1007/s 10701-018-0194-8