On a deterministic disguise of orthodox quantum mechanics
Ulrich Mutze

TL;DR
This paper proposes a deterministic interpretation of quantum mechanics where the global phase of the state vector acts as a hidden variable, allowing the prediction of measurement outcomes rather than just probabilities.
Contribution
It introduces a framework where the quantum state vector's global phase determines measurement results, providing a deterministic perspective parallel to classical mechanics.
Findings
Global phase acts as a hidden variable in quantum states.
Measurement outcomes can be predicted deterministically with known initial states.
The approach parallels classical statistical mechanics in its determinism.
Abstract
According to quantum theory, pure physical states correspond to equivalence classes of state vectors, where any two members of one class differ by a complex factor. The point is that such a factor does not change the probability for the occurrence of any measurement result as computed within the formalism of quantum mechanics. In the formalism to be presented here, the state vector does not only determine the probabilities for the occurrence of measurement results, rather it determines the result itself. The information which guides the selection process is obtained from the state vector in a way that is not invariant against multiplication by a complex factor. It can be seen as a kind of global phase of the state vectors. This global phase then is a perfectly hidden variable. For this presentation of quantum mechanics we have a remarkable parallelism to classical statistical mechanics:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Mechanics and Applications · Biofield Effects and Biophysics · Quantum Information and Cryptography
On a deterministic disguise of orthodox quantum
mechanics
Ulrich Mutze 111www.ulrichmutze.de
Abstract
According to quantum theory, pure physical states correspond to equivalence classes of state vectors, where any two members of one class differ by a complex factor. The point is that such a factor does not change the probability for the occurrence of any measurement result as computed within the formalism of quantum mechanics. In the formalism to be presented here, the state vector does not only determine the probabilities for the occurrence of measurement results, rather it determines the result itself. The information which guides the selection process is obtained from the state vector in a way that is not invariant against multiplication by a complex factor. It can be seen as a kind of global phase of the state vectors. This global phase then is a perfectly hidden variable. For this presentation of quantum mechanics we have a remarkable parallelism to classical statistical mechanics: If we would know exactly the initial state of a system, we could compute the outcomes for all experiments to be done later on this system. In the quantum case the initial state is to be represented by its state vector with specified phase, e.g. as a concrete complex-valued wave function.
1 Introduction
As is well known, quantum mechanics can be interpreted in a number of ways. The view on quantum mechanics which underlies the present work is similar to the ‘Copenhagen spirit’ as outlined in chapter VII.1 of [1]. See also [2], Ch. IV, §§ 15-19. We thus deal with systems that are amenable to reproducible experiments consisting of manipulations and observations that can be described in an unambiguous language. As N. Bohr put it (cited from [1], p. 295): ‘We must be able to tell our friends what we have done and what we have learned’.
It has tradition to characterize this unambiguous language as the language of classical physics. In my eyes it is clearer to characterize it as the language of engineering. For instance, modern photo-detectors such as CCDs are based on non-classical concepts of semiconductor physics. However, as commercially available products, these are the outcome of documented manufacturing processes, based on documented designs, guided by a documented device theory. This device theory is based on authoritative textbooks such as [3] which don’t speak the ‘quantum language’ with states, Hamilton operators, and observables. The existence of an unambiguous language is paralleled by the existence of unambiguous actors: electrons, photons, atoms, molecules and alike. Despite all quantum uncertainty associated with motion and internal dynamics of these quantum actors, their constant physical properties like rest-mass, charge, and internal angular momentum, together with their persistence (each of the atomic nuclei around us has an individual history going back some billion years to its formation in the evolution of a no longer existing star) allow us to perform experiments which deliver reproducible statistical data derived from myriads of individual factual measuring events. A typical case of such measuring events are detector ‘clicks’ which say where a particle was found. Assume that the detectors belong to a conventional scattering experiment, where a beam of particles penetrates a target foil and the scattered particles get registered. Then it is clear that the detected impacts allow an unambiguous digital documentation. It is also clear that quantum theory gives no recipes for predicting the individual impacts. It is only the probability for finding an impact at an arbitrarily chosen position for which quantum mechanics allows to derive formulas.
The idea that the physical content of a motion theory of atomic and subatomic particles could be intrinsically statistical is not natural for most physicists. When the interpretation of Schrödinger’s as a probability emerged from papers by M. Born and others (see. [4]) it created disbelief and opposition such as Einstein’s (abridged) remark ‘God does not play dice’. Against Leibniz’s ‘principle of sufficient reason’ the impacts should happen once here then there and some miraculous principle should tame their arbitrariness as to observe the probabilistic rules which are the content of the theory. Never before was a theory proposed of that kind. Notice that in classical statistical mechanics of gases each molecular collision is determined by the locations and velocities of nearby molecules or walls. The statistical character comes only from the fact that the number of dynamical variables is too large for a treatment on the fundamental level. With the employment of computers, working on the fundamental level gained large new territories [5], though. Determinism and causality as claimed above for collisions allow time-evolution to run autonomously. No authority is required to enforce obedience to the probabilistic rules while still respecting conservation laws exactly. Rather, the process evolves in the manner of an algorithm. This does not imply that future states of the process can be predicted; strong dependence on initial conditions may make predictions fail.
These considerations suggest the question whether a logically equivalent situation could hold for quantum mechanics as well. One answer to this question is known: D. Bohm’s deterministic version of quantum mechanics [6] augments the Schrödinger wave function with particles having definite positions and having momenta that are defined in terms of the wave function. The deterministic time evolution of the wave function forces the particles to follow according to a deterministic rule. The wave function is said to act as a guiding wave for the particle. A particle detector ‘click’ is assumed to indicate just the position of such a guided particle. Whether a detector will ‘click’ or not thus is determined by the ‘Bohmian state’ since it specifies the positions of the particles. Although it is thus a theory with ‘hidden variables’ (the particle positions) in a general meaning of the words, it is well understood today (cf. [7]) that it is not covered by von Neumann’s no-go theorem on hidden variable theories. The presently proposed method is literally a hidden variable theory as well — the global phase of the wave function is the single hidden variable. Since it makes exactly the same prediction as orthodox quantum mechanics, it is not restricted by Bell’s inequalities and, since it does not assume pre-existing values of measured quantities, it does not suffer from quantum contextuality.
2 Measurements
With respect to mathematical representations of measurements and states we follow conventional lines: A complex Hilbert space , the normalized elements of which will always be denoted by and referred to as state vectors or wave functions, allows representing (pure) states as equivalence classes of state vectors with respect to the relation
[TABLE]
Wave functions are considered time-dependent according to the Schrödinger picture, where time refers to an inertial system of reference which is assumed to be selected once and for all. This implies that whenever a specific wave function is under consideration, there is some , the point in time for which is valid.
Since all Hilbert spaces of the same dimension are isomorphic, no specific features of an experiment are expressed by selecting . A connection to the physical world is made by giving certain linear operators in a meaning as descriptors of entities ranging from observable quantities to measurement devices, dynamical variables, and symmetries.
Measurement devices are represented by self-adjoint operators in . Self-adjoint operators viewed as representing measurement devices will be called observables.
Since measurements always have a limited resolution and a finite operational range it is justified to consider as observables only those self-adjoint operators which have a spectrum 222this, of course, is the set of possible outcomes of measurements consisting of finitely many points 333Self-adjoint operators that primarily act as generators of unitary one-parameter groups, such as the Hamiltonian, are excluded from this restriction.. The general form of an observable thus is
[TABLE]
where is a finite subset of and is a family of pairwise orthogonal projectors which add up to the identity operator in . The expression on the right-hand side of equation (2) is said to be the spectral representation of . Observables in the sense of (2) fall short of modeling most realistic measurements as they are restricted to giving a single number as a result. This precludes them e.g. from recording the direction of of a particle track in space, or, even more so, the tracks of a group of particles. A sufficient degree of generality is reached if we consider lists , of mutually commuting observables. When interacting with a state vector , such a composite observable outputs a list as a result, where . This list can be considered as the result of a single measurement obtained with a complex experimental setup, which may, for instance, comprise some spatially separated particle detectors and analyzers. The may show dependencies, such as due to relations between the operators, such as , or, less trivially, due to algebraic properties of state vector as in the case of the singlet state in Section 3. A composite observable which cannot be extended in a non-trivial manner 444Trivial extensions add self-adjoint operators which are functions of the operators which belong the composite observable already. is said to be a complete set of commuting observables. This is a standard concept of quantum mechanics, whereas the name composite observable is not in common use.
We now turn to constructing a function which takes as arguments an arbitrary composite observable and an arbitrary and gives as a result a list . In order to simplify indexing we actually consider only the case as a pattern for . Instead of we write , where
[TABLE]
Our assumption can be shown to be equivalent to for all and all . In such a setting the two spatially separated spin measurements which are considered in D. Bohm’s discussion of the EPR-experiment (see e.g. [6], Chapter 7, [8], Chapter 3.9 and Chapter 6.3), and that are the topic of Section 3, can be treated as a single measurement with a single result (consisting of a pair of values). That observables corresponding to spatially separated measuring devices commute is a foundational principle of local quantum theory [1] and of quantum field theory [9].
Our composite observable determines the set which contains all potential 555If , not all are possible measurement results, only are. measurement results. Which result is to be expected in an actual measurement depends on the state vector of the system at the instant of measurement. Quantum mechanics says that the probability for getting the result is given by
[TABLE]
This probability can be written as a transition probability
[TABLE]
by means of the ‘collapsed wave function’
[TABLE]
If the system is still existing after a measurement that gave the result and an immediately subsequent measurement is feasible, then the state vector for this measurement has no longer to be taken as , but as . This statement is normally referred to as the projection postulate.
It is instructive to see that assuming the corresponding relation for single observables instead of pairs, allows us to deduce the result for pairs and, actually, arbitrarily long lists by successive application:
[TABLE]
This approach is, however, less natural than the one we started with. This is especially so if our measurement device contains spatially separated subsystems and , which are responsible for creating the partial results and respectively. Then, we are forced to give the appearance of the partial results a causal order. Here, in accordance with (7), we let appear first. Then, we need to appeal to the projection postulate and employ the state for computing the the probability for which is consistent with (4) for the pair of results. The state relevant for the computation thus seems to switch from at the place of to at the place of . This, however, is a misconception: the relation between and has nothing to do with causality as can be seen from the fact that one could reverse the roles of and which then would suggest a causal connection in the opposite direction.
In conclusion, the physical situation as represented by the measurement device and the state vector assigns to each element a probability as given by (4). The act of measurement selects one of the elements of in a way which is compatible with these probabilities. Of course, not always an element with a major assigned probability will be selected; only a probability zero is prohibitive for becoming selected. The predictive power of the theory for a single measurement is, therefore, rather poor. If, however, the physical situation allows the measurement to be repeated with the relevant conditions unchanged, statistics over the selected values can be done and the consistency of the obtained measurement data and the the probability measure can be quantified.
Taking quantum mechanics with M. Born as an inherently statistical theory the selection of the actual measurement value out of the manifold of possible values is a ‘true’ random event. Replacing this random event by an algorithmically generated, thus deterministic, pseudo-random event brings about the deterministic disguise announced in the title.
Whereas in biological systems there are many well-understood mechanisms that have a (pseudo-)random output (e.g. the distribution of the parent genes onto the descendants), it is hard to hypothesize how pseudo-random generators in non-living quantum systems could be organized. Here I assume as a working hypothesis that quantum mechanics (actually all Nature) builds on some unknown algorithmic infrastructure 666A plausible speculation sees the update rules of a universal discrete causal network in this role. Since nothing specific from this body of ideas is needed in the present work, a single reference: [10] Chapter 9 and Notes for Chapter 9 may suffice. . In the absence of any pertinent knowledge it is a natural assumption that the computational strength of this structure is just so that it allows to compute all functions which are computable in the technical sense and that, therefore, a valid pseudo-random number generator (RNG) can be realized within this infrastructure (see e.g. [10] Chapter 7, The Intrinsic Generation of Randomness). We thus take for granted that a RNG is a legitimate building block of a physical theory, much like integrals and differential equations.
More specifically, we assume a RNG which outputs uniformly distributed real numbers in the interval . The meaning of this is a pragmatic one: the generator passes a reasonable set of statistical tests of the hypothesis that the output of the generator comes from independent repetition of a stochastic experiment which outputs a value such that the event has probability .
A simple example is the RNG [11]
[TABLE]
which can be shown to pass all 15 tests of the test battery SmallCrush of TestU01 version 1.2.3 (see [12]) and fails in 20 of 144 tests of the more demanding battery Crush, and in 44 of 160 in the even more demanding BigCrush. The latter consumed 16.5 hours of CPU time on a dual core 2.4 GHz Laptop Computer.
Forming weighted means out of several RNGs gives again a RNG with typically better statistical properties. The situation seems to be that for each test battery there can be constructed a RNG which passes it completely and for each RNG there can be constructed a test battery which lets it fail. The real challenge is to identify in this infinite competition a David versus Goliath situation, where a relatively simple RNG passes a huge, complex, and powerful test battery.
The value-range for RNGs is standard in programming languages and comes in naturally in algorithms such as (8). In some of the following considerations the toroidal value range would be more natural in the first place. The bijection
[TABLE]
will provide the connection where needed.
In programming languages a random generator is normally a named function (such as rand() in the language C) which needs no argument to yield a value. This is, in a sense, a fallacy since the function has a counter as an internal state which gets incremented by each function call and which works as a hidden argument. When using pseudo-random generators as legal citizens of theoretical physics one should give them a more natural interface, which does not rely on a hidden state variable. Instead we take for granted ‘pseudo-random clocks’
[TABLE]
which associates a pseudo-random number and a pseudo-random phase with any point in time. Just as numerical values of time are not provided by Nature but by a man-made time-keeping system (which has, however, to satisfy conditions set by Nature), we consider also the numerical output of the random clocks as a part the man-made time-keeping system. Actually, on the Internet there are various services which provide both pseudo-random numbers and ’true’ random numbers (derived from natural processes such as radioactivity) on demand.
Pseudo-random numbers can be used to determine rather general pseudo-random events by constructing images of the probability space to which the pseudo-random numbers belong. In our case we construct the mapping
[TABLE]
such that the image of the Lebesgue measure on under is just from (4):
[TABLE]
The construction of suggests itself. Define
[TABLE]
and write it in lexicographic order as
[TABLE]
Define iteratively by ; obviously . Defining the intervals we have the partition
[TABLE]
and define as constant on each of the intervals and taking the value on .
One way to complete our construction of function would be to require each measurement in state at time to request the random number from the random clock and to stipulate that the outcome of the experiment be given as .
Since for any -valued -distributed random variable the -valued random variable is -distributed this rule let the measurements, when analyzed statistically, appear indistinguishable from measurements that result from the intrinsically statistical version of quantum mechanics.
In this method one may criticize the appearance of on grounds of the possibility that for many measurements the trigger for measurement is not in the device but in the system to be measured (e.g. in the collision of two particles). Since the formation of the actual measurement values is determined by we need for an unknown time which would destroy the determinism of our scheme. It is desirable to refer to only at the end of a measurement, where the measurement result and the point in time exist in documented form and belong to the world that can be represented in the ‘engineering language’. The method to be proposed in the following achieves this.
It derives from the state vector . The simplest choice is , where is a function such that for all state vectors and all . There are many such functions and without having input from a special situation we have to rely on arbitrary selections. Here we choose an orthonormal base in and set for each state vector
[TABLE]
Among the finitely many complex numbers we select the one of greatest modulus and name it . Then
[TABLE]
The final formula for the measurement result and the desired function (see p. 4) is
[TABLE]
In order for this to work we have to manage the circumstances that determine the phase of a state vector. A state comes into existence either by preparation or assumption (typically based on a priori knowledge). Let us consider an experiment dealing with a single isolated hydrogen atom. The state of it is ‘self-preparing’; after some time it can be assumed to be the ground state. Physicists know the formula for the hydrogen atom ground state wave function. No question that in Nature there occurs something that corresponds closely to the behavior of the mathematical object defined by this formula. For mathematical deductions concerning future behaviors of the object it suggests itself to take the wave function as given by the books as initial state vector. Actually, any multiple by a would represent the same state. Since in the formalism to be presented here, the overall phase of the wave function does matter, it would be unnatural to rely here on a choice that text book authors made, based on their understanding of simplicity of formulas. Even in the unmodified quantum mechanics formalism we have to acknowledge that the time-dependent Schrödinger equation deals with state vectors and not with states. If we understand time-evolution as an algorithm, as indicated earlier in this section, it is clear that this algorithm works on a state vector and not on a state. The logic is similar to an algorithmic implementation of the arithmetic of integers: one has to select a base (e.g. 2 or 10) although the results depend on this choice in an obvious way which allows to switch from one representation to another. Let this be motivation enough for the following rule: Whenever a state vector is to be introduced into the description of an experiment, there is a ‘time of birth’ for which the preparation of the state was finished. Then the ‘newly born’ state vector has to be given a phase such that
[TABLE]
with from (10).
In a computational model of a quantum system formulated in C(and in many other object oriented programming languages) an object comes into existence through the execution of a constructor function. In such a framework one simply has to make this random phase operation (16) the last statement in all state constructors. Also the state vector (6) has to be considered ‘newly born’ and has to be ‘re-phased’ as to satisfy
[TABLE]
where is the time of completed measurement.
Let us summarize the properties of the proposed modification of orthodox quantum mechanics:
The result of every measurement is strictly determined. 2. 2.
All measurement results occur with the probability predicted by quantum theory. 3. 3.
Since each measurement introduces new randomization of the phase, there is no way to experimentally find out the mechanism behind the built-in determinism. 4. 4.
Determinism is enforced by the same means that a programmer would employ in creating a simulation of chained quantum measurements.
3 A computational low-dimensional model
As a proof of concept I implemented the ideas presented so far as an interactive free computer program on the web [13]. The source code of the program is freely accessible. It is written in Wolfram Language which is a very expressive universal programming language which, due to its freely accessible documentation, can well serve as a replacement for expressing algorithmic content in pseudo-code.
The system under consideration is a simplified EPR experiment with two electron spins. By running this program one observes how the simulated measurement results approach the quantum mechanical exact result for the spin-spin correlation coefficient. The present form of an article does not allow to present such direct evidence. Nevertheless, a description of the system and the simulated results from a single arbitrarily selected run may be a useful illustration of the method described in Section 2.
The model is defined by Hilbert space and operators as follows:
[TABLE]
where, of course, denote the Pauli matrices.
All these operators are actually square matrices and their spectral representation can be obtained by numerical methods such as the function Eigensystem of the computer algebra system Mathematica. Of course, for the operator we get the eigenvalues [math] and together with two projectors onto an 1-dimensional and a 3-dimensional state space, corresponding to what is known as singlet and triplet states respectively. We select a singlet state which is defined uniquely up to a phase factor. With the system assumed to be in this state we simulate the measurement of the composite observable with
[TABLE]
When running, the program executes the command 777See (15) for . For the pseudo-random generator the name from Wolfram language [14] is used.
[TABLE]
repeatedly, creating a list
[TABLE]
where the repetition stops if some parameter, such as the state or the angles get changed via the graphical user interface. The most recent result is shown on the left-hand side of Figure 1 as a red and a green arrow which connect the origin with points
[TABLE]
The complicating addition of is needed since the vertical (and not the horizontal) direction was chosen to correspond to . In the example shown, the two angle settings differ only by a small angle of so that there is a probability close to that and differ in sign and, thus, the arrows point in nearly opposite directions.
The graphics on the right-hand side deals with the spin-spin correlation coefficient 888We use a simplified definition of a correlation coefficient which assumes that the expectation value of the quantities to be correlated is zero.. The jagged brownish curve shows the most recently obtained 200 values of the correlation coefficient of the obtained data:
[TABLE]
where the are given by
[TABLE]
Each of the jags indicates an event with (recall that is much more probable). In all the countless runs I listened, the brownish curve eventually reached the theoretical value (25), indicated by the blue horizontal line and continued wiggling around this curve. Sometimes one observes that rather pronounced deviations from the blue line build up. But, as a rule, they have a smaller amplitude as in earlier deviation episodes.
The exact quantum mechanical value for the spin-spin correlation coefficient is
[TABLE]
For being the singlet state this quantity can easily be seen to equal and thus to yield perfect anti-correlation for .
4 Conclusion
It is rather evident that the method applied here can be generalized as to transform any probabilistic theory into a deterministic one. The method of converting probabilities into events is, in a sense, a reversion of the program of probability theory, which tries to capture the characteristics of streams of events by ascribing probabilities to them.
Although we have shown that it does not conflict with logic to assume a deterministic mechanism behind the emergence of measurement results, one cannot deny that the algorithmic implementation given above does not look like a strategy that Nature would follow. The algorithmic understanding of physical processes is in its infancy at best, as is the understanding of the required resources.
5 Acknowledgements
I would like to thank D.P.L. Castrigiano for his critical reading of the first version of this article, which led to lots of reformulations and additions, and even to the correction of a mathematical error. The definition of contained an infinite sum which was not guaranteed to exist. A new definition of avoids the problem. I thank Michael J.W. Hall for having made me aware that a construction very similar to my function is described in [15], end of Chapter IV.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Haag: Local Quantum Physics, Fields, Particles, Algebras, Second Edition, Springer 1996
- 2[2] A. Messiah: Quantum Mechanics, Vol. I, North-Holland Publishing Company 1961
- 3[3] S.M. Sze: Physics of Semiconductor Devices, Second Edition, Wiley 1981
- 4[4] A. Pais: Max Born and the Statistical Interpretation of Quantum Mechanics http://cds.cern.ch/record/141137/files/cer-000052203.pdf
- 5[5] Loup Verlet: Computer “Experiments” on Classical Fluids I. Thermodynamic Properties of Lennard-Jones Molecules. Phys. Rev. Vol. 159, Number 1, 5 July 1967, pp. 98-103
- 6[6] D. Bohm, B.J. Hiley: The Undivided Universe, Routledge, 1993
- 7[7] Jeffrey Bub: Von Neumann’s ’No Hidden Variables’ Proof: A Re-Appraisal. ar Xiv: 1006.0499 [quant-ph]
- 8[8] J. J. Sakurai: Modern Quantum Mechanics, Revised Edition, Addison-Wesley 1994
