Complementarity has empirically relevant consequences for the definition of quantum states
Bradley A. Foreman

TL;DR
This paper argues that applying Bohr's complementarity to subsystems necessitates redefining quantum states as equivalence classes, aligning theory with experimental entropy and resolving conflicts in measurement theory.
Contribution
It introduces a new definition of quantum states as equivalence classes, based on complementarity, which impacts the understanding of measurement in quantum mechanics.
Findings
Redefining quantum states resolves measurement conflicts.
Equivalence classes maximize information entropy.
Current measurement theories conflict with experimental results.
Abstract
The Copenhagen interpretation of quantum mechanics, which first took shape in Bohr's landmark 1928 paper on complementarity, remains an enigma. Although many physicists are skeptical about the necessity of Bohr's philosophical conclusions, his pragmatic message about the importance of the whole experimental arrangement is widely accepted. It is, however, generally also agreed that the Copenhagen interpretation has no direct consequences for the mathematical structure of quantum mechanics. Here I show that the application of Bohr's main concepts of complementarity to the subsystems of a closed system requires a change in the definition of the quantum state. The appropriate definition is as an equivalence class similar to that used by von Neumann to describe macroscopic subsystems. He showed that such equivalence classes are necessary in order to maximize information entropy and achieve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Mechanics and Applications · Quantum Information and Cryptography · Advanced Thermodynamics and Statistical Mechanics
Complementarity has empirically relevant consequences for the definition of quantum states
Bradley A. Foreman
Department of Physics, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
Abstract
The Copenhagen interpretation of quantum mechanics, which first took shape in Bohr’s landmark 1928 paper on complementarity, remains an enigma. Although many physicists are skeptical about the necessity of Bohr’s philosophical conclusions, his pragmatic message about the importance of the whole experimental arrangement is widely accepted. It is, however, generally also agreed that the Copenhagen interpretation has no direct consequences for the mathematical structure of quantum mechanics. Here I show that the application of Bohr’s main concepts of complementarity to the subsystems of a closed system requires a change in the definition of the quantum state. The appropriate definition is as an equivalence class similar to that used by von Neumann to describe macroscopic subsystems. He showed that such equivalence classes are necessary in order to maximize information entropy and achieve agreement with experimental entropy. However, the significance of these results for the quantum theory of measurement has been overlooked. Current formulations of measurement theory are therefore manifestly in conflict with experiment. This conflict is resolved by the definition of the quantum state proposed here.
Introduction.
Despite nearly a century of effort, research into the foundations of quantum mechanics remains a Tower of Babel. Weinberg describes the situation well [][;reprintedin]Weinberg2017a; *Weinberg2018: “It is a bad sign that those physicists today who are most comfortable with quantum mechanics do not agree with one another about what it all means.” Many physicists believe that the most contentious issue, the measurement problem, was solved long ago by some variant of orthodoxy Englert (2013), and that the only remaining problem is with “a set of people” [AttributedtoH.Mabuchiby][; p.166.]Fuchs2011. However, the orthodox solution has been criticized by many others, including Gell-Mann, who lamented that “Niels Bohr brainwashed a whole generation of theorists into thinking that the job was done [in the 1920s]” Gell-Mann (1979).
Some of the more prevalent tongues heard in the Tower are those of the Copenhagen Bohr (1928, 1935, 1970) and Princeton von Neumann (1955a); Wigner (1979) schools (each of which has some claim to orthodoxy), the Everett relative-states or many-worlds theory Everett (1957); Wheeler (1957); Saunders et al. (2010), consistent histories Laloë (2019), pilot-wave or hidden-variables models Laloë (2019), dynamical-reduction models Bassi et al. (2013); Arndt and Hornberger (2014), and a range of approaches influenced by the concepts of quantum information, including reconstruction efforts Ball (2013) and QBism Fuchs et al. (2014). A cross-section of the diversity of current opinion can be found in conference surveys Schlosshauer et al. (2013) and interviews Schlosshauer (2011). Decoherence theory Joos et al. (2003); Zurek (1991, 2003); Schlosshauer (2007) has become increasingly popular, in no small part because it is said to have a foot in both the Copenhagen and Everett camps Zurek (1991, 2003); Camilleri (2009); with the advent of quantum Darwinism Zurek (2009), decoherence theory is now also viewed as a branch of quantum information theory.
The reconciliation of Bohr and Everett achieved thus far by decoherence theory is, however, rather limited, with Copenhagen ideas entering mainly via the language by which the results of decoherence theory can be described. The difficulty that many physicists have experienced in putting the amorphous Copenhagen philosophy to practical use is crystallized in the words of Mehra, as reported by Bell Bassi and Ghirardi (2007): “Though Dirac appreciated and admired Bohr greatly, he told me that he did not find any great significance in Bohr’s principles of correspondence and complementarity because they did not lead to any mathematical equations.” Bell responded Bassi and Ghirardi (2007) by saying that “I absolutely endorse this opinion” and added that he could not explain complementarity because he “never got the hang of it.” This opinion seems to be shared by most authors of quantum-mechanics textbooks, whose work is dominated by the mathematics of the Princeton school, with at most a few paragraphs of verbal garnish on the Copenhagen philosophy.
Here I show that a unification of the core concepts of Bohr and Everett does indeed lead to new equations. The key concepts in this regard are Bohr’s emphasis on the whole experimental arrangement—which implies that the properties of a measured microscopic subsystem acquire meaning only through its interaction with a macroscopic measuring apparatus and that “unperformed experiments have no results” Peres (1978)—and Everett’s insistence that all relevant subsystems, including those associated with a measuring apparatus or observer, be included in the quantum-mechanical description. A merger of this type can be viewed as a long-delayed but inevitable consequence of Bohr’s capitulation to Einstein by including the apparatus in the domain of applicability of Heisenberg’s uncertainty relations Bohr (1970). Such a merger was much desired by Wheeler during his unsuccessful attempt to mediate between Bohr and Everett Wheeler (1957); Osnaghi et al. (2009).
As shown here, a consistent combination of these ideas requires a change in the definition of a quantum state. The appropriate definition is not as a ray or density operator in Hilbert space, but as an equivalence class of stable information. Vital components of this definition have appeared previously in the work of von Neumann von Neumann (1955b), Landau and Lifshitz Landau and Lifshitz (1977), and Zurek Zurek (2000), but always in isolation, never as a coherent whole. This formulation of the quantum state has important experimental implications for the entropy of macroscopic subsystems von Neumann (1955b). A separate paper Foreman (2019) provides further details of this theory and shows that this definition of a quantum state solves—in the limited sense of emergence—the problem of outcomes, which is the last outstanding part of the quantum measurement problem Schlosshauer (2007). Perhaps this empirically meaningful unification of two of the major strands of thought in the foundations of quantum mechanics may help to resolve some differences of opinion in this field.
Direct and indirect reductions.
Let us start by considering a set of subsystems, each of which interacts strongly with its environment and therefore decoheres rapidly. Such a subsystem is said to be integrated with its environment. The set of integrated subsystems contains those exhibiting quasiclassical behavior, including macroscopic measuring apparatuses. Subsystems in set may be isolated in the sense that they interact weakly with their environments, apart perhaps from occasional strong interactions with measuring apparatuses in set . Integrated subsystems are typically macroscopic, whereas isolated subsystems are typically microscopic. The composite system is assumed to be closed—i.e., it does not interact with anything else.
In this closed system, a state vector generally has the entangled form
[TABLE]
in which and are complete orthonormal basis sets in and , respectively. This can always be written as a relative-state expansion Landau and Lifshitz (1977); Everett (1957)
[TABLE]
in which
[TABLE]
is the state in relative to in .
Consider now a typical quantum measurement situation, whereby a macroscopic apparatus in interacts with a microscopic subsystem in . The outcome of such a measurement is conventionally described as a reduction process generated by an exhaustive orthonormal set of projection operators :
[TABLE]
Note that in decoherence theory, the set and the subsystems used to define and are not given a priori. Rather, they are derived from a variational principle known as the “predictability sieve” Zurek (2003), which selects projectors and subsystems Foreman (2019) by the criteria of stability and predictability.
According to von Neumann von Neumann (1955a) and most quantum-mechanics textbooks, the projectors act either on the “measured” subsystem in or on all subsystems together. However, as stressed by Landau and Lifshitz Landau and Lifshitz (1977), the measurement process is better described by using projectors that act nontrivially only in :
[TABLE]
The reason for this is simply that we obtain our information about the outcome of the measurement from the measuring apparatus, not from the microscopic subsystem. The vector subspace defined by the projector is usually associated with a set of collective variables, such as those in which the center of mass of the apparatus pointer lies in some given volume in coordinate space. But in a typical quantum measurement situation, all of the states in this subspace may have the same relative state . The direct reduction of generated by thus leads to an indirect reduction of , in which the state of is (in this example) reduced to the pure state . This indirect reduction mechanism Landau and Lifshitz (1977) is an explicit implementation of Bohr’s principle Bohr (1928) that the properties of a microscopic subsystem acquire meaning only through their correlations with those of a quasiclassical apparatus.
The reduction process is most convenient to write in terms of density operators , where in the special case of a pure state described above. It can be separated conceptually into two stages, the first of which is the elimination of interference between the alternatives :
[TABLE]
The second stage is the selection of one individual outcome from among these alternatives:
[TABLE]
in which is the probability of obtaining outcome . This is the Lüders reduction process Lüders (1950), which is used in most textbooks for the description of ideal measurements. Regardless of whether one chooses to portray reduction as real, illusory, emergent, or merely the acquisition of information, it must be taken into account if one is to make contact with experimental physics Everett (1957); Joos et al. (2003).
Superselection rules and equivalence classes.
The elimination of interference in Eq. (6) is commonly described in decoherence theory as the emergence of an “environment-induced superselection rule” Zurek (1991, 2003). A brief detour into the theory of superselection rules in algebraic quantum mechanics Beltrametti and Cassinelli (1981) is helpful in explaining the reason for this. Superselection rules are used to implement the hypothesis that, contrary to the tacit assumption used in most elementary textbooks, not all operators in Hilbert space correspond to observable quantities.
Let be some set of operators in Hilbert space. Its commutant is defined as the set of bounded operators that commute with all operators in ; the bicommutant is defined likewise as . The algebra of observables generated by the set is then defined to be ; it satisfies . The basic hypothesis under consideration is that physical quantities in quantum mechanics are limited to operators in , or operators whose spectral decomposition contains only projectors in .
The nontrivial elements of (i.e., those not proportional to the identity operator) are called superselection operators. The center of is defined as ; operators in are called classical observables. In quantum mechanics, it happens to be true that Beltrametti and Cassinelli (1981), which implies that . In a system with discrete superselection rules (the only type considered here), every superselection operator is of the form Beltrametti and Cassinelli (1981)
[TABLE]
Here the set is as shown in Eq. (4), although in general need not have the form given in Eq. (5). In such a system, the choice of completely defines the algebra of observables, because is generated by . Conversely, is the set of nontrivial projectors contained in .
A state in quantum mechanics is defined as a probability measure over the set of all projectors Beltrametti and Cassinelli (1981). For a system with no superselection rules, it is well known that this probability measure can be written as , where is a density operator that belongs to Gleason (1957). Quantum states are then in one-to-one correspondence with density operators Gleason (1957).
However, this one-to-one correspondence is broken in a system with superselection rules Beltrametti and Cassinelli (1981). A quantum state can then only be identified with an equivalence class of density operators
[TABLE]
in which the equivalence relation is defined by
[TABLE]
In such a system, a density operator is generally not a member of . However, the reduced density operator in Eq. (6) belongs to both and . In fact, is the unique member of the set Beltrametti and Cassinelli (1981). The one-to-one correspondence between and then implies that the equivalence class can be represented mathematically by the reduced density operator . The equality holds if and only if commutes with all Beltrametti and Cassinelli (1981).
The fact that is the canonical representative of the quantum state is the foundation for Jauch’s theory of measurement Jauch (1964). From this perspective, the first stage (6) of the Lüders reduction process is no change at all, because and correspond to the same quantum state . The second stage (7) can then be viewed as merely the selection of one alternative from a classical statistical mixture . The measurement problem in quantum mechanics thus “dissolves into a pseudoproblem” Jauch (1964).
Equivalence classes from complementarity.
A major conceptual flaw in this approach is that, for a closed system, the “observables” in have little to do with what is observed in an experiment. According to this theory, both and are considered to be observables, because they belong to . However, in an experiment, one certainly never has access to the full details of the density operator at the time of reduction. All that one can really deduce from the location of the apparatus pointer is that lies somewhere in the subspace defined by the projector in Eq. (5):
[TABLE]
A more appropriate definition of the quantum state would take into account this limit on the information that is actually accessible in an experiment.
Von Neumann has proposed an alternative definition of equivalence classes, for the special case of systems consisting entirely of integrated subsystems, that is based directly on the meaningful information content of the reduced state von Neumann (1955b). Here von Neumann’s definition is modified to include the indirect reduction of isolated subsystems as well. In this modified definition, the equivalence relation (10) is replaced with
[TABLE]
in which is defined in Eqs. (4) and (5) and is the set of all projectors that act nontrivially only in . The fact that is arbitrary means that the isolated subsystems in are treated in the same way as a system without superselection rules in the standard theory of Eq. (10). However, is limited to the set used to perform the reduction that actually takes place at the given time von Neumann (1955b). This is a concrete implementation of Bohr’s principle of complementarity Bohr (1928, 1935, 1970), according to which (in the memorable phrase coined by Peres Peres (1978)) “unperformed experiments have no results.”
The equivalence relation (12) can be rewritten in the simpler form
[TABLE]
in which denotes a partial trace over the subsystems in , the result of which is an operator in . If the equivalence class (9) is now redefined in terms of this equivalence relation, the reduced density operator in Eq. (6) can no longer serve as the canonical representative of . It must likewise be redefined as
[TABLE]
in which is the same as before and . Here and are normalized density operators; is called the conditional state of given the state of Schumacher and Westmoreland (2010). Such conditional states are used in the definition of quantum discord Zurek (2000); Ollivier and Zurek (2001), where they play a role similar to that of the relative state in Eqs. (2) and (3). Once again, represents an indirect reduction of that accompanies the direct reduction of generated by .
Equation (14) is the main result of this paper. This modification of the reduced density operator brings it back into one-to-one correspondence with the modified equivalence class (i.e., if and only if ). This is a straightforward consequence of the orthogonality of the projectors in the set (4). Zurek has argued that such orthogonality is necessary in order for the states of macroscopic subsystems to be repeatedly accessible (a criterion closely related to the stability and predictability criteria discussed below), using concepts very similar to the equivalence classes of von Neumann Zurek (2013).
The entropy of can also be defined as that of von Neumann (1955b), because has the greatest von Neumann entropy of any member of . This can be seen directly from the definition (14), because clearly has the maximum entropy of any state in the subspace defined by , whereas the value of is fixed by the equivalence relation (13). The change of reduced density operator in going from Eq. (6) to Eq. (14) can thus be thought of as a further stage of coarse-graining in which all information about the details of any particular state in is discarded.
Experimental significance.
This additional loss of information has crucial experimental consequences for the entropy of macroscopic subsystems. As stressed by Jaynes Jaynes (1963, 1965, 1985), for any density operator that satisfies a given set of macroscopic constraints (such as those that define the collective variables of integrated subsystems) is related to the experimental entropy measured under the same constraints by , where the equality holds if and only if has been maximized with respect to all unconstrained variables in . But this means that for the reduced state (14), whereas for the Lüders reduced state (6), because the latter does not satisfy the maximum-entropy condition stated in the previous paragraph. Therefore, even though the Lüders reduction process generates an increase of entropy in qualitative agreement with the second law of thermodynamics, it cannot reproduce the quantitative time dependence of . Previous analyses of entropy changes during the measurement process Zeh (2007) have not taken this into account. Attaining the equality was in fact the reason why von Neumann introduced his equivalence-class definition of quantum states (for the special case of entirely integrated subsystems) in the first place von Neumann (1955b).
The standard formulation of the quantum theory of measurement is therefore fundamentally flawed. To avoid this discrepancy with experiment it would be necessary to restrict the domain of applicability of quantum mechanics to microscopic subsystems (excluding measuring apparatuses), a step that Bohr decisively rejected Bohr (1928, 1935, 1970). As Peres has wryly observed Peres (1995), such a restriction would be tantamount to treating measurement as a “supernatural event.”
The two reduction processes also yield somewhat different predictions for the results of measurements on isolated subsystems, although here the difference is more subtle. The difference occurs because the Lüders reduction process selects one particular state in the manifold —generally not a maximum-entropy state. This difference should, however, be statistically insignificant, at least in the short term, because nearly all of the states in would lead to the same short-term dynamics of the collective variables.
Daneri, Loinger, and Prosperi Daneri et al. (1962) used ergodic theory to justify a reduced density operator that is similar to Eq. (14), except that their is a pure state obtained by direct reduction. Such a definition generally disagrees with the result (14b) of indirect reduction and thus disagrees with experiment. These authors also did not point out the experimental implications of their theory for the entropy of macroscopic subsystems. Their formulation of the reduction process thus never managed to supersede the standard formulation, and it has since fallen into half a century of disuse.
Consistency of the theoretical description.
Although and correspond to the same quantum state , one may question whether the replacement of with could lead to any inconsistency in the theoretical description. The key to avoiding inconsistency is that the time interval between reductions should be chosen to satisfy
[TABLE]
in which is the maximum decoherence time for the integrated subsystems in and is the timescale for changes in the projector set . The condition ensures that different reductions do not interfere with one another, whereas expresses the requirement that change slowly in time. The latter is a stability constraint that ensures the reduction process can be regarded as emergent Foreman (2019) prior to the introduction of the equivalence class . The satisfaction of the conditions (15) is contingent upon the initial quantum state and the definition of . The latter is chosen here in accordance with a modified version Foreman (2019) of Zurek’s predictability sieve Zurek (2003), in which is defined so as to generate the least entropy during the interval .
Another conceivable source of inconsistency has been discussed by d’Espagnat d’Espagnat (1976). This arises because a measuring apparatus in and its corresponding measured subsystem in are entangled in but not in . One could therefore conceivably perform a measurement on the composite system (of apparatus and isolated subsystem) that would reveal this entanglement, thus demonstrating an inconsistency in the reduction process .
The reason why this is not an actual source of inconsistency has nothing to do with the fact that such a secondary measurement would be technologically demanding. It is instead due to the fact that a measuring apparatus can only function as such if it is truly integrated Heisenberg (1958); Peres and Zurek (1982); Peres (1986) (i.e., if it interacts strongly with its environment, thereby generating significant entanglement on the timescale ). In order for a secondary measurement to reveal the entanglement between the primary apparatus and the isolated subsystem, it would therefore be necessary to isolate the primary apparatus from its environment (e.g., by cooling it down to a very low temperature, among other things). But this changes the experimental arrangement in such a way that the primary apparatus no longer functions as a measuring device. The secondary measurement thus fails to demonstrate an inconsistency in the theoretical description of the primary measurement; it merely replaces one experiment with an entirely different experiment. This illustrates once again the importance of the concept that “unperformed experiments have no results” Bohr (1935); Peres (1978).
Conceptual difficulties of the type described above can often be avoided by thinking of a measurement as something that we infer has happened from the correlations between subsystems in the reduced state , rather than an active intervention performed by an agent upon another subsystem. It should be stressed here that the presumed freedom of an experimenter to manipulate the conditions of an experiment is actually a necessary prerequisite for the pragmatic description of experimental physics Bohr (1935); Peres and Zurek (1982). The use of descriptive language involving active agents is therefore not incorrect—however, the validity of such a description emerges only on a timescale , when the behavior of the agent becomes so unpredictable that the concept of “free will” can be invoked without fear of contradiction. Maintaining a conceptual distinction between the emergence of outcomes Foreman (2019) on the timescale and the emergence of free will on the timescale is therefore crucial for logical consistency.
Origins in decoherence theory.
The mathematical form of the reduced state in Eq. (14) was derived from two of the core principles of complementarity: (1) that we obtain information about isolated subsystems exclusively through their interactions with integrated subsystems and (2) that unperformed experiments have no results. These principles have hitherto been treated more or less as axioms of the Copenhagen interpretation.
However, both are corollaries of the most basic principle of decoherence theory—namely, that all information in quantum mechanics must be extracted from structures in the quantum state that are stable in time. This principle of dynamical stability is the reason why pointer variables are required to be robust under environmental monitoring. As discussed previously, it is implemented mathematically by using the predictability sieve to identify stable subsystems and projectors.
Principle (1) is just a special case of the dynamical stability principle, because the integrated subsystems in principle (1) are derived from the predictability sieve. Examples of such subsystems include measuring instruments whose pointers are described by quasiclassical collective variables. Isolated subsystems, by contrast, are fragile and have no stable properties that can be defined independently of the integrated subsystems with which they interact.
Principle (2) then follows immediately, because the dynamically stable pointer variables are not arbitrary. Subsystems and basis states can indeed be chosen arbitrarily, but essentially only one such configuration yields dynamically stable experimental information. A given quantum state cannot be described in terms of different experimental arrangements leading to different (i.e., complementary) types of experimental information. All experimental arrangements other than the dynamically stable one must be regarded as unperformed.
Common ground.
As shown above, the principle of dynamical stability in decoherence theory leads to several key elements of Bohr’s principle of complementarity, which in turn have definite implications for the mathematical structure of the quantum state. The expression of these results in mathematical form clarifies the meaning of complementarity and makes it easier for everyone to “get the hang of it.” However, this mode of derivation also shows that the significance of these results extends beyond the philosophy of the Copenhagen interpretation. The extraction of dynamically stable experimental information is an operational issue that must be dealt with at some point by every interpretation of quantum mechanics. This is highlighted by the fact that the modified definition of the quantum state proposed here has unambiguous experimental implications for the entropy of macroscopic subsystems. This establishes an area of common ground that may help to bring different workers in the field of quantum foundations closer together.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Weinberg (2017) S. Weinberg, The trouble with quantum mechanics, N.Y. Rev. Books 64 (January 19, 2017) .
- 2Weinberg (2018) S. Weinberg, Third Thoughts (Harvard University Press, 2018).
- 3Englert (2013) B.-G. Englert, On quantum theory, Eur. Phys. J. D 67 , 238 (2013) . · doi ↗
- 4Fuchs (2011) C. A. Fuchs, Coming of Age with Quantum Information: Notes on a Paulian Idea (Cambridge University Press, 2011). · doi ↗
- 5Gell-Mann (1979) M. Gell-Mann, What are the building blocks of matter?, in The Nature of the Physical Universe: 1976 Nobel Conference , edited by D. Huff and O. Prewett (Wiley, New York, 1979) pp. 29–45.
- 6Bohr (1928) N. Bohr, The quantum postulate and the recent development of atomic theory, Nature 121 , 580 (1928) . · doi ↗
- 7Bohr (1935) N. Bohr, Can quantum-mechanical description of physical reality be considered complete?, Phys. Rev. 48 , 696 (1935) . · doi ↗
- 8Bohr (1970) N. Bohr, Discussion with Einstein on epistemological problems in atomic physics, in Albert Einstein: Philosopher–Scientist , The Library of Living Philosophers, Vol. 7, edited by P. A. Schilpp (Open Court, La Salle, Illinois, 1970) 3rd ed., pp. 199–241.
