Against the Tyranny of Pure States in Quantum Theory
Christian de Ronde, C\'esar Massri

TL;DR
This paper critiques the inconsistent definitions of pure states in quantum mechanics, revealing fundamental issues with current conceptual frameworks and emphasizing the need for clearer operational interpretations.
Contribution
It identifies and formalizes the inconsistencies between two definitions of purity in quantum theory, providing a theorem that exposes their fundamental incompatibility.
Findings
Operational and trace-invariant purity definitions are incompatible.
Theorem demonstrating the fundamental inconsistency between the two purity notions.
Highlights conceptual issues in the standard interpretation of pure states.
Abstract
We argue that the notion of pure sate within Standard Quantum Mechanics is presently applied within the specialized literature in relation to two mutually inconsistent definitions. While the first (operational purity) provides a basis-dependent definition which makes reference to the certain prediction of measurement outcomes, the latter (trace-invariant purity) provides a purely abstract invariant definition which lacks operational content. In this work we derive a theorem which exposes the serious inconsistencies existent within these two incompatible definitions of purity.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Against the Tyranny of
‘Pure States’ in Quantum Theory
C. de Ronde1,2,3,4 and C. Massri5,6
Abstract
We argue that the notion of pure sate within Standard Quantum Mechanics is presently applied within the specialized literature in relation to two mutually inconsistent definitions. While the first (operational purity) provides a basis-dependent definition which makes reference to the certain prediction of measurement outcomes, the latter (trace-invariant purity) provides a purely abstract invariant definition which lacks operational content. In this work we derive a theorem which exposes the serious inconsistencies existent within these two incompatible definitions of purity.
-
Philosophy Institute Dr. A. Korn, University of Buenos Aires - CONICET
-
Center Leo Apostel for Interdisciplinary Studies
Foundations of the Exact Sciences - Vrije Universiteit Brussel
-
Institute of Engineering - National University Arturo Jauretche
-
Federal University of Santa Catarina, Brasil
-
Institute of Mathematical Investigations Luis A. Santaló, UBA - CONICET
-
University CAECE
**Keywords: pure state, mixture, invariance, quantum mechanics. **
1 Pure States in Quantum Mechanics
Today, the notion of pure state plays an essential role within the ongoing research that takes place in the context of what is considered to be the “Standard” version of Quantum Mechanics (QM).111Even though there are many interpretations of QM, some of which change the mathematical formalism of the theory (e.g., Bohm and GRW), there is a “Standard” version of QM which is taught in Universities all around the world. The kernel role of pure state has also affected the fields Quantum Foundations and Quantum Information through its centrality regarding the definition of quantum entanglement. This notion was introduced during the axiomatic formulation of the theory in the 1930s, and has become increasingly dominant establishing also an ontological primacy over the so called mixed states. As explained by David Mermin [19, p. 758]: “[P]eople distinguish between pure and mixed states. It is often said that a system is in a pure state if we have maximum knowledge of the system, while it is in a mixed state if our knowledge of the system is incomplete.” The explicit reference to “our knowledge” is strictly related to the widespread —specially, in the field of quantum information— operational definition of a pure state in terms of certain predictions. This definition rests in what is known as a maximal test: ‘If a quantum system is prepared in such way that one can obtain a maximal test yielding with certainty (probability = 1) a particular outcome, then it is said that the quantum system is in a pure state.’ In turn, the notion of maximal test allows to interpret a quantum observable as being an actual property —i.e., a property that will yield the answer yes when being measured (see for discussion [3, 15, 17, 20, 26, 23]). It is then stated that the pure state of a quantum system is described by a unit vector in a Hilbert space, , which in Dirac’s notation is written as . Depending on the basis, a pure state in is also represented by a superposition of states:
[TABLE]
As it is well known, such superposed states yield uncertain results. Indeterminate or potential properties pertaining to superposed states might or might not become actualized in a future instant of time. These properties are regarded in the orthodox literature as being uncertain and thus, cannot be considered as elements of physical reality (in the EPR sense [14]) nor interpreted in terms of ignorance. Asher Peres explains this important point in the following manner:
“According to quantum theory, we have a choice between different, mutually incompatible tests. For example, we may orient the Stern-Gerlach magnet in any direction we please. Why then is such a Stern-Gerlach test called complete? The reason can be stated as the following postulate:
A. Statistical determinism. If a quantum system is prepared in such a way that it certainly yields a predictable outcome in a specified maximal test, the various outcomes of any other test have definite probabilities. In particular, these probabilities do not depend on the details of the procedure used for preparing the quantum system, so that it yields a specific outcome in the given maximal test. A system prepared in such a way is said to be in a pure state.” [22, p. 66]
It is also accepted in Standard QM that the notion of pure state can be extended to density operators. This is an essential step that has been worked out by mathematical physicists who regard the previous definition as “too limited” (see [16, p. 419]), specially when considering systems and subsystems. Let be a Hilbert space, a density operator (i.e., a positive trace class operator with trace 1) is called a state. Being positive and self-adjoint, the eigenvalues of are non-negative and real and it is possible to diagonalize it. Pure states are represented by rank 1 matrices which in their diagonal form will be given by —where the place occupied by the ‘1’ is unimportant. In this case, is equal to for some normalized vector . If the rank of is grater than 1 (or equivalently if ), the state is called a mixed state; or in short, a mixture. For example, the vector , , gives the following density matrix:
[TABLE]
Note that, if is a pure state (i.e., ), there always exists a basis in which the matrix can be diagonalized as:
[TABLE]
Unlike the case of pure states, mixtures cannot be represented as a unit vector, . Instead, mixed states are conceived as mixtures of pure states and represented as their convex sums:
[TABLE]
Thus, while pure states seem to refer to the existence of an observable which, if measured, will be obtained with certainty (probability equal to 1), mixed states do not. There will exist no single context of measurement (no basis) for which a mixed state will predict with certainty a yes-no answer for a specific observable. Indeed, if , then whatever basis we choose to write , the value will remain unaltered. Therefore, the matrix of in that basis cannot represent a pure state since for pure states we have . Mixtures are interpreted in terms of ignorance regarding the pure state in which the quantum system really is.222The reference to ‘mixtures’ —as contra-posed to ‘pure states’— is extremely problematic for it erases the fundamental distinction between quantum mixture and classical mixture; a well known distinction in the specialized literature which Bernard d’Espagnat termed proper and improper (see [10, chap. 6]). As remarked by Nancy Cartwright in [6]: “The ignorance interpretation asserts that each member of the collection is in one of the pure states in the sum —it is only our ignorance which prevents us from telling the right pure state for any specific member.” Thus, contrary to the case of pure states, when considering mixed states, all observables are uncertain; they all possess a probability which pertains to the open interval . As stressed by Cartwright: “The ignorance interpretation is the orthodox interpretation for mixtures, and should not be confused with the ignorance interpretation for superpositions, which has been largely abandoned.” As an example of a mixed state (i.e., ) we can consider the following diagonal matrix,
[TABLE]
According to orthodoxy this mixed state provides minimal knowledge about the pure state the quantum system is presupposed to be in. Thus, it is impossible to know —in any bases— which pure state will be obtained after a measurement is performed. Let us remark that this definition of pure state is also essential in the field of quantum information due to its connection to the measure of entanglement through the von Neumann entropy (see [4]). As remarked by Brian Hall: “we may define the von Neumann entropy of a density matrix by where is defined by the functional calculus. (Since , we interpret as being [math]). Since the eigenvalues of are all between [math] and , we see that is a non-negative self-adjoint operator, which has a well-defined trace, which may have the value . According to Exercise 8, a density matrix is a pure state if and only if .”
Up to now, we have just given a summary of the orthodox way in which pure states and mixtures are commonly presented in Standard QM. In the present work we attempt to clearly distinguish between these two different definitions of the notion of ‘pure state’ which are indistinctively applied in the specialized literature. While both definitions are assumed to be equivalent, we will show that this is clearly not the case. More specifically, we will expose a series of inconsistencies between these two different definitions, one contextual and another non-contextual, one possessing a clear operational content and the other purely abstract. In order to understand how these inconsistencies —present within the definition(s) of pure state— were introduced within Standard QM, we will go back, in section 2, to the original meaning of state in classical physics. We will then continue to address, in section 3, the re-definition of the notion of state within Paul Dirac’s 1930 formulation of the theory of quanta. In section 4 we will be then ready to expose the inconsistencies we are referring to. In particular we will derive a theorem which exposes the problem.
2 The Notion of ‘State’ in Classical Physics
As remarked by Arthur Fine [24, p. 94]: “Wolfgang Pauli thought that using the word ‘state’ (Zustand) in QM was not a good idea, since it conveyed misleading expectations from classical dynamics.” The notions of system and state have always played an essential role in all classical physical theories going from classical mechanics and wave mechanics to electromagnetism and classical statistical mechanics. The way in which these notions were developed is intrinsically related to the role played by two kernel notions that appeared with modern physics, namely, invariance and objectivity. In this section, before entering the more specific debate about quantum states and purity, we would like to provide an analysis of the previous understanding of physical state —before QM. The notion of system is essential to classical physics, it comprises the formal-conceptual moment of unity of which the representation of the physical theory talks about. Each physical theory has constructed its own specific notion of what a system is. While in classical mechanics the system is given by the notion of ‘particle’ or ‘rigid body’ (i.e., a compound of particles), in wave mechanics its role is played by the notion of ‘wave’ and in electromagnetism the main notion is that of ‘field’. Every theory has its own mathematical formalism and its own specific notions at play. In all theories, the systems (i.e., particles, fields or waves) provide the basic reference to a specific representation of a real state of affairs which evolves in time. This evolution described through the interaction between systems is computed in mathematical terms via an equation of motion (or evolution). A kernel presupposition in modern physics which helps to build up this type of representation is that the state of affairs described by the theory must be completely detached from any reference frame or subject. The physical notion of state has the main purpose to provide the specific characteristics of a system, in a specific given situation or actual state of affairs. For example, the particle 1 has position and velocity while particle 2 has position and velocity , etc. The specification of the state of each system the theory talks about is always relative to another essential notion in physics —one which every student of physics learns in the first class she attends—, namely, that of reference frame. Students are taught that it is only when considering a frame of reference that the notion of state becomes applicable. In physics, the specific values of the properties of a system (i.e., the state) have meaning only when considered as relative to a specific viewpoint. Thus, without a reference frame there can be no consistent definition of a state. At this point, the notion of physical invariance becomes essential in order to secure the consistency of the different reference-frame dependent representations of the state of affairs being addressed. It is important to stress that this notion is not purely mathematical (see for discussion [25]). Mathematical invariance is a very general abstract concept which can be defined in the following manner. Let be a group acting on spaces and . A function is called invariant if maps orbits to orbits, that is, if for all and . Let us give an example of an invariant quantity (). Consider the space of matrices and let be the group of invertible matrices acting by conjugation. Then, the trace (or the determinant) is invariant, (where is a unitary matrix). Something which distinguishes the physical notion of invariance is that it is essentially operational, it requires a link with experience. In fact, this was the essential point raised by Albert Einstein in his criticism about the notion of simultaneity.
“The concept does not exist for the physicist until he has the possibility of discovering whether or not it is fulfilled in an actual case. We thus require a definition of simultaneity such that this definition supplies us with the method by means of which, in the present case, he can decide by experiment whether or not both the lightning strokes occurred simultaneously. As long as this requirement is not satisfied, I allow myself to be deceived as a physicist (and of course the same applies if I am not a physicist), when I imagine that I am able to attach a meaning to the statement of simultaneity. (I would ask the reader not to proceed farther until he is fully convinced on this point.)” [13, p. 26]
The essential role played by operational invariance is that it allows to consider experience from different reference frames (and observers) in a consistent manner as making reference to the same state of affairs. In turn, as Einstein constantly remarked, this makes possible to address the (real) existence of a state of affairs as detached from particular observations and reference frames.333In more general terms, as discussed in [8], it is exactly this formal aspect which allows us to talk in terms of an Actual State of Affairs (ASA) that evolves in time; i.e., a dynamical description in terms of the variation of (objective) definite valued observables (or ‘dynamical properties’) independent of the (subjective choice of the) perspective (or reference frame) from which they are being observed. Even in relativity theory, due to the Lorentz transformations, one can still consider ‘events’ as the building blocks of physical reality.
In physics, the meaning of invariance deals with physical quantities which not only have the same value for any reference frame, but —as important— have also a consistent translation of dynamical values. In general, the transformations that allow us to consider the physical magnitudes from different frames of reference have the property of forming a group (see [29, 30]). In the case of classical mechanics we have the Galilei transformations which keep space and time apart, while in relativity theory we have the Lorentz transformations which introduce an intimate connection between space and time coordinates. Of course, restricting ourselves to physical magnitudes that remain always the same, independently of the reference frame, does not provide a dynamical picture. Such a description only provides a static table of data. This is of course completely uninteresting for physics, which always attempts to represent, not only a given state of affairs in a given instant of time but —far more importantly— how that state of affairs changes. Thus, that which matters the most for a physical description is the invariant variations of physical magnitudes, that is, the dynamical magnitudes which vary but can be still considered as being the same (e.g., position, velocity, momentum, etc.). The difference within the identity. Even though the values of physical magnitudes might vary from one reference frame to the other, in all classical theories —including relativity theory— there exist a consistent translation between the values of magnitudes of the different reference frames secured by the transformation laws. The position of a rabbit running through a field and observed by a distant passenger of a high speed train (the reference frame ) can be translated to the position of that same rabbit now considered from the perspective of another passenger waiting on the platform of the station (the reference frame ). This relation (of the values between and ) is provided via the transformation laws. Such transformations include not only the dynamics of the observables but also the dynamics between the different observers (see for a detailed analysis [7]). The fact that the values of position, momentum, etc., can be consistently translated from one reference frame to the other allows us to assume that these physical observables bear an objective real existence completely independent of the specific choice of the reference frame pertaining to each observer. It is this consistency within translation which allows the physicist to claim that: the rabbit has a set of dynamical properties (position, a momentum, etc.) independently of any of the subjects observing it —in the train, on the platform or anywhere else. Thus, it is operational invariance, namely, the invariance with respect to the values of measurable properties, which interests the physicist and allows her to constitute what is to be considered the same irrespectively of the reference frame or basis (i.e., the perspective) from which it is being represented or —even— experienced. Invariants capture the objective (non-contextual) content of a theory by providing a consistent translation between different reference frames. And in this way, reference frames as well as (empirical) subjects become superfluous to the physical representation.
In QM the reference frame is given by the mathematical notion of basis and the group that allow us to go from one basis to another is the unitary group. It might be noted that from a conceptual perspective, a mathematical frame of reference (or basis) can be linked to a particular perspective of analysis or an experimental situation (see [3]). Something which in QM is also known as context. As we shall discuss in the following section, during the 20th Century the meaning and content of notion of physical state was radically transformed within quantum theory. Unlike in the classical case we have just discussed the notion of quantum state would, at least partly, become basis dependent or contextual.
3 The Contextual Definition of ‘Quantum State’
Since Paul Dirac’s textbook formulation there has been a deep problem in order to address the meaning of states and systems in QM. Even today, there still exists within the specialized literature a deep and widespread disorientation related to the meaning of these notions which are commonly applied in inconsistent manners, sometimes referring to abstract mathematical properties while others to “small particles” (see [24, Chap. 4]). We believe that the reason behind this situation is intrinsically related to the inadequate interpretation, introduced by Dirac, of the the orthodox mathematical vectorial formulation of QM in terms of these notions.
As we have just seen in the previous section, in modern physics a state made reference to the specific features of an object in a given physical situation. The essential point is that the different reference frame dependent representations of the same state of the system could be related through a group of transformations. Galilean transformation in classical physics and Lorentz transformations in Relativity theory. It is the consistency secured by the transformations which allowed to talk of the same state of affairs regardless of the chosen reference frame. However, in his 1930 book The Principles of Quantum Theory, Dirac related each state with a single observation, mathematically represented by —what he himself called— a ket vector. As a consequence, it made perfect sense to argue that the same system could be represented in terms of different states, depending on the specific basis:
“[E]ach state of a dynamical system at a particular time corresponds to a ket vector, the correspondence being such that if a state results from the superposition of certain other states, its corresponding ket vector is expressible linearity in terms of the corresponding ket vectors of the other states, and conversely. Thus the state results from a superposition of the states and when the corresponding ket vectors are connected by |R\rangle\ =c_{1}|A\rangle\ +c_{2}|B\rangle\.” [12, p. 16]
As it becomes clear, depending on the chosen basis (or reference frame), the representation of the system was given in terms of different states. Bases did not make anymore reference to the same state of the object under study, but instead to different states. Each reference frame (or basis) made thus reference to a sub-set of possible states of the quantum system.
This radical transformation in the meaning of (quantum) state is related in the literature to the contextual nature of the theory of quanta, something that Niels Bohr would make even more explicit in his famous reply to the EPR paper [5]. Thus, the classical presupposition of physics according to which a system can only have a single state at a time was abandoned as a prejudice of the past. The quantum state, or superposition of states, was determined by the choice of the basis regardless of any invariant theoretical condition between the different basis. Dirac simply blamed his controversial definition of quantum states on QM itself and the limits it imposed on the very possibilities of representation. Of course, this was not considered by Dirac [12] as an essential problem for he had already “remarked that the main object of physical science is not the provision of pictures, but the formulation of laws governing phenomena and the application of these laws to the discovery of phenomena. If a picture exists, so much the better; but whether a picture exists of not is a matter of only secondary importance.” As he made the point, it is always “important to remember that science is concerned only with observable things”.
What Dirac did not seem to recognize was that his interpretation implied a much deeper commitment, one which precluded the possibility for QM of an invariant operational discourse (section 2). The abstract invariance of unit vectors with respect to the trace seemed enough to Dirac to restore the reference to quantum systems. During the 1940s and 1950s Dirac’s state-vectorial formulation was established in textbook QM through the essential introduction of the notion of pure state. A notion which made even more explicit the double reference, on the one hand, to the abstract trace-invariance of a unit vector, and on the other, to the non-invariant certain prediction of a single observable.444Today, the reference to states has become explicitly instrumental. As pointed out by Chris Timpson [27], for many contemporary researchers, the quantum state does not represent “how things are in an external, objective world, it merely represents what information one has. Mermin (2001), Peierls (1991), Wheeler (1990) and Zeilinger (1999) have all endorsed this kind of view. Hartle (1968, p. 709) provides an excellent summary: ‘The state is not an objective property of an individual system but is that information, obtained from a knowledge of how a system was prepared, which can be used for making predictions about future measurements’.” Dirac’s redefinition of the notion of state, and the subsequent introduction of the notion of pure state within the orthodox literature can be also understood as intrinsically related to the double role played by the notion of actuality within QM. Indeed, actuality has two different —not necessarily compatible— meanings and uses which have been systematically confused and scrambled. Firstly, there is an empiricist (or subjectivist) understanding of actuality relative to the hic et nunc experience of an individual agent. According to Bas van Fraassen [28, p. 197], one of today’s most prominent empiricists: “the only believe involved in accepting a scientific theory is belief that it is empirically adequate: all that is both actual and observable finds a place in some model of the theory. So far as empirical adequacy is concerned, the theory would be just as good if there existed nothing at all that was either unobservable or not actual. Acceptance of the theory does not commit us to belief in the reality of either sort of thing.” This first meaning of actuality can be resumed in the following manner:
Definition 3.1** (Empiricist Actuality)**
Actuality as making reference to the here and now empirical observations made by individual subjects (or agents).
Secondly, actuality is also —implicitly— understood in metaphysical terms as characterizing a mode of existence —completely independent of observations. In the XVII Century, within the Newtonian mechanical description of the world, any indetermination —related in the Aristotelian scheme to the potential realm of being— was erased from the physical representation of reality. Within classical mechanics, every physical system could be described exclusively by means of its actual properties. As remarked by Dennis Dieks:
“In classical physics the most fundamental description of a physical system (a point in phase space) reflects only the actual, and nothing that is merely possible. It is true that sometimes states involving probabilities occur in classical physics: think of the probability distributions in statistical mechanics. But the occurrence of possibilities in such cases merely reflects our ignorance about what is actual. The statistical states do not correspond to features of the actual system (unlike the case of the quantum mechanical superpositions), but quantify our lack of knowledge of those actual features.” [11, p. 124]
This second understanding of actuality which can be defined without any reference whatsoever to observability is of course purely formal and metaphysical. As discussed in detail in [8], an Actual State of Affairs (ASA) can be defined as a closed system considered in terms of a set of actual (definite valued) properties which can be thought as a map from the set of properties to the . Specifically, an ASA is a function from the set of properties to satisfying certain compatibility conditions. We say that the property is true if and is false if . The evolution of an ASA is formalized by the fact that the morphism satisfies . Diagrammatically,
[TABLE]
Then, given that , the truth of is equivalent to the truth of . This formalization comprises the idea that the properties of a system remain existent through the evolution of the system. The model allows then to claim that the truth or falsity of a property is independent of particular observations. Or in other words, binary-valuations are a formal way to capture the classical actualist (metaphysical) representation of physics according to which the properties of objects preexist to their measurement.
Definition 3.2** (Metaphysical Actuality)**
Actuality as making reference to a mode of existence defined in terms of definite binary valuedness of properties which evolve according to the metaphysical representation —completely independently of subjects and their observations or measurements.
Maybe the best exposure of this scrambling can be found within the famous definition of element of physical reality presented in the famous 1935 paper by Einstein, Podolsky and Rosen [14].
Element of Physical Reality: If, without in any way disturbing a system, we can predict with certainty (i.e., with probability equal to unity) the value of a physical quantity, then there exists an element of reality corresponding to that quantity.
As remarked by Diederik Aerts and Massimiliano Sassoli de Bianchi [2, p. 20]: “An element of reality is a state of prediction: a property of an entity that we know is actual, in the sense that, should we decide to observe it (i.e., to test its actuality), the outcome of the observation would be certainly successful.” It is in this way that the relation between observation and reality is pasted together. Reality is not anymore defined in terms of a theoretical representation, but instead in terms of certain observability. The pasting is also exposed by the dualistic reference to both properties and observables in the context of QM. This shift marks the subtle transmutation form a metaphysical discourse to a subjectivist one. It is interesting to note that the notion of pure state in QM has an analogous role to the one played by actuality within the present widespread empirical-positivist understanding of physical theories. Just like the notion of actuality has a double reference, on the one hand to a metaphysical mode of existence (actual property) and on the other to empirical observation (actual observable), the notion of pure state scrambles a specific type of contextual measurement in which certainty is restored with a non-contextual purely abstract mathematical definition which lacks any clear operational content. As we shall see in the following section, this tension —threatening inconsistency— found within the definition(s) of ‘pure state’ is not only present at the conceptual level of analysis, it is also —maybe more importantly— also present within its formal definition itself.
4 The Inconsistent Definition(s) of ‘Pure State’
An exposure of the tension found within the already mentioned incompatible reference to the notion of ‘actuality’ is also present in QM within the definition(s) of pure state. Let us take as an example the famous book by Michael Nielsen and Isaac Chuang, Quantum Computation and Quantum Information, where they introduce the notion of pure state in the following orthodox manner:
“A quantum system whose state is known exactly is said to be in a pure state. In this case the density operator is simply . Otherwise, is in a mixed state; it is said to be a mixture of the different pure states in the ensemble for . In the exercises you will be asked to demonstrate a simple criterion for determining whether a state is pure or mixed: a pure state satisfies , while a mixed state satisfies .” [21, p. 100]
The attentive reader might have already observed, there exists in this passage an inconsistent reference to two different definitions of purity. While in the first phrase there is an operational reference in terms of the “exact knowledge” of the state; in the last phrase, the reference is made to the trace-invariance of abstract vectors. The idea that these two definitions are somehow equivalent might be regarded as one of the fundamental widespread confusions present within the contemporary Standard understanding of QM. Let us analyze these two different definitions in some detail.
The operational (contextual) definition of pure state to which Nielsen and Chaung make reference in the first phrase rests on a specific type of measurement called maximal test —which we already mentioned in the introduction. A test is considered maximal in the case we obtain with certainty (probability = 1) the observable in question: if we measure the state (in its correspondent basis) we are certain that we will obtain the measurement result related to this state. Consequently, we obtain “complete” or “exact knowledge” with respect to the future prediction of an outcome. This definition rests on the explicit reference to the preferred basis (or context) in which the vector in Dirac’s notation can be written as a single term, namely, as . We say that the definition is ‘contextual’ because it is explicitly dependent on one particular basis —between the many possible ones. It is only when we arrange the experimental set up in this preferred basis that we will be able to predict a certain (probability = 1) result (see [3, 15, 17, 26]). However, if we change the basis, the state will be written as a superposition of other different states, , and in such case, there will be no certain or complete knowledge of any observable (related to ). We will not be certain of which outcome will be obtained. In other words, the operational purity of a quantum system (i.e., the certain knowledge of a future measurement) can be found in only one particular reference frame (or basis). Exactly the same actualist intuition appears in the case of density operators where the state is pure only if there exists a basis in which the matrix can be diagonalized as:
[TABLE]
This contextual definition has the purpose to secure the existence of one observable which will be certain, and consequently actual, if measured.
Definition 4.1** (Operational Purity)**
Given a quantum system in the state , there exists an experimental situation (or context) in which the test of it will yield with certainty (probability = 1) its related outcome.
In short, as remarked in [4, p. 25]: “The pure states are those for which the outcome is certain, so that one of the [probabilities] is equal to one.” As we discussed in the previous sections, this definition is highly problematic for an understanding which respects the basic democratic standpoint of physics according to which a physical concept must be independent of the choice of a particular reference frame. But since the above definition of operational purity makes explicit reference to a preferred basis, the concept appears as related to only one single reference frame. Or in other words, the operational content of purity (i.e., the certain prediction of a measurement) is evidently non-invariant in the most radical way. Not only its translation to a different reference frame is impossible, its very existence is restricted to only one preferred basis. If we were to apply these ideas in the classical case, a rabbit running through a field would have the property of having velocity (or position) only for an observer in the platform, but not for an observer in the train —or anywhere else. Each reference frame would describe a series of different (inconsistent) states of the rabbit. Thus, the very existence of the rabbit would become untenable. The fact there is no consistent translation of the notion of operational purity, not even in principle, becomes explicit through the derivation of a Corollary to the Kochen-Specker theorem in [7] which shows that there does not exist a binary valuation for the dynamical properties that constitute a physical system. Indeed, we define as Value Invariance of Dynamical Properties that the set of dynamical properties that constitute a physical system must be invariant under transformations of frames or coordinates. We say that the context commutes with the context if for all and . In particular, if is maximal and commutes with , then and any Local Valuation defined over is defined over . We have the following result,
Theorem 4.2
Let be a Local Binary Valuation defined over a maximal context and let be any vector. There exists a rotation of where is defined and there exists a rotation where is not defined. In particular, binary valuations are not preserved under rotations.
Proof: see [7].
The previous theorem implies that under a rotation in the binary valuation is lost. Even though the vector is fixed, the coordinate system is fundamental in order to valuate . We must choose another valuation or else the value of may not be defined. Consequently, we find the following Corollary to Kochen-Specker Theorem.
Corollary 4.3
If the dimension of the Hilbert space is greater that , then the Value Invariance of Dynamical Properties of a vector in is precluded.
This Corollary makes explicit the fact that the operational definition of pure state can only make sense in a single basis, and consequently, cannot be regarded as an invariant concept.
On the contrary, mathematical physicists who care not so much about experimental testing of theories and are much more comfortable with abstract definitions apply to their reasonings a different definition of pure state —one which is reflected in the last part of the just mentioned passage by Nielsen and Chuang. This definition makes reference to a purely abstract mathematical feature of vectors, namely, that when considered as density operators their norm is 1, that is, is a pure state if , or equivalently555A density matrix can be diagonalized, thus giving a set of eigenvalues with . If , then and . Hence, and then and . Conversely, if it has eigenvalues [math] or , but from it follows and . Hence, . when . As remarked by Brian Hall in [16]: “There are several different ways of characterizing the pure states as a subset of the density matrices. First, it is not hard to see that a density matrix is a pure state if and only if . Second, the set of density matrices is a convex set, since if and are non-negative and have trace 1, then so is , for . [T]he pure states are precisely the extreme points of this set. That is, a density matrix is a pure state if and only if it cannot be expressed as where and are distinct density matrices and belongs to .” (see [16, Def. 19.11]). The latter definition of pure state in terms of convex geometry leads to the development of several mathematical theories and applications to information theory, (see [4] for a modern approach) as well as to philosophical debates about QM (e.g., [1]).
The important point is that this abstract mathematical definition makes no reference whatsoever to any specific basis. It is completely independent of reference frames.
Definition 4.4** (Trace-Invariant Purity)**
An abstract unit vector in Hilbert space with no reference to a specific basis666We distinguish here between the purely abstract vector and its specific representation in a basis . As shown in detail in [9, Sect. 4] this distinction becomes explicitly visualizable through the use of graph theory (see figures 1 and 6 of the mentioned reference). which understood as a density operator is a projector, namely, where Tr or .
In this case, unlike operational purity, the notion of pure state is defined in terms of an invariant abstract property characterizing vectors —as well as families of matrices—, namely, the trace. As discussed in [8], in purely abstract mathematical terms we can always talk about a vector without making reference to a basis. Thus, the abstract vector makes reference to the state but also to any rotation, as for example, . Now, even though both can be considered as being the same pure state because they have the same trace, while the state will be certain if measured, the state will be not. If certainty is regarded as a property characterizing the purity of a state, cannot be considered as (operationally) pure. While both states will be trace-invariant pure they will not be operationally pure. Even more importantly, this second definition of pure state in terms of the trace (or rank) has no operational content whatsoever. This latter definition lacks what Einstein saw as a necessary constituent of a physical concept, namely, the possibility of discovering whether or not it is fulfilled in an actual case.
The difference between these two notions of purity become even more explicit once we recognize that there is no equivalence between them. Trace-invariant purity does not imply operational purity. This can be also seen from a mathematical point of view. The following theorem exposes the non-equivalence of these notions.
Theorem 4.5
Let be a density matrix. If is operational pure, then is trace-invariant pure. The converse does not hold.
Proof: While operational purity translates into “, where is a matrix having a at position and [math] elsewhere”, trace-invariant purity is simply . Clearly if , we have , but the converse does not hold. Indeed, if then for any change of basis, the new matrix also satisfies (here, we see that the definition of trace invariant-purity is context-independent). However, among all bases, there exists only one basis formed by eigenvectors such that the new matrix is diagonal and equal to . Hence, the definition of operational purity depends on a preferred basis. In particular, trace invariant purity does not imply operational purity.
In general terms we have that while operational purity implies trace-invariant purity the converse is not true:
[TABLE]
[TABLE]
This operational-metaphysical conundrum might have its root in a confusion present within the specialized literature between mathematical equivalence and physical equivalence. Indeed, as we discussed in section 2, that which is equivalent from a mathematical perspective of analysis is not a necessarily equivalent from a physical standpoint. As explicitly remarked by Einstein, a physical concept does not exist —for the physicist— until she has the possibility of discovering whether or not the concept is fulfilled in an actual case. But this is not necessarily provided by mathematical definitions which —due to their level of abstraction— may or may not make reference to specific experimental situations. In particular, because the trace does not relate to any basis the link with experience is clearly broken. In order to fix ideas, given a pure state written in the -basis as a unit vector , we know that if we measure this state in the same -basis we will obtain with certainty a result related with this state. The probability of observing a measurement related to the state , given we prepare the experimental set up in the -basis, will be equal to unity. Of course, in this last statement it is the conditional which is of outmost importance ‘given we prepare the experimental set up in the -basis’. However, if we consider the same unit vector now measured in the -basis, for which we represent our state as , there will be no certainty with respect to the measurements that could be performed. While the state written in the -basis is not operationally pure, both and have the the same trace and consequently will be trace-invariantly pure. While from the trace perspective both states can be regarded as being the same, from the operational perspective of purity (or certainty) they cannot.
To conclude, the definitions of purity are not equivalent. The operational definition of purity has only meaning in terms of a preferred basis. Regardless of the difficulties to argue in favor of such basis dependent definition, the mathematical trace definition of pure state is independent of any basis but has no operational counterpart. Or in other words, it is mathematically invariant but not operationally invariant. Thus, the inconsistency becomes evident. Depending on the choice of the definition, the concept of purity is either invariant or non-invariant, operational or purely abstract. This situation is obviously problematic for any physical theory that would attempt to provide a consistent meaning to the concepts it talks about. Against the primacy of preferred basis and pure states over matrices the authors of this paper believe there is a way out of this conundrum by returning to the democratic principle of modern physics according to which all reference frames and states must be considered on equal footing. A first step in this direction has been already given in [8, 9] where the definition of state presents an equivalence between all matrices independently of the choice of any basis or their rank. We leave this specific discussion for a future work.
Acknowledgements
We want to thank the insightful suggestions of three anonymous referees which allowed us to make the main point of the text substantially more clear. This work was partially supported by the following grants: FWO project G.0405.08 and FWO-research community W0.030.06. CONICET RES. 4541-12, the Project UNAJ 80020170100058UJ and PIO-CONICET-UNAJ 15520150100008CO “Quantum Superpositions in Quantum Information Processing”. The authors state that there is no conflict of interest.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Aerts, D. & \& Sassoli di Bianchi, M., 2014, “The extended Bloch representation of quantum mechanics and the hidden-measurement solution to the measurement problem”, Annals of Physics , 351 , 975-1025.
- 2[2] Aerts, D. & \& Sassoli di Bianchi, M., 2017, “Do Spins Have Directions?”, Soft Computing , 21 , 1483-1504.
- 3[3] Beltrametti, E.G. & \& Cassinelli, G., 1981, The Logic of Quantum Mechanics , Addison-Wesley, Massachusetts.
- 4[4] Bengtsson I. & \& Zyczkowski, K., 2006, Geometry of Quantum States: An Introduction to Quantum Entanglement , Cambridge University Press, Cambridge.
- 5[5] Bohr, N., 1935, “Can Quantum Mechanical Description of Physical Reality be Considered Complete?”, Physical Review , 48 , 696-702.
- 6[6] Cartwright, N., 1972, “A Dilemma for the Traditional Interpretation of Quantum Mixtures”, Proceedings of the Biennial Meeting of the Philosophy of Science Association , Vol. 1972, pp. 251-258.
- 7[7] de Ronde, C. & \& Massri, C., 2017, “Kochen-Specker Theorem, Physical Invariance and Quantum Individuality”, Cadernos de História e Filosofia da Ciência , 2 , 107-130.
- 8[8] de Ronde, C. & \& Massri, C., 2018, “The Logos Categorical Approach to Quantum Mechanics: I. Kochen-Specker Contextuality and Global Intensive Valuations.”, International Journal of Theoretical Physics , DOI: 10.1007/s 10773-018-3914-0.
