Quantum dynamics is linear because quantum states are epistemic
Jacques L. Pienaar

TL;DR
This paper argues that quantum dynamics are linear because quantum states are epistemic, and non-linear theories violate a principle called Local Definiteness, which is naturally satisfied by epistemic interpretations.
Contribution
It establishes a fundamental link between the epistemic nature of quantum states and the linearity of quantum dynamics, providing a new principle to distinguish interpretations.
Findings
Non-linear modifications of quantum theory violate Local Definiteness.
Epistemic interpretations of quantum states satisfy Local Definiteness.
Experiments' failure to detect non-linearity may imply quantum states are epistemic.
Abstract
According to quantum theory, a scientist in a sealed laboratory cannot tell whether they are inside a superposition or not. Consequently, so long as they remain isolated, they can assume without inconsistency that their measurements result in definite outcomes. We elevate this to the status of a general principle, which we call Local Definiteness. We apply this principle in the context of modifications of quantum theory that allow the dynamics to be non-linear. We prove that any such theory satisfies Local Definiteness if and only if its dynamics is linear. We further note that any interpretation that takes quantum states to be epistemic necessarily satisfies the principle, whereas interpretations that take quantum states to be ontic do not satisfy it, unless they make additional assumptions that amount to presupposing linearity of the dynamics. Therefore the reason why experiments to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum Mechanics and Applications · Philosophy and History of Science
Quantum dynamics is linear because quantum states are epistemic
Jacques L. Pienaar
QBism Group, University of Massachusetts Boston, 100 Morrissey Boulevard, Boston MA 02125, USA
Abstract
According to quantum theory, a scientist in a sealed laboratory cannot tell whether they are inside a superposition or not. Consequently, so long as they remain isolated, they can assume without inconsistency that their measurements result in definite outcomes. We elevate this to the status of a general principle, which we call Local Definiteness. We apply this principle in the context of modifications of quantum theory that allow the dynamics to be non-linear. We prove that any such theory satisfies Local Definiteness if and only if its dynamics is linear. We further note that any interpretation that takes quantum states to be epistemic necessarily satisfies the principle, whereas interpretations that take quantum states to be ontic do not satisfy it, unless they make additional assumptions that amount to presupposing linearity of the dynamics. Therefore the reason why experiments to date have not found evidence of non-linear dynamics might simply be that quantum states are epistemic.
I Introduction
Quantum theory involves two notions of linearity: a kinematical one and a dynamical one. Kinematical linearity refers to the fact that the pure states of the theory can be represented (up to an arbitrary choice of phase) by a vector in a Hilbert space, such that for any set of valid physical states one can construct another valid state by making a coherent superposition:
[TABLE]
where the complex coefficients may be chosen arbitrarily up to the constraint that the resulting state be normalized111In the special case where the are orthonormal, this reduces to the requirement that ., . By comparison, what we shall call dynamical linearity consists in the fact that the evolution of an isolated pure state over some time interval is represented by a unitary map which acts linearly on the states:
[TABLE]
Consequently, if we are given a set of solutions to the dynamical equations as represented by the map , we can always construct another solution to the same dynamics by superposing the known solutions as per the final line of Eq. (2) (this is what textbooks usually call “the superposition principle”).
These two notions are conceptually distinct. Kinematical linearity is essentially a constraint on the set of valid physical states: it says that if certain vectors represent valid physical states, then there must be a valid physical state represented by the superposition of these vectors. By contrast, dynamical linearity represents a constraint on the dynamics of the theory.
Indeed, the two notions are independent: either one can be false while the other remains true. For so long as we accept the conceptual distinction between kinematics and dynamics, it is at least logically possible that future experiments might reveal deviations from quantum dynamics – such as small non-linear corrections to the dynamical equations – without however affecting the kinematics of the theory. More specifically, we can consistently suppose that all coherent superpositions of physical states yield physical states, while at the same time allowing the dynamics to violate Eq. (2), so that taking a superposition over the inputs to a dynamical process is not equivalent to taking the same superposition over the outputs of the process.
This observation has inspired numerous theoretical models describing possible deviations from dynamical linearity222See for example Refs [1, 2, 3, 4, 5] and citations therein.. It is noteworthy that experiments to test these proposals have so far not found any evidence of such effects. This is striking, considering that all previous paradigmatic examples of physical theories – such as fluid mechanics, electrodynamics and gravity – are governed by non-linear dynamical equations. Why should quantum theory be different?
The linearity of quantum dynamics has an interesting consequence: it ensures that a scientist enclosed within an isolated laboratory cannot tell by any local experiment (i.e. performed within the laboratory) whether or not the whole laboratory and its contents including the scientist themselves are ‘inside a superposition’. Though rarely remarked upon, this curious fact dates back as far as Hugh Everett III’s 1957 article on the ‘relative state interpretation’, in which he argued that, due to the linearity of the quantum wave equation, an observer who enters a superposition state should not “feel themselves split” [6, 7].
One can also establish the same result by proving the contra-positive: if it were possible for the encapsulated observer to determine by some local process whether or not the entire laboratory including themselves had entered a superposition state, this implies they would have the ability to perfectly distinguish between the total superposition state and a state representing just one of its components. It then suffices to note that since these states are non-orthogonal, they cannot be perfectly distinguished by any linear process.
In this paper we aim to establish the implication in the opposite direction: we aim to prove that if the laws of physics forbid us from determining by local experiments that we are inside a superposition, then the dynamical laws must be linear. We will then argue that this result has consequences for the interpretation of quantum states.
The logical structure of our argument runs as follows: we consider the following empirical principle that we call Local Definiteness (LD): a scientist confined to an enclosed laboratory cannot determine by any local experiment whether the laboratory and all its contents are in a definite effectively classical state, or in a superposition of such states. We then ask why it should hold true. The standard justification, as we have seen, assumes that quantum dynamics is linear and derives LD as a consequence. The problem with this strategy is that it leaves no room for contemplating more general theories in which the dynamics might not be linear.
We shall therefore introduce a more general framework of probabilistic theories in which the dynamics need not be linear, for which LD may not necessarily hold. Our main result is to prove that, within this more general framework, a theory satisfies LD only if its dynamics is linear.
This result has implications for the foundational question of whether quantum states are epistemic (representing knowledge or beliefs) or ontic (representing states of affairs in reality). As we will see, epistemic interpretations of quantum states imply that LD holds; conversely ontic interpretations are not so constrained and can allow non-linear dynamics. Our theorem therefore implies that dynamics is necessarily linear if one regards quantum states as epistemic. We argue that this makes epistemic interpretations superior to ontic interpretations at explaining why quantum dynamics is linear.
The paper is structured as follows. In Sec II we prove that if we assume only the kinematical structure of quantum theory, then the principle of Local Definiteness is enough to imply the linearity of quantum dynamics. In Sec III we show that a stronger result holds: given a probabilistic theory whose kinematics is not necessarily equivalent to that of quantum theory, Local Definiteness implies that the dynamics must act linearly on convex mixtures of probabilistic states. In Sec IV we argue that, by application of Leibniz’ principle, the absence of experimental evidence for non-linear quantum dynamics is best explained by adopting an epistemic interpretation of quantum states.
II A primer: quantum theory with non-linear dynamics
As a warm-up exercise, let us assume that the kinematics of our theory is equivalent to that of quantum theory: we associate a unique density operator to every physical state, and a POVM to every measurement, where runs over the outcomes and . Pure states are rank-1 projectors and the kinematical superposition principle holds, ie. if represent valid states (up to an overall phase), then any superposition of them is also a valid state, as per (1). Probabilities are calculated using the Born rule,
[TABLE]
The only difference to standard quantum theory is that we shall permit dynamical transformations of states to be non-linear in general:
**Definition 1. ** Consider a transformation that maps density matrices to density matrices. We say that is convex-linear on density matrices iff
[TABLE]
for arbitrary convex coefficients , . Otherwise we say is non-convex-linear.
Consider now a system sealed inside a laboratory with a scientist, modeled as another system . For simplicity we assume that all other parts and contents of the lab can be either ignored or subsumed into one of these two systems. Suppose the scientist performs operations that – if we were to perform them ourselves – we would regard as constituting a measurement of a POVM on system . A question arises: given that it is not us but performing these operations on , what state should we assign the total system from our standpoint outside the lab?
This is precisely the scenario considered by Wigner in his famous thought experiment, and the reason it has caused such controversy is that the answer hinges on the contentious issue of precisely when a measurement should be judged to have occurred. Does ’s activity inside the isolated lab also collapse the state for us on the outside? Or should we model it as a coherent entangled superposition? We will not be concerned here with trying to settle this debate, but shall focus on a much simpler question: can the scientist inside the lab tell the difference between the two options, without help from the outside?
First note that if their measurement has indeed ‘collapsed’ the state, then we on the outside may reasonably assign the total system (representing the lab and its contents) a state of the form , where may be interpreted as the state assigned to just in case we are sure that they would report having observed outcome , if interrogated333All that matters for our purposes is that we have some way of checking what observed. This might entail just asking what they observed and interpreting their response; or it might require more complicated actions involving truth serum or neurosurgery; these details are unimportant to our argument.. But since we on the outside do not know which outcome was actually obtained, we must assign some probabilities representing our uncertainty about the value of the outcome. Accordingly, we assign not a single state to the system but an epistemic ensemble of states, represented as444A more thorough treatment of ensemble notation will be given in Sec III.
[TABLE]
By definition the statistics of such an epistemic ensemble for an arbitrary POVM are calculated as:
[TABLE]
Note that in standard quantum theory, an arbitrary ensemble can be represented by a unique density operator via the mapping
[TABLE]
since the latter evidently produces the same statistics for any measurement (due to the linearity of the trace). For this reason, mixed density operators are often referred to as ‘ensembles’.
However, since we shall be going beyond standard quantum theory by contemplating non-convex-linear dynamics , we must be careful to make a formal distinction between purely epistemic combinations of states (also known as ‘proper mixtures’), versus ‘improper mixed states’, which represent uncertainty that is (possibly) attributable to the system itself independently of what anybody knows about it555The conceptual distinction between proper and improper mixtures was emphasized by d’Espagnat [8]; see also Timpson & Brown [9]..
The reason why we need to make this formal distinction becomes clear when we consider how a non-linear transformation acts upon a given state. On one hand, intuitively should act on an ensemble by acting individually on each of its components666See the discussion of ‘quasi-linear dynamics’ in Appendix B for more explanation of why we regard this rule as ‘intuitive’.:
[TABLE]
If on the other hand we were to associate the ensemble with a density matrix via the usual map (7), we would find that in general. In other words, if we try to compute the time evolution of by computing , we do not get the correct answer, namely, we do not obtain a density matrix which is statistically equivalent to the time-evolved ensemble . To be precise:
Proposition 1: The map satisfies for all ensembles iff is convex-linear on density operators per Definition II. (Proof: just let and observe that the LHS of Eq (4) is and the RHS is ).
Therefore, to avoid ambiguity, throughout this article we use the term ‘ensemble’ exclusively for epistemic ensembles as in Eq (5), and we use density operators exclusively for improper mixed states, whose interpretation is left unspecified.
Turning now to the second option, if we think that ’s measurement has not really collapsed the state but has only caused to become entangled to system , it would be reasonable to assign the total system a pure state of the form where:
[TABLE]
and where the complex amplitudes yield the expected probabilities, .
The ensemble and superposition imply different statistics in general, so we can in principle differentiate them by doing appropriate measurements on the total system. But can the scientist inside the lab distinguish them? To answer this, we shall make the following assumption:
Assumption: Any and all operations carried out by inside the laboratory can be modeled by us on the outside as some combination of a transformation followed by a measurement on system .
We justify this assumption as follows. Intuitively, the fact that is ‘sealed inside a laboratory’ means that the set of operations they can carry out is limited, compared to what we could do (in principle) from our standpoint outside the lab777As a trivial example, any procedure that would involve immobilizing or otherwise totally incapacitating for the duration is obviously beyond their own ability to perform. The best they could do is delegate the task to an external person or mechanism, which is precisely the role played by ‘us’ outside the laboratory.. In order to represent this limitation formally, we propose that the operations achievable by inside the lab must be modeled as a strict subset of the operations that we can perform from the outside. We can then conventionally define the subsystem as comprising those degrees of freedom which are targeted by this subset of operations, and as comprising the (assumed non-trivial) remainder.
We emphasize that this is an extremely weak assumption, not to be confused with a significantly stronger assumption sometimes made in the literature, which says that a scientist can only apply a physical theory to systems external to themselves. Indeed, at no point in our argument do we attempt to adopt the point of view of , nor do we make any assumptions about how they should model what happens to them. In particular our analysis is consistent with proposals where assigns a superposition state to themselves in order to make predictions, such as Ref [10].
Continuing with the thought experiment, let us therefore suppose there exist some operations achievable by that would reveal to us whether the total state is the ensemble or the superposition . By the above assumption, from the outside we should be able to model these operations as a special transformation on followed by a special POVM , such that the ensemble and superposition states imply different probabilities for the outcomes of (for if the statistics did not differ between the two cases, then would be powerless to reveal any difference between them).
First consider the special case where is trivial, and ask whether one could succeed at the task just by performing some measurement on . It is easy to see that this will not work, because the reduced density matrix associated to is the same in either case. For if the total state is then the statistics are determined from the reduced density matrix:
[TABLE]
On the other hand if the total system is the ensemble then system alone is assigned the ensemble , and since , this yields the same statistics for all measurements.
The above argument implies that any protocol which would allow to succeed at their task must involve a non-trivial transformation . Moreover, success requires that , since otherwise the reduced states of would still remain indistinguishable. In fact, since , the requirement can be written as ; it then follows from Proposition II that must be non-linear.
We conclude that it is impossible for the scientist in the closed laboratory to determine whether or not they are inside a superposition (that is, Local Definiteness holds) iff the allowed transformations are convex-linear on density matrices888On the face of it, this might appear to contradict another result in the literature: the demonstration of ‘quasi-linear’ transformations, which preserve the convex structure of the space of density matrices but are not linear [11, 12]. We explain in Appendix B how such transformations are compatible with our present results..
For completeness, we would like to show that our general definition of convex-linearity encompasses unitary dynamics for pure states. Fortunately this work has already been done: in 2009 T.F. Jordan pointed out that if we assume999Jordan actually did not think any extra assumption was needed, for he assumed that invertibility of the map followed from its being linear and injective. The error was later pointed out by Hejlesen [3]. the convex-linear map is invertible, then it must transform pure states into pure states and preserve their Hilbert-Schmidt inner products. One can then use Bargmann’s refinement of Wigner’s theorem [13, 14] to prove that must be either unitary or anti-unitary, and in particular it must have the form
[TABLE]
where acts linearly on the pure states as per (2).
Straightforward as it is, the argument of this section is still too weak: a critic could argue that by assuming the kinematical structure of quantum theory we are ‘sneaking linearity in by the back door’. After all, maybe the linearity of quantum state space alone is doing all the heavy lifting in the foregoing argument. To close this loophole, in the next section we extend the argument to a general class of probabilistic theories, making minimal assumptions about their kinematics and dynamics. This will allow us to subsequently establish a firm link between the principle of Local Definiteness and dynamical linearity.
III General version of the argument
III.1 Preliminary definitions
III.1.1 Operational theories
To go beyond quantum theory while making the fewest assumptions possible, we appeal to the concept of an operational theory, first introduced in Refs. [15, 16]. Briefly, the central object of an operational theory is an abstract experiment consisting in a sequence of operations, specifically a preparation , a transformation , and a measurement , culminating in the realization of some outcome from a discrete finite set of possible values . The operations are to be thought of as descriptions of certain procedures to be carried out by the scientist in a suitably equipped laboratory. For every experiment on a given system, defined by some triple , the operational theory assigns probabilities . In general, a list of laboratory procedures contains much more information than is relevant to the statistical predictions of the operational theory. For example, given some quantum state , there will in general be many ways to prepare this same state, depending on the particular laboratory implementation. We are therefore motivated to define a state in the operational theory as an equivalence class of preparation procedures, all of which give identical predictions in the operational theory. Formally, if two preparation procedures produce the same probabilities for all combinations of transformations and measurements, ie. if
[TABLE]
then we shall use a single symbol to represent both procedures, referring to as the state assigned to the system. By this language we do not mean to imply that it represents some state of affairs in reality, but nor do we exclude such an interpretation.
Indeed, throughout this article we shall remain noncommittal about the precise meaning of the symbols . One could interpret them as ontic states that describe or represent states of affairs in reality; or one could treat them instrumentally as mere symbols that allow us to make predictions via the operational theory, and nothing more. None of the results we shall prove in this article will depend on which interpretation one adopts.
It is common in the literature to augment operational theories with an additional compositional structure that tells us how to compose smaller experiments into larger ones and vice-versa. We shall not need to appeal to any particular compositional framework; however we shall require at least a concept of parallel composition of processes, which we now describe. Given any experiment resulting in outcome , and resulting in outcome , defined on subsystems 1 and 2 respectively, the procedure in which both procedures are carried out in parallel is represented by , and the joint probabilities for outcomes , are given by the product:
[TABLE]
Finally, it is useful to separate kinematics from dynamics, using the convention that one can simply drop the ‘’ whenever the transformation is trivial; thus the pair indicates that a system assigned state is immediately measured by without being transformed (say, by evolving over time) in between. Statements referring to such state-measurement pairs can then be thought of as capturing the kinematical content of the theory, whereas statements concerning the possible non-trivial transformations constitute the dynamical content of the theory.
III.1.2 Epistemic layer
An important innovation we shall make, which goes beyond previous literature, is to introduce what we call an epistemic layer on top of the operational theory. Intuitively, as we have said, the operational theory is ambiguous as to whether the states have an ontological significance. Nevertheless, there are always situations in which our uncertainty is unambiguously epistemic. To give an example, suppose we are certain that an experimenter has implemented one of two preparation procedures which are operationally distinct, hence represented by different states and ; but suppose that we simply do not know which one was implemented (due perhaps to lack of communication from them, loss of records, amnesia, or anything else of that kind). Such situations inevitably arise in practice, and it can be useful to deal with them by including this additional strictly epistemic uncertainty in our formal reasoning.
The method usually adopted in the literature for incorporating epistemic uncertainty is simply to ‘absorb’ it into the definition of preparations, by treating a preparation procedure about which we have epistemic doubt as being itself a new preparation procedure. For instance, if I assign epistemic probability to the proposition that state was prepared, and that was prepared, then I could represent this formally by defining a new preparation procedure associated to a state , such that
[TABLE]
Many authors find this technique convenient because it forces the state space to be convex, which can be very useful for proving theorems. However, this strategy can be misleading if one does not use it carefully101010A recent exchange in the literature provides an excellent case in point [17, 18, 19]. Responding to a criticism, the authors of Ref. [17] clarify that the states in the operational theory they consider are ‘agnostic’ about whether the associated preparation procedure includes uncertainty of a purely epistemic character. While there may be valid reasons for doing this depending on one’s aims, we agree with Ref. [18] that this is a non-trivial restriction on the operational theories, and should be stated explicitly as an assumption., because it formally conflates uncertainty that is obviously epistemic with uncertainty that might have a non-epistemic physical origin. This can be a problem if one’s goal is to contemplate possible non-linear dynamics as is the case here (recall the discussion in Sec II); it also makes it difficult to compare different interpretations of the state, as we aim to do in Sec IV.
To see why, note that since epistemic uncertainty concerns only the knowledge or beliefs of the experimenters, it is always presumed that we can eliminate it (at least in principle) by adopting suitably rigorous experimental protocols, such as keeping meticulous records, establishing trust and communication among the scientists, etc. Therefore if we wish to investigate whether some uncertainty implied by might have an ontic rather than epistemic origin, we should define the states only with respect to those preparation procedures in which every effort has been made to eliminate epistemic uncertainty. If some residual uncertainty remains after this process, then – and only then – can we reasonably entertain the hypothesis that it might have an ontic explanation. Note that if we adopt this rule, then the mathematical space of states can indeed be non-convex.
III.1.3 Ensemble formalism
In order to maintain a strict separation between uncertainty that is clearly epistemic (ie. which we know how to eliminate) and that which is not, we cannot simply absorb the former kind of uncertainty into the preparation procedures. Instead, we shall formally represent such uncertainty using an ensemble, which is a set where the index runs over a finite set of integers, . An ensemble refers to a situation of strictly epistemic uncertainty, in which we are sure that the correct state assignment is in the set but we do not know which, and the represents our probabilities for each to be correct. We use the convention that a trivial ensemble consisting of only one possible state is simply referred to as a ‘state’, and the term ‘ensemble’ is reserved exclusively for cases where there is more than one possible state assignment.
An ensemble of ensembles is again an ensemble. Formally, given ensembles and , where the domains of indices and can in general overlap (ie. a state is common to both ensembles iff ), we might be ‘uncertain about our uncertainty’ and assign probability that the ensemble is and that the ensemble is . Intuitively, this should be equivalent to an ensemble that merely represents uncertainty over the union of the states in the two sub-ensembles, , represented as , where the probability of is the sum of the probabilities for in each ensemble, weighted by the probabilities of the relevant ensemble, that is,
[TABLE]
Note that if we treat a state as equivalent to an ensemble of one element, , this means that repeated elements in an ensemble can be formally combined into a single element, and in particular, , which agrees with intuition.
Finally, to compute the probabilities of outcomes for some measurement performed on an ensemble , we must compute the probabilities for the individual cases and then take the sum of these weighted by their respective epistemic likelihoods:
[TABLE]
III.1.4 Kinematical equivalence classes
Consider two states and that yield the same probabilities for all measurements:
[TABLE]
In this case, we shall say that and belong to the same statistical equivalence class, denoted . We emphasize that this equivalence class is defined only kinematically, for as we will see later, some theories might permit transformations that map and to statistically non-equivalent final states.
More generally, two ensembles that produce the same statistics for all measurements are also said to belong to the same equivalence class . That is, for any pair of ensembles in the same equivalence class we have:
[TABLE]
Since the probabilities are the same for any , we shall sometimes refer to them using the notation to emphasize that the probabilities only depend on the equivalence class and not on the particular ensemble that generates them.
It is important to note that individual states (i.e. which are not ensembles) can in general be statistically equivalent to ensembles. For instance, we can define a state such that it satisfies
[TABLE]
for some equivalence class . Note that although the state produces the same statistics as any ensemble , they are conceptually distinct in two essential aspects, as we now explain.
Firstly, the state – like any state – has a flexible interpretation: it may be taken as an ontic state (describing a real state of affairs) or as an epistemic state (representing one’s knowledge or beliefs). By contrast, as we explained earlier, the probabilities in an ensemble are always epistemic by definition: they represent our uncertainty about which of the should be applied to the given situation.
Secondly, we are assuming that the theory imposes no restrictions on the possible ensembles, thus, any logically possible ensemble constructed by taking an arbitrary probabilistic mixture of states is assumed to be a valid ensemble in the theory. By contrast, not every logically possible state is considered to be allowed by the theory. In particular, there might exist ensembles whose statistics cannot be reproduced by any single state, i.e. while it is always logically possible to define the state in the equivalence class , a given theory may exclude it from the set of valid states.
Finally, it will be useful to define an informationally-complete measurement, denoted by the special symbol with associated outcomes , as a measurement whose probabilities when measured on any state are sufficient to determine the probabilities that one would obtain by performing any other measurement on instead. Formally, two states and yield the same probabilities for the measurement (that is, ) if and only if they belong to the same equivalence class, that is:
[TABLE]
Such measurements therefore provide a useful way of indicating that two states belong to the same equivalence class, without needing to explicitly contemplate all possible measurements. In this work we restrict attention to operational theories which have at least one informationally complete measurement that can be performed on every system within the theory.
III.1.5 Mixed states and superpositions
Given the aforementioned differences between ensembles and states that produce equivalent statistics, we shall adapt some familiar terminology to distinguish them: the former will be called proper mixtures and the latter will be called improper mixed states. Conversely, any state which does not produce statistics equivalent to some ensemble will be called a pure state111111Equivalently, a state is called pure iff there is some measurement that ‘tests’ for it, that is, a measurement that predicts a certain outcome with certainty only if the system is assigned that particular state.. Our usage of these terms follows closely their conventional usage in the special case of quantum theory121212There is a subtle difference: in some presentations one assumes that isolated systems cannot evolve dynamically from pure states into improper mixed states, in which case an ‘improper mixed state’ always refers to a ‘reduced’ state of a system that is correlated with some larger environment. Our more general framework does not impose this restriction; it accommodates improper mixed states not correlated to any environment..
Besides mixed states, we would also like to define a ‘superposition state’ in our general operational framework. Note that the state of ‘being in superposition’ is typically understood in contradistinction to some concept of ‘classical’ states, these representing a special subset of states which are by definition not superpositions. It is characteristic of such states that they represent situations in which a special subset of observable physical quantities can be assigned definite values – these quantities being the ones that describe the ‘classical limit’ of the theory. We shall presume that the same holds true in any of the generalized theories that we consider: given any system, we assume there is a special set of states that play the role of the ‘classical states’ of the theory. Since these are supposed to represent states in which a certain set of quantities (the classical observables) take definite mutually distinct values , we shall assume there exists some measurement that is capable of mutually perfectly distinguishing among any subset from this set of states. This inspires the following definition:
**Definition 2. ** Let be a state of a system – not an ensemble – which has the following property: if subjected to the measurement (i.e. the measurement that distinguishes among the classical states of the system) the probabilities to obtain outcomes are non-trivial, that is, two or more values of are assigned non-negligible probabilities. Then we call such a state a quasi-superposition.
The intuition behind this definition is the idea that the essence of a ‘superposition’ lies in the combination of two factors (i) it does not yield definite values for ‘classical’ variables and (ii) it is not an ensemble. The latter point means that, if we adopt an ontic interpretation of states, then the indefiniteness of classical values represented by a quasi-superposition state must be due to a metaphysical indeterminacy at the ontological level: it cannot be explained away as representing merely our lack of knowledge of the true determinate values.
Note that by this definition, a quasi-superposition state need not be a pure state – hence the prefix ‘quasi’. Thus, even a theory that is operationally equivalent to classical theory can admit a notion of ‘quasi-superposition’, in the form of some notion of ‘irreducible randomness’, that is, in which the ontological states of the theory include ones that have indeterminate classical values.
III.2 Main argument
We contemplate the following scenario: a system labelled is sealed inside a laboratory with a scientist, whom we model as another system . The whole laboratory and its contents (including the scientist), hereafter called the total system, will be labelled . The task of the scientist will be to determine – purely by operations performed inside the lab – whether or not the state of the total system is in a ‘superposition’ state or not.
We begin by noting that if were able to succeed in their task by performing some ‘mystery operations’ inside the lab, then it ought to be possible for a scientist situated outside the lab to model the whole process in terms of states, transformations and measurements performed on the total system ; for if that were not possible, we could not say that the operations carried out by were compatible with our physical theory. We shall therefore adopt the perspective of a scientist outside the lab for the duration of the analysis.
From this standpoint, if we judge that a measurement has been done on system and some outcome obtained, then we assign this system a state . Similarly, there is a special measurement we can do upon the scientist , resulting in some outcome , which we interpret as ‘interrogating about what they observed’; thus has the same range as . If we perform and obtain , we shall accordingly assign the scientist the state , which we interpret as implying that they would report having observed outcome if interrogated again. Thus if the state of is and we measure , we must be certain to obtain :
[TABLE]
Furthermore, this interpretation suggests that if we were to measure both on and on then we should expect to obtain the same outcomes for both (i.e. ). We shall therefore assume that this is so.
We can now ask what state we should assign to the total system, consistent with the preceding conditions. The answer depends on whether our uncertainty about prior to measuring should be modeled as epistemic ignorance, by assigning an ensemble, or whether it should be modeled as (possibly) having an ontological origin, as represented by a quasi-superposition. We consider each option in turn.
According to option ‘Ensemble’, operations which constitute a measurement for also constitute a measurement for everyone outside the lab; therefore the fact that the lab is sealed and isolated is irrelevant except insofar as it prevents an outsider from knowing which value of was actually obtained. Consequently in this case we should model the total system as being in an ensemble:
[TABLE]
Alternatively, according to option ‘Superposition’, the operations performed by do not constitute a measurement for us on the outside, but rather a transformation of the total system which leads us to assign it some final joint state .
Note that perfectly discriminates between a set of states , which we interpret as the scientist having observed different values of the variable – it is therefore natural to consider these states a subset of the ‘classical’ states of our theory. Moreover, applying to does not yield a definite value of – hence this state is a quasi-superposition by Definition III.1.5.
We can now pose the key question: is there any experiment that could do on which would allow them to determine whether the total system is assigned the quasi-superposition state or the ensemble ?
If indeed has the power to perform such an experiment on , we outside the lab should be able to model this process explicitly as a transformation applied to system , followed by a measurement , whose outcome statistics are different depending on whether the system is in a superposition or not. Conversely, if no such , exist, then cannot succeed at their task.
We now prove that if is trivial, no exists that would enable to succeed at their task.
**Theorem 1. ** In the scenario described, there is no measurement on whose statistics would allow us to differentiate between and .
Proof:
First note that if the total system is assigned the ensemble as defined in Eq. (22), then taken alone is assigned the state with probability , that is, we assign it the ‘reduced ensemble’ defined as:
[TABLE]
Alternatively, if the total system is assigned the state then in order to know what to assign to alone, we require some rule that tells us how to obtain its state from the total state . In quantum theory this would be the partial trace, but in general it could be something else. Fortunately, the details of this rule turn out not to matter: it is sufficient for our purposes to note that since the total state is not an ensemble, it fully determines the reduced state of system , which therefore also is not an ensemble; hence we can denote it .
Our result can now be proven if we can show that no measurement on can distinguish the reduced state from the ensemble . In order to prove this, we contemplate doing a pair of measurements and on systems and respectively, where is the informationally-complete measurement (as defined in Sec III.1) with outcomes , and corresponds to “checking which outcome was observed by ” as defined earlier, with outcomes . We now compute the marginal probabilities to obtain for each of the two cases Ensemble and Superposition and prove that they are the same.
For Ensemble, we can obtain the probabilities of immediately, since:
[TABLE]
For Superposition, the systems are assigned the joint state . We first compute the joint probabilities for both measurements , , and subsequently sum over the to find the marginal probabilities for . The joint probabilities are:
[TABLE]
where the first line follows from the identity , the second line follows from the definition of , and we used (III.1.1) to get the third line. Using this result, we sum over the to obtain the marginal probabilities for :
[TABLE]
Comparing the results (24) and (26) and noting that by definition has the same domain as , we see that the reduced state produces the same probabilities for the measurement as does the reduced ensemble ; but since is informationally-complete, Eq. (20) implies that:
[TABLE]
that is, they belong to the same equivalence class and no measurement can distinguish them.
The preceding Theorem has the following implication: from a purely kinematical point of view, if is restricted to just performing operations that we model as measurements on , they cannot tell whether the total system is in the state or the ensemble . As we saw, this stems from the fact that the improper mixture belongs to the same equivalence class as the proper mixture .
We next consider the key question: is there some ‘mystery transformation’ that would enable us to statistically distinguish between and , thereby allowing to succeed at their task? We will prove that the answer is ‘no’ iff acts linearly on convex mixtures, as formalized by the following definition:
**Definition 3. ** Consider a transformation that maps the set of physical states to itself131313One might ask why we don’t consider more general transformations that map physical states to ensembles of physical states; we address this in Appendix A.. Given any state , let be the vector of probabilities . Then the map induces a corresponding map , where
[TABLE]
The map is called convex-linear if it acts linearly on arbitrary convex mixtures of , that is if
[TABLE]
with and .
Armed with this definition, we now present our main theorem:
**Theorem 2. ** A transformation preserves the statistical equivalence of any state and ensemble that belong to the same equivalence class iff is convex-linear as defined above.
Proof: As usual, since an ensemble represents epistemic ignorance, we assume acts on it by acting individually on each of its components:
[TABLE]
Let be a state that belongs to the same equivalence class of , and let (resp. ) be the vectors of probabilities generated by (resp. ) for the outcomes of the informationally-complete measurement . Since they belong to the same equivalence class, we must have . This implies that can (like ) be decomposed as:
[TABLE]
Using (30), if one performs on the final ensemble , one obtains probabilities
[TABLE]
Note that maps and to the same equivalence class iff . Comparing (32) to (31) we see that this occurs iff
[TABLE]
or in other words, preserves the equivalence class structure (hence the statistical equivalence of and ) iff is convex-linear141414At face value, this theorem might seem to contradict recent work on ‘quasi-linear dynamics’ [11, 12], which describes a class of non-linear dynamical maps that are insensitive to different decompositions of the density matrix. In Appendix B we explain how quasi-linear dynamics are consistent with our result..
Putting together Theorems III.2 & III.2 we conclude that, in the general scenario described, it is impossible for the scientist in the closed laboratory to determine whether they are inside a superposition (that is, Local Definiteness holds) iff the transformations of the theory are convex-linear in the sense of Definition III.2.
There is one minor step needed to connect this general result to the quantum superposition principle, namely, we need to prove that when we restrict the kinematics of the theory to be that of quantum theory, then convex-linearity as defined in Definition III.2 reduces to convex-linearity on density matrices as per Eq. (4).
Proposition 2: If we take the state space of our theory to be quantum, ie. we identify the valid states with the set of density operators acting on a Hilbert space, and we identify valid measurements with POVMs defined on this space, then the transformations are convex-linear only if they act linearly on density matrices.
Proof: Let the informationally-complete measurement be associated to a POVM ; then every density operator is mapped to a unique set of probabilities via:
[TABLE]
Given a transformation defined on the density operators, the correspondence (34) induces a corresponding map on their probability representations via
[TABLE]
From this definition it follows that, due to the linearity of the trace, if does not act convex-linearly on the density operators it also cannot act convex-linearly on their probability representations. Equivalently, if is convex-linear on the probabilities as per Definition III.2, then it must be convex-linear on the density operators as per Definition II.
IV Discussion: why epistemic?
We began this investigation by asking why no experiments to date have succeeded in detecting any deviation from the linear equations of quantum dynamics. Our foregoing results have proven that the question can be re-framed as follows: why do the laws of physics conform to the principle of Local Definiteness? If we can find a satisfying theoretical reason for why this principle is upheld by our most accurate experiments to date, then this would also explain why we have not observed any evidence of non-linear quantum dynamical effects.
Remarkably, if one adopts an epistemic interpretation of quantum states, then one is automatically committed to the principle of Local Definiteness. This was pointed out by Chris Fuchs during discussions at a recent workshop151515The Wigner’s Friends Theory Workshop, San Francisco, December 2022; supported by Unitary Fund, FQXi, and Convergent Research. devoted to Eugene Wigner’s eponymous thought experiment [20]. Fuchs emphasized that if the quantum state merely represents the knowledge or beliefs of someone outside the laboratory (that is, if the quantum state is interpreted as epistemic) then it is obvious that the friend inside the laboratory should be unable to tell the difference between being ‘in an entangled superposition’ or not. Indeed, experiments within the lab should be powerless to reveal that the lab is in a superposition, just as experiments on a spinning roulette wheel are powerless to reveal whether someone has placed a bet on this particular spin of the wheel161616Actually, the same argument leads to something much stronger than LD: local measurements on a system are unable to distinguish between any pair of alternative states assigned to by another scientist, precisely because these ‘states’ are not properties of the system but rather propositions about that other scientist’s knowledge or beliefs. Local Definiteness is just a special case of this more general feature of epistemic interpretations..
This fact takes on a special significance in light of the foregoing results: for since LD implies the linearity of quantum dynamics, it follows that an epistemic interpretation of quantum states only makes sense if the dynamics is linear. We now argue that this explanation for why quantum dynamics is linear – viz that quantum states are epistemic – is superior to alternative explanations that must be used if quantum states are regarded as ontic. To make our case, we identify two varieties of ontic interpretation, for which we shall make separate arguments in subsections IV.1 and IV.2. In Sec IV.3 we discuss what our results imply for epistemic interpretations.
IV.1 Absolute ontic models
**Definition 4. ** An interpretation is called absolute ontic if every pair of distinct states and of a given system imply two distinct and mutually exclusive ontological states of that system.
This definition encompasses what Harrigan & Spekkens call -complete ontological models, in which there is a one-to-one correspondence between quantum states and ontological states, as well as what they call -supplemented ontological models, where the quantum state is an incomplete specification of the ontological state and hence it is possible for distinct ontological states to be assigned the same quantum state [16].
To see why epistemic interpretations are superior to absolute ontic interpretations, consider Leibniz’ principle as recently elaborated by Spekkens [21]:
“If an ontological theory implies the existence of two scenarios that are empirically indistinguishable in principle but ontologically distinct (where both the indistinguishability and distinctness are evaluated by the lights of the theory in question), then the ontological theory should be rejected and replaced with one relative to which the two scenarios are ontologically identical.”
Recall that the principle of Local Definiteness implies that it is empirically impossible to distinguish the reduced ensemble from the reduced state by any local operations. According to an absolute ontic interpretation, these situations represent two distinct ontological states of : in the former case the state is an ontological state corresponding to one of the (we just don’t know which), whereas in the latter case it is in the ontological state associated to the improper mixed state .
By contrast, on an epistemic interpretation the states and are both only expressions of an external observer’s knowledge or beliefs regarding the system, and so are compatible with the hypothesis that they refer to one and the same ontological state of affairs inside the laboratory.
This is a perfect use-case for Leibniz’ principle: since and are empirically indistinguishable, the principle says we ought to adopt an interpretation of quantum states according to which these states may refer to the same local ontological state of . An epistemic interpretation permits this; an absolute ontic interpretation does not.
One might worry that we have only defeated a straw man, for there may be more sophisticated ways to interpret the quantum state as ontic. We next consider a prominent class of such interpretations.
IV.2 Relational ontic models
**Definition 5. ** An interpretation is called relational ontic if every pair of distinct states and of a given system imply two distinct and mutually exclusive ontological relations between that system and a designated external “reference frame”, denoted .
The motivating thought behind this class of ontic interpretations is that whenever we assign a quantum state to some system in practice, we do so against the backdrop of the physical laboratory in which we happen to be situated, which determines (somehow) the relevant reference frame . On this view, the state assigned to a system includes not only information about the ontic state of the system, but also information about its ontological relation to the implicit reference frame . This means it is possible for a single ontological state of the system to be compatible with two distinct quantum states and , just in case this ontological state supports two distinct ontological relations between itself and the frame .
As an analogy, suppose professor Bertlmann, who prefers to wear mis-matched socks, has some chance of wearing matching socks, say, when he is running late for a lecture and has dressed in a hurry. If on a given day his left sock is red, for example, then let us take the sock as being analogous to a physical system and ‘the sock is red’ as analogous to an ontological fact about that system. This fact alone, however, does not fix the value of the ontological relation between it and the other sock, which under the given conditions may be either ‘matched’ or ‘mis-matched’. This illustrates how, in a relational ontic model, knowing the ontological state of a system does not fix its quantum state uniquely, unless we also specify the ontological relation between the system and the frame in which it is being considered.
It is noteworthy that relational ontic interpretations are not contemplated within the ‘ontological models’ framework of Harrigan & Spekkens. Some prominent interpretations do seem to fall within this category, including Healey’s pragmatist interpretation[22, 23], some readings of Rovelli’s relational interpretation (RQM)171717Rovelli interprets quantum states as equivalent to catalogs of probabilities, however since he remains agnostic about the interpretation of probability, it impossible to say whether he thereby endorses an ontic or epistemic reading of the quantum state [24]; indeed the literature contains examples of both [25, 26]. However, recent developments in RQM [27, 28] have arguably forced it towards an ontic position [29, 30]. Moreover, if we adopt Leifer’s definition of ontic as “ something that objectively exists in the world [] that would still exist if all intelligent beings were suddenly wiped out from the universe”, then the quantum state in RQM certainly qualifies. [31, 32, 25], and perhaps the ‘Contexts, Systems and Modalities’ (CSM) interpretation of Auffèves and Grangier [33, 34].
Unlike absolute ontic interpretations, relational ontic interpretations are not susceptible to the argument from Leibniz’ principle, because they allow two distinct quantum states to be associated with a single ontological state. Thus, the scientist’s inability to distinguish from could possibly be accounted for by the fact that, so long as they are confined inside their laboratory, they do not have access to information about the extrinsic relations between their laboratory and the ‘outer laboratory’ relative to which the state of the total system is defined.
However, there is another principle for evaluating the explanatory power of hypotheses according to which the relational ontic explanation is inferior to the epistemic explanation. Given competing hypotheses and which both purport to explain some empirical phenomenon , we shall judge an explanation superior if it is less able to explain a violation of . The intuition here is that a good explanation of is one that would be falsified (or very nearly so) if it were to transpire that does not occur after all. In the present example, we should ask: if future experiments were to find evidence of non-linear quantum dynamical effects, how easily could this be accounted for by an epistemic versus a relational ontic interpretation?
It is straightforward to see that the relational ontic interpretation would easily survive this hypothetical experimental development; for in any relational theory of mechanics, it is practically a rule that local phenomena of a system at one time depend on the extrinsic relations of that system to other systems at some earlier time. For example, consider the instantaneous relative distance (as defined in some conventionally chosen reference frame) between the Earth and the Sun. While it is true that I cannot determine this distance by any local experiment performed on Earth, I can determine what it was in the past, for instance by measuring the local temperature.
Thus, if non-linear dynamics were someday discovered, allowing us to tell the difference between and by local experiments on in our thought experiment, a relational-ontic interpretation could explain this without too much trouble by saying that the local experiment is merely revealing the difference between two ontic relations, say and respectively, defined between the local laboratory and some extrinsic frame , as they stood at some point in the causal past of the experiment.
Now consider how an epistemic interpretation would fare in light of such a discovery. Since the difference between and is in this case not a physical difference, but instead represents a difference in an external observer’s knowledge or beliefs about the laboratory and its contents, then it would be deeply disturbing to find that this difference could be detected by experiments within the laboratory. It would imply that some hitherto unknown force has breached the laboratory walls181818Admittedly, this alone would not be entirely unprecedented – just think of the discovery of X-rays., but more alarmingly, that this force conveys very specific information about other people’s mental states of knowledge or beliefs. To insist on an epistemic interpretation under these circumstances, in other words, would be tantamount to endorsing mental telepathy as the explanation. Even if one allows that there might be some phenomena for which telepathy is a plausible explanation, this is not one of them, because for this hypothetical case there is a perfectly sound physical explanation: the quantum state is ontic and its dynamics is non-linear. Putting it more bluntly, faced with compelling experimental evidence for non-linear dynamics, epistemic interpretations would be stone dead.
We conclude that while both epistemic and relational ontic interpretations can explain why the principle of Local Definiteness should hold (hence too the linearity of quantum dynamics), only the epistemic interpretation stands the risk of being falsified if experiments were to find evidence of non-linear effects; it is therefore the better explanation of the two.
IV.3 What ontology?
As already noted, according to epistemic interpretations, nothing prevents us from asserting that the real situation inside the laboratory is the same, regardless of whether or not it an ensemble or a superposition from the outside. A skeptic might therefore challenge us to specify just what remains of the quantum formalism that could possibly constitute this “real situation”, according to an epistemic account. As this question can only be answered in the context of a specific epistemic interpretation, a full answer lies beyond the scope of this article; we can however make some preliminary remarks.
One option is to adopt a psi-epistemic ontological model, according to which the state represents partial epistemic knowledge about some hidden variables that describe the true state of reality inside the laboratory [16]. However, a number of no-go theorems have shown that psi-epistemic ontological models cannot reproduce operational quantum theory in general, and they have therefore fallen out of favour (see Ref. [35] for a review).
The alternative is to adopt an interpretation in which quantum states are epistemic, but where there are no hidden variables. There are very few interpretations with this feature; two notable examples are the information-theoretic interpretation of Bub and Pitowsky [36, 37], and the subjective Bayesian interpretation known as QBism (see eg. Refs [38, 39, 40]). In both cases, a quantum state does not represent a state of affairs in reality (not even partially); instead the quantum state is regarded as equivalent to a catalog of probability assignments for the outcomes of possible measurements that could be performed on the system. The interpretations differ as to whether the probabilities are ‘objective’ or ‘subjective’, but they agree that probabilities (hence quantum states) are epistemic: they reflect knowledge or beliefs about anticipated measurement outcomes. It is then clear that the quantum state in such interpretations is fully epistemic; by itself it says nothing about the real situation inside the lab. But if the quantum state has no direct hold upon reality, then what does?
The first thing to note is that, within the epistemic interpretations under discussion, so long as the lab remains ‘sealed’, that is, so long as we do not ‘interrogate’ the scientist as to what value of they observed, we on the outside cannot make any claim about the ‘real situation’ inside the lab. Insofar as such a ‘real situation’ can be said to exist at all, it can only exist for the scientist encapsulated in the lab191919This conclusion follows from arguments laid out in detail in Refs. [41, 42, 43]; it amounts to rejecting the assumption of ‘Absoluteness of Observed Events’..
Consequently, to discuss the ‘real situation’ inside the lab, we must break a rule so far respected within this article and speculate about the point of view of the encapsulated scientist . If we do so, we can gain a clue as to what the ‘real situation’ might be for them, by recalling the definition of Local Definiteness: a scientist confined to an enclosed laboratory cannot determine by any local experiment whether the laboratory and all its contents are in a definite effectively classical state, or in a superposition of such states.
Within the epistemic interpretations we are considering, this implies that if observes some outcome (corresponding to some effectively classical state of affairs inside the lab), they can safely assume that – at minimum – this outcome is part of the invariant ‘real situation’ pertaining to the inside the lab. Somebody outside the lab could still assign it a superposition state, and they might have very good reasons for doing so – but this information is irrelevant to the encapsulated scientist, so long as their concerns remain localized to the interior202020One is reminded of the similar role played by the Principle of Relativity, which guarantees that a scientist inside an inertially moving laboratory can safely assume that all systems in the laboratory are stationary, so long as they restrict attention to experiments confined to the interior of the lab..
Thus, if there is to be a non-trivial ontology implied by an epistemic interpretation, our results suggest that it has the following minimal features: (i) elements of reality are associated to measurement outcomes; (ii) the latter are only ‘locally real’, in the sense that they are real relative to the observer or observers who measure them; (iii) due to Local Definiteness, the local reality of some outcome is secured for all operations that can be performed ‘locally’, that is, within the encapsulated laboratory.
V Conclusion
We have proven that for a wide class of physical theories, a scientist in an enclosed laboratory cannot determine whether they are inside a superposition or not, iff the dynamics of the theory is convex-linear. Consequently, if we choose to adopt the principle of Local Definiteness, then linearity of the dynamics follows automatically.
We have furthermore argued that if quantum states are ontic, then there is no strong reason to expect Local Definiteness to be true, except insofar as it is derivable from assumptions about the dynamics. By contrast, if quantum states are epistemic, then we can only reasonably consider theories that satisfy Local Definiteness. Hence epistemic interpretations can better explain why quantum dynamics has so far been observed to be linear.
We now mention some limitations of our argument and some questions that remain open.
Firstly, in Sec IV.3 we argued that the local real situation inside the lab can be identified with the measurement outcomes obtained by the scientist in the lab. However, this opens up many more questions: is this local situation in principle accessible to observers outside? If so, how and under what conditions?
Secondly, our argument makes a formal distinction between kinematics and dynamics throughout, but this is somewhat ironic since Spekkens has elsewhere applied Leibniz’ principle to argue that this distinction is unfounded [44]. Thus, if our argument really requires us to make a formal split between kinematics and dynamics, our appeal to Leibniz’ principle might actually undermine our argument. It would therefore be desirable to prove that our results still hold in a framework that does not separate kinematics from dynamics.
Finally, we have here only focused on one aspect of quantum theory, namely the linearity of the dynamics. However, a full evaluation of different interpretations should consider their ability to explain a wider range of quantum phenomena. The present results provide one more reason to take epistemic interpretations seriously, but much work still remains to show that they are decisively better than ontic alternatives.
Acknowledgements.
This publication was made possible through the support of Grant 62424 from the John Templeton Foundation. The opinions expressed in this publication are those of the author and do not necessarily reflect the views of the John Templeton Foundation. Thanks to Chris Fuchs, Blake Stacey, Matt Weiss, and Thomas Galley for feedback that led to improvements in the article.
Appendix A What about stochastic non-linear dynamics?
In our proof of Theorem III.2 we considered transformations that map physical states to (unique) physical states. One might worry that such maps exclude so-called stochastic non-linear maps, which include important classes of non-linear dynamical theories such as the Ghirardi–Rimini–Weber [45] and Continuous Spontaneous Localization [46] models (for recent overviews see Refs [2, 1, 3]). There is no reason for concern: our framework is general enough to encompass stochastic non-linear dynamics, as we now explain.
A transformation will be called stochastic in our framework if it transforms a pure state into a state which is not pure, that is, either into an improper mixed state (recall Sec III.1) or into an ensemble of states. The former case is explicitly included within the scope of transformations contemplated by Theorem III.2, so we need only concern ourselves with the latter case. Note that transformations of this kind act on a state by transforming it into an ensemble, which can be written as:
[TABLE]
where the component transformations may be assumed without loss of generality to be transformations in the ordinary sense, that is, they map physical states to unique physical states.
By definition, the ensemble (36) represents a situation of epistemic ignorance: it implies one of the transformations has actually occurred, we just don’t know which one. Note that this means each individually must be an allowed transformation of the theory, one that could in principle be implemented on its own.
Now recall that Theorem III.2 says that a transformation is convex-linear iff it does not permit us to distinguish any state and ensemble belonging to the same equivalence class . Note that if there exists a valid transformation that allows us to distinguish from , we can always contrive a valid stochastic transformation of the form (36) containing that retains this power – for instance, one in which the probability of implementing any other with is arbitrarily small. Conversely, if each individually cannot enable us to distinguish from , then applying a stochastic transformation that merely represents ignorance about which of the was implemented can hardly improve our chances.
In conclusion, stochastic transformations of the form (36) can distinguish from iff there is at least one valid map that can do so. It then follows by Theorem III.2 that such stochastic transformations are powerless to differentiate from iff all valid transformations are convex-linear.
Appendix B What about quasi-linear dynamics?
A special class of non-linear quantum dynamical transformations has recently been proposed, called quasi-linear dynamics [11, 12]. Formally, a transformation is called quasi-linear iff it satisfies the following two properties. First, it preserves the convex structure of density operators:
[TABLE]
where , , and . Secondly, the final state does not depend on which decomposition of is used to compute it. That is, if we have two different convex decompositions of ,
[TABLE]
then we must have
[TABLE]
Note that reduces to a linear transformation of density operators iff in (B).
The significance of this for our present work is that it is tempting to interpret the two convex decompositions (38) and (39) as representing different epistemic ensembles and respectively, where the measurement statistics of the states , are computed using the density matrices , respectively. The problem is that if we interpret the convex decompositions in this way, then it is clear that (i) the epistemic ensembles belong to the same statistical equivalence class (since they have the same total density matrix) and (ii) they cannot be distinguished by any quasi-linear transformation. But this would contradict our Theorem III.2, since the latter implies that transformations which cannot distinguish ensembles in the same equivalence class must be convex-linear, whereas quasi-linear transformations are non-convex-linear whenever . Something is clearly amiss!
To explain why there is no contradiction, we must stress that the word ‘ensemble’ is often abused in standard quantum theory, being used on one hand to refer to improper mixed states, and on the other hand to refer to ‘proper mixtures’ – that is, representations of epistemic uncertainty about which state actually obtains, which in our framework are exclusively called ‘ensembles’.
The reason why conflating these two notions does not normally lead to trouble is that standard quantum dynamics is linear, which makes it operationally impossible to distinguish an improper mixed state from an epistemic ensemble of states whose statistics are given by the same density matrix. The ontological difference between the two concepts is therefore normally operationally inconsequential and can be ignored.
The situation is drastically different when one is contemplating the possibility of non-linear quantum dynamics, as we explained in Sec II. In this case it is absolutely essential to make a formal distinction between proper and improper mixtures (see eg. remarks in Refs [3, 4, 47, 48] ). To avoid inconsistency, we must be explicit about how the mixed density matrices are to be interpreted in the defining equations (B) (40) of quasi-linear dynamics.
In Ref [11] the two decompositions (38) and (39) are referred to as different ‘ensembles’, but it is not specified whether this means they represent distinct epistemic mixtures of states, or whether each one simply refers to an improper mixed state at the ontological level. Fortunately we can fill in this gap ourselves, for a little thought shows that the probability weights in the definition (B) cannot after all be interpreted as representing epistemic ignorance. The reason is that this would imply that acts on an ensemble as follows:
[TABLE]
where , that is, where actively changes the probability weights. If the latter are epistemic, then unless somehow acts on our brains to change our beliefs in subtle ways, this should not happen. In different words, if merely represents our epistemic uncertainty as to whether the state really is prior to the transformation, then simply knowing that we have applied to the system should not make us more or less certain that the state we applied it to was in fact ; after the transformation the epistemic weight for the state to be should still be just (Indeed we explicitly assumed this in the first step (30) of our proof of Theorem III.2).
We therefore see that the premise leading to an apparent contradiction with our framework was false: we cannot after all interpret the two convex decompositions (38) and (39) as representing different epistemic ensembles and . Instead, within our framework, these two decompositions must represent two different improper mixed states , , which can each be thought of as being specified by the density matrix together with a choice of parameters (either or ) that specify the particular decomposition of being considered.
A general non-linear transformation as defined in our framework can therefore in principle distinguish between and by being sensitive to these parameters – for instance we could consider maps that obey (B) relative to the decomposition associated to the state, but which violate (40) by transforming states with initially the same into states associated with distinguishable density matrices. Understood in this way, we see that the quasi-linear maps are the non-trivial subset of non-linear maps for which states having the same density matrix are mapped to states having the same density matrix, even though their other ontological parameters (specifying their decompositions) may change.
Having thus brought quasi-linear transformations into concordance with our framework, we see that Theorem III.2 must apply to them as well: a quasi-linear transformation cannot distinguish a state from an epistemic ensemble belonging to the same equivalence class iff is convex-linear, that is, in (B).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bassi et al. [2013] A. Bassi, K. Lochan, S. Satin, T. P. Singh, and H. Ulbricht, Models of wave-function collapse, underlying theories, and experimental tests, Rev. Mod. Phys. 85 , 471 (2013) . · doi ↗
- 2Ghirardi and Bassi [2020] G. Ghirardi and A. Bassi, Collapse Theories, in The Stanford Encyclopedia of Philosophy , edited by E. N. Zalta (Metaphysics Research Lab, Stanford University, 2020) summer 2020 ed.
- 3Hejlesen [2019] Z. L. Hejlesen, Nonlinear Quantum Mechanics , Master’s thesis , University of Oslo (2019).
- 4Pienaar [2013] J. Pienaar, Causality Violation and Nonlinear Quantum Mechanics , Ph.D. thesis , The University of Queensland (2013).
- 5Kent [2021] A. Kent, Quantum state readout, collapses, probes, and signals, Phys. Rev. D 103 , 064061 (2021) . · doi ↗
- 6Everett [1957] H. Everett, ”Relative State” Formulation of Quantum Mechanics, Rev. Mod. Phys. 29 , 454 (1957) . · doi ↗
- 7Barrett [2018] J. Barrett, Everett’s Relative-State Formulation of Quantum Mechanics, in The Stanford Encyclopedia of Philosophy , edited by E. N. Zalta (Metaphysics Research Lab, Stanford University, 2018) winter 2018 ed.
- 8d’Espagnat [1976] B. d’Espagnat, Conceptual Foundations of Quantum Mechanics (Addison-Wesley, 1976).
