Redundancy of einselected information in quantum Darwinism: The irrelevance of irrelevant environment bits
Michael Zwolak, Wojciech H. Zurek

TL;DR
This paper demonstrates that irrelevant environmental information does not significantly affect the redundancy of relevant information in quantum Darwinism, supporting the robustness of objective classical reality emergence.
Contribution
It shows that mixing relevant and irrelevant information in the environment does not diminish the redundancy of the relevant information, affirming the resilience of quantum Darwinism.
Findings
Redundant information of interest remains intact despite irrelevant bits.
Mixing relevant and irrelevant information does not significantly reduce redundancy.
Objective classical reality emerges robustly from quantum substrate.
Abstract
The objective, classical world emerges from the underlying quantum substrate via the proliferation of redundant copies of selected information into the environment, which acts as a communication channel, transmitting that information to observers. These copies are independently accessible, allowing many observers to reach consensus about the state of a quantum system via its imprints in the environment. Quantum Darwinism recognizes that the redundancy of information is thus central to the emergence of objective reality in the quantum world. However, in addition to the "quantum system of interest," there are many other systems "of no interest" in the Universe that can imprint information on the common environment. There is therefore a danger that the information of interest will be diluted with irrelevant bits, suppressing the redundancy responsible for objectivity. We show that mixing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Redundancy of einselected information in quantum Darwinism: The irrelevance
of irrelevant environment bits
Michael Zwolak
Department of Physics, Oregon State University, Corvallis, OR 97331
Wojciech H. Zurek
Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545
Abstract
The objective, classical world emerges from the underlying quantum substrate via the proliferation of redundant copies of selected information into the environment, which acts as a communication channel, transmitting that information to observers. These copies are independently accessible, allowing many observers to reach consensus about the state of a quantum system via its imprints in the environment. Quantum Darwinism recognizes that the redundancy of information is thus central to the emergence of objective reality in the quantum world. However, in addition to the “quantum system of interest,” there are many other systems “of no interest” in the Universe that can imprint information on the common environment. There is therefore a danger that the information of interest will be diluted with irrelevant bits, suppressing the redundancy responsible for objectivity. We show that mixing of the relevant (the “wheat”) and irrelevant (the “chaff”) bits of information makes little quantitative difference to the redundancy of the information of interest. Thus, we demonstrate that it does not matter whether one separates the relevant information) from the (irrelevant) chaff: The large redundancy of the relevant information survives dilution, providing evidence of the objective, effectively classical world.
Amplification – already invoked by Bohr Bohr (1958) – is the central process by which the underlying quantum substrate gives rise to the objective, classical world Zwolak et al. (2014, 2016). Quantum Darwinism Zurek (2009, 2014) formalizes this notion into the concept of redundancy: When quantum systems are decohered Joos et al. (2003); Zurek (2003); Schlosshauer (2008), they transfer select information – information about their pointer states Zurek (1981) – to their environment. Many observers can then infer the state of the system indirectly by intercepting some small fragment of the environment Ollivier et al. (2004); Blume-Kohout and Zurek (2006). In other words, this select information is redundant, as any small fragment will do, and is thus objective: Many observers can independently deduce the pointer state of the system and reach consensus about it.
In our Universe there are many fragments of the environment that have no or nearly no information about any given quantum system of interest at any given time. Thus, in what way can one then apply the quantum Darwinist considerations? To begin addressing this question, we consider a spin model introduced in Ref. Blume-Kohout and Zurek (2005) with two types of spins in the environment , ones that acquire perfect (classical) information about the system and others that acquire no information. These are the “good” and “bad” environments, respectively. This can be represented by a state of the form
[TABLE]
Physically, this state is generated when a set of “good” environment components – the “wheat” – each perfectly decohere the system in the -basis and a set – the “chaff” – do not interact at all with the system. In what follows, the results are not limited to “diametrically opposed” good and bad environments (i.e., ones like in Eq. (1)), but rather extend to the case with partial information in both the good and bad environments, as well as mixed states, states without permutational invariance (e.g., in the environment ), and alternative measurements to extract the information Kincaid et al. (2017).
Pure decoherence – whether in a globally pure state or in mixed state – is the process by which the state structure in Eq. (1) arises. The Hamiltonian and initial states for pure decoherence by independent environment components Zwolak et al. (2009, 2010) are with and where specifies an environment subsystem. Under pure decoherence, no transitions occur between the pointer states (the eigenstates of Zurek (2003, 1981)). Up to unimportant local unitary rotations, the state, Eq. (1), develops via a pure decoherence process, e.g., one having a interaction in the Hamiltonian with for and for , and an initial state /2, where is a eigenstate. These models – which include run-of-the-mill, everyday photon environments Zwolak et al. (2014) – approximate the case where decoherence is strong compared to the natural dynamics of the system. Moreover, spin models of this type help elucidate the nature of redundancy in various settings Zwolak et al. (2016); Riedel et al. (2012), which is what we will do here.
Intuitively, we know the redundancy of information in the state in Eq. (1): There are “good” bits, which are in a GHZ state, and thus perfectly classically correlated with the pointer observable of the system, and there are “bad” bits, which are in a product state, and thus not correlated at all with the system. Hence, the redundancy is just . However, in a world where we are bombarded with good and bad bits alike, the question arises: What is the typical fragment size that we need to intercept from the total environment, , to get nearly complete information about ? Likewise, in what way should we define redundancy, with respect to or just ?
To answer these questions, we examine the mutual information, which quantifies the correlations between the system and some fragment of the environment,
[TABLE]
This can be divided into classical (the Holevo quantity Holevo (1973)) and quantum (the discord Zurek (2000); Ollivier and Zurek (2001); Henderson and Vedral (2001)) components Zwolak and Zurek (2013),
[TABLE]
where specifies a basis (or, more generally, a POVM) on . The Holevo quantity, , gives the maximum classical information available about in , while the quantum discord is what remains. The information most efficiently transmitted by the environment will be about the pointer basis, Ollivier et al. (2004); Zwolak and Zurek (2013). Thus, the Holevo quantity for that basis, , will be of interest.
To define redundant information, we need to know how much (what portion) of the “classical information of interest” is contained in a typical fragment (of size ) of the total environment, . Thus, we seek , the size of the fragments that contain all but the information deficit of the classical information,
[TABLE]
Above is the average over fragments of size and , i.e., the entropy of the pointer observable – the missing information about . Observers do not require (and usually cannot get) all the missing information. The information deficit, , is the amount of information that observers are prepared to forgo. The averaging can be done over the relevant environment, , or the total environment, . Alternatively, one can maximize the number of distinct fragments that give nearly complete information about , i.e., .
Each of these approaches may yield different results. We will show that all these procedures give the same value for the redundancy,
[TABLE]
up to an insignificant scaling factor, where is taken from Eq. (4) or from the maximization procedure.
The Holevo quantity will approach according to Zwolak et al. (2014, 2016)
[TABLE]
where , a function of , is the error probability for distinguishing the conditional states of the fragment. The latter are given by , where is a pointer state, is the probability that state occurs, and is the reduced state of the system and fragment. In the case of the state in Eq. (1), the conditional states are just and .
The asymptotic behavior of the error probability is given by
[TABLE]
where the exponent
[TABLE]
is the “typical” Chernoff information Zwolak et al. (2014), which generalizes the quantum Chernoff bound (QCB) Audenaert et al. (2007, 2008); Nussbaum and Szkoła (2009); Li (2016) to sources of quantum states that are not independent and identically distributed (i.i.d.). The error probability also depends on the , but only in a prefactor to the exponential, and thus it does not play a role as becomes large. The optimal value of (with ) is the one that maximizes . The latter is not an easy task for non-i.i.d states. However, for spin systems undergoing pure decoherence (of which the state in Eq. (1) is an example) the value of is Zwolak et al. (2016) (the value of can also be found for certain classes of photon environments Zwolak et al. (2014)).
This maps the understanding of classical information communicated by (and ) into a problem of understanding the distinguishability of conditional states, on individual environment components (i.e., for the case here, individual environment spins). Equations (4), (6), and (7) allow us to estimate the redundancy Zwolak et al. (2014, 2016), Eq. (5), as
[TABLE]
Indeed, this shows that macroscopic redundancy is unavoidable for pure decoherence – except for states of measure zero (i.e., completely mixed initial environment states or ones that commute with the Hamiltonian), redundancy is always present Zwolak et al. (2009, 2010, 2014, 2016). The worst scenario for the dilution of information is when is small and fixed, while is taken to be larger and larger. Taking , Eq. (8) is simple,
[TABLE]
Thus, the redundancy is
[TABLE]
We see that the calculation requires that , as redundancy can not be less than one. If , it means that the scenario is outside of the realm of validity of the QCB calculation, as an observer needs essentially the whole environment to approach bits of information about , if they can acquire that amount of information at all.
We note that one can also exactly solve for the average Holevo quantity or mutual information, which yields the same result as above: Given a fragment , will be one if a good bit is intercepted and zero otherwise (similarly for the quantum mutual information, , unless all the good bits are intercepted, but this happens with negligible probability). The probability that the observer will intercept bad bits – and have zero information about the system – is
[TABLE]
with . The probability to intercept at least one good bit is , giving
[TABLE]
with . Redundancy requires that
[TABLE]
and we can use Stirling’s approximation to get the limiting forms of this expression, This yields the redundancy
[TABLE]
in agreement with the QCB result. For small , one can similarly expand Eq. (14), finding for near 1 – e.g., for , – and showing that the QCB result can work well even in the non-asymptotic regime. While this exact calculation is simple, the QCB calculation is even simpler still and is easily extended to many other cases (see, e.g., Ref. Zwolak et al. (2016)).
If instead of the usual definition of redundancy (Eqs. (4) and (5)), one defined redundancy as the maximum number of disjoint fragments for which , the result would be . For this “perfect” good-bad state, Eq. (1), this is true for any 111When , , as every component of the environment contains at least zero information.. By the same token, if one averaged Eq. (4) only over the environment , then and : Only one spin from is necessary to acquire the requisite information (we note that for the perfect GHZ state of the good bits with the system, the QCB calculation is not valid when limiting to the relevant environment, as diverges, reflecting that for any ). Indeed, these two latter approaches agree with our intuition about the state in Eq. (1), as there are just copies of the pointer state information.
These definitions lead to and and are thus not equivalent. However, they only differ by a factor of (and this result is unaffected by the choice of , , the probabilities for and in the initial state of the system) 222We note that Eq. (9) is an asymptotic lower bound to the redundancy, but it was proven in Ref. Zwolak et al. (2014) that it is within a factor of 2 of the exact result. For all cases where exact results can be found, such as the case here, Eq. (9) is in agreement and is thus not an estimate. However, since the redundancy computed from the maximization procedure is larger than Eq. (9), this means that we are finding a worst case difference. Similarly, we expanded the logarithm in Eq. (10). This approximation decreases the value of the redundancy. Without it, the difference between the definitions of redundancy would be closer.. For any reasonable , the two definitions are practically the same 333For , the definitions differ only by a factor of . Thus, for a very small , e.g., or , Eq. (4) underestimates the redundancy by only a factor of, e.g., 10 or 20. In the opposite regime, when is close, but not equal, to 1 (i.e., requiring that the records hold very little information), the redundancy using Eq. (4) continuously approaches as . The maximization, though, changes discontinuously, jumping from to at (where the records hold no information, which is an unimportant case). When one is interested in being close (but not equal) to one, using the maximization is more appropriate, as it accurately reflects the number of imperfect copies of information, whereas using Eq. (4) will overestimate this redundancy.. The reason for the correspondence between the definitions is that – when an observer intercepts both relevant and irrelevant bits – the probability not to get any good bit drops exponentially with the size of the fragment . This only weakly affects the ability of the observer to capture a good bit and deduce the state of the system. This is remarkable, as it says that when states of the form in Eq. (1) arise we need not worry about distinguishing between parts of the larger environment – parts that interact with the system and parts that either do not or only weakly interact – for quantifying the redundancy of information. In other words, we need not worry about separating the wheat from the chaff.
This example can be extended to the case where the good and bad spins are not diametrically opposed, i.e., not perfectly good or bad. For instance, one can consider good spins that contribute to the decoherence factor and bad spins that contribute . In this case, the QCB gives immediately
[TABLE]
where we have made use of the relationship between decoherence and information in pure states, Zwolak et al. (2016).
As with the “perfect” good-bad state, this calculation can be done in an alternative manner. To find the averaged Holevo quantity, one makes use of the equality for pure states or its more general form for Zwolak et al. (2010). Expanding the binary entropy for near 1/2 shows that one just needs to find the average decoherence factor. For fragments of size , the latter is given by the sum of from 0 to of times the probability
[TABLE]
of intercepting bad spins and good spins in the fragment. Stirling’s approximation can be used on the three last factors in the probability and the sum performed. This yields the QCB result, Eq. (16), at much greater expense.
For simplicity, we now assume that the bad spins give , i.e., a worst case where those bits never interacted with the system (and thus have no information). Expanding the QCB result, Eq. (16), one finds,
[TABLE]
Maximizing the number of fragments that give bits of information is asymptotically equivalent to a QCB calculation (for permutationally invariant states) that simply ignores the bad spins:
[TABLE]
These results, Eqs. (18) and (19), for the redundancy look different and it is not readily apparent that the same conclusions hold as in the “perfect” good-bad model, as the difference depends on the contribution to the decoherence factor from the good spins. The ratio between the two results, Eqs. (18) and (19), is
[TABLE]
As we will now show, the smallest this ratio can be is , just like the perfect good-bad model above.
One first has to note that Eq. (20) is monotonically increasing to 1 as increases to 1 (giving a redundancy of zero and making the definitions of redundancy the same in this limit). This can be proven by taking its derivative and applying the inequality in footnote [25] of Ref. Zwolak et al. (2014) to show the derivate is always positive. The smallest value of Eq. (20) can then be found by taking the smallest value of allowed by a consistent application of the QCB. There are two calculations of redundancy – one with good and bad spins, and one with only good spins. The former, Eq. (18), allows for all (indeed, this just recovers Eq. (11)). The latter, Eq. (19), however, requires (i.e., Eq. (19) can not be larger than no matter how small is, or, in other words, can not be less than 1). When
[TABLE]
or , each good spin holds a sufficient record to immediately put the information to within of the plateau upon receipt of one spin 444This ignores finite size effects, which are nevertheless order 1.. When , the redundancy no longer depends on , it is just 555In this regime, , one has to perform computations in an alternative way, but the conclusion is the same: The redundancy using the maximum is and the redundancy from the other definition does not change (the presence of “regularizes” the QCB, so it does not matter if ). Thus, their minimum ratio is still Eq. (22).. Thus, the QCB result, Eq. (19), is valid only when . When this is the case, the ratio of the two computed redundancies is Eq. (20). This ratio is minimal when , giving
[TABLE]
and is also unaffected by the choice of (). Since we want to be small, this ratio is essentially . Thus, just as with the “perfect” good-bad model, the two definitions differ only by an insignificant factor.
In both cases, we see that – when examining symmetric (permutationally invariant) “good” environments, – there is no difference between redundancy defined as the maximization and redundancy defined with the averaging in Eq. (4) over only the good environment. To extend this calculation to mixed and/or non-permutationally invariant states is a straightforward matter. One only has to note the Chernoff Information is no longer directly related to the decoherence factor, , but rather takes on a different form (compare Eq. (16) and Eq. (19) in Ref. Zwolak et al. (2016)), but otherwise the calculation is formally identical. Thus, the correspondence between definitions of redundancy – i.e., a difference of at most – is a general feature of pure decoherence.
This example not only shows the ease of computation using the QCB, it also helps us understand the definition of redundancy itself. If redundancy was defined by maximizing the number of copies of information, rather than taking an average over all fragments of a given size, then one would obtain a different value. However, both definitions give , where the proportionality is different only by a factor of – i.e., a factor that is only weakly dependent on . This is the case even when taking an essentially arbitrarily large number of bad bits in the environment. The reason for such a close correspondence between definitions is that, as the observer intercepts a larger and larger fragment, the probability of not receiving a good bit decreases exponentially with fragment size. The observer is thus likely to always to receive a good bit. The definition of redundancy in Eqs. (4) and (5) applied to the total environment , therefore, gives reasonable estimates – lower than the maximal redundancy but different only by an insignificant factor – for the number of records proliferated into the environment. Thus, the emergence of the classical, objective reality is unavoidable Ollivier et al. (2004); Zwolak et al. (2014); Brandao et al. (2015) and there is no need to separate the wheat from the chaff to perceive objective states of the systems of interest.
Acknowledgements.
We would like to thank Marek Rams for helpful discussions and the Center for Integrated Quantum Science and Technology (IQST) and the University of Ulm, where part of this work was carried out. This research was supported in part by the U.S. Department of Energy through the LANL/LDRD Program and by the Foundational Questions Institute Grant No. 2015-144057 on “Physics of What Happens.”
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bohr (1958) N. Bohr, Atomic Physics and Human Knowledge (Wiley, New York, 1958) p. 83.
- 2Zwolak et al. (2014) M. Zwolak, C. J. Riedel, and W. H. Zurek, Phys. Rev. Lett. 112 , 140406 (2014).
- 3Zwolak et al. (2016) M. Zwolak, C. J. Riedel, and W. H. Zurek, Sci. Rep. 6 , 25277 (2016).
- 4Zurek (2009) W. H. Zurek, Nat. Phys. 5 , 181 (2009).
- 5Zurek (2014) W. H. Zurek, Phys. Today 67 , 44 (2014).
- 6Joos et al. (2003) E. Joos, H. D. Zeh, C. Kiefer, D. Giulini, J. Kupsch, and I.-O. Stamatescu, Decoherence and the Appearance of a Classical World in Quantum Theory (Springer-Verlag, Berlin, 2003).
- 7Zurek (2003) W. H. Zurek, Rev. Mod. Phys. 75 , 715 (2003).
- 8Schlosshauer (2008) M. Schlosshauer, Decoherence and the Quantum-to-Classical Transition (Springer-Verlag, Berlin, 2008).
