Mermin inequalities for perfect correlations in many-qutrit systems
Jay Lawrence

TL;DR
This paper derives Mermin inequalities for many-qutrit systems based on GHZ contradictions, enabling experimental tests and showing exponential divergence of quantum and classical predictions as the number of qutrits increases.
Contribution
It introduces Mermin inequalities derived from GHZ proofs for qutrit systems, facilitating experimental verification of quantum nonlocality in higher-dimensional systems.
Findings
Quantum to classical ratio diverges exponentially with number of qutrits
Quantum prediction for M grows as 2^N/3
Classical value remains bounded, confirming nonlocality
Abstract
The existence of GHZ contradictions in many-qutrit systems was a long-standing theoretical question until it's (affirmative) resolution in 2013. To enable experimental tests, we derive Mermin inequalities from concurrent observable sets identified in those proofs. These employ a weighted sum of observables, called M, in which every term has the chosen GHZ state as an eigenstate with eigenvalue unity. The quantum prediction for M is then just the number of concurrent observables, and this grows asymptotically as 2^N/3 as the number of qutrits (N) goes to infinity. The maximum classical value falls short for every N, so that the quantum to classical ratio (starting at 1.5 when N=3), diverges exponentially (~ 1.064^N) as N goes to infinity, where the system is in a Schroedinger cat-like superposition of three macroscopically distinct states.
|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Mermin inequalities for perfect correlations in many-qutrit systems
Jay Lawrence
Department of Physics and Astronomy, Dartmouth College, Hanover, NH 03755, USA
The James Franck Institute, University of Chicago, Chicago, IL 60637
(revised )
Abstract
The existence of GHZ contradictions in many-qutrit systems was a long-standing theoretical question until it’s (affirmative) resolution in 2013. To enable experimental tests, we derive Mermin inequalities from concurrent observable sets identified in those proofs. These employ a weighted sum of observables, called , in which every term has the chosen GHZ state as an eigenstate with eigenvalue unity. The quantum prediction for is then just the number of concurrent observables, and this grows asymptotically as as the number of qutrits . The maximum classical value falls short for every , so that the quantum to classical ratio (starting at 1.5 when ), diverges exponentially () as , where the system is in a Schrödinger cat-like superposition of three macroscopically distinct states.
pacs:
03.67-a, 03.65.Ta, 03.65.Ud
I Introduction
Bell’s inequality (Bell64, ) shows that no local hidden variable theory (HV) can duplicate the quantum predictions for the correlations of two distant spin-1/2 particles (in Bohm’s model Bohm51 of the original EPR scenario EPR35 ). Specifically, the maximum quantum value of a certain correlation operator exceeds the maximum value allowed by HVs, with both quantum and HV predictions being probabilistic. It was not known for another quarter century (1964-1989) whether a stronger theorem existed, allowing for a definite (non-probabilistic) quantum prediction, until Greenberger, Horne, and Zeilinger (GHZ) GHZ89 found one for a system of three spin-1/2 particles GHZ89footnote . Here, the product of three spin projections measured at distant points is predicted to take a single definite value, despite the randomness of the local measured values. The definiteness of the quantum prediction elevates HVs to the status of EPR elements of reality, since knowledge of local observables at two distant points allows prediction “with certainty” of that at the third point. On a practical level, this definiteness is essential in quantum information protocols such as quantum error correction Shor96 and quantum secret sharing BBH99 .
In 1990, Mermin Mermin90 generalized the GHZ proof and supplied a Bell inequality for all based on the perfect correlations predicted by quantum mechanics. This was done to enable future experimental tests of GHZ contradictions by accounting for inevitable uncertainty in actual measurements, despite their absence in principle. Experimental tests have indeed made use of such inequalities (now called Mermin inequalities) to demonstrate GHZ contradictions with a probability of many standard deviations. The first such test Pan00 came a decade later; a recent test Su17 describes the current state of the art.
Extensions of Mermin’s work within qubit sytems include GHZ contradictions based on stabilizer groups of particular error-correcting codes DiVP97 , and Mermin-like inequalities based on stabilizers for all graph states of Cabello08 . In the latter work (2008), Cabello et. al. defined a Mermin inequality as a Bell inequality for which (I) the Bell operator is a sum of stabilizing operators that represent the perfect correlations in their simultaneous eigenstate, and (II) the ratio of quantum to maximum classical value is a maximum for that state. We shall propose a modest extension below.
Equally interesting for us are the extensions to higher dimensions (), which differ for even and odd cases. Extensions to higher even dimensions include GHZ contradictions and Kochen-Specker identities CMP02 ; Mermin90b for odd (2002), then GHZ contradictions for odd LLK06 (2006), and more recently (2013), GHZ contradictions and corresponding Mermin inequalities Tang13 for systems of all (with even ), using GHZ-type graph states.
Regarding odd dimensions, Bell inequalities have been derived for systems of three Acin04 or more Alsina16 qutrits, as well as for higher odd Son06 . However, these are not Mermin inequalities; their quantum predictions are not definite, so they do not establish an underlying GHZ contradiction. In fact, prior to (2013), it was not known whether a GHZ contradiction existed for any odd . It is now known that they do exist Ryu13 ; JL14 , and their discovery led to the completion of the program to establish GHZ contradictions (theoretically) for all for every JL14 . However, it is also known that these odd- contradictions cannot be based on stabilizer sets Howard13 , as is typical in even dimensions – a conclusion drawn from studies of the discrete Wigner function for odd dimensions Gross ; Veitch .
The newly discovered GHZ contradictions in odd- systems Ryu13 ; JL14 are based on concurrent observables LLK06 - observables that are not compatible but have a common eigenstate. These are not the stabilizers usually associated with graph states, both because they lack compatibility and because local measurement bases are not exclusively those of the generalized Pauli operators. However, these aspects do not comprise experimental testing, for which the essential distinction between Bell and Mermin inequalities is the definiteness of the quantum predictions. Thus, it seems appropriate to broaden the definition of a Mermin inequality by deleting the word “stabilizing” from statement I (paragraph 3) above.
The purpose of this article is to construct Mermin inequalities, in this broader sense, for systems of qutrits, from sets of concurrent observables that share a GHZ eigenstate, violations of which would establish the perfect correlations of GHZ contradictions. Entangled multiple-qutrit states are now being investigated experimentally Anton16 , and such inequalities will enable experimental tests of GHZ contradictions. In the next section we present results for all ; the exceptional case of is presented in Sec. III, and in Sec. IV we discuss conclusions and open questions.
II Mermin inequalities for
It will be useful to consider three choices of GHZ state that differ by relative phases of components,
[TABLE]
where . Envisioning the qutrits as spin-1 particles, Fig. 1a illustrates that is obtained from (and from ) by rotations of . Such “rotations” refer to any combination of individual qutrit rotations about their respective axes, by increments adding up to . A defining symmetry of GHZ states is that the rotated state is independent of the distribution of these increments among qutrits JL14 .
The corresponding observable sets of which are joint eigenstates are also related by compound rotations. The starting point is the basic observable,
[TABLE]
where each factor is the qutrit Pauli matrix () that defines the first local measurement basis. The second local basis () is defined by a rotation of ,
[TABLE]
Compound rotations of the operator (Eq. 2) through generate tensor products in which factors of are distributed in all possible ways, \big{(}{k\atop N}\big{)}, among factors of . These operators appear at points in Fig. 1b.
Clearly, is an eigenstate of with eigenvalue 1. Rotational covariance of operators and states JL14 means that is an eigenstate of all operators at the point 1, and of all operators at point 2 - in all cases with eigenvalue 1. Points 3 and beyond are then governed by the periodicity property JL14 : Any rotation of an operator through (, from 0 to 3) preserves its eigenstates, but multiplies its eigenvalues by . Therefore, is a joint eigenstate of operators at points 0, 3, and 6 (black arrows in Fig. 1); of operators at 1, 4, and 7 (red arrows); and of operators at 2, 5, and 8 (green arrows). In each case, the eigenvalues are 1, , and , respectively. We shall refer to equilateral triangles (0,1,2) defined by each set of arrows, with concurrent operators at its vertices. (Not all vertices are occupied when .)
The case of is special because a third local measurement basis is required for GHZ contradictions. Hence we defer that case and proceed here with , which is the simplest case. Choosing the state , red arrows in Fig. 1b identify the concurrent subset of five observables – the four cyclic permutations of , each with eigenvalue 1, and , with eigenvalue . These observables produce a GHZ contradiction JL14 . For the corresponding inequality, we define the Mermin operator,
[TABLE]
of which is clearly an eigenstate with eigenvalue . This “quantum value” is to be compared with the maximum HV value. The general HV value, which we call , depends on the values (1, , or ) assigned to each of the local factors [thence called and ], which must be the same in each of the five tensor products. It is easy to see that depends only on the local ratios,
[TABLE]
where, hiding an irrelevant overall phase factor, , it is simply
[TABLE]
It is easy to verify by explicit calculations that the maximum value, , is obtained with either of two HV models: (i) uniform (, ), and (ii) a single departure from uniformity (, and all others ). This maximum value is
[TABLE]
so that the ratio of quantum to maximum HV values is
[TABLE]
Clearly the alternative choice, , together with operators at points 0 and 3, would result in the same values of and , while the choice , with operators only at point 2, would result in , showing no GHZ contradiction. Therefore, and are equally valid Mermin operators, according to the definition.
For arbitrary , we pick a state and identify its candidate Mermin operator as the sum all concurrent operators at the vertices of the corresponging triangle, with weighting factors 1, , and assigned to first, second, and third vertices traversed in counterclockwise order. As above, the results ( and ) depend on the choice of , and Table I shows those choices which maximize the ratio and produce Mermin operators. For even , by symmetry, there are two such choices; for odd , only one. The corresponding quantum eigenvalues are given by
[TABLE]
equal to the total number of concurrent observables on the th triangle.
The contrasting values are maxima of , given on the Table, over the local ratios (Eq. 5). In the following six paragraphs, we show how these are determined, including the proof of the following –
Theorem: Maximum values of , for values listed on Table I, are attained with uniform in all cases except , 7, and 9, where the simplest nonuniform model (, ) narrowly prevails.
Proof: A closed-form expression for candidate Mermin operators (valid for 1, or 2) is JMK16
[TABLE]
To verify, one can easily see that certain powers of arising in the binomial expansions cancel out because . With , for example, this cancellation leaves only the powers 0, 3, 6, …, exactly those terms residing on the vertices of the triangle in Fig. 1b; and similarly for and 2. One can also verify that the relative vertex weighting factors are (1, , and ), as required, once the higher powers of have been reduced (, , .). (Note that appears with overall multiplying factor .)
The main utility of 10 is to make the -dependence explicit in
[TABLE]
If the are uniform, then
[TABLE]
where the individual magnitudes are labeled so that :
[TABLE]
and the identity was used. If a single differs from the rest (, , others unity), then the effects on Eq. 12 are to permute the coefficients, and to rotate each vector in the complex plane:
[TABLE]
where the rotation angles [(), (), and (), respectively] are independent of which is chosen to be different.
Thus, introducing the nonuniformity decreases the two largest terms while increasing only the smallest. This can produce a net gain in only if the rotations bring the two largest terms into closer alignment. This unlikely scenario is actually realized in the few cases, , 7, and 9.
To demonstrate, first consider odd . With values listed on Table I, it is easy to see that the three vectors comprising in either 12 or 14 are collinear for every odd . In 12, the term is aligned opposite to the and terms, so that
[TABLE]
In 14, the -like term is aligned opposite to the others, and so
[TABLE]
The difference, (15 - 16), is an increasing function of with a zero at , so that is the larger for all odd , while is the larger for 5, 7, and 9. We still have to rule out more complex HV models – this is done below.
Now consider even : Again with values listed on Table I, one can easily see that the three vectors in Eq. 12 (uniform ) are minimally aligned in the complex plane (angular separations are ). Nonuniformity (14) shrinks the two longer vectors as above, while the induced rotations improve their alignment somewhat (to the smaller of the angular separations , , and ), but not enough to provide a net gain in : Keeping the two dominant terms in each of Eqs. 12 and 14, whose angular separations are and , respectively, it is easy to show formally that is positive for all . For , the exact calculations described above show that both HV models realize the maximum value. Hence provides the maximum for all even .
To rule out further HV models for all even and odd : First consider the alternate single departure, (, others unity). The largest term is reduced sharply (by ), the next largest is increased slightly (by ), while the relative alignment of these two remains unchanged. So this model is ruled out trivially. Multiple departures from uniformity may be viewed as sequences of single departures in which every step has the following properties: (i) Either it reduces the two longest vectors, or it reduces the longest by more than it increases the next-longest, and (ii) beyond the first step (which results in Eq. 14), it reproduces angular separations already seen in Eq. 12 or 14. Thus it cannot increase beyond the larger of and .
This concludes the proof of the theorem stated above. To evaluate , one may simply use Eq. 15 or 16 for odd ; for even use 12, knowing the the angular separations are for the listed -values. It is also instructive for smaller to write the Mermin operator directly from Fig. 1b and evaluate at determined by the theorem. Table I lists the exact maxima, , which are all integers or square roots thereof, along with rounded values of .
The asymptotic form of at large is given by the dominant term in 12, namely
[TABLE]
so that the quantum to classical ratio (Eq. 9 to 17) diverges as . This exponential divergence is slow compared with Mermin’s () for qubit systems Mermin90 ; nevertheless it represents a superposition of three macroscopically distinct states.
III The case of N = 3
This may be the most interesting case experimentally. It is singled out here because its GHZ contradictions require three local measurement bases Ryu14 ; JL14 . So, while we consider the same three GHZ states (1), the concurrent operator sets must now incorporate a third local basis, a natural choice being given by rotation of individual factors through :
[TABLE]
(compare 3). The observables generated by rotations of now include all combinations of , , and factors, and are classified in Fig. 2 according to total rotation angles, . Again these fall into three concurrent subsets, each associated with an equilateral triangle and its own joint eigenstate in Fig. 1a. In this case all three Mermin operators produce the same outcome: and . Let us demonstrate with the simplest example:
[TABLE]
with weight factors (1,) applied as required. Recall that the HV
magnitude depends only on ratios, defined here as and . So, hiding an overall irrelevant phase factor ,
[TABLE]
where the subscripts of and are implied by their positions, , . Now suppose the ratios are uniform, and . Then, . To increase this value, one would require an HV assignment that brought the first and/or last term into equality with the seven other terms, without losing an equal number (or more) of those terms. It is easy to see that there is no such assignment.
Finally, it is interesting to note that and yield the same maximum, but both require nonuniform HV assignments, eg., ( with all others unity) for the former, and, for the latter, ( with all others unity).
IV Conclusions and open questions
We have presented Mermin operators and associated inequalities for systems of qutrits. The exceptional case of requires three local measurement bases; all other cases require two. The eigenvalue of the Mermin operator (the definite quantum prediction of its measured value), is given by Eq. 9 and diverges as for large . The maximum HV values are illustrated in Table I and reflect optimal HV assignments derived in Sec. II. These diverge as 1.879N. The ratio of quantum to maximum HV values diverges as .
Ironically, the structure behind the inequalities derived here forms a close parallel with Mermin’s, despite the compatibility of his observable sets as compared with the mere concurrence of those used here. This is because his Pauli tensor products and their eigenstates are related by the same rotational covariance that forms the basis of the treatment given here. It is a simple exercise to write down two alternative compatible Pauli subsets (one of which is Mermin’s), and their corresponding joint GHZ eigenstates, on diagrams analogous to Fig. 1, in which the basic angular interval is rather than . Moreover, Mermin’s derivation of HV maxima is based on a formula like our Eq. 10, in particular
[TABLE]
obtained by replacing and . The two choices and 1 produce identical and values, resulting in (even ), and (odd ).
A comparison of Eqs. 10 and 21 suggests why our exponential growth is less dramatic than Mermin’s. The maximum length of any factor in the HV expression for qutrits (11) is , no matter how HV values are assigned. The analogous length factor in the qubit case is . This corresponds to the different angular resolutions of vector factors in Eqs. 10 and 21, showing minimum angles in the complex plane of vs . These differences reflect the greater freedom of qutrit HVs over qubit HVs in aiming for the quantum results.
The above comparison raises the question whether Mermin inequalities exist for systems of higher odd dimensions , where compatible observables do not produce GHZ contradictions. It seems plausibible that a similar construction would succeed for any higher prime , although one would expect still weaker violations of local realism for the reason given above. For higher composite dimension, a similar but more complex construction might succeed based on the smallest prime factor of .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) J. S. Bell, Physics 1 , 195 (1964).
- 2(2) D. Bohm, Quantum Theory , New York: Prentice Hall (1951).
- 3(3) A. Einstein, B. Podolsky, and N. Rosen, Phys. Rev. 47 , 777 (1935).
- 4(4) D.M. Greenberger, M.A. Horne, and A. Zeilinger, in Bell’s Theorem, Quantum Theory and Conceptions of the Universe , edited by M. Kafatos (Kluwer Academic, Dordrecht, 1989), p. 69, and eprint ar Xiv:quant-ph/0712.0921(2007).
- 5(5) GHZ GHZ 89 actually derived their result with four spins-1/2, mentioning in a footnote that it also holds with three.
- 6(6) P. Shor, Phys. Rev. A 52 , R 2493 (1995).
- 7(7) V. Busek, A. Berthiaume, and M. Hillery, Phys. Rev. A 59 , 1829 (1999).
- 8(8) N. D. Mermin, Phys. Rev. Lett. 65 , 1838 (1990).
