Topological proofs of contextuality in quantum mechanics
Cihan Okay, Sam Roberts, Stephen D. Bartlett, Robert Raussendorf

TL;DR
This paper introduces a cohomological framework to analyze quantum contextuality, unifying various proofs and highlighting its role as a resource in measurement-based quantum computation.
Contribution
It develops a topological approach to quantum contextuality applicable to both state-dependent and state-independent cases, encompassing parity and symmetry-based proofs.
Findings
Provides a unified topological framework for contextuality proofs
Applies to both state-dependent and state-independent contextuality
Highlights the role of topological methods in quantum computation
Abstract
We provide a cohomological framework for contextuality of quantum mechanics that is suited to describing contextuality as a resource in measurement-based quantum computation. This framework applies to the parity proofs first discussed by Mermin, as well as a different type of contextuality proofs based on symmetry transformations. The topological arguments presented can be used in the state-dependent and the state-independent case.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Topological proofs of contextuality in quantum mechanics
Cihan Okay
Department of Mathematics, University of Western Ontario, London, Ontario, Canada
Sam Roberts
Centre for Engineered Quantum Systems, School of Physics, The University of Sydney, Sydney, NSW, Australia
Stephen D. Bartlett
Centre for Engineered Quantum Systems, School of Physics, The University of Sydney, Sydney, NSW, Australia
Robert Raussendorf
Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada
Abstract
We provide a cohomological framework for contextuality of quantum mechanics that is suited to describing contextuality as a resource in measurement-based quantum computation. This framework applies to the parity proofs first discussed by Mermin, as well as a different type of contextuality proofs based on symmetry transformations. The topological arguments presented can be used in the state-dependent and the state-independent case.
1 Introduction
Contextuality [1]-[5] is a feature that distinguishes quantum mechanics from classical physics. To describe it, let’s consider the question of whether it is possible to assign “pre-existing” outcomes to measurements of quantum observables which are merely revealed by measurement. If this were possible, it would amount to a description of quantum mechanics in terms of classical statistical mechanics. Assuming such a model, for any two different sets and of mutually compatible observables containing a given observable , it is reasonable to require that the value attached to the observable is a property of alone, and thus agrees in and . and are measurement contexts for , and the constraint on just described is called “context independence”. Can context-independent pre-assigned outcomes , or probabilistic combinations thereof, describe all of quantum mechanics?—This turns out not to be the case [1], [2], a fact which is often referred to as contextuality of quantum mechanics.
For quantum computation, contextuality is a resource. In quantum computation with magic states [6] and in measurement-based quantum computation (MBQC) [7], no quantum speedup can occur without it [8]–[10]; [11]–[13].
For the present work, the link between contextuality and quantum computation is the motivation to investigate the mathematical structures underlying contextuality. In this regard, Abramsky and coworkers have provided a sheaf-theoretic description of contextuality [4]. They have further identified cohomological obstructions to the existence of the classical models described above, so-called non-contextual hidden variable models [14], [15]. These methods, based on Čech cohomology, have a wide range of applicability, covering the Bell inequalities [2], Hardy’s model [16], and the Greenberger-Horne-Zeillinger setting [17].
Here, we provide a different cohomological framework for contextuality, involving group cohomology. It is designed to describe the form of contextuality required for the functioning of measurement-based quantum computation. The connection between contextuality and MBQC was first observed in the example of Mermin’s star [11], and subsequently extended to all MBQC on multi-qubit states [12], [13]. From the latter works it is known that all contextuality proofs relevant for MBQC are generalizations of Mermin’s star, in the sense that they invoke an algebraic contradiction to the existence of even a single non-contextual consistent value assignment. By its intended scope, the present framework only needs to apply to such kinds of proofs. But then there is an additional requirement: the cohomological framework in question needs to reproduce the original parity proofs in a topological guise. The reason for this requirement is that both the parity proofs and the classical side-processing required in every MBQC are based on the same linear relations (See Appendix A for a summary on contextuality in MBQC; also see [18]).
Next to the parity-based proofs of contextuality exemplified by Mermin’s square and star, we investigate a different type of contextuality proof which is based on symmetry. The central object in these proofs is the group of transformations that leave the set of observables involved in a parity-based contextuality proof invariant, up to phases. We show that nontrivial cohomology of the symmetry group implies contextuality. Furthermore, the parity-based and the symmetry-based contextuality proofs are related. Every symmetry-based proof implies a parity-based proof.
To summarize, we examine proofs of contextuality of quantum mechanics that have two attributes. They can either be parity-based or symmetry-based, and be state-independent or state-dependent. There are thus four combinations, and for each of these types of proofs we present a topological formulation. The parity-based contextuality proofs are discussed in Section 4 and the symmetry-based proofs in Section 5.
2 First example
To illustrate what “reproducing the original parity proofs in topological guise” means, we consider as a first example Mermin’s square [3] (also see [19]), one of the simplest proofs of contextuality of quantum mechanics. Mermin’s square, depicted in Fig. 1a, demonstrates that in Hilbert spaces of dimension it is impossible to consistently assign pre-existing values to all quantum mechanical observables.
Each row and each column of the square represents a measurement context, consisting of commuting observables. Furthermore, the observables in each context multiply to . For example, in the bottom row in Fig. 1a, we have , and in the right column . Now assume the nine Pauli observables in the square have pre-existing context-independent outcomes , with (the eigenvalues of the Pauli observables are ). Then, the product relations among the observables translate into constraints among the consistent value assignments. Continuing with the above-stated relations, we obtain the constraints , and . It is convenient to express these relations in terms of the value assignments rather than the measured eigenvalues . This leads to a system of linear equations,
[TABLE]
No assignment can satisfy these relations. To see this, add the above equations mod 2, and observe that each value appears twice on the left hand side. This results in the contradiction 0 = 1.
We now reproduce this contradiction in a topological fashion. For this purpose, the six observables are regarded as labeling the edges in a tessellation of a torus; See Fig. 1b. The value assignment is now a 1-cochain. Denote by any of the six elementary faces of the surface, such that , for three edges , , . Then there is a binary-valued function defined on the faces such that . As before, these product constraints among (commuting) observables induce constraints among the corresponding values, namely . By dialing through the six faces , we reproduce the six constraints of Eq. (1).
These constraints have a topological interpretation. Namely, is a 2-cochain, and, for any consistent context-independent value assignment it holds that
[TABLE]
Therein, the coboundary operator and the addition is . We can now show that for the present function , which evaluates to 0 on 5 faces and to 1 on one face, no consistent value assignment exists. To this end, we integrate over the whole surface which is a 2-cycle, . By Stokes’ theorem,
[TABLE]
where all integration is mod 2. In chain/cochain notation, this reads . This is the same contradiction as above in Eq. (1), but in cohomological form. As we show in Section 4 of this paper, all parity proofs consisting of a set of conflicting linear constraints of the form Eq. (1) can be given a similar cohomological interpretation.
To conclude this section, we remark that the above topological version of Mermin’s square, in its mathematical structure, resembles a certain aspect of electromagnetism [20]. First, consider the vector calculus question of whether a given vector field B can be written as the curl of some vector potential A, i.e., . This possibility is ruled out by the existence of a closed surface for which . Here, A is a 1-cochain (1-form) and B is a 2-cochain (2-form). They are the counterparts of the value assignment and the function , respectively. Now let B be a magnetic field. The statement for some closed surface —the counterpart of a contextuality proof —would indicate the presence of magnetic monopoles. However, in contrast to contextuality [21], magnetic monopoles—while being a theoretical possibility—have to date not been experimentally observed [22].
3 Measurement and contextuality
In this section we define our measurement setting and notion of contextuality.
3.1 Observables
In this paper, we consider observables with a restriction on their eigenvalues. Specifically, the eigenvalues are all of the form , where , for some , and . For , such observables are in general not Hermitian operators. However, that doesn’t matter. We may look at the measurement of these observables in two equivalent ways. (i) The observables are unitary, and their eigenvalues can thus be found by phase estimation. Further, due to the special form of the eigenvalues, phase estimation is exact. (ii) If , with all , one may instead measure , which is Hermitian and has the same eigenspaces as .—We note that non-Hermitian observables have found use in Bell inequalities with more than two outcomes per party [23], and also in contextuality proofs [24], [25].
Out of the set of observables , we identify an indexed set over a set . Every observable is related to an element from this indexed set by a phase for some . That is, is of the form
[TABLE]
For example can be taken to be all of the Pauli observables and corresponds to the set of Pauli observables up to a phase. The set has more structure which comes from the multiplicative structure of : We require that the product of two operators and belongs to if they commute, . For commuting operators the product will correspond to an operator up to a phase. We write for this unique element in . The operators satisfy the relation
[TABLE]
The function takes values in . To see this, consider the simultaneous eigenvalues of the operators , , . With Eq. (4) it holds that , and . Thus , as stated.
For any triple of observables satisfying the commutativity condition , the simultaneous eigenvalues can be measured. While individually random, the measurement outcomes are strictly correlated, . These correlations, which are predicted by quantum mechanics and are verifiable by experiment, form the basis of Mermin’s state-independent contextuality proofs [3]. The function is thus a central object in present discussion, summing up the physical properties of .
3.2 Definition of contextuality
We now define the notion of a non-contextual hidden-variable model (ncHVM) with definite value assignments. First, a measurement context is a commuting set . The set of all measurement contexts is denoted by .
Definition 1
Consider a quantum state and a set of observables grouping into contexts of simultaneously measurable observables. A non-contextual hidden variable model consists of a probability distributon over a set of internal states and a set of value assignment functions that meet the following criteria.
- (i)
Each is consistent with quantum mechanics: for any set of commuting observables there exists a quantum state such that
[TABLE]
- (ii)
The distribution satisfies
[TABLE]
Condition (i) in Definition 1 means that for every internal state of the non-contextual HVM the corresponding value assignment is consistent across measurement contexts.
We say that a physical setting is contextual if it cannot be described by any ncHVM .
Lemma 1
For any triple of simultaneously measurable observables and any internal state of an ncHVM it holds that
[TABLE]
The relation Eq. (7) was first used in [3] to rule out the existence of deterministic value assignments for Mermin’s square and star. In the same capacity it is also used in the present discussion.
Proof of Lemma 1. Consider a set of observables such that . This set qualifies as a possible in the sense of point (i) of Def. 1. Therefore, for any there exists a quantum state such that , , Furthermore, . By comparison, , which proves Eq. (7).
4 Parity-based contextuality proofs
The example of Section 2 is not special. As we show here, every parity-based contextuality proof—consisting of a set of conflicting linear constraints on the value assignments as in Eq. (1)—can be given a cohomological formulation. The main result of this section is Theorem 1.
4.1 The chain complex
We have two assumptions on the set of operators :
is closed under products of commuting operators i.e., if for then . 2. 2.
contains the identity operator.
Let denote the map given by
[TABLE]
with the index set introduced in Eq. (3). The set has more structure coming from Eq. (4). We say two elements commute if the corresponding operators commute . Given two commuting elements we define the sum to be the unique element which satisfies , cf. Eq. (4). We assume that there is an element in denoted by [math] corresponding to the identity operator in . Under this addition operation every maximal subset of commuting elements in has the structure of an abelian group.
Let us define the chain complex . A standard reference for chain complexes is [26]. It will suffice to describe this complex up to dimension three, i.e., . The geometric picture is as follows. The space we consider consists of a single vertex ([math]-cell). It has an edge (-cell) for each element of the set whose both boundary points attached to the single vertex. A face (–cell) is attached for every product relation among commuting operators. The set of faces is thus given by
[TABLE]
Thus, every face is bounded by three edges, namely , and .
Volumes (-cells) are constructed from triples of commuting observables (see Fig. 2 for an illustration). The set of volumes is
[TABLE]
Now comes the description of the chains:
since there is a single vertex. 2. 2.
, i.e., the elements of are linear combinations
[TABLE]
In other words, is freely generated as a -module by , where . 3. 3.
is freely generated as a -module by the pairs , where . 4. 4.
is freely generated as a -module by the triples , where .
In summary are freely generated by as -modules. We stop at dimension three although the definition can be continued for higher dimensions analogously, see [28]. The differentials in the complex
[TABLE]
are defined by
[TABLE]
Here the general pattern is as follows
[TABLE]
The homology groups of are defined by
[TABLE]
The dual notion of cochains gives a cochain complex
[TABLE]
where consists of -module maps . The differential is defined by where .
4.2 is a 2-cocycle
We may now formally extend the function introduced in Eq. (4) from to all of via the linear relations , , for all , . The function is thus a 2-cochain, .
The function is constrained in the following way. Consider three commuting elements , and expand the observable in two ways,
[TABLE]
and
[TABLE]
Comparing the two expressions, we find that
[TABLE]
whenever , , and .
The four faces , , , , with appropriate orientation (hence sign), bound a volume , i.e.,
[TABLE]
Geometrically, the situation looks as displayed in Fig. 2. We can follow the convention that denotes a face in the geometric sense and denotes an element of the chain complex. So . Therefore, with Eq. (11),
[TABLE]
Applying this relation to all volumes , we obtain
[TABLE]
Finally, there is an equivalence relation among the functions . To see this, recall the map which is defined by . There is a certain freedom in this definition which does not affect the commutation relations of the operators. Consider the following re-parametrization
[TABLE]
where . Then if and only if . From the perspective of contextuality, it does not matter which map we use to define the observables . Contextuality cannot be defined away by rephasing. However, the function is affected by the transformation Eq. (13). Namely, changing from to results in
[TABLE]
Therein, all addition is . The functions are thus subject to a restriction Eq. (12) and an identification Eq. (14). The various possible functions thus fall into equivalence classes , and hence .
4.3 Cohomological formulation of parity-based contextuality proofs
The function relates to the question of existence of non-contextual HVMs. We have the following result. First, a non-contextual value assignment , is such that . Again, by linearity, we can extend the assignment from to all of , and is thus a 1-cochain. We have the following relation.
Lemma 2
For every consistent non-contextual value assignment it holds that
[TABLE]
Proof of Lemma 2. Evaluating Eq. (15) on any given face reads
[TABLE]
As a consequence of Eq. (7), , for all and all . Now, with Lemma 1, setting and in Eq. (7), it holds that . This is precisely what Eq. (16) requires.
Theorem 1
Given set of observables, if then exhibits state-independent contextuality.
Proof of Theorem 1. If there were a value assignment it would satisfy . This means that is a boundary: . Hence .
Example: Mermin’s star. In addition to Mermin’s square, which we already discussed in Section 2, we now provide Mermin’s star [3] as a further example. Mermin’s star comes both in a state-independent and a state-dependent version, and is thus best suited as a running example for all topological constructions presented in this paper.
Here we consider the state-independent version; See Fig. 3. Denote by the surface displayed in Fig. 3c, consisting of the five smaller surfaces ,.., each corresponding to a measurement context in Fig. 3a. Each of the surfaces may be split up into two elementary faces; See Fig. 3b. satisfies . Since , we have , and for the other four measurement contexts it holds that . Hence, . If for some 1-cochain , then . Contradiction. Hence, . Then, by Theorem 1, Mermin’s star exhibits state-independent contextuality, in accordance with the original proof [3].
4.4 Squaring the star
It tuns out that, from the cohomological perspective developed above, Mermin’s square and star are equivalent contextuality proofs. Denote by the complex induced by the set , the Pauli observables on 3 qubits. Both Mermin’s square and star embed into it. The star provides a closed surface and the square provides a closed surface , such that and . Both facts thus equally demonstrate that .
What makes the star and the square equivalent is that there is a volume such that
[TABLE]
The surfaces and representing the respective contextuality proofs are elements of the same homology class in ; and therefore for any 2-cocycle .
The volume of Eq. (17) is depicted in Fig. 4a. The surfaces and are shown in Fig. 4b. They are obtained from another by adding the boundary . The Mermin square resulting from this procedure is locally rotated w.r.t. the standard convention, namely
[TABLE]
4.5 State-dependent parity proofs
Mermin’s star—whose state-independent version was discussed in Section 4.3—also exists in a state-dependent version [3]. We use it as an initial example, to illustrate the adaption of the topological argument to the state-dependent case and to motivate the definitions Eq. (18) and Def. 2 below. The state-dependent Mermin star contains a special set of observables and a special state, the Greenberger-Horne-Zeilinger state . The latter is a simultaneous eigenstate of the observables in , with eigenvalues , respectively. There is thus a value assignment , . From the perspective of non-contextual hidden variable models, the question is whether the value assignment can be extended in a consistent fashion to the local observables and .
Adapting the topological state-independent argument, we now demonstrate that this is not the case. We choose the mapping such that , and consider the surface displayed in Fig. 5b. For any consistent value assignment we thus have . On the other hand, , for . Thus, assuming the existence of a consistent value assignment , with (cf. Lemma 2) and with Stokes’ theorem, we arrive at the following contradiction (addition ):
[TABLE]
Hence our assumption that a consistent value assignment exists must be wrong.
We now turn to the general state-dependent scenario. Any state-dependent contextuality proof singles out a subset of observables of which a special state is an eigenstate. Namely,
[TABLE]
The set may or may not be a context. It is required of that the observables therein have at least one joint eigenstate, , but it is not required of them that they commute.
We want to integrate this extra bit of information into our topological description. By the definition of and Eq. (18), the set has the property that whenever for the product also lies in . We need this condition to be able to construct a subcomplex of . The corresponding labels determine a subset of edges and a subcomplex whose definition is analogous to .
Let us define via Eq. (18), i.e.,
[TABLE]
We can regard as an element of by extending it linearly. A consistent value assignment in the state-dependent case has to be compatible with the eigenvalues on the given state. This suggests the following definition.
Definition 2
A state-dependent consistent value assignment is a function that satisfies
[TABLE]
for all commuting , and its restriction to coincides with .
According to Eq. (20) only the commuting labels which are not contained in matters. Geometrically we can remove the edges in by contracting them. For the example of Mermin’s star, this process is depicted in Fig. 5. On the algebraic side, the chain complex of the contracted space is described by the relative complex defined by the quotient
[TABLE]
In this quotient edges, the faces, and volumes which come from are removed. Therefore we can think of this complex as having edges in the complement of the set . More explicitly, a -chain in this complex can be identified as a sum
[TABLE]
similarly -chains are linear combinations of commuting elements not contained in . We refer to the boundary operator of as the relative boundary operator and denote it by to distinguish it from . The boundary operator is the same as except that the edges, faces or volumes corresponding to are removed. For example, in Mermin’s star of Fig. 5b, , whereas . In general the relative boundary of a -chain is the sum of the edges in which lie in .
The relation between the chain complexes we defined so far can be expressed as a short exact sequence
[TABLE]
and the corresponding short exact sequence of cochain complexes is
[TABLE]
Note that can be characterized as cochains in whose restriction to vanishes. We will interpret Def. 2 using the cochain complex . In order to do this must be modified so that it vanishes on all faces whose boundary is in . We will denote the modified function by , and show that it is a cocycle in . We define
[TABLE]
where is regarded as a function by defining it to be zero on .
Theorem 2
If in then the pair exhibits state dependent contextuality.
Proof of Theorem 2. Given a -chain with boundary
[TABLE]
our definition yields
[TABLE]
Note that vanishes on faces whose boundary is in . To see this let be two commuting elements. Then,
[TABLE]
Therein, the first line is the definition of , Eq. (21). The second line follows by Lemma 2, and the third line by the second item of Def. 2. As a result, is an element of . Moreover it is a cocyle since . The remainder of the proof proceeds as in Theorem 1. There is a -cochain which satisfies Def. 2 if and only if the cohomology class vanishes.
Finally, we return to our initial example of the state-dependent Mermin star, and explain it in terms of the relative cocycle . Although the new argument is almost exactly the same as the former (which used and ), we give it here in order to invoke in an example the above-introduced notions of and . The chain complex corresponding to the state-dependent Mermin star has four elementary faces shown in Fig. 5c. evaluates to 1 on one of those faces, and to 0 on the other three. Thus, for the surface consisting of these four elementary faces, . We further have .
Now assume that a consistent non-contextual value assignment exists, . Then, . Contradiction.
5 Symmetry-based proofs of contextuality
The contextuality proofs in this section are based on invariance transformations. They lead the assumption of the existence of non-contextual value assignments into an algebraic contradiction, as did the parity-based proof encountered before. The new ingredient of these proofs is symmetry, and its representation in terms of group cohomology.
The main results of this section are Theorems 3, 4, 5 and 6 relating contextuality to the cohomology of the symmetry group. Also, we establish a relation between symmetry-based contextuality proofs and the parity-based proofs of Section 4.3; see Corollary 1.
5.1 First example based on Mermin’s square
To illustrate the concept of contextuality proofs based on a symmetry of a set of observables, we return to our earlier example of Mermin’s square. We find that it is invariant under certain symmetry transformations, for example the exchange of the two qubits, a Hadamard gate on qubit 1 or 2, or the CNOT gate between qubits 1 and 2; See Fig. 6. The square is mapped to itself under these transformations, with the observables in and the contexts being permuted, and observables possibly flipping signs. Consider, in particular, the transformation of the square under the Hadamard gate . In this case, the Pauli observable changes its sign under conjugation, whereas all the other observables in the square map to one another without incurring sign changes. As we discuss now, a contextuality proof can be extracted from this transformation behaviour. This proof is of a different kind than the earlier parity proof, since the parity proof does not invoke any symmetry transformation.
For this example, is
[TABLE]
and is the corresponding index set. The Hadamard gate on the first qubit is in the symmetry group for Mermin’s square, i.e. it maps the set to itself. For example, , , etc. The latter minus sign is important for the proof.
Assume that a consistent non-contextual value assignment exists. Then, from it, an new value assignment can be constructed that is obtained from by application of the Hadamard gate . Namely,
[TABLE]
We now consider the quantity
[TABLE]
With the above transformation , we observe that
[TABLE]
Now consider obtaining the value assignment from by flipping individual values. To preserve the product constraints in the square—which from the perspective of contextuality are the relevant information contained in —there must be an even number of flips in every row and column of the square, and hence
[TABLE]
This is in contradiction to Eq. (24), and our assumption that a consistent value assignment existed must be wrong.
5.2 The symmetries of
For our general setting, we consider transformations that satisfy the following two properties.
- (i)
The set is preserved under all transformations in . That is, there is an action of on and an induced action of on such that
[TABLE]
Therein, is the so-called phase function. It describes how observables in transform under the symmetry group .
- (ii)
Multiplication in all abelian subgroups of is preserved,
[TABLE]
for all pairs of commuting and all .
The conjugation by a Hadamard gate on qubit 1 described in Section 5.1, , is a special case of the transformations Eq. (26), (27).
The above transformations form a group under composition. Let denote the group of all symmetries of , that is all the transformations satisfying Eq. (26)-(27). An action of a group as defined above gives a group homomorphism
[TABLE]
which sends a group element to the transformation determined by Eq. (26).
Eq. (26) can be understood as a coordinate transformation. Commuting observables obey the same algebraic relations before and after the transformation. The constraint Eq. (27) enforces this property.
It is useful to restate Eq. (27) in terms of . It then reads
[TABLE]
Thus, for all , the function is the same before and after the transformation.
The phase function satisfies a further constraint resulting from the compatibility with the group structure of . Namely, we require that , for all and all .
To state the above two conditions in a convenient form, we develop further the underlying topological notions. The function assigns to a group element a function . Therefore we can think of as an element of , the group of -cochains which takes values in . We can also regard as an element of by identifying [math]-cochains with the coefficient group . To express the properties of in a compact way we introduce the more general object . These are -cochains on taking values in the group of -cochains in the complex . There are two types of differentials
[TABLE]
The vertical differential is induced by the differentials in , the horizontal differential is the group cohomology differential.
Lemma 3
For all phase functions defined through Eq. (26) it holds that
[TABLE]
The cocycle and the phase function , along with its “essence” introduced below, are the central physical objects in this paper. describes algebraic relations among commuting observables in , and describes the transformation behaviour of these observables under the symmetry group . Eq. (31) shows that these two quantities are linked.
Proof of Lemma 3. Regarding Eq. (31a), with the transformation rule Eq. (26) for observables, we find
[TABLE]
Alternatively, using group compatibility , we find
[TABLE]
Comparing the two expressions, we find the group compatibility condition
[TABLE]
which is Eq. (31a).
Eq. (31b) is a consequence of Eq. (29). We have
[TABLE]
and after rearranging it we obtain Eq. (31b).
The symmetry-based contextuality proofs discussed in this section will employ the phase function . Lemma 4 below is a first link between the phase function and consistent value assignments.
Lemma 4
If satisfies the consistency constraints Eq. (15) of Lemma 2, then so does defined for any given by
[TABLE]
This Lemma provides the formal justification for Eq. (22) in the contextuality proof of Section 5.1, namely the transformation of the value assignment into a new assignment under the Hadamard gate . Recall that all addition involving the phase function is .
Proof of Lemma 4. With Eqs. (33), (31b) and (15) we have
[TABLE]
Thus, the same constraints Eq. (15) satisfied by are also satisfied by .
5.3 The general state-independent case
Here we generalize the symmetry-based proof for Mermin’s square given in Section 5.1 to general sets of observables with a sufficiently large symmetry group . To begin, let’s analyze the inner workings of that proof.
First, consider the sum of value assignments. In cochain notation it reads , for some 1-chain (in the above case, where ). In order to permit the comparison of Eq. (24), i.e., in order to have the same summation on the lhs and rhs, the transformation ( in the proof of Section 5.1), needs to satisfy
[TABLE]
Further, in order to have definite values on either side of Eq. (25), needs to be a sum of constraints. In topological notation, we thus require that
[TABLE]
for some .
Finally, in order to have disagreement between the comparisons of Eq. (24) and (25), we must require that
[TABLE]
The conditions Eq. (34) - (36) are the central ingredients for the symmetry-based proofs. This leads us to the following result.
Lemma 5
Given a set of observables and the corresponding symmetry group , if there exist a and an such that and then has state-independent contextuality.
In addition to the above argument, we now give a formal proof for this Lemma.
Proof of Lemma 5. Eq. (31b) implies that
[TABLE]
Now under the assumption that there exists a value assignment satisfying Eq. (15) and there exists and such that this equation becomes
[TABLE]
Therefore if we get a contradiction.
Example: decorated Mermin star. We present a symmetry-based proof for the “decorated Mermin star”, depicted in Fig. 7a, based on the symmetry transformation
[TABLE]
where . We call this version of Mermin’s star “decorated”, because of the additional observable which is not included in the original star, but automatically included in the corresponding setting derived from a complex (cf. the first property of described in Section 4.1). This additional observable is of importance for the symmetry-based proof.
We show that for the two conditions of Lemma 5, namely such that and are met. Choose , with
[TABLE]
See Fig. 7b for illustration. It is now easily verified that . Furthermore, since , it holds that . For all other edges displayed in Fig. 7b, it holds that . Finally, since , it follows that . The conditions of Lemma 5 are thus met, and the decorated Mermin star is contextual.
5.4 Topological formulation
We now reformulate Lemma 5 in terms of cohomology groups, which are invariant objects in topology. The result is Theorem 3. To this end, we investigate the effect of the transformations Eq. (13) on . Changing from the map of Eq. (8) to induces the change
[TABLE]
The cohomological interpretation of this equation is
[TABLE]
The change of the map has no effect on contextuality, as was demonstrated in Section 4.2. The phase functions thus group into equivalence classes
[TABLE]
Together with Eq. (31a), this implies that .
To make contact with Lemma 5, we now restrict the 1-chains of on which the phase functions are evaluated. The boundaries are contained in as a subgroup. We can write this as a short exact sequence
[TABLE]
Now taking the duals of each group in this sequence gives an other short exact sequence. That is applying to each group in the above exact sequence gives
[TABLE]
where and . More explicitly, consists of -linear maps and is the set of 1-cocycles, i.e., the set of 1-cochains that vanish on boundaries,
[TABLE]
Let denote the composition of with the map in the short exact sequence in (39).
We still have the constraint
[TABLE]
and re-parametrizing by has the effect of
[TABLE]
and therefore
[TABLE]
We then have the following topological reformulation of Lemma 5.
Lemma 6
For a given set of observables and corresponding symmetry group , if then exhibits state-independent contextuality.
Proof of Lemma 6. The elements of are of the form for some –chain . By Lemma 3 we have
[TABLE]
As shown in the proof of Lemma 5 if there is a value assignment , that is , then
[TABLE]
where by definition of the horizontal differential. In other words is the coboundary of with respect to the group cohomology differential. So existence of a value assignment implies that .
We proceed to establish a further reformulation of Lemma 5, Theorem 3 below. It makes explicit the structure of the symmetry group , which is of relevance for MBQC. Namely, the symmetry group has a subgroup which fixes the edges. That is for all . We now make two observations:
(i) is normal in . Hence the set of equivalence classes forms a group .
(ii) A symmetry-based contextuality proof according to Lemma 5 works for a group element if and only if it works for any , with . That is, symmetry-based contextuality proofs are properties of equivalence classes , or, equivalently, of elements .
A proof of statement (ii) is as follows. We verify that the conditions of Lemma 5 are met for the pair if and only if they are met for the pair , with . We observe that , for all and all . Thus, first, .
Furthermore, by Eq. (31b) and since , it holds that . Then, by group compatibility Eq. (32), . Thus, second, .
Let denote the quotient map and be a section of i.e. for all . We define to be the composition of with . Then the observation that for all can be written as
[TABLE]
where . Moreover, this observation combined with Eq. (31a) in Lemma 3 implies that is a cocycle.
Theorem 3
For a given set of observables and corresponding symmetry group , if then exhibits state-independent contextuality.
This is our final result on symmetry-based contextuality proofs for the state-independent case.
Proof of Theorem 3. By Lemma 6 we need to show if and only if . By definition of , its class maps to under the map
[TABLE]
induced by the homomorphism . That is, implies . For the converse assume is a coboundary:
[TABLE]
for some . Since for and by definition of we have
[TABLE]
Therefore in . In other words implies .
5.5 Relation between parity-based and symmetry-based proofs
We have so far found two topological methods to prove Kochen-Specker theorems in the state-independent case, one involving the second cohomology group in a chain complex and the other involving the first cohomology group of a symmetry group . In this section we show that these proofs are related. It turns out that the symmetry-based proofs are at most as strong as the parity proofs. A proof of the former kind always implies a proof of the latter kind.
Corollary 1
Every symmetry-based proof of contextuality implies a parity-based proof.
Proof of Corollary 1. What we actually proved in Lemma 6 is implies . The other way around, implies .
5.6 Contextuality and the group extension problem
The group extension problem is concerned with the following question: “Given two groups and , with an action of on , what are the groups such that and ?”. Any such group is called an extension of and , which is expressed as a short exact sequence
[TABLE]
The simplest way to compose the groups and is via the semi-direct product, , but often there are additional possibilities. A semi-direct product is a twisted version of the direct product i.e. when multiplying two elements
[TABLE]
on the second factor is changed by an automorphism which depends on .
For example, the quaternion group has a normal subgroup and a quotient , but , which can be seen by counting the elements of order two.
The structure of the group extension has implications on the detection of contextuality by cohomology groups, as we now explain. The exact sequence (39) gives a short exact sequence of cochain complexes
[TABLE]
which gives long exact sequence of cohomology groups
[TABLE]
see [32, Proposition 6.1]. In general is defined by lifting the cocycle in to an element of which we denote by , and then applying the (group cohomology) differential . Then the coboundary is in . Its image under is zero since . Therefore actually lies in . The map sends to . Now let us describe the class . Recall that is defined by the composition . As the lift of this class we can take defined by the composition . Then . Therefore we have
[TABLE]
Theorem 4
For a given set of observables and corresponding symmetry group , if then exhibits state-independent contextuality.
Proof of Theorem 4. If then the class which maps to it cannot be zero. Now Theorem 3 implies that exhibits state-independent contextuality.
The gist of the above Theorems 1 – 4 is thus the chain of implications
[TABLE]
Thus, , , are successively stronger contextuality witnesses.
We conclude this section by showing that the weakest of these witnesses, , is indeed strictly weaker than . We demonstrate this by example. First, the following observation is helpful.
Lemma 7
If then .
Remark: Lemma 7 can be strengthened to an “if and only if” if the symmetry group is large enough. See Lemma 10 in Appendix C.
Proof of Lemma 7. The proof essentially follows from Eq. (43). We can choose the section to be a group homomorphism since splits as a semi-direct product. Then induces a map of chain complexes. In this case we have . By Eq. (43) we have .
Thus, if we do not have any hope for detecting contextuality by the cohomology class . We use this observation in the example of the decorated Mermin star, discussed at the end of Section 5.3. Consider as the symmetry group the group generated by (cf. Eq. (37)) and the set of all 3-qubit Pauli operators, . The Pauli operators form the normal subgroup , and . Note that , and
[TABLE]
We have in particular that , and thus . This means by Lemma 7, and the symmetry group under consideration does not provide a contextuality proof via Theorem 4. Yet, since , and a contextuality proof is provided by Theorem 3.
5.7 State-dependent contextuality proofs based on symmetry
State-dependent, symmetry-based proofs of contextuality have previously been constructed by Spekkens, Edwards and Coecke [29] and by J. Lawrence [30], for GHZ-scenarios. Here we describe general such contextuality proofs, and relate them to group cohomology.
The symmetry group of the state-independent case preserves and . It is now replaced by a subgroup which preserves , , and . For any that preserves the action on is as follows. The resource state is an eigenstate of any , , with eigenvalue . Thus, , for all . By Eq. (26), under the transformation the expectation value transforms as . Hence, the update rule for the values is
[TABLE]
for all such that and all . Now the extra condition on the subgroup is that , for all . Thus, in topological notation,
[TABLE]
It is useful to illustrate these symmetry constraints with the example of the state-dependent Mermin star; See Fig. 8. We consider the transformation that has a unitary projective representation , which acts on the observables in by conjugation. It preserves and as we have seen before, and it also preserves . But it does not preserve . For example, since , it holds that , whereas . Likewise, but . The values of and are thus flipped, while the values and remain unchanged. Therefore, .
However, we can find a related transformation , defined via . Namely, the extra operation flips and back, and leaves and unaffected. The action of also preserves , and . In total, preserves , , and , and is thus in the symmetry group .
We will formulate symmetry based state-dependent contextuality proofs using the symmetry group and the relative complex . The symmetry group preserves by definition. It acts on the chain complex by permuting the edges, faces, and volumes in each dimension. There is an induced action on the quotient . Geometrically we can think of this action as the permutation of the cells of the contracted space. The action on the chains gives an action on the cochains. By replacing and in Diag. (30) by and we consider the cochain complex with horizontal and vertical differentials. As before is induced by the group cohomology differential, and is induced by the relative boundary operator. The counterpart of Lemma 5 for the state-dependent case is the following.
Lemma 8
Given a set of observables, a quantum state and the corresponding symmetry group , if there exists an and an such that and then has state-dependent contextuality.
Example: The prototypical state-dependent contextuality scenario is the state-dependent version of Mermins’s star [3], depicted in Fig. 8. In this case, the set is a context. The state is the Greenberger-Horne-Zeilinger (GHZ) state [17]. The symmetry group is generated by permutation of the three particles, the transformation , and the GHZ stabilizer . Choose and , with the labeling referring to Fig. 8. Indeed, . Furthermore, and for all other it holds that . Hence, . The two conditions in Lemma 8 are thus satisfied, and the state-dependent version of Mermin’s star is contextual.
Let’s verify this statement at the elementary level, similar to the symmetry-based contextuality proof for Mermin’s square in Section 5.1. Assume a consistent value assignment exists. Then, with all addition mod 2,
[TABLE]
Contradiction. Hence no consistent value assignment exists. Above, in lines 1 and 6 we have used that . In lines 2 and 5 we have used the consistency of value assignments, in line 3 Lemma 4, and in line 4 the above stated values for .
In analogy with Lemma 6 a cohomological formulation of Lemma 8 can be achieved. Let denote the boundaries in . Let denote the -cochains defined on the boundaries , and denote -cochains which vanish on the boundaries . With these definitions we have a short exact sequence
[TABLE]
a state-dependent version of the short exact sequence (39).
Lemma 9
If in then the pair with symmetry group exhibits state-dependent contextuality.
Proof of this lemma is the same as Lemma 6 after is replaced by .
We are now in the position to obtain the state-dependent versions of Theorems 3 and 4. Recall that is the normal subgroup of which preserves operators in up to a scalar. Consider the intersection and the quotient group . Let denote the composition of a section of the map with the restricted map . Arguing as in the proof of Theorem 3 we obtain the following.
Theorem 5
If in then the pair with symmetry group exhibits state-dependent contextuality.
Similarly as in Theorem 4 second cohomology groups play a role in state-dependent case. The long exact sequence associated to (45) gives a map .
Theorem 6
If in then the pair with symmetry group exhibits state-dependent contextuality.
6 Conclusion
In this work we have discussed two kinds of contextuality proofs, based on parity and on symmetry respectively. Both types of proofs come in two flavours, state-independent and state-dependent. For each of the four resulting cases, we have established that the obstruction to the existence of non-contextual hidden variable models is topological.
Regarding the parity-based proofs (as in Mermin’s square and star), algebraic relations among the observables involved are captured by a 2-cocyle living in a suitably defined chain complex , and is a witness of contextuality.
The symmetry-based proofs invoke transformations that leave the complex and product relations among commuting observables invariant. Again, nontrivial cohomology of any such group is an obstruction to the viability of a non-contextual hidden variable model for the given setting.
The purpose of studying the above contextuality proofs is their relation to quantum computation. Contextuality has previously been established as a necessary resource for quantum computation, in both the models of quantum computation with magic states (see [8]–[10]) and measurement-based quantum computation (MBQC) (see [11]–[13]). The type of contextuality considered here is precisely what shows up in MBQC. The study of the mathematical structure underlying such contextuality proofs may thus lead to novel insights into the foundations of quantum computation.
Acknowledgments.
CO acknowledges funding from NSERC. SDB acknowledges support from the ARC via the Centre of Excellence in Engineered Quantum Systems (EQuS), project number CE110001013. RR is supported by NSERC and Cifar, and is scholar of the Cifar Quantum Information Processing program.
Appendix A Contextuality in measurement-based quantum computation
In this appendix we review measurement-based quantum computation and the role of contextuality in it. This section is based on [7], [11], [12] and [13]. We assemble this material here is to provide the background and motivation for the cohomological framework of contextuality developed in the main text.
We emphasize one point in particular: The classical processing relations of MBQC for determining the computational output from the individual measurement outcomes, when spelled out for all values of the computational input, are precisely the equations that give contextuality proofs based on the impossibility of non-contextual value assignments [3]. See Section A.2.
A.1 Quantum computation by local measurement
Measurement-based quantum computation [7] is a scheme of universal quantum computation in which the process of computation is driven by local measurements as opposed to unitary gates. The measurements are applied to a suitable entangled state, such as a cluster state or graph state. The pattern of measurements encodes the algorithm implemented. For reviews of measurement-based quantum computation, see [33], [34], [35].
Each MBQC consists of (i) a resource state whose entanglement is consumed by the process of computation, (ii) the set of observables measured to drive the computation, and (iii) rules for the classical side-processing of measurement outcomes.
(i) Resource state. The standard choices for the resource state are cluster states or graph states, which are stabilizer states where the stabilizer generators have a particular geometric interpretation; See [7].
(ii) Measured observables. The standard choice for the local measured observables is
[TABLE]
for all qubits . Therein, the angles are a property of the quantum algorithm to be implemented, and the binary numbers depend on the classical input to the computation, as well as an offset determined at runtime. The classical input may e.g. be the argument of a function to be evaluated.
(iii) Classical side-processing. The need for classical side-processing in MBQC arises because quantum-mechanical measurement is inherently random. In fact, in the standard scheme [7] of MBQC, every individual local measurement is completely random. This has two consequences. First, the classical output is represented by certain correlations of measurement outcomes; only they can be non-random. Second, to keep the computation on track in the presence of randomness, measurement bases need to be adapted according to outcomes obtained in earlier measurements. This boils down to adjusting the parameters .
In summary, both the bitwise output and the choice of measurement bases, are functions of the measurement outcomes . In addition, q is also a function of the classical input . Remarkably, in standard MBQC these functional relations are all mod 2 linear,
[TABLE]
Therein, the binary matrix encodes the temporal order in a given MBQC. If then the measurement basis at location depends on the measurement outcome at location , hence the qubit at must be measured before the qubit at . Therefore, for the measurement events to have a partial (temporal) ordering, the matrix must be lower triangular w.r.t. a suitable labeling of the qubits.
A.2 MBQC and Mermin’s star
The role of contextuality for measurement-based quantum computation was first noted in the example of Mermin’s star [11]. Here, we review this example.
The state-dependent Mermin star was already discussed in Section 4.5. In the state-dependent version, one of the five contexts of the star is taken up by a quantum state, namely the Greenberger-Horne-Zeilinger (GHZ) state [17]. The four non-local observables in this context, , , , , are stabilizer operators for the GHZ-state. The other four contexts remain for measurement. They are labeled by the elements of the input group ; See Fig. 5a.
We now describe the objects (i) - (iii) specifying an MBQC with Mermin’s star.
- (i)
The resource state is .
- (ii)
The local measurable observables are
[TABLE]
- (iii)
There are three qubits, two bits of input, , and one bit of output. The temporal order is flat, . The classical side-processing relations Eq. (47) are in this case
[TABLE]
Looping through the possible values for , with Eqs. (49j) and (48), Eq. (49a) becomes four equations, one for each value of ,
[TABLE]
Therein, is the outcome of the measurement of an observable with eigenvalues only, in the measurement context defined by the input .
One may look at Eq. (50) from the quantum mechanical and the HVM angle, which we will do in turn. The GHZ-state satisfies the eigenvalue equations
[TABLE]
Further, since the observables , and pairwise commute and obey the relation , it holds that . With the first of the above eigenvalue equations, , we thus have with certainty. By the same argument, . Thus, the quantum mechanical prediction is that the computation described evaluates the function
[TABLE]
This is of significance from the following fundamental point of view. The classical control computer of MBQC by itself is only capable of performing mod 2 addition, cf. Eq. (47). Hence it is not classically universal. If supplemented with quantum resources—GHZ states and the capability to measure local Pauli observables , —it can execute OR-gates in addition, and thereby becomes classically universal. The computational power of the control computer is thus significantly boosted.
Let’s now look at Eq. (50) from the perspective of a non-contextual HVM with deterministic value assignments. Non-contextual HVMs with definite value assignments invoke assumption the additional assumption that the “pre-existing” values of measurement outcomes are independent of the measurement context,
[TABLE]
Can there be a consistent non-contextual assignment of values on the r.h.s. of Eq. (50)?—This is quickly ruled out. Substituting Eq. (52) into Eq. (50) and adding the four resulting equations mod 2 leads to the familiar contradiction . Hence a consistent non-contextual HVM value assignment does not exist.
We observe that the above statements about computational power and contextuality do not require the function to be precisely an OR-gate. The classical control computer is boosted to classical universality whenever the function is non-linear, i.e., if and only if . The same relation is an obstruction to the existence of an ncHVM. To summarize, is both a witness of contextuality and a guarantee for boosting the a priori very limited classical control computer to classical universality.
A.3 Computational output and contextuality
The points made in the last paragraph about the MBQC based on Mermin’s star generalize to all MBQCs that satisfy the classical processing relations Eq. (47); See [12], [13]. When Eq. (47a), which defines the MBQC output, is spelled out for all input values and combined with the ncHVM assumption Eq. (52), those very equations rule out the existence of a corresponding non-contextual HVM. Furthermore, it is the non-linearity of the outputted function (and hence the boost in classical computational power) that represents the obstruction to the existence of non-contextual HVMs.
Our running example of the MBQC based on Mermin’s star misses two aspects of the general case. First, it is temporally flat, i.e., measurement bases are not influenced by the outcomes of measurements on other qubits, and second, it is deterministic. Both of these constraints can be relaxed while keeping the relation with contextuality. We have the following result.
Theorem 7
[13]* Be an MBQC with classical processing relations Eq. (47) evaluating a function . Then, is contextual if it succeeds with an average probability , where is the Hamming distance of from the closest linear function.*
Remark: The lowest contextuality thresholds are reached for bent functions. For even and bent, it holds that [36], and therefore the contextuality threshold for the average success probability approaches for large . An MBQC can thus be contextual even if its output is very close to completely random.
Appendix B Chain complexes
Throughout the text we work with modules over the ring . A chain complex of modules is a sequence
[TABLE]
such that the composition of any two successive maps gives zero i.e. . Homology groups of the chain complex are defined by
[TABLE]
A map of chain complexes is a sequence of module maps which commutes with the differential . Such a map induces a map in homology .
Dually, we can consider a cochain complex obtained from a chain complex. This is a sequence
[TABLE]
where consists of module maps , and is defined by for all in . Similarly we can talk about cohomology groups
[TABLE]
A map of chain complexes as above induces a map in cohomology in the reverse direction.
A simplicial complex with edges, faces, and volumes… naturally defines a chain complex. The modules , , , … in this complex consists of -linear combinations of labels representing vertices, edges, faces, volumes… Another source for a chain complex is group cohomology. Starting from a single vertex, one builds a space by glueing the boundary of an edge representing an element . The resulting space is a bouquet of circles where the circles are labelled by the elements of the group. Now continue to glue higher dimensional basic shapes which encode the structure of the group. For each pair of group elements glue a triangle whose edges are , , and . This process repeats for higher dimensional triangles which corresponds to an -tuple of group elements so that edges are products of these elements arranged in an organized way. The resulting space is called the classifying space of . The associated chain complex in dimension is a module which consists of -linear combinations of the representatives . The cochain complex consists of -linear combinations of set maps . It is a standard fact in group cohomology that it suffices to consider non-trivial -tuples i.e. for all . This is convenient for computational purposes.
In the text we introduce a complex constructed from commuting operators which imitates the construction of a classifying space. We will show that why is a chain complex i.e. . The proof is similar to the group cohomology case. Let be a basis element of . The -tuple consists of commuting elements in . Although our complex consists of dimensions we prove the result for all . Let us introduce the following maps , for , and . Then we can write . As a preliminary observation one checks that by definition for . Using this
[TABLE]
In the last sum we set hence . The first sum is indexed over and the second one is indexed over . Note that these sets are the same. Therefore two sums cancel each other when corresponding terms with different signs are matched together111As an example consider . In this case the first sum is and the second sum is . .
Appendix C A converse of Lemma 7
Lemma 7 has a converse if the symmetry group is large enough. We start with an observation which will lead to a structural relation between the symmetry group and the chain complex. Recall the definition of the sub-complex . In particular, this is an abelian group under addition. We define an action of on the set of operators by
[TABLE]
for each . Note that this gives a group action since . It also satisfies Eq. (29). Therefore this is a symmetry of the system. We can regard this symmetry as a group homomorphism
[TABLE]
which is in fact injective. We identify as a subgroup of .
In general given a symmetry group we defined as the subgroup which fixes each edge:
[TABLE]
Given a symmetry associated to the homomorphism in Eq. (28) the image of lies inside . That is the restriction of to gives a group homomorphism
[TABLE]
which sends to .
Lemma 10
Assume that is injective and is an isomorphism. Then splits as if and only if .
Proof of Lemma 10. If splits then is proved in Lemma 7. For the converse assume that that is there exists a such that
[TABLE]
Next, corresponding to the map we define a new map via , where is such that , for all and all . Under the assumption that , such an exists for all .
Now, the action of on the is
[TABLE]
where . We can now show that , namely
[TABLE]
Therein, in the third line we have used Eq. (53). Thus .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Kochen and E. P. Specker, The problem of hidden variables in quantum mechanics , J. Math. Mech. 17, 59 (1967).
- 2[2] J. S. Bell, On the Einstein Podolsky Rosen Paradox , Physics 1, No. 3, 1995 (1964).
- 3[3] N. D. Mermin, Hidden variables and the two theorems of John Bell , Rev. Mod. Phys. 65 , 803 (1993).
- 4[4] Samson Abramsky and Adam Brandenburger, The Sheaf-Theoretic Structure Of Non-Locality and Contextuality , New J. Phys. 13 , 113036 (2011).
- 5[5] Adan Cabello, Simone Severini, Andreas Winter, Graph-Theoretic Approach to Quantum Correlations , Phys. Rev. Lett. 112 , 040401 (2014).
- 6[6] S. Bravyi and A. Kitaev, Universal quantum computation with ideal Clifford gates and noisy ancillas , Phys. Rev. A 71 , 022316 (2005).
- 7[7] R. Raussendorf and H.J. Briegel, A one-way quantum computer , Phys. Rev. Lett. 86 , 5188 (2001).
- 8[8] M. Howard, J.J. Wallman, V. Veitch, J. Emerson, Contextuality supplies the ‘magic’ for Quantum Computation , Nature (London) 510 , 351 (2014).
