Topological proofs of contextuality in quantum mechanics

Cihan Okay; Sam Roberts; Stephen D. Bartlett; Robert Raussendorf

arXiv:1701.01888·quant-ph·October 18, 2017

Topological proofs of contextuality in quantum mechanics

Cihan Okay, Sam Roberts, Stephen D. Bartlett, Robert Raussendorf

PDF

TL;DR

This paper introduces a cohomological framework to analyze quantum contextuality, unifying various proofs and highlighting its role as a resource in measurement-based quantum computation.

Contribution

It develops a topological approach to quantum contextuality applicable to both state-dependent and state-independent cases, encompassing parity and symmetry-based proofs.

Findings

01

Provides a unified topological framework for contextuality proofs

02

Applies to both state-dependent and state-independent contextuality

03

Highlights the role of topological methods in quantum computation

Abstract

We provide a cohomological framework for contextuality of quantum mechanics that is suited to describing contextuality as a resource in measurement-based quantum computation. This framework applies to the parity proofs first discussed by Mermin, as well as a different type of contextuality proofs based on symmetry transformations. The topological arguments presented can be used in the state-dependent and the state-independent case.

Equations223

\begin{array}[]{rcl}s(X_{1})+s(X_{2})+s(X_{1}X_{2})\mod 2&=&0,\\ s(Z_{2})+s(Z_{1})+s(Z_{1}Z_{2})\mod 2&=&0,\\ s(X_{1}Z_{2})+s(Z_{1}X_{2})+(Y_{1}Y_{2})\mod 2&=&0,\\ s(X_{1})+s(Z_{2})+s(X_{1}Z_{2})\mod 2&=&0,\\ s(Z_{1})+(X_{2})+s(Z_{1}X_{2})\mod 2&=&0,\\ s(X_{1}X_{2})+s(Z_{1}Z_{2})+s(Y_{1}Y_{2})\mod 2&=&1.\end{array}

\begin{array}[]{rcl}s(X_{1})+s(X_{2})+s(X_{1}X_{2})\mod 2&=&0,\\ s(Z_{2})+s(Z_{1})+s(Z_{1}Z_{2})\mod 2&=&0,\\ s(X_{1}Z_{2})+s(Z_{1}X_{2})+(Y_{1}Y_{2})\mod 2&=&0,\\ s(X_{1})+s(Z_{2})+s(X_{1}Z_{2})\mod 2&=&0,\\ s(Z_{1})+(X_{2})+s(Z_{1}X_{2})\mod 2&=&0,\\ s(X_{1}X_{2})+s(Z_{1}Z_{2})+s(Y_{1}Y_{2})\mod 2&=&1.\end{array}

d s = β .

d s = β .

1 = \int_{F} β = \int_{F} d s = \oint_{\partial F} s = \oint_{0} s = 0,

1 = \int_{F} β = \int_{F} d s = \oint_{\partial F} s = \oint_{0} s = 0,

O = {ω^{k} T_{a} ∣ a \in E, k \in Z_{d}} .

O = {ω^{k} T_{a} ∣ a \in E, k \in Z_{d}} .

T_{a + b} = ω^{β (a, b)} T_{a} T_{b}, \forall a, b \in E, s.th. [T_{a}, T_{b}] = 0.

T_{a + b} = ω^{β (a, b)} T_{a} T_{b}, \forall a, b \in E, s.th. [T_{a}, T_{b}] = 0.

A ∣ ψ ⟩ = λ_{ν} (A) ∣ ψ ⟩, \forall A \in M .

A ∣ ψ ⟩ = λ_{ν} (A) ∣ ψ ⟩, \forall A \in M .

tr (A ρ) = ν \in S \sum λ_{ν} (A) q_{ρ} (ν), \forall A \in O

tr (A ρ) = ν \in S \sum λ_{ν} (A) q_{ρ} (ν), \forall A \in O

λ_{ν} (A B) = λ_{ν} (A) λ_{ν} (B) .

λ_{ν} (A B) = λ_{ν} (A) λ_{ν} (B) .

η (a) = T_{a},

η (a) = T_{a},

F = {(a, b) \in E \times E ∣ [T_{a}, T_{b}] = 0} .

F = {(a, b) \in E \times E ∣ [T_{a}, T_{b}] = 0} .

V = {(a, b, c) \in E \times E \times E ∣ [T_{a}, T_{b}] = [T_{b}, T_{c}] = [T_{a}, T_{c}] = 0} .

V = {(a, b, c) \in E \times E \times E ∣ [T_{a}, T_{b}] = [T_{b}, T_{c}] = [T_{a}, T_{c}] = 0} .

a \in E \sum α_{a} [a] where α_{a} \in Z_{d} .

a \in E \sum α_{a} [a] where α_{a} \in Z_{d} .

C_{3} \to \partial C_{2} \to \partial C_{1} \to \partial C_{0}

C_{3} \to \partial C_{2} \to \partial C_{1} \to \partial C_{0}

\partial [a] = 0, \partial [a ∣ b] = [b] - [a + b] + [a], \partial [a ∣ b ∣ c] = [b ∣ c] - [a + b ∣ c] + [a ∣ b + c] - [a ∣ b] .

\partial [a] = 0, \partial [a ∣ b] = [b] - [a + b] + [a], \partial [a ∣ b ∣ c] = [b ∣ c] - [a + b ∣ c] + [a ∣ b + c] - [a ∣ b] .

\partial [a_{1} ∣ a_{2} ∣ \dots ∣ a_{n}] = [a_{2} ∣ a_{3} ∣ \dots ∣ a_{n}] + i = 1 \sum n - 1 (- 1)^{i} [a_{1} ∣ \dots ∣ a_{i} + a_{i + 1} ∣ \dots ∣ a_{n}] + (- 1)^{n} [a_{1} ∣ a_{2} ∣ \dots ∣ a_{n - 1}] .

\partial [a_{1} ∣ a_{2} ∣ \dots ∣ a_{n}] = [a_{2} ∣ a_{3} ∣ \dots ∣ a_{n}] + i = 1 \sum n - 1 (- 1)^{i} [a_{1} ∣ \dots ∣ a_{i} + a_{i + 1} ∣ \dots ∣ a_{n}] + (- 1)^{n} [a_{1} ∣ a_{2} ∣ \dots ∣ a_{n - 1}] .

H_{n} (C_{*}, Z_{d}) = \frac{ker ( \partial )}{im ( \partial )} .

H_{n} (C_{*}, Z_{d}) = \frac{ker ( \partial )}{im ( \partial )} .

C^{3} \leftarrow d C^{2} \leftarrow d C^{1} \leftarrow d C^{0}

C^{3} \leftarrow d C^{2} \leftarrow d C^{1} \leftarrow d C^{0}

\begin{array}[]{rcl}T_{a+b+c}&=&\displaystyle{T_{(a+b)+c}}\\ &=&\displaystyle{\omega^{\beta(a+b,c)}T_{a+b}T_{c}}\\ &=&\displaystyle{\omega^{\beta(a+b,c)+\beta(a,b)}T_{a}T_{b}T_{c}},\end{array}

\begin{array}[]{rcl}T_{a+b+c}&=&\displaystyle{T_{(a+b)+c}}\\ &=&\displaystyle{\omega^{\beta(a+b,c)}T_{a+b}T_{c}}\\ &=&\displaystyle{\omega^{\beta(a+b,c)+\beta(a,b)}T_{a}T_{b}T_{c}},\end{array}

\begin{array}[]{rcl}T_{a+b+c}&=&\displaystyle{T_{a+(b+c)}}\\ &=&\displaystyle{\omega^{\beta(a,b+c)}T_{a}T_{b+c}}\\ &=&\displaystyle{\omega^{\beta(a,b+c)+\beta(b,c)}T_{a}T_{b}T_{c}}.\end{array}

\begin{array}[]{rcl}T_{a+b+c}&=&\displaystyle{T_{a+(b+c)}}\\ &=&\displaystyle{\omega^{\beta(a,b+c)}T_{a}T_{b+c}}\\ &=&\displaystyle{\omega^{\beta(a,b+c)+\beta(b,c)}T_{a}T_{b}T_{c}}.\end{array}

β (a + b, c) + β (a, b) - β (a, b + c) - β (b, c) mod d = 0,

β (a + b, c) + β (a, b) - β (a, b + c) - β (b, c) mod d = 0,

\partial V = (a + b, c) + (a, b) - (a, b + c) - (b, c) .

\partial V = (a + b, c) + (a, b) - (a, b + c) - (b, c) .

\begin{array}[]{rcl}d\beta(V)=\beta(\partial V)&=&\beta((a+b,c)+(a,b)-(a,b+c)-(b,c))\\ &=&\beta(a+b,c)+\beta(a,b)-\beta(a,b+c)-\beta(b,c)\\ &=&0.\end{array}

\begin{array}[]{rcl}d\beta(V)=\beta(\partial V)&=&\beta((a+b,c)+(a,b)-(a,b+c)-(b,c))\\ &=&\beta(a+b,c)+\beta(a,b)-\beta(a,b+c)-\beta(b,c)\\ &=&0.\end{array}

d β \equiv 0.

d β \equiv 0.

η_{γ} (\cdot) = ω^{γ (\cdot)} η (\cdot),

η_{γ} (\cdot) = ω^{γ (\cdot)} η (\cdot),

\begin{array}[]{rcl}\beta(a,b)\longrightarrow\beta_{\gamma}(a,b)&=&\beta(a,b)-\gamma(a)-\gamma(b)+\gamma(a+b)\\ &=&\beta(a,b)-d\gamma(a,b).\end{array}

\begin{array}[]{rcl}\beta(a,b)\longrightarrow\beta_{\gamma}(a,b)&=&\beta(a,b)-\gamma(a)-\gamma(b)+\gamma(a+b)\\ &=&\beta(a,b)-d\gamma(a,b).\end{array}

d s = - β .

d s = - β .

s (a) + s (b) - s (a + b) = - β (a, b) .

s (a) + s (b) - s (a + b) = - β (a, b) .

F_{square} = F_{star} + \partial V .

F_{square} = F_{star} + \partial V .

0 = \int_{F} β = \int_{F} d s = \int_{\partial F} s = 1.

0 = \int_{F} β = \int_{F} d s = \int_{\partial F} s = 1.

O_{Ψ} := {O \in O ∣ \exists s_{O} \in Z_{d} such that O ∣Ψ ⟩ = ω^{s_{O}} ∣Ψ ⟩} .

O_{Ψ} := {O \in O ∣ \exists s_{O} \in Z_{d} such that O ∣Ψ ⟩ = ω^{s_{O}} ∣Ψ ⟩} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Topological proofs of contextuality in quantum mechanics

Cihan Okay

Department of Mathematics, University of Western Ontario, London, Ontario, Canada

Sam Roberts

Centre for Engineered Quantum Systems, School of Physics, The University of Sydney, Sydney, NSW, Australia

Stephen D. Bartlett

Centre for Engineered Quantum Systems, School of Physics, The University of Sydney, Sydney, NSW, Australia

Robert Raussendorf

Department of Physics and Astronomy, University of British Columbia, Vancouver, BC, Canada

Abstract

We provide a cohomological framework for contextuality of quantum mechanics that is suited to describing contextuality as a resource in measurement-based quantum computation. This framework applies to the parity proofs first discussed by Mermin, as well as a different type of contextuality proofs based on symmetry transformations. The topological arguments presented can be used in the state-dependent and the state-independent case.

1 Introduction

Contextuality [1]-[5] is a feature that distinguishes quantum mechanics from classical physics. To describe it, let’s consider the question of whether it is possible to assign “pre-existing” outcomes to measurements of quantum observables which are merely revealed by measurement. If this were possible, it would amount to a description of quantum mechanics in terms of classical statistical mechanics. Assuming such a model, for any two different sets ${\cal{A}}$ and ${\cal{B}}$ of mutually compatible observables containing a given observable $A$ , it is reasonable to require that the value $\lambda(A)$ attached to the observable $A$ is a property of $A$ alone, and thus agrees in ${\cal{A}}$ and ${\cal{B}}$ . ${\cal{A}}$ and ${\cal{B}}$ are measurement contexts for $A$ , and the constraint on $\lambda(A)$ just described is called “context independence”. Can context-independent pre-assigned outcomes $\lambda$ , or probabilistic combinations thereof, describe all of quantum mechanics?—This turns out not to be the case [1], [2], a fact which is often referred to as contextuality of quantum mechanics.

For quantum computation, contextuality is a resource. In quantum computation with magic states [6] and in measurement-based quantum computation (MBQC) [7], no quantum speedup can occur without it [8]–[10]; [11]–[13].

For the present work, the link between contextuality and quantum computation is the motivation to investigate the mathematical structures underlying contextuality. In this regard, Abramsky and coworkers have provided a sheaf-theoretic description of contextuality [4]. They have further identified cohomological obstructions to the existence of the classical models described above, so-called non-contextual hidden variable models [14], [15]. These methods, based on Čech cohomology, have a wide range of applicability, covering the Bell inequalities [2], Hardy’s model [16], and the Greenberger-Horne-Zeillinger setting [17].

Here, we provide a different cohomological framework for contextuality, involving group cohomology. It is designed to describe the form of contextuality required for the functioning of measurement-based quantum computation. The connection between contextuality and MBQC was first observed in the example of Mermin’s star [11], and subsequently extended to all MBQC on multi-qubit states [12], [13]. From the latter works it is known that all contextuality proofs relevant for MBQC are generalizations of Mermin’s star, in the sense that they invoke an algebraic contradiction to the existence of even a single non-contextual consistent value assignment. By its intended scope, the present framework only needs to apply to such kinds of proofs. But then there is an additional requirement: the cohomological framework in question needs to reproduce the original parity proofs in a topological guise. The reason for this requirement is that both the parity proofs and the classical side-processing required in every MBQC are based on the same linear relations (See Appendix A for a summary on contextuality in MBQC; also see [18]).

Next to the parity-based proofs of contextuality exemplified by Mermin’s square and star, we investigate a different type of contextuality proof which is based on symmetry. The central object in these proofs is the group of transformations that leave the set of observables involved in a parity-based contextuality proof invariant, up to phases. We show that nontrivial cohomology of the symmetry group implies contextuality. Furthermore, the parity-based and the symmetry-based contextuality proofs are related. Every symmetry-based proof implies a parity-based proof.

To summarize, we examine proofs of contextuality of quantum mechanics that have two attributes. They can either be parity-based or symmetry-based, and be state-independent or state-dependent. There are thus four combinations, and for each of these types of proofs we present a topological formulation. The parity-based contextuality proofs are discussed in Section 4 and the symmetry-based proofs in Section 5.

2 First example

To illustrate what “reproducing the original parity proofs in topological guise” means, we consider as a first example Mermin’s square [3] (also see [19]), one of the simplest proofs of contextuality of quantum mechanics. Mermin’s square, depicted in Fig. 1a, demonstrates that in Hilbert spaces of dimension $\geq 4$ it is impossible to consistently assign pre-existing values to all quantum mechanical observables.

Each row and each column of the square represents a measurement context, consisting of commuting observables. Furthermore, the observables in each context multiply to $\pm I$ . For example, in the bottom row in Fig. 1a, we have $(X_{1}Z_{2})(Z_{1}X_{2})(Y_{1}Y_{2})=+I$ , and in the right column $(X_{1}X_{2})(Z_{1}Z_{2})(Y_{1}Y_{2})=-I$ . Now assume the nine Pauli observables $T_{a}$ in the square have pre-existing context-independent outcomes $\lambda(T_{a})=(-1)^{s(T_{a})}$ , with $s(T_{a})\in\mathbb{Z}_{2}$ (the eigenvalues of the Pauli observables are $\pm 1$ ). Then, the product relations among the observables translate into constraints among the consistent value assignments. Continuing with the above-stated relations, we obtain the constraints $\lambda(X_{1}Z_{2})\lambda(Z_{1}X_{2})\lambda(Y_{1}Y_{2})=1$ , and $\lambda(X_{1}X_{2})\lambda(Z_{1}Z_{2})\lambda(Y_{1}Y_{2})=-1$ . It is convenient to express these relations in terms of the value assignments $s(\cdot)$ rather than the measured eigenvalues $\lambda(\cdot)$ . This leads to a system of linear equations,

[TABLE]

No assignment $s$ can satisfy these relations. To see this, add the above equations mod 2, and observe that each value $s(T_{a})$ appears twice on the left hand side. This results in the contradiction 0 = 1.

We now reproduce this contradiction in a topological fashion. For this purpose, the six observables are regarded as labeling the edges in a tessellation of a torus; See Fig. 1b. The value assignment $s$ is now a 1-cochain. Denote by $f$ any of the six elementary faces of the surface, such that $\partial f=a+b+c$ , for three edges $a$ , $b$ , $c$ . Then there is a binary-valued function $\beta$ defined on the faces $f$ such that $T_{c}=(-1)^{\beta(a,b)}T_{a}T_{b}$ . As before, these product constraints among (commuting) observables induce constraints among the corresponding values, namely $s(a)+s(b)+s(c)\mod 2=\beta(f)$ . By dialing through the six faces $f$ , we reproduce the six constraints of Eq. (1).

These constraints have a topological interpretation. Namely, $\beta$ is a 2-cochain, and, for any consistent context-independent value assignment $s$ it holds that

[TABLE]

Therein, $d$ the coboundary operator and the addition is $\text{mod}\;2$ . We can now show that for the present function $\beta$ , which evaluates to 0 on 5 faces and to 1 on one face, no consistent value assignment $s$ exists. To this end, we integrate over the whole surface $F$ which is a 2-cycle, $\partial F=0$ . By Stokes’ theorem,

[TABLE]

where all integration is mod 2. In chain/cochain notation, this reads $1=\beta(F)=ds(F)=s(\partial F)=s(0)=0$ . This is the same contradiction as above in Eq. (1), but in cohomological form. As we show in Section 4 of this paper, all parity proofs consisting of a set of conflicting linear constraints of the form Eq. (1) can be given a similar cohomological interpretation.

To conclude this section, we remark that the above topological version of Mermin’s square, in its mathematical structure, resembles a certain aspect of electromagnetism [20]. First, consider the vector calculus question of whether a given vector field B can be written as the curl of some vector potential A, i.e., $\textbf{B}=\nabla\times\textbf{A}$ . This possibility is ruled out by the existence of a closed surface $F$ for which $\int_{F}d\textbf{F}\cdot\textbf{B}\neq 0$ . Here, A is a 1-cochain (1-form) and B is a 2-cochain (2-form). They are the counterparts of the value assignment $s$ and the function $\beta$ , respectively. Now let B be a magnetic field. The statement $\int_{F}d\textbf{F}\cdot\textbf{B}\neq 0$ for some closed surface $F$ —the counterpart of a contextuality proof $\beta(F)\neq 0$ —would indicate the presence of magnetic monopoles. However, in contrast to contextuality [21], magnetic monopoles—while being a theoretical possibility—have to date not been experimentally observed [22].

3 Measurement and contextuality

In this section we define our measurement setting and notion of contextuality.

3.1 Observables

In this paper, we consider observables with a restriction on their eigenvalues. Specifically, the eigenvalues are all of the form $\omega^{k}$ , where $\omega=e^{2\pi i/d}$ , for some $d\in\mathbb{Z}$ , and $k\in\mathbb{Z}_{d}$ . For $d>2$ , such observables are in general not Hermitian operators. However, that doesn’t matter. We may look at the measurement of these observables in two equivalent ways. (i) The observables are unitary, and their eigenvalues can thus be found by phase estimation. Further, due to the special form of the eigenvalues, phase estimation is exact. (ii) If $O=\sum_{i}\omega^{s_{i}}|i\rangle\langle i|$ , with all $s_{i}\in\mathbb{Z}_{d}$ , one may instead measure $\tilde{O}=\sum_{i}s_{i}|i\rangle\langle i|$ , which is Hermitian and has the same eigenspaces as $O$ .—We note that non-Hermitian observables have found use in Bell inequalities with more than two outcomes per party [23], and also in contextuality proofs [24], [25].

Out of the set of observables $\cal{O}$ , we identify an indexed set $\{T_{a},a\in E\}$ over a set $E$ . Every observable $O\in\cal{O}$ is related to an element $T_{a}$ from this indexed set by a phase $\omega^{k}$ for some $k$ . That is, $\cal{O}$ is of the form

[TABLE]

For example $\cal{O}$ can be taken to be all of the Pauli observables and $E$ corresponds to the set of Pauli observables up to a phase. The set $E$ has more structure which comes from the multiplicative structure of $\cal{O}$ : We require that the product of two operators $T_{a}$ and $T_{b}$ belongs to ${\cal O}$ if they commute, $[T_{a},T_{b}]=0$ . For commuting operators the product $T_{a}T_{b}$ will correspond to an operator $T_{c}$ up to a phase. We write $c=a+b$ for this unique element in $E$ . The operators $\{T_{a}\}_{a\in E}$ satisfy the relation

[TABLE]

The function $\beta$ takes values in $\mathbb{Z}_{d}$ . To see this, consider the simultaneous eigenvalues of the operators $T_{a}$ , $T_{b}$ , $T_{a+b}$ . With Eq. (4) it holds that $\omega^{k_{a+b}}=\omega^{\beta(a,b)+k_{a}+k_{b}}$ , and $k_{a+b},k_{a},k_{b}\in\mathbb{Z}_{d}$ . Thus $\beta(a,b)\in\mathbb{Z}_{d}$ , as stated.

For any triple $\{T_{a},T_{b},T_{a+b}\}$ of observables satisfying the commutativity condition $[T_{a},T_{b}]=0$ , the simultaneous eigenvalues can be measured. While individually random, the measurement outcomes are strictly correlated, $\lambda(a+b)/\lambda(a)\lambda(b)=\omega^{\beta(a,b)}$ . These correlations, which are predicted by quantum mechanics and are verifiable by experiment, form the basis of Mermin’s state-independent contextuality proofs [3]. The function $\beta$ is thus a central object in present discussion, summing up the physical properties of ${\cal{O}}$ .

3.2 Definition of contextuality

We now define the notion of a non-contextual hidden-variable model (ncHVM) with definite value assignments. First, a measurement context is a commuting set $M\subset{\cal{O}}$ . The set of all measurement contexts is denoted by ${\cal{M}}$ .

Definition 1

Consider a quantum state $\rho$ and a set ${\cal{O}}$ of observables grouping into contexts $M\in{\cal{M}}$ of simultaneously measurable observables. A non-contextual hidden variable model $(\mathcal{S},q_{\rho},\Lambda)$ consists of a probability distributon $q_{\rho}$ over a set $\mathcal{S}$ of internal states and a set $\Lambda=\{\lambda_{\nu}\}_{\nu\in\mathcal{S}}$ of value assignment functions $\lambda_{\nu}:{\cal{O}}\rightarrow\mathbb{C}$ that meet the following criteria.

(i)

Each $\lambda_{\nu}\in\Lambda$ is consistent with quantum mechanics: for any set $M\in{\cal{O}}$ of commuting observables there exists a quantum state $|{\psi}\rangle$ such that

[TABLE]

(ii)

The distribution $q_{\rho}$ satisfies

[TABLE]

Condition (i) in Definition 1 means that for every internal state $\nu$ of the non-contextual HVM the corresponding value assignment $\lambda_{\nu}$ is consistent across measurement contexts.

We say that a physical setting $(\rho,{\cal{O}})$ is contextual if it cannot be described by any ncHVM $(\mathcal{S},q_{\rho},\Lambda)$ .

Lemma 1

For any triple $A,B,AB\in{\cal{O}}$ of simultaneously measurable observables and any internal state $\nu\in{\cal{S}}$ of an ncHVM $(\mathcal{S},q_{\rho},\Lambda)$ it holds that

[TABLE]

The relation Eq. (7) was first used in [3] to rule out the existence of deterministic value assignments for Mermin’s square and star. In the same capacity it is also used in the present discussion.

Proof of Lemma 1. Consider a set $M=\{A,B,AB\}\subset{\cal{O}}$ of observables such that $[A,B]=0$ . This set qualifies as a possible $M$ in the sense of point (i) of Def. 1. Therefore, for any $\nu\in\Lambda$ there exists a quantum state $|\psi\rangle$ such that $A|\psi\rangle=\lambda_{\nu}(A)\,|\psi\rangle$ , $B|\psi\rangle=\lambda_{\nu}(B)\,|\psi\rangle$ , $AB|\psi\rangle=\lambda_{\nu}(AB)\,|\psi\rangle.$ Furthermore, $(AB)|\psi\rangle=A(B|\psi\rangle)=\lambda_{\nu}(A)\lambda_{\nu}(B)|\psi\rangle$ . By comparison, $\lambda_{\nu}(AB)=\lambda_{\nu}(A)\lambda_{\nu}(B)$ , which proves Eq. (7). $\Box$

4 Parity-based contextuality proofs

The example of Section 2 is not special. As we show here, every parity-based contextuality proof—consisting of a set of conflicting linear constraints on the value assignments as in Eq. (1)—can be given a cohomological formulation. The main result of this section is Theorem 1.

4.1 The chain complex ${\cal{C}}_{*}$

We have two assumptions on the set of operators $\cal{O}$ :

$\cal{O}$ is closed under products of commuting operators i.e., if $[O_{1},O_{2}]=0$ for $O_{1},O_{2}\in{\cal{O}}$ then $O_{1}O_{2}\in\cal{O}$ . 2. 2.

$\cal{O}$ contains the identity operator.

Let $\eta:E\rightarrow{\cal O}$ denote the map given by

[TABLE]

with $E$ the index set introduced in Eq. (3). The set $E$ has more structure coming from Eq. (4). We say two elements $a,b\in E$ commute if the corresponding operators commute $[T_{a},T_{b}]=0$ . Given two commuting elements $a,b\in E$ we define the sum $a+b\in E$ to be the unique element which satisfies $T_{a+b}=\omega^{\beta(a,b)}T_{a}T_{b}$ , cf. Eq. (4). We assume that there is an element in $E$ denoted by [math] corresponding to the identity operator $\eta(0)=I$ in $\cal{O}$ . Under this addition operation every maximal subset of commuting elements in $E$ has the structure of an abelian group.

Let us define the chain complex ${\cal{C}}_{*}={\cal{C}}_{*}(E)$ . A standard reference for chain complexes is [26]. It will suffice to describe this complex up to dimension three, i.e., ${\cal{C}}_{*}=\{C_{0},C_{1},C_{2},C_{3}\}$ . The geometric picture is as follows. The space we consider consists of a single vertex ([math]-cell). It has an edge ( $1$ -cell) for each element of the set $E$ whose both boundary points attached to the single vertex. A face ( $2$ –cell) is attached for every product relation among commuting operators. The set of faces is thus given by

[TABLE]

Thus, every face $(a,b)\in F$ is bounded by three edges, namely $a$ , $b$ and $a+b$ .

Volumes ( $3$ -cells) are constructed from triples of commuting observables $T_{a},T_{b},T_{c}$ (see Fig. 2 for an illustration). The set of volumes is

[TABLE]

Now comes the description of the chains:

$C_{0}=\mathbb{Z}_{d}$ since there is a single vertex. 2. 2.

$C_{1}=\mathbb{Z}_{d}E$ , i.e., the elements of $C_{1}$ are linear combinations

[TABLE]

In other words, $C_{1}$ is freely generated as a $\mathbb{Z}_{d}$ -module by $[a]$ , where $a\in E$ . 3. 3.

$C_{2}$ is freely generated as a $\mathbb{Z}_{d}$ -module by the pairs $[a|b]$ , where $(a,b)\in F$ . 4. 4.

$C_{3}$ is freely generated as a $\mathbb{Z}_{d}$ -module by the triples $[a|b|c]$ , where $(a,b,c)\in V$ .

In summary $C_{1},C_{2},C_{3}$ are freely generated by $E,F,V$ as $\mathbb{Z}_{d}$ -modules. We stop at dimension three although the definition can be continued for higher dimensions analogously, see [28]. The differentials in the complex

[TABLE]

are defined by

[TABLE]

Here the general pattern is as follows

[TABLE]

The homology groups of ${\cal{C}}_{*}$ are defined by

[TABLE]

The dual notion of cochains ${\cal C}^{*}$ gives a cochain complex

[TABLE]

where $C^{n}$ consists of $\mathbb{Z}_{d}$ -module maps $\phi:C_{n}\rightarrow\mathbb{Z}_{d}$ . The differential $d:C^{n}\rightarrow C^{n+1}$ is defined by $d\phi(\alpha)=\phi(\partial\alpha)$ where $\alpha\in C_{n+1}$ .

4.2 $\beta$ is a 2-cocycle

We may now formally extend the function $\beta$ introduced in Eq. (4) from $F$ to all of $C_{2}$ via the linear relations $\beta(u+v)=\beta(u)+\beta(v)$ , $\beta(ku)=k\beta(u)$ , for all $u,v\in C_{2}$ , $k\in\mathbb{Z}_{d}$ . The function $\beta$ is thus a 2-cochain, $\beta\in C^{2}$ .

The function $\beta$ is constrained in the following way. Consider three commuting elements $a,b,c\in E$ , and expand the observable $T_{a+b+c}$ in two ways,

[TABLE]

and

[TABLE]

Comparing the two expressions, we find that

[TABLE]

whenever $[T_{a},T_{b}]=0$ , $[T_{a},T_{c}]=0$ , and $[T_{b},T_{c}]=0$ .

The four faces $(a,b)$ , $(a+b,c)$ , $(a,b+c)$ , $(b,c)$ , with appropriate orientation (hence sign), bound a volume $V$ , i.e.,

[TABLE]

Geometrically, the situation looks as displayed in Fig. 2. We can follow the convention that $(a,b)$ denotes a face in the geometric sense and $[a|b]$ denotes an element of the chain complex. So $\partial V=[a+b|c]+[a|b]-[a|b+c]-[b|c]$ . Therefore, with Eq. (11),

[TABLE]

Applying this relation to all volumes $V\in C_{3}$ , we obtain

[TABLE]

Finally, there is an equivalence relation among the functions $\beta$ . To see this, recall the map $\eta:E\longrightarrow{\cal{O}}$ which is defined by $a\mapsto T_{a}$ . There is a certain freedom in this definition which does not affect the commutation relations of the operators. Consider the following re-parametrization

[TABLE]

where $\gamma:E\longrightarrow\mathbb{Z}_{d}$ . Then $[\eta(a),\eta(b)]=0$ if and only if $[\eta_{\gamma}(a),\eta_{\gamma}(b)]=0$ . From the perspective of contextuality, it does not matter which map $\eta_{\gamma}$ we use to define the observables $\{T_{a},a\in E\}$ . Contextuality cannot be defined away by rephasing. However, the function $\beta$ is affected by the transformation Eq. (13). Namely, changing from $\eta_{0}=\eta$ to $\eta_{\gamma}$ results in

[TABLE]

Therein, all addition is $\text{mod}\;d$ . The functions $\beta$ are thus subject to a restriction Eq. (12) and an identification Eq. (14). The various possible functions $\beta$ thus fall into equivalence classes $[\beta]=\{\beta+d\gamma,\forall\gamma\}$ , and hence $[\beta]\in H^{2}({\cal{C}},\mathbb{Z}_{d})$ .

4.3 Cohomological formulation of parity-based contextuality proofs

The function $\beta$ relates to the question of existence of non-contextual HVMs. We have the following result. First, a non-contextual value assignment $s:E\longrightarrow\mathbb{Z}_{d}$ , is such that $\lambda(T_{a})=\omega^{s(a)}$ . Again, by linearity, we can extend the assignment from $E$ to all of $C^{1}$ , and $s$ is thus a 1-cochain. We have the following relation.

Lemma 2

For every consistent non-contextual value assignment $s:E\longrightarrow\mathbb{Z}_{d}$ it holds that

[TABLE]

Proof of Lemma 2. Evaluating Eq. (15) on any given face $(a,b)\in F$ reads

[TABLE]

As a consequence of Eq. (7), $\lambda(\omega^{x}A)=\omega^{x}\lambda(A)$ , for all $x\in\mathbb{Z}_{d}$ and all $A\in{\cal{O}}$ . Now, with Lemma 1, setting $A=T_{a}$ and $B=T_{b}$ in Eq. (7), it holds that $\lambda(T_{a})\lambda(T_{b})=\lambda(T_{a}T_{b})=\lambda(\omega^{-\beta(a,b)}T_{a+b})=\omega^{-\beta(a,b)}\lambda(T_{a+b})$ . This is precisely what Eq. (16) requires. $\Box$

Theorem 1

Given set ${\cal{O}}$ of observables, if $H^{2}({\cal{C}},\mathbb{Z}_{d})\ni[\beta]\neq 0$ then ${\cal{O}}$ exhibits state-independent contextuality.

Proof of Theorem 1. If there were a value assignment $s$ it would satisfy $ds=-\beta$ . This means that $\beta$ is a boundary: $\beta=d(-s)$ . Hence $[\beta]=0$ . $\Box$

Example: Mermin’s star. In addition to Mermin’s square, which we already discussed in Section 2, we now provide Mermin’s star [3] as a further example. Mermin’s star comes both in a state-independent and a state-dependent version, and is thus best suited as a running example for all topological constructions presented in this paper.

Here we consider the state-independent version; See Fig. 3. Denote by $F_{\text{star}}$ the surface displayed in Fig. 3c, consisting of the five smaller surfaces $F_{1}$ ,.., $F_{5}$ each corresponding to a measurement context in Fig. 3a. Each of the surfaces $F_{i}$ may be split up into two elementary faces; See Fig. 3b. $F_{\text{star}}:=\sum_{i=1}^{5}F_{i}$ satisfies $\partial F_{\text{star}}=0$ . Since $(X_{1}X_{2}X_{3})(X_{1}Y_{2}Y_{3})(Y_{1}X_{2}Y_{3})(Y_{1}Y_{2}X_{3})=-I$ , we have $\beta(F_{5})=1$ , and for the other four measurement contexts it holds that $\beta(F_{i})=0$ . Hence, $\beta(F_{\text{star}})=1$ . If $\beta=ds$ for some 1-cochain $s$ , then $1=\beta(F_{\text{star}})=ds(F_{\text{star}})=s(\partial F_{\text{star}})=s(0)=0$ . Contradiction. Hence, $[\beta]\neq 0$ . Then, by Theorem 1, Mermin’s star exhibits state-independent contextuality, in accordance with the original proof [3].

4.4 Squaring the star

It tuns out that, from the cohomological perspective developed above, Mermin’s square and star are equivalent contextuality proofs. Denote by ${\cal{C}}_{*}(3)$ the complex induced by the set ${\cal{O}}=\mathbb{P}^{3}$ , the Pauli observables on 3 qubits. Both Mermin’s square and star embed into it. The star provides a closed surface $F_{\text{star}}\in C_{2}(3)$ and the square provides a closed surface $F_{\text{square}}\in C_{2}(3)$ , such that $\beta(F_{\text{star}})=1$ and $\beta(F_{\text{square}})=1$ . Both facts thus equally demonstrate that $\beta\neq 0\in H^{2}({\cal{C}}_{*}(3),\mathbb{Z}_{2})$ .

What makes the star and the square equivalent is that there is a volume $V\in C_{3}(3)$ such that

[TABLE]

The surfaces $F_{\text{square}}$ and $F_{\text{star}}$ representing the respective contextuality proofs are elements of the same homology class in $H_{2}({\cal{C}}_{*}(3),\mathbb{Z}_{2})$ ; and therefore $\beta(F_{\text{square}})=\beta(F_{\text{star}})$ for any 2-cocycle $\beta$ .

The volume $V$ of Eq. (17) is depicted in Fig. 4a. The surfaces $F_{\text{star}}$ and $F_{\text{square}}$ are shown in Fig. 4b. They are obtained from another by adding the boundary $\partial V$ . The Mermin square resulting from this procedure is locally rotated w.r.t. the standard convention, namely

[TABLE]

4.5 State-dependent parity proofs

Mermin’s star—whose state-independent version was discussed in Section 4.3—also exists in a state-dependent version [3]. We use it as an initial example, to illustrate the adaption of the topological argument to the state-dependent case and to motivate the definitions Eq. (18) and Def. 2 below. The state-dependent Mermin star contains a special set $S=\{X_{1}X_{2}X_{3},X_{1}Y_{2}Y_{3},Y_{1}X_{2}Y_{3},Y_{1}Y_{2}X_{3}\}$ of observables and a special state, the Greenberger-Horne-Zeilinger state $|\text{GHZ}\rangle=(|000\rangle+|111\rangle)/\sqrt{2}$ . The latter is a simultaneous eigenstate of the observables in $S$ , with eigenvalues $+1,-1,-1,-1$ , respectively. There is thus a value assignment $s(XXX)=0$ , $s(XYY)=s(YXY)=s(YYX)=1$ . From the perspective of non-contextual hidden variable models, the question is whether the value assignment $s$ can be extended in a consistent fashion to the local observables $X_{i}$ and $Y_{i}$ .

Adapting the topological state-independent argument, we now demonstrate that this is not the case. We choose the mapping $\eta$ such that $X_{1}X_{2}X_{3},X_{1}Y_{2}Y_{3},Y_{1}X_{2}Y_{3},Y_{1}Y_{2}X_{3},X_{i},Y_{i}\in\eta(E)$ , and consider the surface $F=\sum_{i=1}^{8}f_{i}$ displayed in Fig. 5b. For any consistent value assignment $s$ we thus have $s(\partial F)=s(XXX)+s(XYY)+s(YXY)+s(YYX)\mod 2=1$ . On the other hand, $\beta(f_{i})=0$ , for $i=1,..,8$ . Thus, assuming the existence of a consistent value assignment $s$ , with $ds=\beta$ (cf. Lemma 2) and with Stokes’ theorem, we arrive at the following contradiction (addition $\text{mod}\;2$ ):

[TABLE]

Hence our assumption that a consistent value assignment exists must be wrong.

We now turn to the general state-dependent scenario. Any state-dependent contextuality proof singles out a subset ${\cal{O}}_{\Psi}\subset{\cal O}$ of observables of which a special state $|\Psi\rangle$ is an eigenstate. Namely,

[TABLE]

The set ${\cal{O}}_{\Psi}$ may or may not be a context. It is required of ${\cal{O}}_{\Psi}$ that the observables therein have at least one joint eigenstate, $|\Psi\rangle$ , but it is not required of them that they commute.

We want to integrate this extra bit of information into our topological description. By the definition of ${\cal{O}}$ and Eq. (18), the set ${\cal{O}}_{\Psi}$ has the property that whenever $[O_{1},O_{2}]=0$ for $O_{1},O_{2}\in{\cal O}_{\Psi}$ the product $O_{1}O_{2}$ also lies in ${\cal O}_{\Psi}$ . We need this condition to be able to construct a subcomplex of ${\cal C}_{*}={\cal C}_{*}(E)$ . The corresponding labels determine a subset $E_{\Psi}\subset E$ of edges and a subcomplex ${\cal C}_{*}(E_{\Psi})$ whose definition is analogous to ${\cal C}_{*}$ .

Let us define $s_{\Psi}:E_{\Psi}\rightarrow\mathbb{Z}_{d}$ via Eq. (18), i.e.,

[TABLE]

We can regard $s_{\Psi}$ as an element of $C^{1}(E_{\Psi})$ by extending it linearly. A consistent value assignment in the state-dependent case has to be compatible with the eigenvalues on the given state. This suggests the following definition.

Definition 2

A state-dependent consistent value assignment is a function $s:E\rightarrow\mathbb{Z}_{d}$ that satisfies

[TABLE]

for all commuting $(a,b)\notin E_{\Psi}\times E_{\Psi}$ , and its restriction to $E_{\Psi}$ coincides with $s_{\Psi}$ .

According to Eq. (20) only the commuting labels which are not contained in $E_{\Psi}$ matters. Geometrically we can remove the edges in $E_{\Psi}$ by contracting them. For the example of Mermin’s star, this process is depicted in Fig. 5. On the algebraic side, the chain complex of the contracted space is described by the relative complex defined by the quotient

[TABLE]

In this quotient edges, the faces, and volumes which come from $E_{\Psi}$ are removed. Therefore we can think of this complex as having edges in the complement $E-E_{\Psi}$ of the set $E_{\Psi}$ . More explicitly, a $1$ -chain in this complex can be identified as a sum

[TABLE]

similarly $2$ -chains are linear combinations of commuting elements not contained in $E_{\Psi}\times E_{\Psi}$ . We refer to the boundary operator of ${\cal C}_{*}(E,E_{\Psi})$ as the relative boundary operator and denote it by $\partial_{R}$ to distinguish it from $\partial$ . The boundary operator $\partial_{R}$ is the same as $\partial$ except that the edges, faces or volumes corresponding to $E_{\Psi}$ are removed. For example, in Mermin’s star of Fig. 5b, $\partial_{R}(f_{1}+f_{2})=a_{X_{1}}+a_{X_{2}}+a_{X_{3}}$ , whereas $\partial(f_{1}+f_{2})=a_{X_{1}}+a_{X_{2}}+a_{X_{3}}+a_{XXX}$ . In general the relative boundary $\partial_{R}f$ of a $2$ -chain $f$ is the sum of the edges in $\partial f$ which lie in $E-E_{\Psi}$ .

The relation between the chain complexes we defined so far can be expressed as a short exact sequence

[TABLE]

and the corresponding short exact sequence of cochain complexes is

[TABLE]

Note that $\mathcal{C}^{*}(E,E_{\Psi})$ can be characterized as cochains in $\mathcal{C}^{*}$ whose restriction to $E_{\Psi}$ vanishes. We will interpret Def. 2 using the cochain complex $\mathcal{C}^{*}(E,E_{\Psi})$ . In order to do this $\beta$ must be modified so that it vanishes on all faces whose boundary is in $E_{\Psi}$ . We will denote the modified function by $\beta_{\Psi}$ , and show that it is a cocycle in $C^{2}(E,E_{\Psi})$ . We define

[TABLE]

where $s_{\Psi}$ is regarded as a function $E\rightarrow\mathbb{Z}_{d}$ by defining it to be zero on $E-E_{\Psi}$ .

Theorem 2

If $[\beta_{\Psi}]\neq 0$ in $H^{2}(\mathcal{C}(E,E_{\Psi}),\mathbb{Z}_{d})$ then the pair $(\mathcal{O},|\Psi\rangle)$ exhibits state dependent contextuality.

Proof of Theorem 2. Given a $2$ -chain $f\in C_{2}(E)$ with boundary

[TABLE]

our definition yields

[TABLE]

Note that $\beta_{\Psi}$ vanishes on faces whose boundary is in $E_{\Psi}$ . To see this let $a,b\in E_{\Psi}$ be two commuting elements. Then,

[TABLE]

Therein, the first line is the definition of $\beta_{\Psi}$ , Eq. (21). The second line follows by Lemma 2, and the third line by the second item of Def. 2. As a result, $\beta_{\Psi}$ is an element of $C^{2}(E,E_{\Psi})$ . Moreover it is a cocyle since $d\beta_{\Psi}=d\beta+dds_{\Psi}=0$ . The remainder of the proof proceeds as in Theorem 1. There is a $1$ -cochain $s$ which satisfies Def. 2 if and only if the cohomology class $[\beta_{\Psi}]$ vanishes. $\Box$

Finally, we return to our initial example of the state-dependent Mermin star, and explain it in terms of the relative cocycle $\beta_{\Psi}\in C^{2}(E,E_{\Psi})$ . Although the new argument is almost exactly the same as the former (which used $\beta$ and $s_{\Psi}$ ), we give it here in order to invoke in an example the above-introduced notions of $\beta_{\Psi}$ and ${\cal{C}}_{*}(E,E_{\Psi})$ . The chain complex ${\cal{C}}_{*}(E,E_{\Psi})$ corresponding to the state-dependent Mermin star has four elementary faces shown in Fig. 5c. $\beta_{\Psi}$ evaluates to 1 on one of those faces, and to 0 on the other three. Thus, for the surface $F$ consisting of these four elementary faces, $\beta_{\Psi}(F)=1$ . We further have $\partial_{R}F=0$ .

Now assume that a consistent non-contextual value assignment $s$ exists, $\beta_{\Psi}=-ds$ . Then, $1=\beta_{\Psi}(F)=ds(F)=s(\partial_{R}F)=s(0)=0$ . Contradiction.

5 Symmetry-based proofs of contextuality

The contextuality proofs in this section are based on invariance transformations. They lead the assumption of the existence of non-contextual value assignments into an algebraic contradiction, as did the parity-based proof encountered before. The new ingredient of these proofs is symmetry, and its representation in terms of group cohomology.

The main results of this section are Theorems 3, 4, 5 and 6 relating contextuality to the cohomology of the symmetry group. Also, we establish a relation between symmetry-based contextuality proofs and the parity-based proofs of Section 4.3; see Corollary 1.

5.1 First example based on Mermin’s square

To illustrate the concept of contextuality proofs based on a symmetry $G$ of a set ${\cal{O}}$ of observables, we return to our earlier example of Mermin’s square. We find that it is invariant under certain symmetry transformations, for example the exchange of the two qubits, a Hadamard gate on qubit 1 or 2, or the CNOT gate between qubits 1 and 2; See Fig. 6. The square is mapped to itself under these transformations, with the observables in ${\cal{O}}$ and the contexts being permuted, and observables possibly flipping signs. Consider, in particular, the transformation of the square under the Hadamard gate $H_{1}$ . In this case, the Pauli observable $Y_{1}Y_{2}$ changes its sign under conjugation, whereas all the other observables in the square map to one another without incurring sign changes. As we discuss now, a contextuality proof can be extracted from this transformation behaviour. This proof is of a different kind than the earlier parity proof, since the parity proof does not invoke any symmetry transformation.

For this example, $\eta(E)$ is

[TABLE]

and $E$ is the corresponding index set. The Hadamard gate $H_{1}$ on the first qubit is in the symmetry group $G$ for Mermin’s square, i.e. it maps the set $\mathcal{O}=\pm\eta(E)$ to itself. For example, $H_{1}:X_{1}\leftrightarrow Z_{1}$ , $Y_{1}Y_{2}\leftrightarrow-Y_{1}Y_{2}$ , etc. The latter minus sign is important for the proof.

Assume that a consistent non-contextual value assignment $s$ exists. Then, from it, an new value assignment $s^{\prime}$ can be constructed that is obtained from $s$ by application of the Hadamard gate $H_{1}$ . Namely,

[TABLE]

We now consider the quantity

[TABLE]

With the above transformation $s\longrightarrow s^{\prime}$ , we observe that

[TABLE]

Now consider obtaining the value assignment $s^{\prime}$ from $s$ by flipping individual values. To preserve the product constraints in the square—which from the perspective of contextuality are the relevant information contained in ${\cal{O}}$ —there must be an even number of flips in every row and column of the square, and hence

[TABLE]

This is in contradiction to Eq. (24), and our assumption that a consistent value assignment $s$ existed must be wrong.

5.2 The symmetries of ${\cal{O}}$

For our general setting, we consider transformations $g\in G$ that satisfy the following two properties.

(i)

The set ${\cal{O}}$ is preserved under all transformations in $G$ . That is, there is an action of $G$ on ${\cal{O}}$ and an induced action of $G$ on $E$ such that

[TABLE]

Therein, $\tilde{\Phi}$ is the so-called phase function. It describes how observables in ${\cal{O}}$ transform under the symmetry group $G$ .

(ii)

Multiplication in all abelian subgroups of ${\cal{O}}$ is preserved,

[TABLE]

for all pairs of commuting $O_{1},O_{2}\in{\cal{O}}$ and all $g\in G$ .

The conjugation by a Hadamard gate $H_{1}$ on qubit 1 described in Section 5.1, $H_{1}(T_{a})=H_{1}T_{a}H_{1}^{\dagger}$ , is a special case of the transformations Eq. (26), (27).

The above transformations $g$ form a group under composition. Let $\text{Sym}({\cal O})$ denote the group of all symmetries of ${\cal O}$ , that is all the transformations satisfying Eq. (26)-(27). An action of a group $G$ as defined above gives a group homomorphism

[TABLE]

which sends a group element $g$ to the transformation determined by Eq. (26).

Eq. (26) can be understood as a coordinate transformation. Commuting observables obey the same algebraic relations before and after the transformation. The constraint Eq. (27) enforces this property.

It is useful to restate Eq. (27) in terms of $\eta(E)\subset{\cal{O}}$ . It then reads

[TABLE]

Thus, for all $g\in G$ , the function $\beta:C_{2}\longrightarrow\mathbb{Z}_{d}$ is the same before and after the transformation.

The phase function $\tilde{\Phi}$ satisfies a further constraint resulting from the compatibility with the group structure of $G$ . Namely, we require that $(gh)(T_{a})=g(h(T_{a}))$ , for all $g,h\in G$ and all $a\in E$ .

To state the above two conditions in a convenient form, we develop further the underlying topological notions. The function $\tilde{\Phi}$ assigns to a group element $g\in G$ a function $\tilde{\Phi}_{g}:C_{1}\rightarrow\mathbb{Z}_{d}$ . Therefore we can think of $\tilde{\Phi}$ as an element of $C^{1}(G,C^{1})$ , the group of $1$ -cochains which takes values in $C^{1}$ . We can also regard $\beta$ as an element of $C^{0}(G,C^{2})$ by identifying [math]-cochains with the coefficient group $C^{2}$ . To express the properties of $\tilde{\Phi}$ in a compact way we introduce the more general object $C^{p}(G,C^{q})$ . These are $p$ -cochains on $G$ taking values in the group $C^{q}$ of $q$ -cochains in the complex $\cal{C}$ . There are two types of differentials

[TABLE]

The vertical differential $d^{v}$ is induced by the differentials in $\cal{C}$ , the horizontal differential $d^{h}$ is the group cohomology differential.

Lemma 3

For all phase functions $\tilde{\Phi}$ defined through Eq. (26) it holds that

[TABLE]

The cocycle $\beta$ and the phase function $\tilde{\Phi}$ , along with its “essence” $\Phi$ introduced below, are the central physical objects in this paper. $\beta$ describes algebraic relations among commuting observables in ${\cal{O}}$ , and $\tilde{\Phi}$ describes the transformation behaviour of these observables under the symmetry group $G$ . Eq. (31) shows that these two quantities are linked.

Proof of Lemma 3. Regarding Eq. (31a), with the transformation rule Eq. (26) for observables, we find

[TABLE]

Alternatively, using group compatibility $(gh)(T_{a})=g(h(T_{a}))$ , we find $\forall a\in E,\,\forall g,h\in G$

[TABLE]

Comparing the two expressions, we find the group compatibility condition

[TABLE]

which is Eq. (31a).

Eq. (31b) is a consequence of Eq. (29). We have

[TABLE]

and after rearranging it we obtain Eq. (31b). $\Box$

The symmetry-based contextuality proofs discussed in this section will employ the phase function $\tilde{\Phi}$ . Lemma 4 below is a first link between the phase function and consistent value assignments.

Lemma 4

If $\textbf{s}:E\longrightarrow\mathbb{Z}_{d}$ satisfies the consistency constraints Eq. (15) of Lemma 2, then so does $\textbf{s}^{\prime}:E\longrightarrow\mathbb{Z}_{d}$ defined for any given $g\in G$ by

[TABLE]

This Lemma provides the formal justification for Eq. (22) in the contextuality proof of Section 5.1, namely the transformation of the value assignment $s$ into a new assignment $s^{\prime}$ under the Hadamard gate $H_{1}$ . Recall that all addition involving the phase function is $\text{mod}\;d$ .

Proof of Lemma 4. With Eqs. (33), (31b) and (15) we have

[TABLE]

Thus, the same constraints Eq. (15) satisfied by $s$ are also satisfied by $s^{\prime}$ . $\Box$

5.3 The general state-independent case

Here we generalize the symmetry-based proof for Mermin’s square given in Section 5.1 to general sets ${\cal{O}}$ of observables with a sufficiently large symmetry group $G$ . To begin, let’s analyze the inner workings of that proof.

First, consider the sum $\chi(s)$ of value assignments. In cochain notation it reads $\chi(s)=s(e)$ , for some 1-chain $e$ (in the above case, $e=\sum_{a\in E}\alpha_{a}[a]$ where $\alpha_{a}\in\mathbb{Z}_{d}$ ). In order to permit the comparison of Eq. (24), i.e., in order to have the same summation on the lhs and rhs, the transformation $g$ ( $g=H_{1}$ in the proof of Section 5.1), needs to satisfy

[TABLE]

Further, in order to have definite values on either side of Eq. (25), $\chi(s)=s(e)$ needs to be a sum of constraints. In topological notation, we thus require that

[TABLE]

for some $f\in C_{2}$ .

Finally, in order to have disagreement between the comparisons of Eq. (24) and (25), we must require that

[TABLE]

The conditions Eq. (34) - (36) are the central ingredients for the symmetry-based proofs. This leads us to the following result.

Lemma 5

Given a set ${\cal{O}}$ of observables and the corresponding symmetry group $G$ , if there exist a $g\in G$ and an $f\in C_{2}$ such that $g\,\partial f=\partial f$ and $\tilde{\Phi}_{g}(\partial f)\neq 0$ then ${\cal{O}}$ has state-independent contextuality.

In addition to the above argument, we now give a formal proof for this Lemma.

Proof of Lemma 5. Eq. (31b) implies that

[TABLE]

Now under the assumption that there exists a value assignment $s$ satisfying Eq. (15) and there exists $g\in G$ and $f\in C_{2}$ such that $g\partial f=\partial f$ this equation becomes

[TABLE]

Therefore if $\tilde{\Phi}_{g}(\partial f)\not=0$ we get a contradiction. $\Box$

Example: decorated Mermin star. We present a symmetry-based proof for the “decorated Mermin star”, depicted in Fig. 7a, based on the symmetry transformation

[TABLE]

where $A:=(X+Y)/\sqrt{2}$ . We call this version of Mermin’s star “decorated”, because of the additional observable $I_{1}Z_{2}Z_{3}$ which is not included in the original star, but automatically included in the corresponding setting derived from a complex ${\cal{C}}$ (cf. the first property of ${\cal{C}}$ described in Section 4.1). This additional observable is of importance for the symmetry-based proof.

We show that for $g=A_{1}A_{2}$ the two conditions of Lemma 5, namely $\exists f\in C_{2}$ such that $A_{1}A_{2}\,\partial f=\partial f$ and $\tilde{\Phi}_{A_{1}A_{2}}(\partial f)\neq 0$ are met. Choose $f=f_{1}+f_{2}+f_{3}$ , with

[TABLE]

See Fig. 7b for illustration. It is now easily verified that $\partial f=A_{1}A_{2}\,\partial f$ . Furthermore, since $AZ=-ZA$ , it holds that $\tilde{\Phi}_{A_{1}A_{2}}(a_{IZZ})=1$ . For all other edges $a$ displayed in Fig. 7b, it holds that $\tilde{\Phi}_{A_{1}A_{2}}(a)=0$ . Finally, since $a_{IZZ}\in\{\partial f\}$ , it follows that $\tilde{\Phi}_{A_{1}A_{2}}(\partial f)=1\neq 0$ . The conditions of Lemma 5 are thus met, and the decorated Mermin star is contextual.

5.4 Topological formulation

We now reformulate Lemma 5 in terms of cohomology groups, which are invariant objects in topology. The result is Theorem 3. To this end, we investigate the effect of the transformations Eq. (13) on $\tilde{\Phi}$ . Changing from the map $\eta$ of Eq. (8) to $\eta_{\gamma}$ induces the change

[TABLE]

The cohomological interpretation of this equation is

[TABLE]

The change of the map $\eta$ has no effect on contextuality, as was demonstrated in Section 4.2. The phase functions $\tilde{\Phi}$ thus group into equivalence classes

[TABLE]

Together with Eq. (31a), this implies that $[\tilde{\Phi}]\in H^{1}(G,C^{1})$ .

To make contact with Lemma 5, we now restrict the 1-chains of $C_{1}$ on which the phase functions $\tilde{\Phi}_{g}$ are evaluated. The boundaries $B_{1}$ are contained in $C_{1}$ as a subgroup. We can write this as a short exact sequence

[TABLE]

Now taking the duals of each group in this sequence gives an other short exact sequence. That is applying $\text{Hom}(-,\mathbb{Z}_{d})$ to each group in the above exact sequence gives

[TABLE]

where $V=\text{Hom}(C_{1}/B_{1},\mathbb{Z}_{d})$ and $U=\text{Hom}(B_{1},\mathbb{Z}_{d})$ . More explicitly, $U$ consists of $\mathbb{Z}_{d}$ -linear maps $B_{1}\rightarrow\mathbb{Z}_{d}$ and $V$ is the set of 1-cocycles, i.e., the set of 1-cochains that vanish on boundaries,

[TABLE]

Let $\tilde{\Phi}|_{B_{1}}:G\rightarrow U$ denote the composition of $\tilde{\Phi}:G\rightarrow C^{1}$ with the map $C^{1}\rightarrow U$ in the short exact sequence in (39).

We still have the constraint

[TABLE]

and re-parametrizing by $\gamma$ has the effect of

[TABLE]

and therefore

[TABLE]

We then have the following topological reformulation of Lemma 5.

Lemma 6

For a given set ${\cal{O}}$ of observables and corresponding symmetry group $G$ , if $[\tilde{\Phi}|_{B_{1}}]\neq 0\in H^{1}(G,U)$ then ${\cal{O}}$ exhibits state-independent contextuality.

Proof of Lemma 6. The elements of $B_{1}$ are of the form $\partial f$ for some $2$ –chain $f$ . By Lemma 3 we have

[TABLE]

As shown in the proof of Lemma 5 if there is a value assignment $s$ , that is $d^{v}s=-\beta$ , then

[TABLE]

where $d^{v}s(g,f)=s(g,\partial f)$ by definition of the horizontal differential. In other words $\tilde{\Phi}|_{B_{1}}$ is the coboundary of $s|_{B_{1}}$ with respect to the group cohomology differential. So existence of a value assignment implies that $[\tilde{\Phi}|_{B_{1}}]=0$ . $\Box$

We proceed to establish a further reformulation of Lemma 5, Theorem 3 below. It makes explicit the structure of the symmetry group $G$ , which is of relevance for MBQC. Namely, the symmetry group $G$ has a subgroup $N$ which fixes the edges. That is $n(T_{a})=\omega^{\tilde{\Phi}_{n}(a)}T_{a}$ for all $n\in N$ . We now make two observations:

(i) $N$ is normal in $G$ . Hence the set of equivalence classes $\{gn,\;n\in N\}$ forms a group $Q:=G/N$ .

(ii) A symmetry-based contextuality proof according to Lemma 5 works for a group element $g\in G$ if and only if it works for any $gn$ , with $n\in N$ . That is, symmetry-based contextuality proofs are properties of equivalence classes $\{gn,\;n\in N\}$ , or, equivalently, of elements $q\in Q$ .

A proof of statement (ii) is as follows. We verify that the conditions of Lemma 5 are met for the pair $(g,f)$ if and only if they are met for the pair $(gn,f)$ , with $n\in N$ . We observe that $n\,a=a$ , for all $n\in N$ and all $a\in E$ . Thus, first, $g\partial f=\partial f\Longleftrightarrow gn\,\partial f=\partial f$ .

Furthermore, by Eq. (31b) and since $nf=f$ , it holds that $\tilde{\Phi}_{n}(\partial f)=d^{v}\tilde{\Phi}(n,f)=d^{h}\beta(n,f)=\beta(nf)-\beta(f)=0$ . Then, by group compatibility Eq. (32), $\tilde{\Phi}_{gn}(\partial f)=\tilde{\Phi}_{g}(\partial f)$ . Thus, second, $\tilde{\Phi}_{gn}(\partial f)\neq 0\Longleftrightarrow\tilde{\Phi}_{g}(\partial f)\neq 0$ . $\Box$

Let $\pi:G\rightarrow Q$ denote the quotient map and $\theta:Q\rightarrow G$ be a section of $\pi$ i.e. $\pi\theta(q)=q$ for all $q\in Q$ . We define $\Phi:Q\rightarrow U$ to be the composition of $\tilde{\Phi}|_{B_{1}}:G\rightarrow U$ with $\theta$ . Then the observation that $\tilde{\Phi}_{gn}(\partial f)=\tilde{\Phi}_{g}(\partial f)$ for all $n\in N$ can be written as

[TABLE]

where $q=\pi(g)$ . Moreover, this observation combined with Eq. (31a) in Lemma 3 implies that $\Phi$ is a cocycle.

Theorem 3

For a given set ${\cal{O}}$ of observables and corresponding symmetry group $G$ , if $[\Phi]\neq 0\in H^{1}(Q,U)$ then ${\cal{O}}$ exhibits state-independent contextuality.

This is our final result on symmetry-based contextuality proofs for the state-independent case.

Proof of Theorem 3. By Lemma 6 we need to show $[\Phi]\neq 0$ if and only if $[\tilde{\Phi}|_{B_{1}}]\neq 0$ . By definition of $\Phi$ , its class $[\Phi]$ maps to $[\tilde{\Phi}|_{B_{1}}]$ under the map

[TABLE]

induced by the homomorphism $\pi:G\rightarrow Q$ . That is, $[\tilde{\Phi}|_{B_{1}}]\neq 0$ implies $[\Phi]\neq 0$ . For the converse assume $\tilde{\Phi}|_{B_{1}}$ is a coboundary:

[TABLE]

for some $s:B_{1}\rightarrow\mathbb{Z}_{d}$ . Since $\theta(q)\partial f=g\,\partial f$ for $q=\pi(g)$ and by definition of $\Phi$ we have

[TABLE]

Therefore $[\Phi]=0$ in $H^{1}(Q,U)$ . In other words $[\Phi]\neq 0$ implies $[\tilde{\Phi}|_{B_{1}}]\neq 0$ . $\Box$

5.5 Relation between parity-based and symmetry-based proofs

We have so far found two topological methods to prove Kochen-Specker theorems in the state-independent case, one involving the second cohomology group $H^{2}({\cal{C}},\mathbb{Z}_{d})$ in a chain complex ${\cal{C}}$ and the other involving the first cohomology group $H^{1}(G,U)$ of a symmetry group $G$ . In this section we show that these proofs are related. It turns out that the symmetry-based proofs are at most as strong as the parity proofs. A proof of the former kind always implies a proof of the latter kind.

Corollary 1

Every symmetry-based proof of contextuality implies a parity-based proof.

Proof of Corollary 1. What we actually proved in Lemma 6 is $[\beta]=0$ implies $[\tilde{\Phi}|_{B_{1}}]=0$ . The other way around, $[\tilde{\Phi}|_{B_{1}}]\not=0$ implies $[\beta]\not=0$ . $\Box$

5.6 Contextuality and the group extension problem

The group extension problem is concerned with the following question: “Given two groups $Q$ and $N$ , with an action of $Q$ on $N$ , what are the groups $G$ such that $N\subset G$ and $Q=G/N$ ?”. Any such group $G$ is called an extension of $N$ and $Q$ , which is expressed as a short exact sequence

[TABLE]

The simplest way to compose the groups $Q$ and $N$ is via the semi-direct product, $G=Q\ltimes N$ , but often there are additional possibilities. A semi-direct product is a twisted version of the direct product $Q\times N$ i.e. when multiplying two elements

[TABLE]

on the second factor $n_{1}$ is changed by an automorphism which depends on $q_{2}$ .

For example, the quaternion group $Q_{8}$ has a normal subgroup $\mathbb{Z}_{4}$ and a quotient $\mathbb{Z}_{2}$ , but $Q_{8}\neq\mathbb{Z}_{2}\ltimes\mathbb{Z}_{4}$ , which can be seen by counting the elements of order two.

The structure of the group extension has implications on the detection of contextuality by cohomology groups, as we now explain. The exact sequence (39) gives a short exact sequence of cochain complexes

[TABLE]

which gives long exact sequence of cohomology groups

[TABLE]

see [32, Proposition 6.1]. In general $\sigma([\alpha])$ is defined by lifting the cocycle $\alpha$ in $C^{k}(Q,U)$ to an element of $C^{k}(Q,C^{1})$ which we denote by $\alpha^{\prime}$ , and then applying the (group cohomology) differential $d^{h}$ . Then the coboundary $d^{h}\alpha^{\prime}$ is in $C^{k+1}(Q,C^{1})$ . Its image under $j$ is zero since $j(d^{h}\alpha^{\prime})=d^{h}(j(\alpha^{\prime}))=d^{h}\alpha=0$ . Therefore $d^{h}\alpha^{\prime}$ actually lies in $C^{k+1}(Q,V)$ . The map $\sigma$ sends $[\alpha]$ to $[d^{h}\alpha^{\prime}]$ . Now let us describe the class $\sigma([\Phi])$ . Recall that $\Phi:Q\rightarrow U$ is defined by the composition $Q\stackrel{{\scriptstyle\theta}}{{\rightarrow}}G\stackrel{{\scriptstyle\tilde{\Phi}|_{B_{1}}}}{{\rightarrow}}U$ . As the lift of this class we can take $\Phi^{\prime}:Q\rightarrow C^{1}$ defined by the composition $Q\stackrel{{\scriptstyle\theta}}{{\rightarrow}}G\stackrel{{\scriptstyle\tilde{\Phi}}}{{\rightarrow}}C^{1}$ . Then $j(\Phi^{\prime})=\Phi$ . Therefore we have

[TABLE]

Theorem 4

For a given set ${\cal{O}}$ of observables and corresponding symmetry group $G$ , if $\sigma([\Phi])\neq 0\in H^{2}(Q,V)$ then ${\cal{O}}$ exhibits state-independent contextuality.

Proof of Theorem 4. If $\sigma([\Phi])\neq 0$ then the class $[\Phi]$ which maps to it cannot be zero. Now Theorem 3 implies that ${\cal{O}}$ exhibits state-independent contextuality. $\Box$

The gist of the above Theorems 1 – 4 is thus the chain of implications

[TABLE]

Thus, $\sigma([\Phi])$ , $[\Phi]$ , $[\beta]$ are successively stronger contextuality witnesses.

We conclude this section by showing that the weakest of these witnesses, $\sigma([\Phi])$ , is indeed strictly weaker than $[\Phi]$ . We demonstrate this by example. First, the following observation is helpful.

Lemma 7

If $G=Q\ltimes N$ then $\sigma([\Phi])=0$ .

Remark: Lemma 7 can be strengthened to an “if and only if” if the symmetry group $G$ is large enough. See Lemma 10 in Appendix C.

Proof of Lemma 7. The proof essentially follows from Eq. (43). We can choose the section $\theta:Q\rightarrow G$ to be a group homomorphism since $G$ splits as a semi-direct product. Then $\theta$ induces a map $\theta^{*}:C^{2}(G,C^{1})\rightarrow C^{2}(Q,C^{1})$ of chain complexes. In this case we have $\Phi^{\prime}=\theta^{*}(\tilde{\Phi})$ . By Eq. (43) we have $\sigma([\Phi])=[d^{h}\Phi^{\prime}]=[d^{h}\theta^{*}(\tilde{\Phi})]=[\theta^{*}(d^{h}\tilde{\Phi})]=0$ . $\Box$

Thus, if $G=Q\ltimes N$ we do not have any hope for detecting contextuality by the cohomology class $\sigma([\Phi])\in H^{2}(Q,V)$ . We use this observation in the example of the decorated Mermin star, discussed at the end of Section 5.3. Consider as the symmetry group $G$ the group generated by $u(g)=A_{1}A_{2}I_{3}$ (cf. Eq. (37)) and the set of all 3-qubit Pauli operators, ${\cal{P}}_{3}$ . The Pauli operators form the normal subgroup $N$ , and $Q=G/N\cong\mathbb{Z}_{2}$ . Note that $g^{2}=I$ , and

[TABLE]

We have in particular that $\langle g\rangle\cap({\cal{P}}_{3}=N)=I$ , and thus $G=\mathbb{Z}_{2}\ltimes N$ . This means $\sigma([\Phi])=0$ by Lemma 7, and the symmetry group under consideration does not provide a contextuality proof via Theorem 4. Yet, $[\Phi]\not=0$ since $[\tilde{\Phi}|_{B_{1}}]\not=0$ , and a contextuality proof is provided by Theorem 3.

5.7 State-dependent contextuality proofs based on symmetry

State-dependent, symmetry-based proofs of contextuality have previously been constructed by Spekkens, Edwards and Coecke [29] and by J. Lawrence [30], for GHZ-scenarios. Here we describe general such contextuality proofs, and relate them to group cohomology.

The symmetry group $G$ of the state-independent case preserves $E$ and $\beta$ . It is now replaced by a subgroup $H\subset G$ which preserves $E$ , $E_{\Psi}$ , $\beta$ and $s_{\Psi}$ . For any $g\in G$ that preserves $E_{\Psi}$ the action on $s_{\Psi}$ is as follows. The resource state $|\Psi\rangle$ is an eigenstate of any $T_{a}$ , $a\in E_{\Psi}$ , with eigenvalue $\omega^{s_{\Psi}(a)}$ . Thus, $\langle T_{a}\rangle_{\Psi}=\omega^{s_{\Psi}(a)}$ , for all $a\in E_{\Psi}$ . By Eq. (26), under the transformation $g$ the expectation value $\langle T_{a}\rangle_{\Psi}$ transforms as $\langle T_{a}\rangle_{\Psi}\longrightarrow\langle g(T_{a})\rangle_{\Psi}=\omega^{\tilde{\Phi}_{g}(a)}\omega^{s_{\Psi}(ga)}=\omega^{s_{\Psi}^{\prime}(a)}$ . Hence, the update rule for the values $s_{\Psi}$ is

[TABLE]

for all $g\in G$ such that $g(E_{\Psi})=E_{\Psi}$ and all $a\in E_{\Psi}$ . Now the extra condition on the subgroup $H\subset G$ is that $s_{\Psi}^{\prime}\equiv s_{\Psi}$ , for all $h\in H$ . Thus, in topological notation,

[TABLE]

It is useful to illustrate these symmetry constraints with the example of the state-dependent Mermin star; See Fig. 8. We consider the transformation $g\in G$ that has a unitary projective representation $u(g)=A_{1}A_{2}I_{3}$ , which acts on the observables in ${\cal{O}}$ by conjugation. It preserves $E$ and $\beta$ as we have seen before, and it also preserves $E_{\Psi}$ . But it does not preserve $s_{\Psi}$ . For example, since $\tilde{\Phi}_{g}(a_{XXX})=0$ , it holds that $s^{\prime}_{\Psi}(a_{XXX})=s_{\Psi}(a_{YYX})+0=1$ , whereas $s_{\Psi}(a_{XXX})=0$ . Likewise, $s^{\prime}_{\Psi}(a_{YYX})=0$ but $s_{\Psi}(a_{YYX})=1$ . The values of $s_{\Psi}(a_{XXX})$ and $s_{\Psi}(a_{YYX})$ are thus flipped, while the values $s_{\Psi}(a_{XYY})$ and $s_{\Psi}(a_{YXY})$ remain unchanged. Therefore, $g\not\in H$ .

However, we can find a related transformation $h\in H$ , defined via $u(h)=Y_{3}u(g)=A_{1}A_{2}Y_{3}$ . Namely, the extra operation $Y_{3}$ flips $s^{\prime}_{\Psi}(a_{XXX})$ and $s^{\prime}_{\Psi}(a_{YYX})$ back, and leaves $s^{\prime}_{\Psi}(a_{YXY})$ and $s^{\prime}_{\Psi}(a_{XYY})$ unaffected. The action of $Y_{3}$ also preserves $E$ , $E_{\Psi}$ and $\beta$ . In total, $h$ preserves $E$ , $E_{\Psi}$ , $\beta$ and $s_{\Psi}$ , and is thus in the symmetry group $H$ .

We will formulate symmetry based state-dependent contextuality proofs using the symmetry group $H$ and the relative complex ${\cal C}_{*}(E,E_{\Psi})$ . The symmetry group $H$ preserves $E_{\Psi}$ by definition. It acts on the chain complex ${\cal C}_{*}(E_{\Psi})$ by permuting the edges, faces, and volumes in each dimension. There is an induced action on the quotient ${\cal{C}}_{*}(E,E_{\Psi})$ . Geometrically we can think of this action as the permutation of the cells of the contracted space. The action on the chains gives an action on the cochains. By replacing $G$ and ${\cal C}_{*}(E)$ in Diag. (30) by $H$ and ${\cal C}_{*}(E,E_{\Psi})$ we consider the cochain complex $C^{p}(H,C^{q}(E,E_{\Psi}))$ with horizontal $d^{h}$ and vertical $d^{v}$ differentials. As before $d^{h}$ is induced by the group cohomology differential, and $d^{v}$ is induced by the relative boundary operator. The counterpart of Lemma 5 for the state-dependent case is the following.

Lemma 8

Given a set ${\cal{O}}$ of observables, a quantum state $|\Psi\rangle$ and the corresponding symmetry group $H$ , if there exists an $h\in H$ and an $f\in C_{2}$ such that $h\,\partial_{R}f=\partial_{R}f$ and $\tilde{\Phi}_{h}(\partial_{R}f)\neq 0$ then ${\cal{O}}$ has state-dependent contextuality.

Example: The prototypical state-dependent contextuality scenario is the state-dependent version of Mermins’s star [3], depicted in Fig. 8. In this case, the set $S=\{XXX,XYY,YXY,YYX\}$ is a context. The state $|\Psi\rangle$ is the Greenberger-Horne-Zeilinger (GHZ) state $|\text{GHZ}\rangle=(|000\rangle+|111\rangle)/\sqrt{2}$ [17]. The symmetry group $H$ is generated by permutation of the three particles, the transformation $A_{1}\otimes A_{2}\otimes Y_{3}$ , and the GHZ stabilizer $S$ . Choose $H\ni h=A_{1}A_{2}Y_{3}$ and $f=f_{1}+f_{2}$ , with the labeling referring to Fig. 8. Indeed, $\partial_{R}f=a_{X_{1}}+a_{X_{3}}+a_{Y_{1}}+a_{Y_{3}}=A_{1}A_{2}Y_{3}\,\partial_{R}f$ . Furthermore, $\tilde{\Phi}_{AAY}(a_{X_{3}})=1$ and for all other $a\in\{\partial_{R}f\}$ it holds that $\tilde{\Phi}_{AAY}(a)=0$ . Hence, $\tilde{\Phi}(\partial_{R}f)=1$ . The two conditions in Lemma 8 are thus satisfied, and the state-dependent version of Mermin’s star is contextual.

Let’s verify this statement at the elementary level, similar to the symmetry-based contextuality proof for Mermin’s square in Section 5.1. Assume a consistent value assignment exists. Then, with all addition mod 2,

[TABLE]

Contradiction. Hence no consistent value assignment exists. Above, in lines 1 and 6 we have used that $X_{1}X_{2}X_{3}\,|\text{GHZ}\rangle=-Y_{1}X_{2}Y_{3}\,|\text{GHZ}\rangle=|\text{GHZ}\rangle$ . In lines 2 and 5 we have used the consistency of value assignments, in line 3 Lemma 4, and in line 4 the above stated values for $\tilde{\Phi}_{AAY}$ .

In analogy with Lemma 6 a cohomological formulation of Lemma 8 can be achieved. Let $B_{1}^{\prime}$ denote the boundaries in $C_{2}(E,E_{\Psi})$ . Let $U_{\Psi}$ denote the $1$ -cochains defined on the boundaries $B_{1}^{\prime}$ , and $V_{\Psi}$ denote $1$ -cochains which vanish on the boundaries $B_{1}^{\prime}$ . With these definitions we have a short exact sequence

[TABLE]

a state-dependent version of the short exact sequence (39).

Lemma 9

If $[\tilde{\Phi}|_{B_{1}^{\prime}}]\neq 0$ in $H^{1}(H,U_{\Psi})$ then the pair $(\mathcal{O},|\Psi\rangle)$ with symmetry group $H\subset G$ exhibits state-dependent contextuality.

Proof of this lemma is the same as Lemma 6 after $\partial$ is replaced by $\partial_{R}$ .

We are now in the position to obtain the state-dependent versions of Theorems 3 and 4. Recall that $N$ is the normal subgroup of $G$ which preserves operators in $\cal{O}$ up to a scalar. Consider the intersection $N^{\prime}=H\cap N$ and the quotient group $Q^{\prime}=H/N^{\prime}$ . Let $\Phi^{\prime}:Q^{\prime}\rightarrow U_{\Psi}$ denote the composition of a section $\theta^{\prime}:Q^{\prime}\rightarrow H$ of the map $H\rightarrow H/N^{\prime}$ with the restricted map $\tilde{\Phi}|_{B_{1}^{\prime}}:G\rightarrow U_{\Psi}$ . Arguing as in the proof of Theorem 3 we obtain the following.

Theorem 5

If $[\Phi^{\prime}]\neq 0$ in $H^{1}(Q^{\prime},U_{\Psi})$ then the pair $(\mathcal{O},|\Psi\rangle)$ with symmetry group $H\subset G$ exhibits state-dependent contextuality.

Similarly as in Theorem 4 second cohomology groups play a role in state-dependent case. The long exact sequence associated to (45) gives a map $\sigma:H^{1}(Q^{\prime},U_{\Psi})\rightarrow H^{2}(Q^{\prime},V_{\Psi})$ .

Theorem 6

If $\sigma([\Phi^{\prime}])\neq 0$ in $H^{2}(Q^{\prime},V_{\Psi})$ then the pair $(\mathcal{O},|\Psi\rangle)$ with symmetry group $H\subset G$ exhibits state-dependent contextuality.

6 Conclusion

In this work we have discussed two kinds of contextuality proofs, based on parity and on symmetry respectively. Both types of proofs come in two flavours, state-independent and state-dependent. For each of the four resulting cases, we have established that the obstruction to the existence of non-contextual hidden variable models is topological.

Regarding the parity-based proofs (as in Mermin’s square and star), algebraic relations among the observables involved are captured by a 2-cocyle $\beta$ living in a suitably defined chain complex ${\cal{C}}_{*}$ , and $[\beta]\not\in H^{2}({\cal{C}^{*}},\mathbb{Z}_{d})$ is a witness of contextuality.

The symmetry-based proofs invoke transformations that leave the complex ${\cal{C}}_{*}$ and product relations among commuting observables invariant. Again, nontrivial cohomology of any such group is an obstruction to the viability of a non-contextual hidden variable model for the given setting.

The purpose of studying the above contextuality proofs is their relation to quantum computation. Contextuality has previously been established as a necessary resource for quantum computation, in both the models of quantum computation with magic states (see [8]–[10]) and measurement-based quantum computation (MBQC) (see [11]–[13]). The type of contextuality considered here is precisely what shows up in MBQC. The study of the mathematical structure underlying such contextuality proofs may thus lead to novel insights into the foundations of quantum computation.

Acknowledgments.

CO acknowledges funding from NSERC. SDB acknowledges support from the ARC via the Centre of Excellence in Engineered Quantum Systems (EQuS), project number CE110001013. RR is supported by NSERC and Cifar, and is scholar of the Cifar Quantum Information Processing program.

Appendix A Contextuality in measurement-based quantum computation

In this appendix we review measurement-based quantum computation and the role of contextuality in it. This section is based on [7], [11], [12] and [13]. We assemble this material here is to provide the background and motivation for the cohomological framework of contextuality developed in the main text.

We emphasize one point in particular: The classical processing relations of MBQC for determining the computational output from the individual measurement outcomes, when spelled out for all values of the computational input, are precisely the equations that give contextuality proofs based on the impossibility of non-contextual value assignments [3]. See Section A.2.

A.1 Quantum computation by local measurement

Measurement-based quantum computation [7] is a scheme of universal quantum computation in which the process of computation is driven by local measurements as opposed to unitary gates. The measurements are applied to a suitable entangled state, such as a cluster state or graph state. The pattern of measurements encodes the algorithm implemented. For reviews of measurement-based quantum computation, see [33], [34], [35].

Each MBQC consists of (i) a resource state $|\Phi\rangle$ whose entanglement is consumed by the process of computation, (ii) the set of observables measured to drive the computation, and (iii) rules for the classical side-processing of measurement outcomes.

(i) Resource state. The standard choices for the resource state $|\Phi\rangle$ are cluster states or graph states, which are stabilizer states where the stabilizer generators have a particular geometric interpretation; See [7].

(ii) Measured observables. The standard choice for the local measured observables is

[TABLE]

for all qubits $i$ . Therein, the angles $\{\phi_{i}\}$ are a property of the quantum algorithm to be implemented, and the binary numbers $\{q_{i}\}$ depend on the classical input to the computation, as well as an offset determined at runtime. The classical input may e.g. be the argument of a function to be evaluated.

(iii) Classical side-processing. The need for classical side-processing in MBQC arises because quantum-mechanical measurement is inherently random. In fact, in the standard scheme [7] of MBQC, every individual local measurement is completely random. This has two consequences. First, the classical output is represented by certain correlations of measurement outcomes; only they can be non-random. Second, to keep the computation on track in the presence of randomness, measurement bases need to be adapted according to outcomes obtained in earlier measurements. This boils down to adjusting the parameters $q_{i}$ .

In summary, both the bitwise output $\textbf{o}=(o_{1},o_{2}..,o_{k})$ and the choice of measurement bases, $\textbf{q}=(q_{1},q_{2},..,q_{N})$ are functions of the measurement outcomes $\textbf{s}=(s_{1},s_{2},..,s_{N})$ . In addition, q is also a function of the classical input $\textbf{i}=(i_{1},i_{2},..,i_{m})$ . Remarkably, in standard MBQC these functional relations are all mod 2 linear,

[TABLE]

Therein, the binary matrix $T$ encodes the temporal order in a given MBQC. If $T_{ij}=1$ then the measurement basis at location $i$ depends on the measurement outcome at location $j$ , hence the qubit at $j$ must be measured before the qubit at $i$ . Therefore, for the measurement events to have a partial (temporal) ordering, the matrix $T$ must be lower triangular w.r.t. a suitable labeling of the qubits.

A.2 MBQC and Mermin’s star

The role of contextuality for measurement-based quantum computation was first noted in the example of Mermin’s star [11]. Here, we review this example.

The state-dependent Mermin star was already discussed in Section 4.5. In the state-dependent version, one of the five contexts of the star is taken up by a quantum state, namely the Greenberger-Horne-Zeilinger (GHZ) state [17]. The four non-local observables in this context, $X_{1}X_{2}X_{3}$ , $X_{1}Y_{2}Y_{3}$ , $Y_{1}X_{2}Y_{3}$ , $Y_{1}Y_{2}X_{3}$ , are stabilizer operators for the GHZ-state. The other four contexts remain for measurement. They are labeled by the elements of the input group $Q=\mathbb{Z}_{2}\times\mathbb{Z}_{2}$ ; See Fig. 5a.

We now describe the objects (i) - (iii) specifying an MBQC with Mermin’s star.

(i)

The resource state is $|GHZ\rangle=(|000\rangle+|111\rangle)/\sqrt{2}$ .

(ii)

The local measurable observables are

[TABLE]

(iii)

There are three qubits, two bits of input, $\textbf{i}=(a,b)$ , and one bit $o$ of output. The temporal order is flat, $T=0$ . The classical side-processing relations Eq. (47) are in this case

[TABLE]

Looping through the possible values for $(a,b)$ , with Eqs. (49j) and (48), Eq. (49a) becomes four equations, one for each value of $(a,b)$ ,

[TABLE]

Therein, $s_{ij}(O)\in\mathbb{Z}_{2}$ is the outcome of the measurement of an observable $O$ with eigenvalues $\pm 1$ only, in the measurement context defined by the input $(i,j)$ .

One may look at Eq. (50) from the quantum mechanical and the HVM angle, which we will do in turn. The GHZ-state satisfies the eigenvalue equations

[TABLE]

Further, since the observables $X_{1}$ , $X_{2}$ and $X_{3}$ pairwise commute and obey the relation $X_{1}X_{2}X_{3}=(X_{1}I_{2}I_{3})(I_{1}X_{2}I_{3})(I_{1}I_{2}X_{3})$ , it holds that $s_{00}(X_{1})+s_{00}(X_{2})+s_{00}(X_{3})\mod 2=s_{00}(X_{1}X_{2}X_{3})$ . With the first of the above eigenvalue equations, $s_{00}(X_{1}X_{2}X_{3})=0$ , we thus have $o(0,0)=0$ with certainty. By the same argument, $o(0,1)=o(1,0)=o(1,1)=1$ . Thus, the quantum mechanical prediction is that the computation described evaluates the function

[TABLE]

This is of significance from the following fundamental point of view. The classical control computer of MBQC by itself is only capable of performing mod 2 addition, cf. Eq. (47). Hence it is not classically universal. If supplemented with quantum resources—GHZ states and the capability to measure local Pauli observables $X_{i}$ , $Y_{i}$ —it can execute OR-gates in addition, and thereby becomes classically universal. The computational power of the control computer is thus significantly boosted.

Let’s now look at Eq. (50) from the perspective of a non-contextual HVM with deterministic value assignments. Non-contextual HVMs with definite value assignments invoke assumption the additional assumption that the “pre-existing” values of measurement outcomes are independent of the measurement context,

[TABLE]

Can there be a consistent non-contextual assignment of values $s(X_{1}),..,s(Y_{3})$ on the r.h.s. of Eq. (50)?—This is quickly ruled out. Substituting Eq. (52) into Eq. (50) and adding the four resulting equations mod 2 leads to the familiar contradiction $1=0$ . Hence a consistent non-contextual HVM value assignment does not exist.

We observe that the above statements about computational power and contextuality do not require the function $o$ to be precisely an OR-gate. The classical control computer is boosted to classical universality whenever the function $o$ is non-linear, i.e., if and only if $\Sigma(o):=o(0,0)+o(0,1)+o(1,0)+o(1,1)\mod 2=1$ . The same relation is an obstruction to the existence of an ncHVM. To summarize, $\Sigma(o)=1$ is both a witness of contextuality and a guarantee for boosting the a priori very limited classical control computer to classical universality.

A.3 Computational output and contextuality

The points made in the last paragraph about the MBQC based on Mermin’s star generalize to all MBQCs that satisfy the classical processing relations Eq. (47); See [12], [13]. When Eq. (47a), which defines the MBQC output, is spelled out for all input values and combined with the ncHVM assumption Eq. (52), those very equations rule out the existence of a corresponding non-contextual HVM. Furthermore, it is the non-linearity of the outputted function (and hence the boost in classical computational power) that represents the obstruction to the existence of non-contextual HVMs.

Our running example of the MBQC based on Mermin’s star misses two aspects of the general case. First, it is temporally flat, i.e., measurement bases are not influenced by the outcomes of measurements on other qubits, and second, it is deterministic. Both of these constraints can be relaxed while keeping the relation with contextuality. We have the following result.

Theorem 7

[13]* Be ${\cal{M}}$ an MBQC with classical processing relations Eq. (47) evaluating a function $o:(\mathbb{Z}_{2})^{m}\longrightarrow\mathbb{Z}_{2}$ . Then, ${\cal{M}}$ is contextual if it succeeds with an average probability $p_{S}>1-d_{H}(o)/2^{m}$ , where $d_{H}(o)$ is the Hamming distance of $o$ from the closest linear function.*

Remark: The lowest contextuality thresholds are reached for bent functions. For $m$ even and $o$ bent, it holds that $d_{H}(o)=2^{m-1}-2^{m/2-1}$ [36], and therefore the contextuality threshold for the average success probability $p_{S}$ approaches $1/2$ for large $m$ . An MBQC can thus be contextual even if its output is very close to completely random.

Appendix B Chain complexes

Throughout the text we work with modules over the ring $\mathbb{Z}_{d}=\{0,1,\cdots,d-1\}$ . A chain complex of modules is a sequence

[TABLE]

such that the composition of any two successive maps gives zero i.e. $\partial\partial=0$ . Homology groups of the chain complex are defined by

[TABLE]

A map $f:{\cal C}\rightarrow{\cal D}$ of chain complexes is a sequence of module maps $C_{n}\rightarrow D_{n}$ which commutes with the differential $\partial$ . Such a map induces a map in homology $f_{*}:H_{n}({\cal C}_{*})\rightarrow H_{n}({\cal D})$ .

Dually, we can consider a cochain complex obtained from a chain complex. This is a sequence

[TABLE]

where $C^{n}$ consists of module maps $\alpha:C_{n}\rightarrow\mathbb{Z}_{d}$ , and $d_{n}$ is defined by $d_{n}(\alpha)(c)=\alpha(\partial_{n+1}(c))$ for all $c$ in $C_{n+1}$ . Similarly we can talk about cohomology groups

[TABLE]

A map $f$ of chain complexes as above induces a map in cohomology $f^{*}:H^{n}({\cal D}^{*})\rightarrow H^{n}({\cal C}^{*})$ in the reverse direction.

A simplicial complex with edges, faces, and volumes… naturally defines a chain complex. The modules $C_{0}$ , $C_{1}$ , $C_{2}$ , $C_{3}$ … in this complex consists of $\mathbb{Z}_{d}$ -linear combinations of labels representing vertices, edges, faces, volumes… Another source for a chain complex is group cohomology. Starting from a single vertex, one builds a space by glueing the boundary of an edge representing an element $g\in G$ . The resulting space is a bouquet of circles where the circles are labelled by the elements of the group. Now continue to glue higher dimensional basic shapes which encode the structure of the group. For each pair of group elements $(g_{1},g_{2})$ glue a triangle whose edges are $g_{1}$ , $g_{2}$ , and $g_{1}g_{2}$ . This process repeats for higher dimensional triangles which corresponds to an $n$ -tuple $(g_{1},g_{2},\cdots,g_{n})$ of group elements so that edges are products of these elements arranged in an organized way. The resulting space is called the classifying space of $G$ . The associated chain complex in dimension $n$ is a module which consists of $\mathbb{Z}_{d}$ -linear combinations of the representatives $[g_{1}|g_{2}|\cdots|g_{n}]$ . The cochain complex consists of $\mathbb{Z}_{d}$ -linear combinations of set maps $G^{n}\rightarrow\mathbb{Z}_{d}$ . It is a standard fact in group cohomology that it suffices to consider non-trivial $n$ -tuples i.e. $g_{i}\not=1$ for all $i$ . This is convenient for computational purposes.

In the text we introduce a complex ${\cal C}_{*}(E)$ constructed from commuting operators which imitates the construction of a classifying space. We will show that why ${\cal C}_{*}(E)$ is a chain complex i.e. $\partial\partial=0$ . The proof is similar to the group cohomology case. Let $[a_{1}|\cdots|a_{n}]$ be a basis element of $C_{n}(E)$ . The $n$ -tuple consists of commuting elements in $E$ . Although our complex consists of dimensions $n=0,1,2,3$ we prove the result for all $n$ . Let us introduce the following maps $d_{0}[a_{1}|\cdots|a_{n}]=[a_{2}|\cdots|a_{n}]$ , $d_{i}[a_{1}|\cdots|a_{n}]=[a_{1}|\cdots|a_{i}+a_{i+1}|\cdots|a_{n}]$ for $1\leq i\leq n-1$ , and $d_{n}[a_{1}|\cdots|a_{n}]=[a_{1}|\cdots|a_{n-1}]$ . Then we can write $\partial=\sum_{i=0}^{n}(-1)^{n}d_{i}$ . As a preliminary observation one checks that by definition $d_{i}d_{j}=d_{j-1}d_{i}$ for $i\leq j-1$ . Using this

[TABLE]

In the last sum we set $k=j-1$ hence $0\leq k\leq n-1$ . The first sum is indexed over $\{0\leq i\leq n-1\text{ and }0\leq k\leq n-1|\;i\leq k\}$ and the second one is indexed over $\{0\leq i\leq n-1\text{ and }0\leq j\leq n|\;i\geq j\}$ . Note that these sets are the same. Therefore two sums cancel each other when corresponding terms with different signs are matched together111As an example consider $\partial\partial:C_{2}\rightarrow C_{0}$ . In this case the first sum is $-d_{0}d_{1}+d_{0}d_{2}-d_{1}d_{2}$ and the second sum is $d_{0}d_{0}-d_{1}d_{0}+d_{1}d_{1}$ . .

Appendix C A converse of Lemma 7

Lemma 7 has a converse if the symmetry group $G$ is large enough. We start with an observation which will lead to a structural relation between the symmetry group and the chain complex. Recall the definition of the sub-complex $V=\{\alpha\in C^{1}|\;d^{v}\alpha=0\}$ . In particular, this is an abelian group under addition. We define an action of $V$ on the set of operators by

[TABLE]

for each $\alpha\in V$ . Note that this gives a group action since $(\alpha+\alpha^{\prime})(T_{a})=\omega^{\alpha(a)+\alpha^{\prime}(a)}T_{a}=\alpha(\alpha^{\prime}(T_{a}))$ . It also satisfies Eq. (29). Therefore this is a symmetry of the system. We can regard this symmetry as a group homomorphism

[TABLE]

which is in fact injective. We identify $V$ as a subgroup of $\text{Sym}({\cal O})$ .

In general given a symmetry group $G$ we defined $N$ as the subgroup which fixes each edge:

[TABLE]

Given a symmetry associated to the homomorphism $\xi:G\rightarrow\text{Sym}({\cal O})$ in Eq. (28) the image $\xi(N)$ of $N$ lies inside $V$ . That is the restriction of $\xi$ to $N$ gives a group homomorphism

[TABLE]

which sends $n$ to $\tilde{\Phi}_{n}$ .

Lemma 10

Assume that $\xi:G\rightarrow\text{Sym}({\cal O})$ is injective and $\xi|_{N}:N\rightarrow V$ is an isomorphism. Then $G$ splits as $Q\ltimes N$ if and only if $\sigma([\Phi])=0$ .

Proof of Lemma 10. If $G$ splits then $\sigma([\Phi])=0$ is proved in Lemma 7. For the converse assume that $\sigma([\Phi])=0$ that is there exists a $\chi:Q\longrightarrow V$ such that

[TABLE]

Next, corresponding to the map $\theta:Q\longrightarrow G$ we define a new map $\hat{\theta}$ via $\hat{\theta}(q)=\theta(q)n_{q}$ , where $n_{q}\in N$ is such that $n_{q}(T_{a})=\omega^{\chi_{q}(a)}T_{a}$ , for all $q\in Q$ and all $a\in E$ . Under the assumption that $N\cong V$ , such an $n_{q}$ exists for all $q\in Q$ .

Now, the action of $\hat{\theta}(Q)$ on the $T_{a}$ is

[TABLE]

where $\Phi^{\prime\prime}:=\Phi^{\prime}+\chi$ . We can now show that $\hat{\theta}(pq)=\hat{\theta}(p)\hat{\theta}(q)$ , namely

[TABLE]

Therein, in the third line we have used Eq. (53). Thus $G=Q\ltimes N$ . $\Box$

Bibliography36

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Kochen and E. P. Specker, The problem of hidden variables in quantum mechanics , J. Math. Mech. 17, 59 (1967).
2[2] J. S. Bell, On the Einstein Podolsky Rosen Paradox , Physics 1, No. 3, 1995 (1964).
3[3] N. D. Mermin, Hidden variables and the two theorems of John Bell , Rev. Mod. Phys. 65 , 803 (1993).
4[4] Samson Abramsky and Adam Brandenburger, The Sheaf-Theoretic Structure Of Non-Locality and Contextuality , New J. Phys. 13 , 113036 (2011).
5[5] Adan Cabello, Simone Severini, Andreas Winter, Graph-Theoretic Approach to Quantum Correlations , Phys. Rev. Lett. 112 , 040401 (2014).
6[6] S. Bravyi and A. Kitaev, Universal quantum computation with ideal Clifford gates and noisy ancillas , Phys. Rev. A 71 , 022316 (2005).
7[7] R. Raussendorf and H.J. Briegel, A one-way quantum computer , Phys. Rev. Lett. 86 , 5188 (2001).
8[8] M. Howard, J.J. Wallman, V. Veitch, J. Emerson, Contextuality supplies the ‘magic’ for Quantum Computation , Nature (London) 510 , 351 (2014).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Topological proofs of contextuality in quantum mechanics

Abstract

1 Introduction

2 First example

3 Measurement and contextuality

3.1 Observables

3.2 Definition of contextuality

Definition 1

Lemma 1

4 Parity-based contextuality proofs

4.1 The chain complex C∗{\cal{C}}_{*}C∗​

4.2 β\betaβ is a 2-cocycle

4.3 Cohomological formulation of parity-based contextuality proofs

Lemma 2

Theorem 1

4.4 Squaring the star

4.5 State-dependent parity proofs

Definition 2

Theorem 2

5 Symmetry-based proofs of contextuality

5.1 First example based on Mermin’s square

5.2 The symmetries of O{\cal{O}}O

Lemma 3

Lemma 4

5.3 The general state-independent case

Lemma 5

5.4 Topological formulation

Lemma 6

Theorem 3

5.5 Relation between parity-based and symmetry-based proofs

Corollary 1

5.6 Contextuality and the group extension problem

Theorem 4

Lemma 7

5.7 State-dependent contextuality proofs based on symmetry

Lemma 8

Lemma 9

Theorem 5

Theorem 6

6 Conclusion

Acknowledgments.

Appendix A Contextuality in measurement-based quantum computation

A.1 Quantum computation by local measurement

A.2 MBQC and Mermin’s star

A.3 Computational output and contextuality

Theorem 7

Appendix B Chain complexes

Appendix C A converse of Lemma 7

Lemma 10

4.1 The chain complex ${\cal{C}}_{*}$

4.2 $\beta$ is a 2-cocycle

5.2 The symmetries of ${\cal{O}}$