Phase space simulation method for quantum computation with magic states   on qubits

Robert Raussendorf; Juani Bermejo-Vega; Emily Tyhurst; Cihan Okay; and; Michael Zurel

arXiv:1905.05374·quant-ph·March 10, 2020

Phase space simulation method for quantum computation with magic states on qubits

Robert Raussendorf, Juani Bermejo-Vega, Emily Tyhurst, Cihan Okay, and, Michael Zurel

PDF

TL;DR

This paper introduces a classical simulation method for qubit quantum systems using quasiprobability distributions, enabling efficient simulation of certain quantum computations and extending previous results to all finite dimensions.

Contribution

It generalizes simulation techniques to all finite dimensions, including qubits, and introduces a robustness measure for simulation cost, surpassing stabilizer-based methods.

Findings

01

Efficient classical simulation for non-negative quasiprobability states.

02

Extension of simulation to negative quasiprobability distributions with amplitude estimation.

03

Identification of states outside the stabilizer polytope that are still efficiently simulable.

Abstract

We propose a method for classical simulation of finite-dimensional quantum systems, based on sampling from a quasiprobability distribution, i.e., a generalized Wigner function. Our construction applies to all finite dimensions, with the most interesting case being that of qubits. For multiple qubits, we find that quantum computation by Clifford gates and Pauli measurements on magic states can be efficiently classically simulated if the quasiprobability distribution of the magic states is non-negative. This provides the so far missing qubit counterpart of the corresponding result [V. Veitch et al., New J. Phys. 14, 113011 (2012)] applying only to odd dimension. Our approach is more general than previous ones based on mixtures of stabilizer states. Namely, all mixtures of stabilizer states can be efficiently simulated, but for any number of qubits there also exist efficiently simulable…

Figures16

Click any figure to enlarge with its caption.

Tables3

Table 1. Table 2: Number of points in phase space as a function of { m } 𝑚 \{m\} .

$m$	$0$	$1$	${1, 2}$
2 rebits	24	72	120
2 qubits	60	240	432

Table 2. Table 3: Volume fraction of state space filled by the positively representable states, as a function of { m } 𝑚 \{m\} ; (top) two rebits, (bottom) two qubits. The volume fraction V + / V subscript 𝑉 𝑉 V_{+}/V was obtained numerically, by sampling 10 6 superscript 10 6 10^{6} random states according to the Fubini-Study measure for pure states (second row) and the Hilbert-Schmidt measure for mixed states (third row). The first column, m = 0 𝑚 0 m=0 , describes mixtures of stabilizer states, and the last column hyper-octahedral states Rall for comparison.

$m$	$0$	$1$	${1, 2}$	hy.oct.
$V_{+} / V$ [pure]	0	1	1	0
$V_{+} / V$ [mixed]	0.144	1	1	0.924

Table 3. Table 4: Robustness values of selected magic states. For robustness of magic ( ℜ S subscript ℜ 𝑆 \mathfrak{R}_{S} ), also see Heinrich .

state	$ℜ$	$ℜ_{S}$
${\| H ⟩}^{\otimes 2}$	1.0	1.7472
${\| T ⟩}^{\otimes 2}$	1.0	2.23205
${\| H ⟩}^{\otimes 3}$	1.283	2.2189
${\| T ⟩}^{\otimes 3}$	1.385	3.09807
$\| Hoggar ⟩$	1.80	3.8000

Equations211

T_{a} = e^{i ϕ (a)} X (a_{X}) Z (a_{Z}), \forall a = (a_{X}, a_{Z}) \in E := Z_{d}^{2 n} .

T_{a} = e^{i ϕ (a)} X (a_{X}) Z (a_{Z}), \forall a = (a_{X}, a_{Z}) \in E := Z_{d}^{2 n} .

A_{Ω}^{γ} := \frac{1}{d ^{n}} b \in Ω \sum ω^{γ (b)} T_{b},

A_{Ω}^{γ} := \frac{1}{d ^{n}} b \in Ω \sum ω^{γ (b)} T_{b},

ω^{γ (0)} T_{0} = I .

ω^{γ (0)} T_{0} = I .

ρ = (Ω, γ) \in V \sum W_{ρ} (Ω, γ) A_{Ω}^{γ} .

ρ = (Ω, γ) \in V \sum W_{ρ} (Ω, γ) A_{Ω}^{γ} .

T_{a} T_{b} = ω^{β (a, b)} T_{a + b}, \forall a, b \in E : [T_{a}, T_{b}] = 0.

T_{a} T_{b} = ω^{β (a, b)} T_{a + b}, \forall a, b \in E : [T_{a}, T_{b}] = 0.

[a, b] := a_{X} b_{Z} - a_{Z} b_{X} mod d,

[a, b] := a_{X} b_{Z} - a_{Z} b_{X} mod d,

β (a, b) + β (a + b, c) - β (b, c) - β (a, b + c) = 0 mod d,

β (a, b) + β (a + b, c) - β (b, c) - β (a, b + c) = 0 mod d,

β (a, b) = β (b, a), \forall a, b with [a, b] = 0.

β (a, b) = β (b, a), \forall a, b with [a, b] = 0.

a, b \in Ω \land [a, b] = 0 ⟹ a + b \in Ω.

a, b \in Ω \land [a, b] = 0 ⟹ a + b \in Ω.

γ (a) + γ (b) - γ (a + b) = β (a, b),

γ (a) + γ (b) - γ (a + b) = β (a, b),

d β (a, b, c) := β (a, b) + β (a + b, c) - β (b, c) - β (a, b + c) .

d β (a, b, c) := β (a, b) + β (a + b, c) - β (b, c) - β (a, b + c) .

d γ (a, b) := γ (a) + γ (b) - γ (a + b),

d γ (a, b) := γ (a) + γ (b) - γ (a + b),

A_{E}^{γ} = \frac{1}{d ^{n}} a \in E \sum ω^{γ (a)} T_{a},

A_{E}^{γ} = \frac{1}{d ^{n}} a \in E \sum ω^{γ (a)} T_{a},

V = Z_{d}^{2 n} (for odd d) .

V = Z_{d}^{2 n} (for odd d) .

Ω_{0} = {0, x, y, z},

Ω_{0} = {0, x, y, z},

\begin{array}[]{rcl}\rho(x,y)&=&\displaystyle{\frac{1}{4}I_{12}+x(X_{1}X_{2}+Z_{1}Z_{2}-Y_{1}Y_{2})\vspace{1mm}}\\ &&\displaystyle{+y(Z_{1}+Z_{2}),}\end{array}

\begin{array}[]{rcl}\rho(x,y)&=&\displaystyle{\frac{1}{4}I_{12}+x(X_{1}X_{2}+Z_{1}Z_{2}-Y_{1}Y_{2})\vspace{1mm}}\\ &&\displaystyle{+y(Z_{1}+Z_{2}),}\end{array}

∣ H (ϕ)⟩ := (∣0 ⟩ + e^{- i ϕ} ∣1 ⟩) / 2

∣ H (ϕ)⟩ := (∣0 ⟩ + e^{- i ϕ} ∣1 ⟩) / 2

Ω = k = 1 ⋃ ξ I_{k}

Ω = k = 1 ⋃ ξ I_{k}

\begin{array}[]{rcl}C_{2j-1}&=&I_{1..j-1}X_{j}Z_{j+1}Z_{j+2}..Z_{m-1}Z_{m},\\ C_{2j}&=&I_{1..j-1}Y_{j}Z_{j+1}Z_{j+2}..Z_{m-1}Z_{m},\end{array}

\begin{array}[]{rcl}C_{2j-1}&=&I_{1..j-1}X_{j}Z_{j+1}Z_{j+2}..Z_{m-1}Z_{m},\\ C_{2j}&=&I_{1..j-1}Y_{j}Z_{j+1}Z_{j+2}..Z_{m-1}Z_{m},\end{array}

C_{2 m + 1} = Z_{1} Z_{2} ... Z_{m - 1} Z_{m} .

C_{2 m + 1} = Z_{1} Z_{2} ... Z_{m - 1} Z_{m} .

Ω = k = 1 ⋃ ξ ⟨ p_{k}, \tilde{I} ⟩

Ω = k = 1 ⋃ ξ ⟨ p_{k}, \tilde{I} ⟩

[p_{1}, p_{2}] [p_{1}, p_{3}] = [p_{1}, p_{4}] = [p_{2}, p_{3}] = [p_{3}, p_{4}] = 0 = [p_{2}, p_{4}] = 1

[p_{1}, p_{2}] [p_{1}, p_{3}] = [p_{1}, p_{4}] = [p_{2}, p_{3}] = [p_{3}, p_{4}] = 0 = [p_{2}, p_{4}] = 1

[p_{1}, p_{2}] [p_{1}, p_{3}] = [p_{2}, p_{3}] = [p_{3}, p_{4}] = 0 = [p_{1}, p_{4}] = [p_{2}, p_{4}] = 1.

[p_{1}, p_{2}] [p_{1}, p_{3}] = [p_{2}, p_{3}] = [p_{3}, p_{4}] = 0 = [p_{1}, p_{4}] = [p_{2}, p_{4}] = 1.

[p_{1}, p_{2}] [p_{1}, p_{3}] = [p_{2}, p_{3}] = 0 = [p_{1}, p_{4}] = [p_{2}, p_{4}] = [p_{3}, p_{4}] = 1.

[p_{1}, p_{2}] [p_{1}, p_{3}] = [p_{2}, p_{3}] = 0 = [p_{1}, p_{4}] = [p_{2}, p_{4}] = [p_{3}, p_{4}] = 1.

\begin{array}[]{rcl}\{p_{1},\dots,p_{\xi}\}&=&\{p_{1,1},p_{1,2},\dots,p_{1,\xi_{1}}\}\cup\\ &&\{p_{2,1},p_{2,2},\dots,p_{2,\xi_{2}}\}\cup\cdots\cup\\ &&\{p_{\pi,1},p_{\pi,2},\dots,p_{\pi,\xi_{\pi}}\}\end{array}

\begin{array}[]{rcl}\{p_{1},\dots,p_{\xi}\}&=&\{p_{1,1},p_{1,2},\dots,p_{1,\xi_{1}}\}\cup\\ &&\{p_{2,1},p_{2,2},\dots,p_{2,\xi_{2}}\}\cup\cdots\cup\\ &&\{p_{\pi,1},p_{\pi,2},\dots,p_{\pi,\xi_{\pi}}\}\end{array}

[c, a_{k}] = 11 \leq k \leq 2 m

[c, a_{k}] = 11 \leq k \leq 2 m

I_{x} = ⟨ x, \tilde{I} ⟩, I_{y} = ⟨ y, \tilde{I} ⟩, I_{z} = ⟨ z, \tilde{I} ⟩,

I_{x} = ⟨ x, \tilde{I} ⟩, I_{y} = ⟨ y, \tilde{I} ⟩, I_{z} = ⟨ z, \tilde{I} ⟩,

A_{Ω_{0}}^{γ_{0}} \otimes A_{\tilde{I}}^{\tilde{γ}} = A_{Ω_{x y z}}^{γ},

A_{Ω_{0}}^{γ_{0}} \otimes A_{\tilde{I}}^{\tilde{γ}} = A_{Ω_{x y z}}^{γ},

Ω \times a := Ω_{a} \cup {a + b ∣ b \in Ω_{a}}, \forall a \neq \in Ω.

Ω \times a := Ω_{a} \cup {a + b ∣ b \in Ω_{a}}, \forall a \neq \in Ω.

γ \times s_{a} (b)

γ \times s_{a} (b)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Phase space simulation method for quantum computation with magic states on qubits

$\text{Robert Raussendorf}^{1,2}$ , $\text{Juani Bermejo-Vega}^{3}$ , $\text{Emily Tyhurst}^{4}$ , $\text{Cihan Okay}^{1,2}$ , and $\text{Michael Zurel}^{1}$

1: Department of Physics & Astronomy, University of British Columbia, Vancouver, BC V6T1Z1, Canada

2: Stewart Blusson Quantum Matter Institute, University of British Columbia, Vancouver, BC, Canada

3: Freie Universität Berlin, Berlin, Germany

4: Department of Physics, University of Toronto, Toronto, ON, Canada

Abstract

We propose a method for classical simulation of finite-dimensional quantum systems, based on sampling from a quasiprobability distribution, i.e., a generalized Wigner function. Our construction applies to all finite dimensions, with the most interesting case being that of qubits. For multiple qubits, we find that quantum computation by Clifford gates and Pauli measurements on magic states can be efficiently classically simulated if the quasiprobability distribution of the magic states is non-negative. This provides the so far missing qubit counterpart of the corresponding result [V. Veitch et al., New J. Phys. 14, 113011 (2012)] applying only to odd dimension. Our approach is more general than previous ones based on mixtures of stabilizer states. Namely, all mixtures of stabilizer states can be efficiently simulated, but for any number of qubits there also exist efficiently simulable states outside the stabilizer polytope. Further, our simulation method extends to negative quasiprobability distributions, where it provides probability estimation. The simulation cost is then proportional to a robustness measure squared. For all quantum states, this robustness is smaller than or equal to robustness of magic.

I Introduction

How to mark the classical-to-quantum boundary is a question that dates back almost to the beginning of quantum theory. Ehrenfest’s theorem Ehrenfest provides an early insight, and the Einstein-Podolsky-Rosen paradox EPR and Schrödinger’s cat Scat are two early puzzles. The advent of quantum computation Feyn –Deutsch added a computational angle: When does it become hard to simulate a quantum mechanical computing device on a classical computer? Which quantum mechanical resource do quantum computers harness to generate a computational speedup?

One instructive computational model is quantum computation with magic states (QCM) BK . In QCM, both “traditional” indicators of quantumness (developed in the fields of quantum optics and foundations of quantum mechanics) and a computational indicator can be applied. From quantum optics and foundations, the indicators are the negativity of a Wigner function Wig –Buz , and the breakdown of non-contextual hidden variable models Bell –Merm . Computer science is concerned with the breakdown of efficient classical simulation.

In the particular setting of QCM, an important distinction arises between the cases of even and odd local Hilbert space dimension $d$ . If $d$ is odd, then all three of the above indicators for the classical-to-quantum boundary align NegWi –Howard . This is a very satisfying situation: the physicist, the philosopher and the computer scientist can have compatible notions of what is “quantum”.

In even local dimension, the situation differs starkly. Non-contextual hidden variable models for QCM are not viable regardless of computational power Howard , which voids the foundational indicator, and furthermore obstructs the view of contextuality as a computational resource. Also, the multi-qubit Wigner functions constructed to date do not support efficient classical simulation of QCM by sampling over phase space. Thus, the physics and computer-science based criteria for classicality differ, which is an unsatisfactory state of affairs compared to odd $d$ . The purpose of this paper is to align the perspectives of the physicist and the computer scientist on the classical-to-quantum transition in QCM on qubits.

To prepare for the subsequent discussion, we provide a short summary of QCM, and the role of the Wigner function in it. Quantum computation with magic states operates with a restricted set of instructions, the Clifford gates. These are unitary operations defined by the property that they map all Pauli operators onto Pauli operators under conjugation. Clifford gates are not universal, and, in fact, can be efficiently classically simulated Goma . This operational restriction is compensated for by invoking the “magic” states, which are special quantum states that cannot be created by Clifford gates and Pauli measurements. Suitable magic states restore quantum computational universality; and in fact QCM is a leading paradigm for fault-tolerant universal quantum computation. In sum, computational power is transferred from the quantum gates to the magic states, and one is thus led to ask: Which quantum properties give the magic states their computational power?

One such property is, for odd $d$ at least, negativity in the Wigner function. A quantum speedup can arise only if the Wigner function of the magic states assumes negative values. If, to the contrary, the Wigner function is positive, then the whole quantum computation can be efficiently classically simulated NegWi ,Mari . Further, a positive Wigner function is, for $n\geq 2$ quantum systems, equivalent to the existence of a non-contextual hidden variable model Howard , Delf2 . Both Wigner function negativity and contextuality of the magic states are therefore necessary quantum computational resources.

As we noted, this picture only applies if the local Hilbert space dimension is odd. This excludes the full multi-qubit case, which arguably is the most important. Approaches to the qubit case have been made, e.g. through the rebit scenario ReWi and multi-qubit settings with operational restrictions QuWi , Bermejo , or by invoking a Wigner function over Grassmann variables Love , or multiple Wigner functions at once Galvao . Common to these approaches is that, unlike for odd $d$ NegWi , they do not efficiently simulate the evolution under general Clifford gates and Pauli measurements by sampling, a.k.a. weak simulation VdN1 –BT .

An alternative approach to weak simulation is by defining a quasiprobability function over stabilizer states BK , RoM , Pashayan , bypassing Wigner functions. It has the advantage of efficiently simulating all Clifford circuits on positively represented states. For multi-qubit systems, it has so far been unknown how the stabilizer method relates to Wigner functions, but we clarify the relation here.

In this paper, we provide the thus far missing phase space picture for QCM on multi-qubit systems. Central to our discussion is a new quasi-probability function defined for all local Hilbert space dimensions $d$ and all numbers of subsystems $n$ . When applied to odd $d$ , it reproduces the known finite-dimensional adaptation Gross –Woott of the original Wigner function Wig ; but for even $d$ , in particular $d=2$ , it is different. Then, this quasiprobability function requires a phase space of increased size, in accordance with KarWalBart . Even in $d=2$ , the positivity of this quasiprobability is preserved under all Pauli measurements. This property is crucial for the efficient classical simulation of QCM on positively represented states. Also, this simulation contains the efficient classical simulation BK of stabilizer mixtures as a special case. We thus reproduce the essential features of the odd-dimensional scenario in $d=2$ .

Starting from the definition of the quasiprobability function $W$ , we treat the following subjects: characterization of phase space for $d=2$ , preservation of positivity of $W$ under Pauli measurements, covariance of $W$ under all Clifford unitaries, efficient classical simulation of QCM for $W\geq 0$ , relation to the qubit stabilizer formalism, hardness of classical simulation for $W<0$ , and a monotone under the free operations.

In summary, we arrive at a description that resembles the corresponding scenario in odd local dimension. Namely, negativity in the quasiprobability distribution $W$ for the initial magic state is a necessary precondition for quantum speedup. However, one difference between even and odd $d$ remains. In odd $d$ , every positive Wigner function is also a non-contextual hidden variable model. This is not so for even $d$ , due to the phenomenon of state-independent contextuality among Pauli observables.

II Results and outline

II.1 Summary of results

This paper addresses the full $n$ -qubit case of quantum computation with magic states, from the perspectives of the classical-to-quantum transition and quantum computational resources. For the case of local dimension $d=2$ we closely reproduce the relations between Wigner function and efficient classical simulation existing in odd $d$ . Central to our discussion is a novel quasiprobability function $W$ defined for all local Hilbert space dimensions $d$ . It has the following general properties:

(i) For all $n$ and $d$ , $W$ is Clifford-covariant and positivity-preserving under Pauli measurements.

(ii) If the local Hilbert space dimension $d$ is even, $W_{\rho}$ is non-unique for any given quantum state $\rho$ . The set of phase point operators corresponding to $W$ is over-complete.

(iii) If $d$ is odd and $n\geq 2$ , then $W$ reduces to the standard Wigner function Gross , Gross2 for odd finite dimension.

(iv) For all $n$ and $d$ , the stabilizer formalism is contained as a special case. All stabilizer states can be positively represented by $W$ , and efficiently updated under Clifford operations.

(v) The present description goes beyond the stabilizer formalism. In particular, for $d=2$ , for every number $n$ of qubits there exist non-mixtures of stabilizer states which are positively represented by $W$ . Furthermore, for any quantum state $\rho$ , the 1-norm of the optimal $W_{\rho}$ is smaller than or equal to the robustness of magic $\mathfrak{R}_{S}(\rho)$ . (Both robustness measures are instances of sum negativity Pashayan .)

The following properties of $W$ for special values of $n$ (and $d=2$ ) are also worth noting. (a) The Eight-state model EightState is a special case of $W$ , namely for $n=1$ . (b) For Mermin’s square Merm , the present simulation algorithm saturates the lower bound MemCo on the memory cost of classical simulation. (c) Up to two copies of magic $T$ and $H$ states are positively represented by $W$ .

We establish the following main results: (I) The set of states positively represented by $W$ is closed under Pauli measurement (Theorem 2 in Section V). (II) If a quantum state $\rho$ has a non-negative function $W_{\rho}$ , and $W_{\rho}$ can be efficiently sampled, then, for every Clifford circuit applied to $\rho$ , the corresponding measurement statistics can be efficiently sampled (Theorem 3 in Section VI). In this sense, $W\geq 0$ leads to efficient classical simulation of the corresponding quantum computation. (III) For $d=2,n\geq 2$ , the $n$ -system phase space has a more complicated structure than in the case of odd $d$ , reflecting the fact that the phase point operators are dependent. The points in generalized multi-qubit phase space are classified (Theorem 1 in Section IV). (IV) There exists a robustness measure $\mathfrak{R}$ which bounds the hardness of classical simulation of quantum computation with magic states, when $W_{\rho_{\text{init}}}<0$ for the initial state $\rho_{\text{init}}$ . $\mathfrak{R}$ is less than or equal to the robustness of magic (Lemma 9), and a monotone under Clifford unitaries and Pauli measurements (Theorem 4 in Section VII).

II.2 Outline

The remainder of this paper is organized as follows. In Section III we define a quasiprobability function $W$ . We show that it reduces to Gross’ Wigner function Gross whenever the local Hilbert space dimension $d$ is odd, but, more importantly, is different in even dimension. Specifically, $W$ represents all quantum states redundantly for even $d$ , which enables Clifford covariance and positivity preservation under Pauli measurement. In Section IV we analyze the structure of the phase space on which $W$ lives, for the case of multiple qubits. In particular, we classify the points of phase space. We also clarify the relation to the qubit stabilizer states and their mixtures.

In Sections V and VI we turn to dynamics. In Section V we discuss the update of $W$ under Pauli measurement, and in Section VI the efficient classical simulation of QCM for positive $W$ .

In Section VII we address the case of $W_{\rho}<0$ . We discuss hardness of classical simulation, as well as the elements of a resource theory based on $W$ .

In Section VIII we discuss the extent to which the quasiprobility function $W$ satisfies the Stratonovich-Weyl criteria, and its relation to hidden variable models. We conclude in Section IX.

III The quasiprobability function

In this section we introduce the generalized $n$ -qudit phase space ${\cal{V}}$ , for any local Hilbert space dimension $d$ , and a quasi-probability distribution $W:{\cal{V}}\longrightarrow\mathbb{R}$ living on it. In Section III.1 we define the phase point operators corresponding to $W$ , and in Section III.2 identify a minimal set of them. Section III.3 reveals the cohomological underpinning of our construction, which links the present subject to parity proofs of quantum contextuality Coho and contextuality in measurement-based quantum computation CohoMBQC .

III.1 Generalized phase space

We choose a phase convention for the Pauli operators,

[TABLE]

Therein, the function $\phi:E\longrightarrow\mathbb{R}$ has to satisfy the constraint that $(T_{a})^{d}=I$ , for all $a\in E$ . As a consequence of this condition, all eigenvalues of the operators $T_{a}$ are of the form $\omega^{k}$ , $k\in\mathbb{N}$ , with $\omega:=\exp(2\pi i/d)$ .

We now proceed to the definition of the phase point operators. We consider a subset $\Omega$ of $E$ , and a function $\gamma:\Omega\longrightarrow\mathbb{Z}_{d}$ , both subject to additional constraints that will be specified in Definitions 2–4 below. The pair $(\Omega,\gamma)$ specifies a corresponding phase point operator $A_{\Omega}^{\gamma}$ ,

[TABLE]

with the constraint that

[TABLE]

When comparing Eq. (2) to the phase point operators of the previously discussed qudit NegWi , rebit ReWi and restricted qubit QuWi cases, we note that the overall structure remains the same. In this case, the sets $\Omega$ are an additional varying parameter, and the phase space thereby becomes larger.

Based on the phase point operators $A_{\Omega}^{\gamma}$ of Eq. (2), we introduce the counterpart to the Wigner function that applies to our setting. The generalized phase space ${\cal{V}}$ consists of all admissible pairs $(\Omega,\gamma)$ , to be specified below. Any $n$ -system quantum state $\rho$ can be expanded in terms of a function $W_{\rho}:{\cal{V}}\longrightarrow\mathbb{R}$ ,

[TABLE]

The reason for imposing Eq. (3) is that it implies $\text{Tr}A_{\Omega}^{\gamma}=1$ , for all $(\Omega,\gamma)\in{\cal{V}}$ . Hence, $W$ defined in Eq. (4) is a quasiprobability distribution. As we see shortly, it generalizes the Wigner function Gross for odd-dimensional qudits to qubits.

We note that when $d$ is even, the quasiprobability distribution $W_{\rho}$ is non-unique because the set of phase point operators of Eq. (2) is overcomplete.

Definition 1

An $n$ -qudit quantum state $\rho$ is positively representable if it can be expanded in the form of Eq. (4), with $W_{\rho}(\Omega,\gamma)\geq 0$ , for all $(\Omega,\gamma)\in{\cal{V}}$ .

The efficient classical simulation algorithm described in Section VI applies to positively representable quantum states $\rho$ . The non-uniqueness of $W_{\rho}$ allows for more positively representable states than prior quasiprobability representations.

We now turn to the properties of admissible sets $\Omega$ and functions $\gamma$ that define points in the phase space ${\cal{V}}$ . To begin, we define a function $\beta$ which encodes how translation operators on phase space compose,

[TABLE]

We further define the symplectic product

[TABLE]

and hence $[a,b]=0\;\Longleftrightarrow\;[T_{a},T_{b}]=0$ .

The function $\beta$ satisfies the relation

[TABLE]

for $a,b,c\in E$ . We state this relation for later reference. It is a consequence of the associativity of operator multiplication. Consider the operator product $T_{a}T_{b}T_{c}=T_{a}(T_{b}T_{c})=(T_{a}T_{b})T_{c}$ , and expand $T_{a}(T_{b}T_{c})=\omega^{\beta(a,b+c)+\beta(b,c)}T_{a+b+c}$ , $(T_{a}T_{b})T_{c}=\omega^{\beta(a,b)+\beta(a+b,c)}T_{a+b+c}$ . Comparing the two equivalent expressions yields Eq. (7).

Then, it follows straightforwardly from the definition Eq. (5) of $\beta$ that

[TABLE]

We constrain $\Omega$ by the following definitions:

Definition 2

A set $\Omega\subset E$ is closed under inference if it holds that

[TABLE]

The motivation for this definition is that if $T_{a}$ and $T_{b}$ can be simultaneously measured, then the value of $T_{a+b}$ can be inferred from the measurement outcomes, through relation (5). A consequence of the closedness under inference is that $0\in\Omega$ for all closed sets $\Omega$ .

Definition 3

A set $\Omega\subset E$ is non-contextual if there exists a value assignment $\gamma:\Omega\longrightarrow\mathbb{Z}_{d}$ that satisfies the condition

[TABLE]

for all $a,b\in\Omega$ , and $[a,b]=0$ .

To motivate the nomenclature, if the set $\Omega\subset E$ is non-contextual per the above definition, then it does not admit a parity-based contextuality proof Coho . Namely, Eq. (10) represents the constraints on non-contextual value assignments $\gamma$ that result from the operator constraints Eq. (5). If these constraints can be satisfied, then there is no parity-based contextuality proof.

Definition 4

The generalized phase space ${\cal{V}}$ consists of all pairs $(\Omega,\gamma)$ such that (i) $\Omega$ is closed under inference, (ii) $\Omega$ is non-contextual, (iii) $\gamma:\Omega\longrightarrow\mathbb{Z}_{d}$ satisfies the relation Eq. (10), and (iv) Eq. (3) holds.

Thus, for the generalized phase space ${\cal{V}}$ , the only sets $\Omega$ that matter are simultaneously closed and non-contextual. For short, we call such sets “cnc”.

III.2 Maximal sets $\Omega$

The cnc sets $\Omega$ partially specify the points in phase space, and it is thus desirable to eliminate possible redundancies among them. It turns out that only the “maximal” sets $\Omega$ need to be considered for ${\cal{V}}$ .

Definition 5

A cnc set $\Omega\subset E$ is maximal if there is no cnc set $\tilde{\Omega}\subset E$ such that $\Omega\subsetneq\tilde{\Omega}$ .

We denote by ${\cal{V}}_{M}$ the subset of ${\cal{V}}$ constructed only from the maximal cnc sets $\Omega$ . Then, any quantum state $\rho$ has expansions like Eq. (4), but with ${\cal{V}}$ replaced by ${\cal{V}}_{M}$ . If one of those expansions is non-negative, then we say that $\rho$ is positively representable w.r.t. ${\cal{V}}_{M}$ .

Lemma 1

For any $n$ and $d$ , a quantum state $\rho$ is positively representable w.r.t. ${\cal{V}}$ if and only if it is positively representable w.r.t. ${\cal{V}}_{M}$ .

From the perspective of positive representability, we may therefore shrink ${\cal{V}}$ to ${\cal{V}}_{M}$ without loss. We make use of this property when discussing the case of odd $d$ in Section IV.1 right below, and in the classification of cnc sets $\Omega$ for the multi-qubit case in Section IV.3. The proof of Lemma 1 is given in Appendix A.

III.3 The cohomological viewpoint

The above Definitions 3 and 4 have a cohomological underpinning, which connects the subject of the present paper to the topological treatment of parity-based contextuality proofs Coho , and of contextuality in measurement-based quantum computation CohoMBQC .

The cohomological picture arises as follows. The partial value assignments $\gamma$ and the function $\beta$ are cochains in a chain complex, with Eqs. (7) and (10) constraining them. Eq. (7) says that $\beta$ is a special cochain, namely a cocycle. Now, the basic reason for why the case of even $d$ is so much more involved than the case of odd $d$ is that, for even $d$ , the cocycle $\beta$ is non-trivial whereas for odd $d$ it is trivial Coho .

Eqs. (7) and (10) are frequently used in this paper, for example in the update rules of the phase point operators under Pauli measurements (proof of Lemma 5), the closedness of the generalized phase space ${\cal{V}}$ under update by Pauli measurement (proof of Lemma 7), and covariance of the quasiprobability function $W$ under Clifford unitaries (proof of Lemma 10). These are central properties for the phase-space description of quantum computation with magic states. and they are all matters of cohomology.

The cohomological formulation is based on a chain complex ${\cal{C}}_{n}$ constructed from the $n$ -qubit Pauli operators $T_{a}$ . The operator labels $a$ define the edges of this complex; the faces of ${\cal{C}}_{n}$ correspond to commuting pairs $(a,b)$ and volumes $(a,b,c)$ to commuting triples. For details, the interested reader is referred to Coho . Here, we only state two basic topological properties of the present scenario.

As already noted, the cochain $\beta$ defined in Eq. (5) is in fact a 2-cocycle, with the cocycle condition $d\beta=0$ enforced by Eq. (7). For any given volume $v=(a,b,c)$ , the coboundary $d\beta$ evaluates on $v$ to

[TABLE]

Thus, Eq. (7) says that $d\beta(v)=0$ , for all volumes $v$ .

Eq. (10) in Definition 3 also has a cohomological interpretation, namely $d\gamma=\beta|_{\Omega\times\Omega}$ , with

[TABLE]

for any face $(a,b)$ spanned by commuting edges $a,b\in E$ .

Subsequently, we use evaluations of $d\beta$ and $d\gamma$ , defined in Eqs. (11) and (12), as a short-hand to express Eqs. (7) and (10). As outlined above, it is conceptually helpful to remember that $d\beta$ and $d\gamma$ denote coboundaries, but it is not required for the technical results presented in this paper.

IV Properties of the phase space ${\cal{V}}$

In this section, we look at the structure of the phase space ${\cal{V}}$ more closely, and make connections to previous phase space formulations. Namely, in Section IV.1 we address the relationship of this phase space to the usual qudit phase space, and in Section IV.2 we make clear the connections to the previously addressed rebit case. Further, in Section IV.3, we classify the cnc sets $\Omega$ , and for every $\Omega$ describe the sets $\Gamma(\Omega)$ of value assignments $\gamma$ . In Section IV.4 we clarify the relation to the stabilizer formalism.

IV.1 Qudits of odd dimension

This is the only place in the present paper where we consider the case of odd $d$ . The purpose of this section is to show that if $d$ is odd then for $n\geq 2$ qudits the generalized phase space ${\cal{V}}$ reduces to the standard phase space $V=\mathbb{Z}_{d}^{2n}$ . There, the quasiprobability function $W$ becomes the standard Wigner function Gross for odd finite-dimensional systems. Hence, the present quasiprobability function $W$ is a generalization of the finite odd dimensional Wigner function Gross , which in turn is a descendant of the original Wigner function Wig .

If $d$ is odd then the whole set $E$ is cnc. First, $E$ is closed under inference by definition. And second, it is known that in odd dimension Pauli observables have non-contextual deterministic value assignments Howard2 , QuWi . These yield the functions $\gamma$ , satisfying the condition Eq. (10). $E$ is thus non-contextual.

$E$ is furthermore the single maximal set, and, with Lemma 1, the only cnc set that needs to be considered for the phase space. Hence, the phase point operators are

[TABLE]

with the functions $\gamma$ satisfying Eqs. (3) and (10). The former condition ensures that the identity operator appears with weight $1/d^{n}$ in the expansion (real and positive). If $n\geq 2$ , the latter condition has $d^{2n}$ solutions for the functions $\gamma$ if $d$ Delf2 . For a suitable choice of $\phi$ in Eq. (1), it holds that $\beta\equiv 0$ (odd $d$ only). The solutions for $\gamma$ then form a vector space

[TABLE]

We note that the case of a single qudit, $n=1$ , is an exception to the above behaviour. In this case, the set $\mathcal{V}$ has greater cardinality than $\mathbb{Z}_{d}^{2}$ Delf2 .

IV.2 Qubits and rebits

The remainder of this paper is about local Hilbert space dimension $d=2$ . This means mostly qubits, but we will occasionally also consider systems of rebits. The reason is that the major complication of the $d=2$ case stems from Mermin’s square and star Merm —two strikingly simple contextuality proofs. Those settings embed most efficiently in rebits rather than qubits, which warrants the inclusion of rebits here.

We remark that the present discussion of rebits is almost identical to the discussion of qubits, but very different from the earlier discussion of rebits in ReWi . In the latter, the physically measurable observables were restricted from real Pauli operators to CSS-type Pauli operators, and the real Clifford unitaries to CSS-ness preserving Clifford unitaries. No such restrictions are imposed here. If the restriction to CSS-ness preserving operations is imposed, then Mermin’s square and star, along with all other state-independent contextuality proofs based on Pauli observables, are effectively excised ReWi . Here, we retain those contextuality proofs, and consequently have to adjust to their presence. Notably, these contextuality proofs constrain quasiprobability distributions that preserve positivity under Pauli measurement.

We start the exploration of the $d=2$ case with two examples that illustrate the concept of generalized phases space ${\cal{V}}$ . The second example also illustrates the differences between contextuality, negative quasiprobability and quantum computational power for two-level systems.

Example 1: Eight-state model. It is known that every one-qubit quantum state can be positively represented by the so-called Eight-state-model EightState , which consists of two standard 1-qubit Wigner functions tagged together. The Eight-state-model is an instance of the state expansion Eq. (4), namely for $d=2$ , $n=1$ , and it contains only one set $\Omega$ ,

[TABLE]

with $T_{0}=I$ , $T_{x}=X$ , $T_{y}=Y$ and $T_{z}=Z$ . It is easily checked that $\Omega_{0}$ is non-contextual and closed under inference (no inference possible). The value assignments $\gamma$ are constrained by Eq. (3), hence $\gamma(0)=0$ , and no constraints arise from Eq. (7) due to the lack of non-trivial commuting elements in $\Omega_{0}$ . Thus, $\gamma(x)$ , $\gamma(y)$ and $\gamma(z)$ can be freely chosen. There are eight resulting functions, and they define the eight states of the model.

All one-qubit quantum states can be positively represented by this model, which is strictly more than all mixtures of one-qubit stabilizer states.

Example 2: Mermin’s square. Mermin’s square is at the very root of the complications that arise for Wigner functions in even dimension. In particular, no $n$ -qubit Wigner function for which the corresponding phase point operators form an operator basis can preserve positivity under all Pauli measurements QuWi .

All observables appearing in Mermin’s star are real, and can thus be embedded in two rebits. Our formalism is easily adaptable to this slightly simpler scenario. Fig. 1 shows three distinct types of cnc sets $\Omega$ . Type (a) is the union of two non-trivially intersecting isotropic subspaces (9 sets), type (b) is isotropic subspaces (6 sets), and type (c) is triples of anti-commuting elements, i.e., one from each row and column of the square (6 sets). For each cnc set $\Omega$ of type (a), (b) and (c) of Fig. 1, the constraint Eq. (10) allows for $2^{3}$ , $2^{2}$ , $2^{3}$ functions $\gamma$ , respectively. The number of phase space points of each type therefore is 72, 24, 48.

We make the following numerical observations about the two-rebit case: (i) Random sampling suggests that all 2-rebit states are positively representable; see Table 3. (ii) In Fig. 2 the region of positively representable density matrices of the form

[TABLE]

for $x,y\in\mathbb{R}$ , is displayed for three different methods; namely the stabilizer method RoM , the hyper-octrahedral method Rall , and the present phase space method. We find that all quantum states in the plane spanned by the parameters $x$ , $y$ are positively represented by the present phase space method, and this is not the case for the stabilizer and hyper-octahedral methods.

Example 3: 2 qubits. Numerical analysis shows that two copies of the state

[TABLE]

can be positively represented, for all angles $\phi$ .

IV.3 Classification of multi-qubit phase space points

Denote by $\Gamma(\Omega)$ the set of functions $\gamma:\Omega\longrightarrow\mathbb{Z}_{2}$ that satisfy the constraints Eqs. (3) and (10). Then, the following statement holds.

Lemma 2

For all sets $\Omega$ of Def. 4, $\Gamma(\Omega)$ is the coset of a vector space $U(\Omega)$ .

Proof of Lemma 2. Write $\gamma=\gamma_{0}+\eta$ , where $\gamma_{0}\in\Gamma(\Omega)$ is some reference function. Then, the only condition on the functions $\eta\in U(\Omega)$ is $d\eta=0$ . Thus, if $\eta,\eta^{\prime}\in U(\Omega)$ then $c\eta+c^{\prime}\eta^{\prime}\in U(\Omega)$ , for all $c,c^{\prime}\in\mathbb{Z}_{2}$ . $\Box$

Lemma 2 reproduces a familiar feature. In infinite and finite odd dimension, the whole phase space is an orbit under the vector space of translations. There is an origin 0 of phase space, and all other phase space points are obtained from it by translation. In our present case of $d=2$ , the phase space ${\cal{V}}$ splinters into many fragments, each of which corresponds to a vector space $U$ attached to a cnc set $\Omega$ .

At this point, one question about the structure of ${\cal{V}}$ remains: Can the cnc sets $\Omega$ be classified? It is resolved by Lemma 3 and Theorem 1 below.

Lemma 3

For $n$ qubits, consider an isotropic subspace $\tilde{I}\subset E$ of dimension $n-m$ , with $m\leq n$ , and $\xi\leq 2m+1$ elements $a_{k}\in E$ that pairwise anti-commute but all commute with $\tilde{I}$ . Denote $I_{k}:=\langle a_{k},\tilde{I}\rangle$ for $k=1,..,\xi$ . For any number $n$ of qubits, the sets

[TABLE]

are non-contextual and closed under inference.

Proof of Lemma 3. Existence. The sets $\Omega$ of Eq. (15) exist for all $m$ , $n$ . To see this, consider the $m$ -qubit Jordan-Wigner transforms of the Majorana fermion operators acting on qubits 1 to $m$ ,

[TABLE]

for $j=1,..,m$ , and, if $m>0$ , the further observable

[TABLE]

Further, be $\tilde{I}$ the isotropic subspace corresponding to a stabilizer state supported on the $n-m$ qubits numbered $m+1,..,n$ . Define $a_{k}$ via $C_{k}=T_{a_{k}}$ as in Eqs. (16),(17), for all $k=1,..,2m+1$ . These $a_{k}$ and $a\in\tilde{I}$ have the commutation relations required.

Closedness. Consider a pair $c,d\in\Omega$ such that $[c,d]=0$ . There are two cases. (i) $c,d\in I_{k}$ , for some $k$ . Then, $c+d\in I_{k}$ , hence $c+d\in\Omega$ .

(ii) $c\in I_{k}$ and $d\in I_{l}$ , $k\neq l$ . We may write $c=\nu\,x+g$ , $d=\mu\,y+g^{\prime}$ , for some $\nu,\mu\in\mathbb{Z}_{2}$ and $g,g^{\prime}\in\tilde{I}$ . The commutation relation $[c,d]=0$ then implies that $\nu\mu=0$ , hence either $\nu=0$ or $\mu=0$ . Wlog. assume that $\nu=0$ . Then, $c\in\tilde{I}$ , hence $c,d\in I_{l}$ . Thus, $c+d\in I_{l}\subset\Omega$ .

In both cases, $c,d\in\Omega$ and $[c,d]=0$ implies that $c+d\in\Omega$ . Hence, $\Omega$ is closed under inference.

Non-contextuality. There exists a function $\gamma|_{\tilde{I}}:\tilde{I}\longrightarrow\mathbb{Z}_{2}$ that satisfies Eq. (10) on $\tilde{I}$ . We now extend this function to $\Omega$ as follows. The values $\gamma(a_{k})$ , for $k=1,..,\xi$ can be freely chosen, and for all $a\in\tilde{I}$ and all $k$ , $\gamma(a_{k}+a):=\gamma(a_{k})+\gamma(a)+\beta(a_{k},a)$ . This fully defines $\gamma:\Omega\longrightarrow\mathbb{Z}_{2}$ . All commuting triples $c,d,c+d$ lie within one of the isotropic spaces $I_{k}$ forming $\Omega$ , and $d\gamma(a,b)=\beta(a,b)$ thus holds.

This establishes that the sets $\Omega$ of Eq. (15) exist for the maximum value of $\xi$ , $\xi=2m+1$ . One may always choose $\xi$ smaller, which neither affects closedness nor non-contextuality. $\Box$

Theorem 1

All maximal cnc sets $\Omega$ are of the form Eq. (15), with $\xi=2m+1$ and $1\leq m\leq n$ .

Proof of Theorem 1. Let $\Omega\subset E$ be closed under inference and non-contextual. We can partition the elements of $\Omega$ into two subsets, $\Omega=\{q_{1},\dots,q_{\mu}|g_{1},\dots,g_{\nu}\}$ , where $\tilde{I}=\{g_{1},\dots,g_{\nu}\}$ are the elements of $\Omega$ which commute with the whole set. $\tilde{I}$ is an isotropic subspace since if two elements, $a$ and $b$ , commute with $\Omega$ , then clearly their sum, $a+b$ , also commutes with $\Omega$ , and $\tilde{I}$ is isotropic by definition.

If all elements of $\Omega$ pair-wise commute then $\Omega=\tilde{I}$ is an isotropic subspace. Isotropic subspaces are not maximal cnc sets because they are always contained in Eq. (15) sets with parameter $m=1$ . If $\Omega$ is not an isotropic subspace then it can be written compactly as

[TABLE]

where $\xi\geq 2$ , the cosets $p_{1}+\tilde{I},\dots,p_{\xi}+\tilde{I}$ are distinct and $q_{1},\dots,q_{\mu}$ are in the cosets $p_{1}+\tilde{I},\dots,p_{\xi}+\tilde{I}$ . Note that in this form, there can be no element $p_{j}$ which commutes with all of $p_{1},\dots,p_{\xi}$ because $\tilde{I}$ is defined to contain all such elements. Now we consider the possible commutation relations that $p_{1},\dots,p_{\xi}$ can have if $\Omega$ is non-contextual.

The Mermin square is generated by products of commuting pairs of the two qubit Pauli operators $\{X_{1},X_{2},Z_{1},Z_{2}\}$ . This is a contextual set. Therefore, any set which is closed under inference and contains four elements $p_{1},p_{2},p_{3},p_{4}$ with the commutation relations like those of $\{X_{1},X_{2},Z_{1},Z_{2}\}$ :

[TABLE]

will necessarily contain the full Mermin square and therefore be contextual.

Another sufficient condition for a closed under inference set to be contextual is that it contains four elements with the commutation relations

[TABLE]

The reason is that since the set is closed under inference, it will necessarily contain the elements $p_{1}+p_{2}$ and $p_{3}+p_{4}$ , and the elements $p_{1},p_{1}+p_{2},p_{3}+p_{4},p_{4}$ have the commutation relations of Eq. (19). Thus, it must contain a Mermin square.

A similar argument shows that another sufficient condition for a closed under inference set to be contextual is that it contains four elements with the commutation relations

[TABLE]

In this case, since the set is closed under inference, it must also contain the elements $p_{1}+p_{2}$ and $p_{2}+p_{3}$ and the elements $p_{1}+p_{2},p_{2},p_{2}+p_{3},p_{4}$ have the commutation relations of Eq. (19).

To determine the possible commutation relations of the elements $p_{1},\dots,p_{\xi}$ , we will look at their commutativity graph $\mathcal{G}$ . That is the undirected graph with a vertex for each of $p_{1},\dots,p_{\xi}$ and an edge connecting each pair of commuting vertices. Since $\Omega$ is non-contextual, the commutation relations of Eq. (19), Eq. (20) and Eq. (21) provide restrictions on the possible commutation relations of the elements $p_{1},\dots,p_{\xi}$ of $\Omega$ . In terms of the commutativity graph $\mathcal{G}$ , these are forbidden induced subgraphs111An induced subgraph of a graph is the graph obtained by taking a subset of the vertices of the original graph and all of the edges connecting pairs of vertices in the subset.

The restriction of Eq. (19) says that $\mathcal{G}$ cannot have a four vertex chordless cycle ( $C_{4}$ ) as an induced subgraph and the restriction from Eq. (20) says that $\mathcal{G}$ cannot have a four vertex path ( $P_{4}$ ) as an induced subgraph. These two forbidden induced subgraphs characterize the trivially perfect graphs Golumbic77 . I.e. $\mathcal{G}$ must be a trivially perfect graph.

Connected trivially perfect graphs have the the property that they contain a universal vertex Golumbic77 222A universal vertex is a vertex that is adjacent to every other vertex in the graph.. If the commutativity graph $\mathcal{G}$ were connected then there would be an element $p_{j}$ which commutes with all other elements of $\{p_{1},\dots,p_{\xi}\}$ . This is also forbidden. Therefore, the graph $\mathcal{G}$ is disconnected.

Given that $\mathcal{G}$ is disconnected, Eq. (21) provides another restriction. Namely that each connected component of $\mathcal{G}$ cannot have a three vertex path ( $P_{3}$ ) as an induced subgraph. I.e. each connected component of $\mathcal{G}$ is a clique.

This means we can partition the elements $\{p_{1},\dots,p_{\xi}\}$ into disjoint subsets

[TABLE]

where two elements commute if and only if they are in the same subset in the partition. Since the set $\{p_{1},\dots,p_{\xi}\}$ is closed under inference, each subset in the partition must be closed under inference. Now suppose a subset in the partition contained at least two elements. Then since the subset is closed under inference it must also contain their sum. But each of the two elements anticommutes with the elements of all other subsets in the partition so their sum must commute with the elements of all other subsets in the partition. This is a contradiction. Therefore, each subset in the partition contains a single element. Thus, the elements $\{p_{1},\dots,p_{\xi}\}$ of Eq. (18) pair-wise anticommute.

Maximal cnc sets are sets of the form Eq. (18) for which $\xi$ is maximal for a given isotropic subspace $\tilde{I}$ . If the isotropic subspace $\tilde{I}$ has dimension $n-m$ where $n$ is the number of qubits and $1\leq m\leq n$ , then the pair-wise anticommuting elements $p_{k}$ which complete the set are elements of the symplectic complement $\tilde{I}^{\perp}$ . This is a $m$ dimensional symplectic subspace, therefore the maximal value of $\xi$ is the largest number of pair-wise anticommuting Pauli operators on $m$ qubits. The largest sets of pair-wise anticommuting Pauli operators on $m$ qubits have $2m+1$ elements. This can be seen as follows. Consider the elements $a_{k}\in E$ given by $T_{a_{k}}=C_{k}$ , with $C_{k}$ defined in Eq. (16). The set $\{a_{k}|\;1\leq k\leq 2m\}$ consists of pairwise anticommuting elements. There is an element $c$ , with $T_{c}=C_{2m+1}$ , cf. Eq. (17) that anticommutes with each one of the elements in this set. It is the only element in $E$ to do so, since the set of equations

[TABLE]

has a unique solution. Therefore together with this element we can construct a set of size $2m+1$ .

We would like to show any other set of pairwise anticommuting elements whose size is $2m$ can be mapped bijectively to the set we constructed. Suppose $\{\tilde{a}_{k}|\;1\leq k\leq 2m\}$ is such a set. By Witt’s lemma (Asch, , §20) the function that sends $a_{k}$ to $\tilde{a}_{k}$ extends to a linear map $f:E\to E$ that satisfies $[f(v),f(w)]=[v,w]$ for all $v,w\in E$ (symplectic transformation). Therefore there is a unique element that anticommutes with all the $\tilde{a}_{k}$ , and it is given by $f(c)$ . In particular, $2m+1$ is the maximal number.

To complete the proof we must show that maximal sets of pair-wise anticommuting elements on $m$ qubits with size less than $2m+1$ do not lead to maximal cnc sets. To see this note that by Witt’s lemma, for any maximal anticommuting set of size $2m^{\prime}+1$ ( $m^{\prime}<m$ ), there is a bijection $f:E\rightarrow E$ which maps the set to one of the form Eq. (16,17). Therefore, we can find $m-m^{\prime}$ independent elements which commute with the set. For example, if $g_{1},g_{2},\dots,g_{m-m^{\prime}}$ are the vectors corresponding to Pauli operators $X_{m^{\prime}+1},X_{m^{\prime}+2},\dots,X_{m}$ , then we could take $f^{-1}(g_{1}),f^{-1}(g_{2}),\dots,f^{-1}(g_{m-m^{\prime}})$ . Therefore, the $n-m$ dimensional isotropic subspace can be extended to one with dimension $n-m^{\prime}$ .

This completes the proof. Therefore, all maximal cnc sets have the form Eq. (15), with $\xi=2m+1$ . $\Box$

A result equivalent to the characterization of Eq. (22) is given in Theorem 3 of KirbyLove .

Tensor products of phase point operators are not typically phase point operators. Consider, for example, two phase point operators with $m=2$ and $n\geq 2$ . Their tensor product does not appear in the classification provided by Theorem 1, as the commutativity graph shows. Physically, such tensor products are not closed under inference, violating Def. 4. Upon closure, they cease to be non-contextual as they then contain a Mermin square. Hence the closures also violate Def. 4.

But there is an exception. If one of the two phase point operators in the tensor product corresponds to an isotropic subspace, i.e., has $m=0$ , then the tensor product is a valid phase point operator. See Appendix D for details.

IV.4 Relation to the stabilizer formalism

The purpose of this section is to describe the relation between positive representability by the quasiprobability distribution $W$ and qubit stabilizer states. We demonstrate that, for all $n$ , the set of positively $W$ -representable states contains the stabilizer mixtures as a strict subset. This is the content of Lemma 4 below. The lemma is based on two examples.

Example 4. Be $|\text{stab}\rangle$ an $n$ -qubit stabilizer state, with isotropic subspace $\tilde{I}\subset E$ corresponding to its stabilizer. Then, it is easily verified that $\tilde{I}$ is non-contextual and closed under inference. Namely, $\tilde{I}$ is of form Eq. (15), with $m=0$ , $\xi=1$ .

The next example generalizes Example 1 to $n$ -qubit states.

Example 5. Every $n$ -qubit state of the form $\Psi=\rho_{1}\otimes|\text{stab}\rangle\langle\text{stab}|_{2,..,n}$ , with $\rho$ a general one-qubit state and $|\text{stab}\rangle$ an $n-1$ -qubit pure stabilizer state, is positively representable.

To prove this statement, for any number $n$ of qubits, consider an isotropic subspace $\tilde{I}\subset E$ of rank $n-1$ representing the stabilizer state $|\text{stab}\rangle_{2,..,n}$ , and three elements $x,y,z\in E$ , such that $T_{x}=X_{1}$ , $T_{y}=Y_{1}$ and $T_{z}=Z_{1}$ . Define the three isotropic subspaces $I_{x},I_{y},I_{z}\subset E$ ,

[TABLE]

and $\Omega_{xyz}:=I_{x}\cup I_{y}\cup I_{z}$ . $\Omega_{xyz}$ is of form Eq. (15), with $m=1$ , $\xi=3$ and $n\geq 2$ , hence cnc by Lemma 3.

We now apply this result to the state $\Psi=\rho_{1}\otimes|\text{stab}\rangle\langle\text{stab}|_{2,..,n}$ above. We can write the constitutents as $\rho=\sum_{\gamma_{0}}W_{\rho}(\Omega_{0},\gamma_{0})A_{\Omega_{0}}^{\gamma_{0}}$ , with $W_{\rho}\geq 0$ (cf. Example 1), and $|\text{stab}\rangle\langle\text{stab}|=A_{\tilde{I}}^{\tilde{\gamma}}$ (cf. Example 2). We observe that

[TABLE]

with $\gamma:=\gamma_{0}(a|_{1})+\tilde{\gamma}(a|_{2,..,n})\mod 2$ . To see this, recall that $\Omega_{0}=\{0,x,y,x\}$ , and note that the set $\Omega_{xyz}$ can also be written as $\Omega_{xyz}=\tilde{I}\cup(\tilde{I}+x)\cup(\tilde{I}+y)\cup(\tilde{I}+z)$ , where the coset $\tilde{I}+x:=\{a+x,\;\forall a\in\tilde{I}\}$ , etc. Thus, $\Psi=\sum_{\gamma_{0}}W_{\rho}(\Omega_{0},\gamma_{0})A_{\Omega_{xyz}}^{\gamma}$ . Since $W_{\rho}\geq 0$ by Example 1, the states $\Psi$ are all positively representable. Yet not all these states are mixtures of stabilizer states. Stabilizer mixedness is preserved under partial trace. Now assume that $\Psi$ is a stabilizer mixture for all $\rho$ . Then $\text{Tr}_{2..n}\Psi=\rho_{1}$ is also a stabilizer mixture. Contradiction.

We cast the combined conclusion of Examples 4 and 5 as a Lemma.

Lemma 4

For all $n\in\mathbb{N}$ , all mixtures of $n$ -qubit stabilizer states are positively representable, and furthermore there exist positively representable states that are not mixtures of stabilizer states.

V Quantum mechanical rules for state update under measurement

In the previous sections we have analyzed the generalized phase space ${\cal{V}}$ on which the quasiprobability function $W$ is defined. We now turn to dynamics.

For our setting of QCM this concerns evolution under the free operations, i.e., the Clifford unitaries and Pauli measurements. As already noted in ReWi and QuWi , the situation simplifies even further. If the goal is to sample from the joint probability distribution of measurement outcomes—which is the case in quantum computation—then only the update under Pauli measurements needs to be considered.

The Clifford unitaries can be propagated forward in time, thereby conjugating the Pauli measurements into other such measurements, past the final measurement and then discarded. (This redundancy notwithstanding, we will visit the update of $W$ under Clifford unitaries in Section VII.3, where we prove covariance.) The main results of this section are Theorem 2 and Lemma 5.

Theorem 2

For any $n\in\mathbb{N}$ , the set ${\cal{P}}_{n}$ of positively representable $n$ -qubit quantum states is closed under Pauli measurement.

To describe the dynamics under measurement, we need to set up some further notation. For every set $\Omega$ we introduce the derived set $\Omega\times a$ . Denoting $\text{Comm}(a):=\{b\in E|[a,b]=0\}$ and $\Omega_{a}:=\Omega\cap\text{Comm}(a)$ ,

[TABLE]

Likewise, we define an update on functions $\gamma$ invoking the measurement outcome $s_{a}$ of an observable $T_{a}$ , namely $(\cdot)\times s_{a}:\left(\gamma:\Omega\longrightarrow\mathbb{Z}_{2}\right)\mapsto\left(\gamma\times s_{a}:\Omega\times a\longrightarrow\mathbb{Z}_{2}\right)$ . We define this update only for $(\Omega,\gamma)\in{\cal{V}}$ , and only for $a\not\in\Omega$ 333The definitions of $\Omega\times a$ and $\gamma\times s_{a}$ can without modification be extended to $a\in\Omega$ . However, in that case the function values $\gamma\times s_{a}(b)$ can be determined both through Eq. (25a) and (25b), and we need to check consistency. These inferences are indeed consistent, as a consequence of Eq. (10). Since we do not need the case of $a\in\Omega$ subsequently, we skip the details of the argument.. The updated function $\gamma\times s_{a}:\Omega\times a\longrightarrow\mathbb{Z}_{2}$ is given by

[TABLE]

The rules of Eq. (25) are used to formulate the update rule for phase point operators of Eq. (2) under Pauli measurement.

Remark. Update rules similar to Eq. (25) have been used previously Lilly to construct a $\psi$ -epistemic model of the multi-qubit stabilizer formalism. Those rules update the value assignments in the same way but are applied under different conditions. Specifically, the update in Lilly does not refer to general sets $\Omega$ satisfying the conditions of Def 4.

Lemma 5

Denote the projectors $P_{a}(s_{a}):=(I+(-1)^{s_{a}}T_{a})/2$ , and be $A_{\Omega}^{\gamma}$ a phase point operator defined through Eq. (2), with $(\Omega,\gamma)\in{\cal{V}}$ satisfying the conditions of Definition 4. Then, the effect of a measurement of the Pauli observable $T_{a}$ with outcome $s_{a}$ on $A_{\Omega}^{\gamma}$ is

[TABLE]

Example 2, continued. Eq. (26) entails the update of both the sets $\Omega$ and the functions $\gamma$ . Here we only consider the former. Fig. 5 displays the update of the set $\Omega$ shown in Fig. 1 a, under the measurement of (a) the observable $X_{2}$ , with $a(X_{2})\in\Omega$ , and (b) the observable $X_{1}$ , with $a(X_{1})\not\in\Omega$ .

In preparation for the proof of Lemma 5 it is useful to state two relations of the function $\beta$ for $d=2$ . With the definition Eq. (5) of $\beta$ and Eq. (3), the operator identities $T_{a}T_{a}=I$ and $T_{b}I=T_{b}$ imply that

[TABLE]

Furthermore, evaluating $d\beta(a,a,0)=0$ (see Eqs. (7) and (11)), and using Eq. (27) yields

[TABLE]

To prove Lemma 5 we also need the following result.

Lemma 6

If $\Omega\subset E$ is non-contextual and closed under inference, then so is $\Omega_{a}$ , for all $a\in E$ .

Proof of Lemma 6. First consider closure. Assume that $c,d\in\Omega_{a}$ and $[c,d]=0$ . Then, $c,d\in\Omega$ , and also $c+d\in\Omega$ , since $\Omega$ is closed by assumption. Further, $[c,a]=[d,a]=0$ implies $[c+d,a]=0$ , and hence $c+d\in\Omega_{a}$ . $\Omega_{a}$ is thus closed.

Now consider non-contextuality. Since $\Omega$ is non-contextual, there exists a function $\gamma$ such that $d\gamma=\beta$ on $\Omega$ . Since $\Omega_{a}$ is closed, $\beta$ can be properly restricted to ${\cal{C}}(\Omega_{a})$ , and so can $\gamma$ . Hence, $d\gamma|_{{\cal{C}}(\Omega_{a})}=\beta|_{{\cal{C}}(\Omega_{a})}$ . Thus, $\Omega_{a}$ is non-contextual. $\Box$

Proof of Lemma 5. Under the measurement of $T_{a}$ with outcome $s_{a}\in\mathbb{Z}_{2}$ we have

[TABLE]

From hereon we need to distinguish two cases, $a\in\Omega$ and $a\not\in\Omega$ .

Case I: $a\in\Omega$ . Focusing on the second term in the expansion Eq. (29),

[TABLE]

Therein, in the first line we have used Eq. (5), in the second line Eq. (10), in the third line the completeness of $\Omega_{a}$ under inference (Lemma 6), and the fourth line is just a relabeling of the elements in $\Omega_{a}$ . Inserting this result in the above expansion Eq. (29), we find

[TABLE]

and Eq. (26a) follows.

Case II: $a\not\in\Omega$ . Substituting $b\longrightarrow a+b$ in Eq. (25b) gives $\gamma\times s_{a}(a+b)=\gamma(b)+s_{a}+\beta(a,a+b)$ , for $b\in\Omega_{a}$ . With Eq. (28) we obtain

[TABLE]

With this, we now look at the second term in the expansion Eq. (29),

[TABLE]

The first line above follows with Eq. (5), and the second with Eq. (31).

Considering the first term in the expansion Eq. (29), with Eq. (25a) we have

[TABLE]

Inserting the above expressions for the two terms in Eq. (29), and using the definition Eq. (24) of $\Omega\times a$ , we obtain Eq. (26b). $\Box$

We have so far shown how the phase point operators can be updated under measurement once. We still need to show that this update can be iterated. This requires that the new phase point operators appearing on the r.h.s. of Eq. (26) satisfy the consistency constraints of Definition 4.

Lemma 7

If $(\Omega,\gamma)\in{\cal{V}}$ then $(\Omega,\gamma+[a,\cdot])\in{\cal{V}}$ , for all $a\in\Omega$ , and $(\Omega\times a,\gamma\times s_{a})\in{\cal{V}}$ , for all $a\not\in\Omega$ and $s_{a}\in\mathbb{Z}_{2}$ .

The proof of Lemma 7 is given in Appendix B.

Proof of Theorem 2. Consider a state $\rho\in{\cal{P}}_{n}$ , and a measurement of the Pauli observable $T_{a}$ on it. Assume that the measurement outcome $s_{a}$ can occur, $p_{a}(s_{a}):=\text{Tr}(P_{a}(s_{a})\rho)>0$ . We have to show that under these conditions, the post-measurement state

[TABLE]

is also contained in the set ${\cal{P}}_{n}$ .

Denote $\overline{\delta}_{a\in\Omega}:=1-\delta_{a\in\Omega}$ . Then, with Lemma 5 and the state expansion Eq. (4) of $\rho$ , we have

[TABLE]

Thus, $\rho^{\prime}$ can be represented by a quasiprobability distribution $W_{\rho^{\prime}}$ with elements

[TABLE]

The $W_{\rho^{\prime}}(\Omega^{\prime},\gamma^{\prime})$ are thus linear combinations of $W_{\rho}(\Omega,\gamma)$ with non-negative coefficients (0 or $1/2p_{a}(s_{a})$ ). Since the $W_{\rho}(\Omega,\gamma)$ are non-negative by assumption, it follows that $W_{\rho^{\prime}}(\Omega^{\prime},\gamma^{\prime})\geq 0$ , for all $(\Omega^{\prime},\gamma^{\prime})\in{\cal{V}}$ . $\Box$

VI Classical simulation for $W_{\rho}\geq 0$

VI.1 Simulation algorithm

We now turn to the question of how hard it is to classically simulate the outcome statistics for a sequence of Pauli measurements on an initial quantum state. In this regard, we show that if the initial quantum state is positively represented and the corresponding probability distribution $W$ can be efficiently sampled from, then the statistics of the measurement outcomes can be efficiently simulated.

The classical simulation procedure in Table 1 describes weak simulation VdN1 –VdN2 , i.e., it outputs one sample from the joint probability distribution $p(s_{a_{1}},s_{a_{2}},..,s_{a_{N}})$ of outcomes corresponding to a sequence of measurements of Pauli operators $T_{a_{1}},T_{a_{2}},..,T_{a_{N}}$ ( $T_{a_{1}}$ is measured first, $T_{a_{N}}$ last). If more than one sample are desired, the procedure is just repeated. We note that the measurement can be adaptive. I.e., it is not necessary for the simulation algorithm that a measurement sequence is committed to at the beginning. As a special case of this, the measured observables may depend on earlier measurement outcomes.

We have the following result.

Theorem 3

If for an initial quantum state $\rho$ it holds that $W_{\rho}\geq 0$ and furthermore $W_{\rho}$ can be efficiently sampled from, then the output distribution of all sequences of Pauli measurements, possibly interspersed with Clifford gates, on $\rho$ can be classically efficiently sampled from.

As a first application of Theorem 3, we return to Example 2, Mermin’s square.

Example 2 continued. How much memory capacity is needed to classically simulate measurements of the observables in Mermin’s square? We first turn to the state-independent case, which was previously discussed in MemCo . The task is to devise a classical algorithm that outputs an outcome sequence for any given sequence of Pauli measurements, which can occur according to quantum mechanics. The measurement sequence can be of any length and the measurements therein may be commuting or anti-commuting. In MemCo , a lower bound on the memory cost of any such simulation was established, $\log_{2}24$ bits; and a specific model was constructed that attains it.

The classical simulation algorithm of Table 1 also saturates this limit. To show this, we use as cnc sets $\Omega$ the six maximal isotropic subspaces of two rebits, cf. Fig. 1 b. This set of sets $\Omega$ is closed under update by Pauli measurement, as described by Eq. (26). For each such set $\Omega$ , each value assignment $\gamma$ is specified by two evaluations (the other evaluations then follow via Eq. (10)). There are thus four functions $\gamma$ for each cnc set $\Omega$ , hence 24 combinations in total, which is the same as in MemCo .

We now turn to the state-dependent version of the problem. How much memory is needed to sample from the correct outcome statistics for arbitrary measurement sequences, for any two-rebit state $\rho$ with $W_{\rho}\geq 0$ , and given the capability to sample from $W_{\rho}$ ? This problem is harder than the former: Not only must the sequence of outcomes be internally consistent for all measurement sequences, but also it needs to represent the state $\rho$ .

Memory cost now depends on the state $\rho$ . If $\rho$ is a mixture of stabilizer states, i.e., the sets $\Omega$ can be limited to $m=0$ , then the classical simulation algorithm of Table 1 can still run on $\log_{2}24\approx 4.59$ bits.

If sets $\Omega$ with $m=1$ are included in the expansion, then more two-rebit states $\rho$ can be positively represented (among them, for example, $|T\rangle_{1}\otimes|T\rangle_{2}$ ) but on the other hand, memory consumption goes up. For $m=1$ , there are $3^{2}\times 2^{3}$ pairs $(\Omega,\gamma)$ , cf. Fig. 1a. Hence the memory consumption for configurations with $m=1$ is $\log_{2}72\approx 6.17$ bits. (Note that the sets $\Omega$ for $m=0$ are not maximal. If sets with $m=1$ are included, then sets with $m=0$ can be omitted without loss.) The size $|{\cal{V}}(\{m\})|$ of the phase space vs. the maximum value of $m$ is displayed in Table 2. The memory cost is $\log_{2}|{\cal{V}}(\{m\})|$ . The volume fraction of positively representable two-rebit and two-qubit states is displayed in Table. 3, for various sets $\{m\}$ .

VI.2 Correctness and efficiency of the classical simulation

In preparation for the proof of correctness of the classical simulation algorithm, we introduce the following notation. Given a probability distribution $W_{\rho}$ , there are two objects that the classical simulation algorithm needs to reproduce correctly, namely the probability $p_{a}(s_{a})$ for the outcomes $s_{a}\in\mathbb{Z}_{2}$ of the measurement of any Pauli observable $T_{a}$ , and the post-measurement state $\rho^{\prime}$ . There are two ways of obtaining these quantities, a quantum-mechanical one and a classical one using the simulation algorithm of Section VI.1.

Regarding the outcome probability $p_{a}(s_{a})$ given $W_{\rho}$ , the quantum mechanical way first obtains the corresponding quantum state $\rho$ from $W_{\rho}$ through Eq. (4). This is represented by a map $R:W_{\rho}\mapsto\rho$ . Second, from $\rho$ , the outcome probability $p_{a}(s_{a})$ is obtained via the Born rule, $p_{a}(s_{a})=\text{Tr}(P_{a}(s_{a})\rho)$ . This is represented by a map $\pi_{q}(a):\rho\mapsto p_{a}$ . The classical way uses the algorithm of Section VI.1 to obtain $p_{a}$ . This is represented by a map $\pi_{c}(a):W_{\rho}\mapsto p_{a}$ .

Likewise, the quantum mechanical way of obtaining the post-measurement state $\rho^{\prime}$ from $W_{\rho}$ proceeds by first applying the map $R$ (Eq. (4)) to obtain $\rho$ , and second by obtaining $\rho^{\prime}$ from $\rho$ through the Dirac projection postulate. The second step is represented by a map $\Pi_{q}(a)$ .

The classical way of obtaining $\rho^{\prime}$ from $W_{\rho}$ proceeds by first using the simulation algorithm to obtain $W_{\rho^{\prime}}$ , and second by mapping $W_{\rho^{\prime}}$ to $\rho^{\prime}$ using the map $R$ . The first step in this procedure is represented by the map $\Pi_{c}(a)$ .

The classical simulation algorithm of Section VI.1 is correct only if the quantum and the classical ways of computing $p_{a}(s_{a})$ and $\rho^{\prime}$ agree. That is, we require the diagrams in Fig. 6 to commute.

Lemma 8

The diagrams of Fig. 6 commute.

Proof of Lemma 8. We discuss the outcome probability and the post-measurement state separately.

Outcome probability $p_{a}(s_{a})$ . Then, the quantum mechanical expression for $p_{a}(s_{a})$ is

[TABLE]

The classical expression $p_{a}^{(c)}(s_{a})$ for $p_{a}(s_{a})$ obtained through the algorithm of Section VI.1 is as follows. If $a\in\Omega$ , then the conditional probability for the outcome $s_{a}$ given the state $(\Omega,\gamma)$ is $\delta_{s_{a},\gamma(a)}$ . If $a\not\in\Omega$ then the conditional probability for the outcome $s_{a}$ is 1/2. Thus,

[TABLE]

By comparing the two expressions, we find that $p_{a}(s_{a})=p_{a}^{(c)}(s_{a})$ for all $a$ , $s_{a}$ , and the left diagram of Eq. (6) thus commutes.

Post-measurement state $\rho^{\prime}$ . The quantum mechanical expression for the post-measurement state $\rho^{\prime}$ has already been given in Eq. (32), and we now derive the corresponding expression $\rho^{\prime}_{(c)}$ that follows from the classical simulation algorithm.

We consider the joint probability $p((\Omega^{\prime},\gamma^{\prime})\cap s_{a})$ of obtaining the outcome $s_{a}$ in the measurement of $T_{a}$ and ending up in the state $(\Omega^{\prime},\gamma^{\prime})$ . We may invoke conditional probabilities in two ways,

[TABLE]

Noting that $p((\Omega^{\prime},\gamma^{\prime})|s_{a})=W_{\rho^{\prime}}(\Omega^{\prime},\gamma^{\prime})$ , and equating the two above expressions we find

[TABLE]

We now infer the conditional probabilities $p((\Omega^{\prime},\gamma^{\prime})\cap s_{a}|(\Omega,\gamma))$ from the classical simulation algorithm of Section VI.1,

[TABLE]

Inserting this into Eq. (35), and using the resulting expression in Eq. (4), i.e. applying the map $R$ , we obtain

[TABLE]

Comparing the last expression with Eq. (32), we find that $\rho^{\prime}_{(c)}=\rho^{\prime}$ for all $a$ , $s_{a}$ , and the right diagram in Fig. 6 thus commutes. $\Box$

Proof of Theorem 3. As explained in Section V, we only need to discuss sequences of Pauli measurements. For those, we show that the algorithm of Table 1 is correct, and, if the initial $W_{\rho}$ can be efficiently sampled from, it is also computationally efficient. (i) Correctness. Denote by $\rho(t)$ the state before the $t$ -th measurement. With Lemma 8, by induction on the right diagram in Eq. (6), if $W_{\rho(1)}$ represents the initial state $\rho(1)$ , then $W_{\rho(t)}$ represents $\rho(t)$ for all time steps $t=1,..,N$ . Then, by the left diagram in Eq. (6), the outcome probabilities $p_{a_{t}}(s_{a_{t}}|\textbf{s}_{\prec t})$ , with $\textbf{s}_{\prec t}=(s_{a_{1}},..,s_{a_{t-1}})$ the measurement record prior to time $t$ , are also correct. Thus the joint outcome probability sampled from

[TABLE]

is also correct.

(ii) Efficiency. We recall that all cnc sets $\Omega$ are unions of $O(n)$ isotropic spaces $\Omega_{i}$ (Theorem 1). Further, each $\Omega_{i}$ defines a stabilizer group

[TABLE]

This allows us to describe $(\Omega,\gamma)\in\mathcal{V}$ using polynomial memory by storing $O(n)$ stabilizer tables of size $O(n^{2})$ Goma ; Gottesman99Heisenberg . Indeed, by Defs. 2-3 and Lemma 3, $T_{\Omega_{i}}^{\gamma}$ is a closed commutative group. Furthermore, with Def. 3, it holds that $T_{a}^{\gamma}T_{b}^{\gamma}=T_{a+b}^{\gamma},\forall a,b\in\Omega_{i}$ . This implies the existence of a non-trivial stabilized subspace: $P_{\Omega_{i}}^{\gamma}:=\sum_{a\in\Omega_{i}}T_{a}^{\gamma}/|\Omega_{i}|$ is a common +1-eigenprojector of every $T_{a}\in T_{\Omega_{i}}^{\gamma}$ as $T_{a}^{\gamma}P_{\Omega_{i}}^{\gamma}=\sum_{b\in\Omega_{i}}\frac{T_{a+b}^{\gamma}}{|\Omega_{i}|}=\sum_{b^{\prime}\in\Omega_{i}}\frac{T_{b^{\prime}}^{\gamma}}{|\Omega_{i}|}=P_{\Omega_{i}}^{\gamma},\forall T_{\Omega_{i}}^{\gamma}$ , which also implies ${P_{\Omega_{i}}^{\gamma}}^{2}=P_{\Omega_{i}}^{\gamma}$ .

We now note that the update rules in algorithm 1, namely (i) checking whether $a\in\Omega$ , (ii) evaluating $\gamma$ on $a\in\Omega$ , (iii) updating $\gamma\longrightarrow\gamma+[a,\cdot]$ , (iv) $\Omega\longrightarrow\Omega\times a$ and (v) $\gamma\longrightarrow\gamma\times s_{a}$ , implement tasks that admit efficient classical algorithms in the stabilizer formalism Goma ; Gottesman99Heisenberg . Rules (i) and (ii): To test $a\in\Omega$ , we check whether $a\in\Omega_{i},i=1,\ldots,O(n)$ . If $a\in\Omega_{j}$ for some value of $j$ , then $\gamma(a)$ is computed as the bit determining the phase of the stabilizer operator $T_{a}^{\gamma}\in T_{\Omega_{i}}^{\gamma}$ . Both tasks can be solved classically efficiently via Gaussian elimination given the stabilizer table data Goma ; Gottesman99Heisenberg . Rule (iii): $\gamma$ is updated to $\gamma^{\prime}=\gamma+[a,\cdot]$ by (classically efficiently) evaluating $\gamma(\cdot)+[a,\cdot]$ on the generators of every $\Omega_{i}$ . Rules (iv) and (v): For all $j$ , $T_{\Omega_{j}\times a}^{\gamma|_{\Omega_{j}}\times{s_{a}}}$ is the stabilizer group resulting from the measurement of $T_{a}$ with outcome $s_{a}$ on a state with stabilizer group $T_{\Omega_{j}}^{\gamma|_{\Omega_{j}}}$ . This update can be efficiently performed using the standard measurement update-rule of Ref. Goma ; Gottesman99Heisenberg to every stabilizer table in the description of $(\Omega,\gamma)$ . Thus, all steps of the algorithm run in polynomial time. $\Box$

VII The case of $W_{\rho}<0$

As we have established in the previous sections, $W_{\rho}<0$ is a precondition for quantum speedup. When the initial state is represented by a quasiprobability rather than a true probability function, a standard problem of interest is estimating outcome probabilities for sequences of measurements. An established method for probability estimation is Pashayan , utilizing the Hoeffding bound. Note that probability estimation is a different problem than weak simulation VdN1 , and is not efficiently adaptive.

VII.1 Robustness

In close analogy to the “robustness of magic” RoM $\mathfrak{R}_{S}$ (the subscript $S$ is for “stabilizer”), we define a phase space robustness $\mathfrak{R}$ , through

[TABLE]

with $\langle{\cal{A}},W\rangle:=\sum_{\alpha\in{\cal{V}}}W_{\alpha}A_{\alpha}$ .

Since the definitions of the robustness $\mathfrak{R}$ and of the robustness of magic $\mathfrak{R}_{S}$ RoM are so similar, one may wonder if there is a relation between them. This is indeed the case; namely, we have the following result.

Lemma 9

For all quantum states $\rho$ , of any number $n$ of qubits, the phase space robustness $\mathfrak{R}(\rho)$ and the robustness of magic $\mathfrak{R}_{S}(\rho)$ are related via

[TABLE]

Thus, the phase space robustness $\mathfrak{R}$ is never larger than the robustness of magic, but can only be moderately smaller. The proof of Lemma 9 is given in Appendix C.

VII.2 Hardness of classical simulation

The Hoeffding bound says that the number $N$ of samples required to estimate the output probability distribution up to an error $\epsilon$ scales as $N\sim{\cal{M}}^{2}/\epsilon^{2}$ , where ${\cal{M}}$ is a measure of the negativity contained in the quantum process. In our case, the operations are positivity-preserving, and all negativity comes from the initial state. The algorithm of Pashayan et al. Pashayan , when applied to our setting, says that the number $N$ of samples required to estimate the output probability scales as

[TABLE]

Thus, the robustness $\mathfrak{R}(\rho_{\text{init}})$ of the initial state $\rho_{\text{init}}$ is the critical parameter determining the classical hardness of probability estimation.

The same relation, with the robustness $\mathfrak{R}$ replaced by the robustness of magic $\mathfrak{R}_{S}$ holds for the classical simulation based on quasiprobability distributions over stabilizer states RoM . Lemma 9 above is therefore of interest for relating the operational costs of the two simulation methods.

Classical simulation also requires a quasiprobability function $W_{\mu^{\otimes n}}$ for $n$ copies of the magic state $\mu$ . Since the $n$ -qubit phase space is large, the numerical optimization to obtain the least-negative expansion $W_{\mu^{\otimes n}}$ is computationally costly. However, we can apply a similar splitting into smaller blocks of magic states as in the stabilizer case RoM . The computational cost for providing the expansion is then a function of block size rather than total number of copies $n$ . The 1-norm of the resulting expansion is smaller than of the stabilizer expansion, by a factor that is constant in $n$ . Details are given in Appendix D.

VII.3 Elements of a resource theory based on $W$

It is illuminating to discuss QCM within the framework of resource theories. Every resource theory has three main operational components BG15 , (i) the resource(s), (ii) the non-resources, or free states, (iii) the free operations.

In the physical setting of our interest, the resources are quantum states which cannot be positively represented by $W$ (cf. Theorem 3). The free operations are Clifford unitaries and Pauli measurements. The free states are those that can be created from the free operations from a completely mixed state, i.e., all mixtures of stabilizer states.

We observe that there is a third class of states which are neither resources nor free, namely the positively representable states which are not mixtures of stabilizer states. Such states are called (iv) bound magic states. We have seen an example of them in Section IV.4, the general 1-qubit states tensored with a stabilizer state on arbitrarily many qubits.

The reason for calling those states “bound magic” is that they cannot be distilled into computationally useful ones by free operations. In our setting, by Theorem 2, positive representability is an invariant under the free operations. Hence, bound states can only be converted into other bound states or into free states by the free operations, but never into a resource.

The question of inter-convertibility may more generally be asked for resource states. To facilitate this discussion, one may identify monotones, i.e., real-valued functions on the state space that never increase under the free operations. The main result of this section is that the robustness $\mathfrak{R}$ , defined in Eq. (37) and already known to measure hardness of classical simulation by sampling, is a monotone.

Theorem 4

The robustness $\mathfrak{R}$ is a monotone under all Clifford unitaries and Pauli measurements.

As part of the proof of Theorem 4, we now discuss an important structural property of the quasiprobability function $W$ , namely its covariance under Clifford unitaries. Be $\text{Cl}_{n}$ the $n$ -qubit Clifford group. It acts on the $n$ -qubit Pauli operators via

[TABLE]

This relation simultaneously defines the phase function $\Phi$ and the action of $\text{Cl}_{n}$ on $E$ . It implies an action of the Clifford group on the phase point operators $A_{\Omega}^{\gamma}$ , which in turn induces an action on the sets $\Omega$ and the functions $\gamma$ , via

[TABLE]

Therein, the set $\Omega^{\prime}$ is defined as $\Omega^{\prime}:=\{ha,\;a\in\Omega\}$ , and the function $\gamma^{\prime}:\Omega^{\prime}\longrightarrow\mathbb{Z}_{2}$ is given by

[TABLE]

Henceforth we denote $\Omega^{\prime}$ as $h\cdot\Omega$ and $\gamma^{\prime}$ as $h\cdot\gamma$ , to emphasize the dependence on $h\in\text{Cl}_{n}$ .

For use in the proof below we quote Lemma 3 from Coho which says that, for any face $(a,b)\in\Omega\times\Omega$ ,

[TABLE]

We then have the following result.

Lemma 10

${\cal{V}}$ * is mapped to itself under $\text{Cl}_{n}$ , and the quasiprobability function $W$ transforms covariantly. That is, if the state $\rho$ can be described by $W_{\rho}$ through Eq. (4), then for any $h\in\text{Cl}_{n}$ the state $h\rho h^{\dagger}$ can be described by a quasiprobability function $W_{h\rho h^{\dagger}}$ defined by*

[TABLE]

Remark 3: We say “the state $\rho$ can be described by $W_{\rho}$ ” rather than “is described” because $W_{\rho}$ is not unique.

Proof of Lemma 10. First, we show that the phase space ${\cal{V}}$ is closed under the action of $\text{Cl}_{n}$ , i.e., if $(\Omega,\gamma)\in{\cal{V}}$ then $(\Omega^{\prime},\gamma^{\prime})\in{\cal{V}}$ . The four items in Definition 4 need to be checked. (i) Closedness under inference. Assume that $c,d\in\Omega^{\prime}$ and $[c,d]=0$ . Then there exist $a,b\in\Omega$ such that $c=ha$ , $d=hb$ and $[a,b]=0$ . Then, $c+d=ha+hb=h(a+b)\in\Omega^{\prime}$ , since $a+b\in\Omega$ by the assumption of closedness. Hence $\Omega^{\prime}$ is closed under inference.

(iii) $\gamma^{\prime}$ * satisfies Eq. (10).* With the definition of $\gamma^{\prime}$ we have (all addition mod 2)

[TABLE]

Therein, in the second line we have used Eq. (10). Thus, $\gamma^{\prime}$ satisfies Eq. (10) on its domain.

(ii) $\Omega^{\prime}$ * is non-contextual.* With $\gamma^{\prime}$ we have just proved the existence of a function on $\Omega^{\prime}$ that satisfies Eq. (10).

(iv) $\gamma^{\prime}$ * satisfies Eq. (3).* Since $\gamma$ satisfies Eq. (3), it follows $I=h(I)=h\left((-1)^{\gamma(0)}T_{0}\right)=(-1)^{\gamma(0)+\Phi_{h}(0)}T_{0}=(-1)^{\gamma^{\prime}(0)}T_{0}$ . Eq. (3) is thus satisfied for $\gamma^{\prime}$ .

Hence, if $(\Omega,\gamma)\in{\cal{V}}$ then $(\Omega^{\prime},\gamma^{\prime})\in{\cal{V}}$ , as claimed.

Next we turn to the covariance of $W$ under $\text{Cl}_{n}$ . We have

[TABLE]

Comparing the last expression with the expansion Eq. (4) for $h\rho h^{\dagger}$ , we find that for all $h\in\text{Cl}_{n}$ , the quasiprobability distribution $W_{h\rho h^{\dagger}}$ defined by

[TABLE]

describes the state $h\rho h^{\dagger}$ . This is the covariance condition. $\Box$

We are now ready to prove the monotonicity of $\mathfrak{R}$ , as stated in Theorem 4.

Proof of Theorem 4. (a) Clifford unitaries. With Lemma 10, we have that for any $n$ -qubit Clifford gate $h$ applied to any $n$ -qubit state $\rho$ , the quasiprobability distribution $W_{h\rho h^{\dagger}}$ can be related to $W_{\rho}$ via the covariance condition Eq. (39). Since $W$ is non-unique, there may a priori be a representation $W^{\prime}_{h\rho h^{\dagger}}$ with smaller 1-norm, and thus it holds that

[TABLE]

(b) Pauli measurements. We consider the measurement of a Pauli observable $T_{a}$ on a quantum state $\rho$ . Denote by $\rho_{a,s_{a}}$ the normalized post-measurement states for the outcomes $s_{a}=0,1$ , respectively. We have to show that, for all $n$ , for all $a\in\mathbb{Z}_{2}^{n}\times\mathbb{Z}_{2}^{n}$ and all $n$ -qubit states $\rho$ it holds that

[TABLE]

With Eq. (33), we can write $p_{a}(0)\,W_{\rho_{a,0}}=W_{+}+\overline{W}_{+}$ , and $p_{a}(1)\,W_{\rho_{a,1}}=W_{-}+\overline{W}_{-}$ , where

[TABLE]

From now on, denote by $W_{\rho}$ the optimal representation for $\rho$ w.r.t. 1-norm, i.e., $\mathfrak{R}(\rho)=\left\|W_{\rho}\right\|_{1}$ . With the triangle inequality, and the fact that the functions $W_{\rho_{a,s_{a}}}$ induced from the optimal $W_{\rho}$ through Eq. (42) need not be optimal for the states $\rho_{a,s_{a}}$ w.r.t. their 1-norm, it holds that $p_{a,0}\,\mathfrak{R}(\rho_{a,0})\leq\left\|W_{+}\right\|_{1}+\left\|\overline{W}_{+}\right\|_{1}$ , and $p_{a,1}\,\mathfrak{R}(\rho_{a,1})\leq\left\|W_{-}\right\|_{1}+\left\|\overline{W}_{-}\right\|_{1}$ , hence

[TABLE]

With Eq. (42) we find that

[TABLE]

where in the second line we used the triangle inequality again. Furthermore, performing the summation over all $(\Omega^{\prime},\gamma^{\prime})\in{\cal{V}}$ first, we obtain

[TABLE]

Inserting the last two relations into Ineq. (43), we arrive at

[TABLE]

Since $\mathfrak{R}(\rho)=\left\|W_{\rho}\right\|_{1}$ by assumption, Eq. (41) follows. $\Box$

VII.4 Numerical results

In Table 4 and Fig. 7 we present numerical values 444Our calculations use the software packages CVXPY cvxpy and GUROBI gurobi . for the robustness of various magic states, and compare them to robustness of magic as defined by Howard and Campbell RoM . Table 4 summarizes the robustness comparisons for the common magic states, as well as the maximal-robustness Hoggar state RoM . In Fig. 7 we plot the robustness against the stabilizer state robustness for three qubits, as a function of rotation angle. Note the wide and almost flat—though not perfectly flat—plateaus of robustness $\mathfrak{R}$ in the vicinity of stabilizer states.

VII.5 Curious resurgence of $4^{n}$ -dimensional phase space

Numerical calculations of robustness for various quantum states revealed an unexpected feature. Namely, the optimal quasiprobability distribution $W_{\rho}$ w.r.t. Eq. (37) for a given $n$ -qubit state $\rho$ always was non-zero only on $4^{n}$ phase space points, or fewer. $4^{n}$ is only a tiny fraction of the whole phase space ${\cal{V}}$ , and furthermore the naive expectation if one were completely oblivious of the differences between even and odd $d$ . However, the support of the optimal $W(\rho)$ depends on the state $\rho$ . We can now explain the initially puzzling upper bound on the size of the support, $4^{n}$ .

The robustness $\mathfrak{R}$ of a state $\rho$ defined in Eq. (37) is the solution to the convex optimization problem

[TABLE]

where $M_{i,j}=\text{Tr}(A_{\alpha_{j}}P_{i})$ , $b_{i}=\text{Tr}(\rho P_{i})$ , $\{\alpha_{j}:1\leq j\leq|\mathcal{V}|\}$ is an enumeration of the phase points and $P_{i}$ are the $n$ -qubit Pauli operators. For each variable $q_{j}$ in Eq. (44), define two new variables $q_{j}^{+}\coloneqq\max(0,q_{j})$ and $q_{j}^{-}\coloneqq\max(0,-q_{j})$ . Then the convex optimization problem of Eq. (44) is equivalent to the standard form linear program

[TABLE]

where $\tilde{M}=\begin{bmatrix}M&-M\end{bmatrix}$ and $\tilde{\textbf{q}}=\begin{bmatrix}(\textbf{q}^{+})^{T}&(\textbf{q}^{-})^{T}\end{bmatrix}^{T}$ . This doubles the number of variables but does not change the number of equality constraints. Since we know this problem is feasible (any physical state can be written as an affine combination of phase point operators) and bounded (no physical state can have robustness less than 1), by the fundamental theorem of linear programming, for any physical state, Eq. (45) has a solution at a vertex of the feasible polytope opt .

Since Eq. (45) has an equality constraint for each $n$ -qubit Pauli operator (including the identity), this means any state $\rho$ has a robustness-minimizing expansion in phase point operators with no more than $4^{n}$ non-zero coefficients.

VIII Discussion

VIII.1 Stratonovich-Weyl correspondence

In the field of quantum optics, an important set of criteria for a proper quasiprobability distribution over a phase space is given by the Stratonovich-Weyl (SW) correspondence. Denote by $F_{A}^{(s)}:X\longrightarrow\mathbb{R}$ the quasiprobability distribution corresponding to the operator $A$ , with $X$ the phase space and $s$ a real parameter in the interval $[-1,1]$ . In the standard formalism for infinite-dimensional Hilbert spaces, $s=-1,0,1$ correspond to the Glauber-Sudarshan $P$ , Wigner, and Husimi $Q$ function, respectively. The SW correspondence is the following set of criteria on the $F_{A}^{(s)}$ Strato ; also see Brif ,

(0)

Linearity: $A\longrightarrow F_{A}^{(s)}$ is a one-to-one linear map. 2. (1)

Reality:

[TABLE] 3. (2)

Standardization:

[TABLE] 4. (3)

Covariance:

[TABLE]

with $G$ the dynamical symmetry group. 5. (4)

Traciality:

[TABLE]

We now investigate to which extent these SW criteria apply to the present quasiprobability function $W$ . There are two deviations. First, the present quasiprobability function $W$ does not come with a parameter $s$ ; there is only a single function $W$ . This will affect the formulation of traciality. Second, the present mapping $A\longrightarrow W_{A}$ is one-to-many, as we have noted in Section III.1. The mapping is nonetheless linear, $A+B$ can be represented as $W_{A}+W_{B}$ .

The remaining SW conditions do apply. (1) Reality of $W$ follows directly from the definition Eq. (4), since all $A_{\Omega}^{\gamma}$ are Hermitian. (2) Standardization: The definition Eq. (2) and property Eq. (3) of the phase point operators imply $\text{Tr}\,A_{\Omega}^{\gamma}=1$ , for all $\Omega$ and $\gamma$ ; and standardization then follows from Eq. (4).

(3) Covariance holds for the entire Clifford group, as stated in Lemma 10. In fact, insisting on Clifford covariance leads to the non-uniqueness of $W$ . Namely, an over-complete set of phase point operators is necessary to achieve Clifford covariance Zhu .

(4) Traciality. In the absence of a continuously varying parameter $s$ , we define a dual quasiprobability function $\tilde{W}$ in addition to $W$ , to stand in for $F^{(-s)}$ . For all Pauli operators $T_{a}$ we have

[TABLE]

Since the $n$ -qubit Pauli operators form an operator basis, $\tilde{W}$ can be extended to all $n$ qubit operators by linearity. With Eq. (34) we then have

[TABLE]

We thus satisfy the SW criteria (1) - (4).

To conclude, we reiterate that for the present purpose of classically simulating QCM, a crucial property of $W$ is positivity preservation under Pauli measurement. This property has no counterpart in the Stratonovich-Weyl correspondence.

VIII.2 Probabilistic hidden variable model

In the case of odd $d$ NegWi , there is a third equivalent indicator of classicality, next to positivity of the initial Wigner function and the efficiency of classical simulation of QCM by sampling. Namely, a positive Wigner function is equivalent to a non-contextual hidden variable model (HVM) with deterministic value assignments Howard . This triple coincidence cannot be replicated in $d=2$ , because, for $n\geq 2$ all quantum states—even the completely mixed state—are contextual Howard .

One interpretation of this situation is that contextuality, i.e., the unviability of non-contextual HVMs, is not sufficiently tight a criterion to reveal genuine quantumness. A more stringent marker is required, which (i) classifies the present HVM as classical, and (ii) for QCM in odd $d$ reduces to contextuality. At present, we have no suggestion for this more restrictive notion of quantumness. However, we point to a hidden variable model that is illustrative of the shifted quantum-to-classical boundary in the multi-qubit case, and we propose it for further study.

Namely, when positive, the quasiprobability distribution $W$ can be considered an HVM. While classified as contextual by the common definitions, it shares many features with non-contextual HVMs.

This HVM consists of a triple $(\Lambda,\{h^{\lambda}\},p_{\lambda})$ where $\Lambda={\cal{V}}$ or ${\cal{V}}_{M}$ , $h^{\lambda}$ is a compatible family of distributions on the set of outcomes on contexts and $p_{\lambda}$ is a probability distribution on the set $\Lambda$ of hidden variables. For each $\alpha=(\Omega,\gamma)$ we define $h^{\alpha}$ by

[TABLE]

Therein, $I$ is any isotropic subspace, $s:I\to\mathbb{Z}_{2}$ is a function, and $P_{s}$ is the projector corresponding to the outcome. Note that $P_{s}=0$ if $ds\neq\beta$ .

It is useful to state the probability distributions $h_{I}^{\alpha}(\cdot)$ in their explicit form. Let $P_{s}$ denote the projector corresponding to the non-contextual value assignment $s:I\to\mathbb{Z}_{2}$ . Then we have

[TABLE]

From Eq. (47) we see that the value assignments in our HVM are generally probabilistic; only in the special case of $I\subset\Omega$ they become deterministic. Further, the $\{h_{I}^{\alpha}\}$ form compatible families,

[TABLE]

When applicable, this HVM reproduces the predictions of quantum mechanics (cf. Theorem 3) for measurements of Pauli observables, in single contexts or arbitrary measurement sequences.

We argue that the HVM of Eq. (46) is classical. It is an HVM with partial value assignments, with deterministic values for some observables and random values for others. The only resource this HVM uses beyond those required by non-contextual HVMs with deterministic value assignments is that of classical uniform randomness (in the evaluation of value assignments). Such use of randomness should not render the present HVM genuinely quantum.

And yet, the HVM of Eq. (46) is (a) contextual in the sense of Abramsky and Brandenburger AB11 , (b) preparation and transformation contextual, as well as measurement-non-contextual, in the sense of Spekkens Spekkens , and (c) contextual for sequences of transformations in the recently-introduced sense of Mansfield and Kashefi Mansfield18 .

To summarize, we have described a hidden variable model corresponding to positive quasiprobabilities $W$ . By this correspondence, the HVM is considered classical from the quantum optics perspective. It is also classical from the computational perspective, as it leads to efficient classical simulation of QCM (for applicable magic states). And yet this HVM is contextual, per the definitions commonly applied. As such, the present HVM may serve as a reference point for a refined foundational notion of quantumness that goes beyond contextuality.

IX Conclusion

We have introduced a quasiprobability distribution $W$ over generalized phase space, which is defined for any number $n$ of qudits with any number $d$ of levels. For multi-qudit systems with odd local dimension $d$ , $W$ reduces to the familiar Wigner function for finite-dimensional systems defined by Gross Gross . For even $d$ , the phase space is enlarged and $W$ becomes non-unique. Importantly, also for $d=2$ (the multi-qubit case), $W$ has the property that a positive quasiprobability function remains positive under all Pauli measurements. This property is crucial for classical simulation algorithms of quantum computation with magic states (QCM) by sampling.

Once this fundamental property is established, it is natural investigate the efficiency (or non-efficiency) of classical simulation in the various regimes, and resource theories characterizing QCM. Here we have treated the canonical questions that arise in this context: we have devised an efficient classical simulation of QCM for $W\geq 0$ , and clarified the relation to the qubit stabilizer formalism. Namely, the present method for efficient classical simulation of QCM strictly contains the stabilizer method. It applies to all mixtures of stabilizer states, but in addition to certain states outside the stabilizer polytope. We have further characterized the hardness of classical simulation for $W<0$ in terms of a robustness measure, and established this robustness is a monotone under the free operations of QCM.

In summary, we arrive at a resource perspective of QCM on qubits that closely resembles the corresponding picture for odd dimension $d$ . However, there are two deviations. First, the phase space on which the quasiprobability function $W$ is defined has a far more intricate structure for $d=2$ than for odd $d$ . Second, for $d=2$ the hidden variable model (HVM) induced by any non-negative quasiprobability function $W_{\rho}$ is contextual, as a consequence of Mermin’s square.

The latter observation leads to a puzzle. The HVM induced for positively representable states $\rho$ is classified as “classical” from the perspectives of quantum optics ( $W_{\rho}\geq 0$ ) and computer science (classical simulation is efficient), but it is classified as “quantum” from the perspective of contextuality.

In this regard, we have argued (also see Howard ) that in multi-qubit QCM, contextuality is not suitable as an indicator of genuine quantumness. We have proposed the notion of “HVM with partial non-contextual value assignments” in which is classicality and contextuality coexist.

Acknowledgments. We thank Piers Lillystone (JBV, CO, RR, ET) and Shane Mansfield (JBV) for discussion. This work is funded by NSERC (CO, RR, ET, MZ), Cifar (RR) and ERC (JBV).

Appendix A Proof of Lemma 1

Recall that, unlike most material in this paper, Lemma 1 holds for all local dimensions $d$ .

Proof of Lemma 1. Consider two sets, $\Omega,\tilde{\Omega}\in{\cal{V}}$ , such that $\Omega\subset\tilde{\Omega}$ , and the phase point operator $A_{\Omega}^{\gamma}$ according to Eq. (2). Furthermore, denote by $\tilde{\Gamma}$ the set of value assignments $\tilde{\gamma}:\tilde{\Omega}\longrightarrow\mathbb{Z}_{d}$ that satisfy the constraint

[TABLE]

Then, $\tilde{\Gamma}$ is the coset of a vector space $U$ . This is the first fact we prove. Write $\tilde{\gamma}=\tilde{\gamma}_{0}+\eta$ , where $\tilde{\gamma}_{0}\in\tilde{\Gamma}$ is some reference function, and the functions $\eta\in U$ all satisfy

[TABLE]

The condition of Eq. (48a) need only be satisfied for commuting pairs of elements in $\tilde{\Omega}$ . From Eq. (48) it follows that if $\eta,\eta^{\prime}\in U$ then $c\eta+c^{\prime}\eta^{\prime}\in U$ , for all $c,c^{\prime}\in\mathbb{Z}_{d}$ . Hence $U$ is indeed a vector space, as claimed.

Key is the relation

[TABLE]

which we now prove, armed with the previous observation. Using the definition of the phase point operators, we start expanding the r.h.s. of Eq. (49).

[TABLE]

Now we consider two cases. (i) $a\in\Omega$ . Then, with property Eq. (48b),

[TABLE]

Furthermore, note $|\tilde{\Gamma}|=|U|$ .

(ii) $a\in\tilde{\Omega}\backslash\Omega$ . There is at least one $\eta\in U$ with $\eta(a)\neq 0$ . Since $U$ is a vector space, it follows by character orthogonality that

[TABLE]

Inserting Eqs. (50) and (51) in the above expansion, and furthermore using property Eq. (48b), we find

[TABLE]

This proves Eq. (49). Now, wlog. we may choose $\tilde{\Omega}$ to be maximal. Since by definition any set $\Omega$ is contained in some maximal set $\tilde{\Omega}(\Omega)$ , we may convert any positive state expansion over ${\cal{V}}$ into a positive state expansion over ${\cal{V}}_{M}$ ,

[TABLE]

If the expansion coefficients on the l.h.s. are positive, so they are on the r.h.s. $\Box$

Appendix B Proof of Lemma 7

Proof of Lemma 7. Statement (A): $(\Omega,\gamma+[a,\cdot])\in{\cal{V}}$ , $\forall a\in\Omega$ . The set $\Omega$ does not change, and we only need to check the properties in Def. 4 that concern the function update, i.e., Eqs. (10), (3).

Assume that $\gamma:\Omega\longrightarrow\mathbb{Z}_{2}$ satisfies $d\gamma=\beta$ on $\Omega$ , i.e. $d\gamma(f)=\beta(f)$ for all faces $f\in F(\Omega)$ . Consider any such face, with its boundary $\partial f$ consisting of the edges $c$ , $d$ and $c+d$ . By definition of $F(\Omega)$ it holds that $c,d,c+d\in\Omega$ . Then, with all addition mod 2,

[TABLE]

Thus, $\gamma+[a,\cdot]$ satisfies Eq. (10).

Furthermore, assume that $\gamma$ satisfies Eq. (3). Then, $\left(\gamma+[a,\cdot]\right)(0)=\gamma(0)+[a,0]=\gamma(0)$ . Hence, $\gamma+[a,\cdot]$ satisfies Eq. (3).

Statement (B): $(\Omega\times a,\gamma\times s_{a})\in{\cal{V}}$ , $\forall a\not\in\Omega$ and $s_{a}\in\mathbb{Z}_{2}$ . There are four items to check in Def. 4, namely (I) $\Omega\times a$ is closed under inference, (II) $\Omega\times a$ is non-contextual, (III) $\gamma\times s_{a}$ satisfies Eq. (10), and (IV) $\gamma\times s_{a}$ satisfies Eq. (3).

(I): Consider $c,d\in\Omega\times a$ , with $[c,d]=0$ , and denote $c^{\prime}=c+a$ , $d^{\prime}=d+a$ . There are three sub-cases. (i) $c,d\in\Omega_{a}$ . Then, $c+d\in\Omega_{a}$ , since $\Omega_{a}$ is closed under inference by Lemma 6. Thus, $c+d\in\Omega\times a$ .

(ii) $c\in\Omega_{a}$ , $d\not\in\Omega_{a}$ . By construction of $\Omega\times a$ , $d^{\prime}\in\Omega_{a}$ . Thus, $c+d=c+(d^{\prime}+a)=(c+d^{\prime})+a$ . Now, since $[c,d]=0$ by assumption and $[c,a]=0$ ( $c\in\Omega_{a})$ it follows that $[c,d^{\prime}]=0$ . Since $\Omega_{a}$ is closed by Lemma 6, it holds that $c+d^{\prime}\in\Omega_{a}$ . By construction of $\Omega\times a$ , $c+d=(c+d^{\prime})+a\in\Omega\times a$ .

(iii) $c,d\not\in\Omega_{a}$ . By construction of $\Omega\times a$ , $c^{\prime},d^{\prime}\in\Omega_{a}$ . Thus, $c+d=(c^{\prime}+a)+(d^{\prime}+a)=c^{\prime}+d^{\prime}$ , and further $[c^{\prime},d^{\prime}]=0$ . Since $\Omega_{a}$ is closed under inference by Lemma 6, $c^{\prime}+d^{\prime}=c+d\in\Omega_{a}$ . Thus, $c+d\in\Omega\times a$ .

Thus in all three cases, $c,d\in\Omega\times a$ , with $[c,d]=0$ , implies $c+d\in\Omega\times a$ . Hence, $\Omega\times a$ is closed under inference.

(III): Assume that $d\gamma=\beta$ on $\Omega$ , and consider a triple of edges $c,d,c+d\in\Omega\times a$ with $[c,d]=0$ . Then, either (i) all or (ii) one of these edges are in the component $\Omega_{a}$ .

(i) $c,d,c+d\in\Omega_{a}$ . Since $\Omega_{a}\subset\Omega$ and with Eq. (25a), it holds that $d(\gamma\times s_{a})(c,d)=d\gamma(c,d)=\beta(c,d)$ .

(ii) W.l.o.g. assume that $c\in\Omega_{a}$ and $d,c+d\not\in\Omega_{a}$ , and denote $c^{\prime}=c+a$ , $d^{\prime}=d+a$ as before. Then, for the face $f=(c,d)$ with boundary $\partial f$ consisting of the edges $c$ , $d$ and $c+d$ ,

[TABLE]

Therein, in the second line we have used Eq. (25), in the third line Eq. (10), in the fourth line Eq. (28), and in the fourth line $d\beta(a,d,c)=0$ , cf. Eqs. (7) and (11).

(II): Per Def. (3), $\Omega\times a$ is non-contextual if there is a function $\tau:\Omega\times a\longrightarrow\mathbb{Z}_{2}$ that satisfies $d\tau=\beta$ . We have explicitly constructed such a function in (III) above, $\tau:=\gamma\times s_{a}$ .

(IV): Assume that $\gamma:\Omega\longrightarrow\mathbb{Z}_{2}$ satisfies Eq. (3). Since $0\in\Omega_{a}$ for all cnc sets $\Omega$ , with Eq. (25a) it follows that $\gamma\times s_{a}(0)=\gamma(0)$ , and hence $\gamma\times s_{a}$ also satisfies Eq. (3). $\Box$

Appendix C Proof of Lemma 9

Proof of Lemma 9. Recall from Lemma 3 that each set $\Omega$ can be written in the form $\Omega=\bigcup_{k=1}^{\xi(\Omega)}I_{k}$ , where each $I_{k}$ is an isotropic subspace, $I_{k}=\langle a_{k},\tilde{I}\rangle$ , $a_{k}\in E$ . Therefore, for all $(\Omega,\gamma)\in{\cal{V}}$ , it holds that

[TABLE]

Therein, the phase point operators appearing on the r.h.s. are all of the type $m=0$ , i.e., they correspond to stabilizer states. The Wigner function $\delta_{(\Omega,\gamma)}$ representing the operator $A_{\Omega}^{\gamma}$ can thus be expanded as

[TABLE]

Denote by $\|\cdot\|_{1}$ the 1-norm of the expansion in terms of phase point operators $A_{\Omega}^{\gamma}$ , and by $\|\cdot\|_{1,S}$ the 1-norm of the expansion in terms of (density matrices of) stabilizer states. With the last equation, the triangle inequality, $\|\delta_{(I_{k},\gamma|_{I_{k}})}\|_{1,S}=\|\delta_{(\tilde{I},\gamma|_{\tilde{I}})}\|_{1,S}=1$ , and $\xi(\Omega)\leq 2n+1$ for all cnc sets $\Omega$ (cf. Lemma 3), it follows that

[TABLE]

Now, for any given state $\rho$ consider the optimal representation $W_{\rho}$ , i.e., the one with minimal norm $\|W_{\rho}\|_{1}$ . Then,

[TABLE]

Therein, in the first line we have an inequality because the representation $W_{\rho}$ of $\rho$ is optimized for $\|W_{\rho}\|_{1}$ , not necessarily for $\|W_{\rho}\|_{1,S}$ . The third line follows by the triangle inequality and Eq. (52), and the fifth line holds as an equality because $W_{\rho}$ , per assumption, was chosen to minimize $\|W_{\rho}\|_{1}$ .

This proves the right half of Eq. (38). The left half, $\mathfrak{R}(\rho)\leq\mathfrak{R}_{S}(\rho)$ , follows from the fact that all stabilizer states correspond to phase point operators of type $m=0$ . Hence, an expansion in terms of stabilizer states induces an expansion in terms of phase point operators $A_{\Omega}^{\gamma}$ , with the same non-zero coefficients. $\Box$

Appendix D Computing $W$ -representations of many copies of magic states

Here we describe how to construct valid quasiprobabilities $W_{\mu^{\otimes n}}$ for $n$ copies of a magic state $\mu$ , at bounded computational cost. As with robustness of magic RoM , we merge expansions for small numbers of magic states into valid expansions for larger numbers of copies.

Denote by $\Omega_{n}^{m}$ cnc sets $\Omega$ with parameters $n$ , $m\leq n$ , and choose the phases $\phi$ in Eq. (1) such that

[TABLE]

Here we identified $a$ and $b$ as elements of $\mathbb{Z}_{d}^{2(n_{1}+n_{2})}$ by writing $((a_{X},0),(a_{Z},0))$ and $((0,b_{X}),(0,b_{Z}))$ , respectively. We then have the following result.

Lemma 11

Be $\Omega_{n_{1}}^{m_{1}}$ and $\Omega_{n_{2}}^{0}$ two cnc sets with parameters $n_{1}$ , $m_{1}\leq n$ , and $n_{2}$ , $m_{2}=0$ , respectively. Then, $A_{\Omega_{n_{1}}^{m_{1}}\oplus\Omega_{n_{2}}^{0}}^{\gamma}:=A_{\Omega_{n_{1}}^{m_{1}}}^{\gamma_{1}}\otimes A_{\Omega_{n_{2}}^{0}}^{\gamma_{1}}$ , with the function $\gamma:\Omega_{n_{1}}^{m_{1}}\oplus\Omega_{n_{2}}^{0}\longrightarrow\mathbb{Z}_{2}$ defined by

[TABLE]

for all $a_{1}\in\Omega_{n_{1}}^{m_{1}}$ , $a_{2}\in\Omega_{n_{1}}^{0}$ is a valid phase point operator on $n_{1}+n_{2}$ qubits.

Proof of Lemma 11. We need to verify the properties of Def. 4, namely (a) that $\Omega_{n_{1}}^{m_{1}}\oplus\Omega_{n_{2}}^{0}$ is cnc, and (b) that the function $\gamma$ defined in Lemma 11 satisfies Eq. (10), i.e., $d\gamma=\beta$ .

Regarding (a), with Eq. (15), $\Omega_{n_{1}}^{m_{1}}=\bigcup_{k=1}^{m_{1}}\langle a_{k},\tilde{I}\rangle$ , and $\Omega_{n_{2}}^{0}=I$ , with $\tilde{I}$ , $I$ isotropic subspaces. Then,

[TABLE]

and $\tilde{I}\oplus I$ is also an isotropic subspace. Hence the set $\Omega_{n_{1}}^{m_{1}}\oplus\Omega_{n_{2}}^{0}$ is cnc by Lemma 3.

Regarding (b), we need to show that for all $a,b\in\Omega_{n_{1}}^{m_{1}}\oplus\Omega_{n_{2}}^{0}$ with $[a,b]=0$ it holds that $\gamma(a+b)+\gamma(a)+\gamma(b)=\beta(a,b)$ . To this end, for any given commuting pair $a$ , $b$ , we split

[TABLE]

with $a_{1},b_{1}\in\Omega_{n_{1}}^{m_{1}}$ , $a_{2},b_{2}\in\Omega_{n_{2}}^{0}$ . The decompositions are unique. Since $a$ and $b$ commute, $[a_{1},b_{1}]+[a_{2},b_{2}]=0$ . Furthermore, $a_{2}$ commutes with $b_{2}$ , since $\Omega_{n_{2}}^{0}$ is an isotropic subspace. Thus,

[TABLE]

Further, Eq. (53) implies that

[TABLE]

Now we rewrite

[TABLE]

Therein, in the second line we have used the definition of $\gamma$ in Lemma 11. In the third line Eq. (55), and in the fourth line the definition of $\gamma$ again. In the fifth line we have used Eq. (11), Eq. (8), and Eq. (56) on $\beta(a_{1}+b_{1},a_{2}+b_{2})$ . In the last line we used Eq. (7) ( $d\beta=0$ ). $\Box$

Denote by $W^{(0)}_{\sigma}$ an expansion Eq. (4) of the state $\sigma$ , but only containing phase point operators with parameter $m=0$ , i.e., an expansion into stabilizer states. $W_{\rho}$ is a valid expansion of $\rho$ , according to Eq. (4). Then, it follows from Lemma 11 that

[TABLE]

is a valid expansion of $\rho\otimes\sigma$ .

Let $k$ be the largest integer for which decompositions $W_{\mu^{\otimes k}}$ and $W_{\mu^{\otimes k}}^{(0)}$ are obtainable. Then, with Eq. (57), $W_{\mu^{\otimes n}}=W_{\mu^{\otimes k}}\otimes W_{\mu^{\otimes n-k}}^{(0)}$ , and the second factor may be further decomposed as $W_{\mu^{\otimes n-k}}^{(0)}=W_{\mu^{\otimes k}}^{\otimes(n/k-1)}$ , if $k$ divides $n$ . Thus, we arrive at an explicit decomposition for $\mu^{\otimes n}$ , with 1-norm

[TABLE]

Thus the reduction to blocks of $k$ magic states familiar from the stabilizer case RoM can be applied in the present setting as well. By Lemma 9, the resulting 1-norm is lower by a constant factor than of the corresponding expansion into stabilizer states.

Bibliography54

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) P. Ehrenfest, Bemerkung über die angenäherte Gültigkeit der klassichen Mechanik innerhalb der Quanatenmechanik , Z. Physik 45 , 455 (1927).
2(2) A. Einstein, B. Podolsky, and N. Rosen, Can Quantum-Mechanical Description of Physical Reality be Considered Complete? . Phys. Rev. 47 , 777 (1935).
3(3) E. Schrödinger, Die gegenwärtige Situation in der Quantenmechanik , Naturwissenschaften 23 , 807 (1935).
4(4) R. Feynman, Simulating Physics with Computers , Int. J. Theor. Phys. 21 , 467 (1982).
5(5) E. Bernstein and U. Vazirani, Quantum complexity theory . SIAM J. Comput., 26(5):1411–1473, (1997). First appeared in ACM STOC 1993.
6(6) D. Deutsch, Quantum Theory, the Church-Turing Principle and the Universal Quantum Computer , Proc. Roy. Soc. London A 400, 97 (1985).
7(7) S. Bravyi and A. Kitaev, Universal Quantum Computation with Ideal Clifford Gates and Noisy Ancillas , Phys. Rev. A 71 , 022316 (2005).
8(8) E. Wigner, On the Quantum Correction For Thermodynamic Equilibrium , Phys. Rev. 40 , 749 (1932).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Phase space simulation method for quantum computation with magic states on qubits

Abstract

I Introduction

II Results and outline

II.1 Summary of results

II.2 Outline

III The quasiprobability function

III.1 Generalized phase space

Definition 1

Definition 2

Definition 3

Definition 4

III.2 Maximal sets Ω\OmegaΩ

Definition 5

Lemma 1

III.3 The cohomological viewpoint

IV Properties of the phase space V{\cal{V}}V

IV.1 Qudits of odd dimension

IV.2 Qubits and rebits

IV.3 Classification of multi-qubit phase space points

Lemma 2

Lemma 3

Theorem 1

IV.4 Relation to the stabilizer formalism

Lemma 4

V Quantum mechanical rules for state update under measurement

Theorem 2

Lemma 5

Lemma 6

Lemma 7

VI Classical simulation for Wρ≥0W_{\rho}\geq 0Wρ​≥0

VI.1 Simulation algorithm

Theorem 3

VI.2 Correctness and efficiency of the classical simulation

Lemma 8

VII The case of Wρ<0W_{\rho}<0Wρ​<0

VII.1 Robustness

Lemma 9

VII.2 Hardness of classical simulation

VII.3 Elements of a resource theory based on WWW

Theorem 4

Lemma 10

VII.4 Numerical results

VII.5 Curious resurgence of 4n4^{n}4n-dimensional phase space

VIII Discussion

VIII.1 Stratonovich-Weyl correspondence

VIII.2 Probabilistic hidden variable model

IX Conclusion

Appendix A Proof of Lemma 1

Appendix B Proof of Lemma 7

Appendix C Proof of Lemma 9

Appendix D Computing WWW-representations of many copies of magic states

Lemma 11

III.2 Maximal sets $\Omega$

IV Properties of the phase space ${\cal{V}}$

VI Classical simulation for $W_{\rho}\geq 0$

VII The case of $W_{\rho}<0$

VII.3 Elements of a resource theory based on $W$

VII.5 Curious resurgence of $4^{n}$ -dimensional phase space

Appendix D Computing $W$ -representations of many copies of magic states