Universality of EPR pairs in Entanglement-Assisted Communication   Complexity, and the Communication Cost of State Conversion

Matthew Coudron; Aram W. Harrow

arXiv:1902.07699·quant-ph·July 17, 2019·CCC

Universality of EPR pairs in Entanglement-Assisted Communication Complexity, and the Communication Cost of State Conversion

Matthew Coudron, Aram W. Harrow

PDF

TL;DR

This paper demonstrates that EPR pairs are universally sufficient for entanglement-assisted quantum communication, and provides bounds on the cost of converting one bipartite state into another using a new metric based on optimal transport.

Contribution

It proves the universality of EPR pairs in entanglement-assisted communication complexity and introduces the $ ext{EMD}$ metric to bound state conversion costs.

Findings

01

Any entanglement-assisted protocol can be approximated using only EPR pairs.

02

The quantum communication cost for state conversion is bounded by the $ ext{EMD}$ metric.

03

Lower bounds on state conversion costs are established via smoothed $ ext{EMD}$.

Abstract

Entanglement assistance is known to reduce the quantum communication complexity of evaluating functions with distributed inputs. But does the type of entanglement matter, or are EPR pairs always sufficient? This is a natural question because in several other settings maximally entangled states are known to be less useful as a resource than some partially entangled state. These include non-local games, tasks with quantum communication between players and referee, and simulating bipartite unitaries or communication channels. By contrast, we prove that the bounded-error entanglement-assisted quantum communication complexity of a function cannot be improved by more than a constant factor by replacing maximally entangled states with arbitrary entangled states. In particular, we show that every quantum communication protocol using $Q$ qubits of communication and arbitrary shared entanglement…

Equations173

⟨ ϕ ∣^{A B} U_{P} ∣ ψ ⟩^{A B} \leq 1 - \frac{1}{4} ϵ^{2} + 24 \cdot 2^{- \frac{1}{2} (d_{\infty}^{ϵ} (∣ ψ ⟩, ∣ ϕ ⟩) - 3 Q)}

⟨ ϕ ∣^{A B} U_{P} ∣ ψ ⟩^{A B} \leq 1 - \frac{1}{4} ϵ^{2} + 24 \cdot 2^{- \frac{1}{2} (d_{\infty}^{ϵ} (∣ ψ ⟩, ∣ ϕ ⟩) - 3 Q)}

∣ ⟨ ψ ∣ U ∣ ν ⟩ ∣ \leq 2^{\frac{3}{2} Q} \cdot r k_{S c hmi d t} (∣ ψ ⟩) λ_{m a x} ν_{m a x}

∣ ⟨ ψ ∣ U ∣ ν ⟩ ∣ \leq 2^{\frac{3}{2} Q} \cdot r k_{S c hmi d t} (∣ ψ ⟩) λ_{m a x} ν_{m a x}

∣ ⟨ ψ ∣ ν ⟩ ∣ \leq r k_{S c hmi d t} (∣ ψ ⟩) λ_{m a x} ν_{m a x}

∣ ⟨ ψ ∣ ν ⟩ ∣ \leq r k_{S c hmi d t} (∣ ψ ⟩) λ_{m a x} ν_{m a x}

⟨ ψ ∣ ν ⟩

⟨ ψ ∣ ν ⟩

= i = 0 \sum r - 1 j \sum λ_{i} ν_{j} ⟨ i_{A} ∣ j_{A} ⟩ \otimes ⟨ j_{B}^{*} ∣ i_{B}^{*} ⟩ = i = 0 \sum r - 1 λ_{i} ⟨ i_{A} ∣ (j \sum ν_{j} ∣ j ⟩_{A} \otimes ⟨ j ∣_{B}) ∣ i_{B}^{*} ⟩

= i = 0 \sum r - 1 λ_{i} ⟨ i_{A} ∣ M_{ν} ∣ i_{B}^{*} ⟩

∣ ⟨ ψ ∣ ν ⟩ ∣

∣ ⟨ ψ ∣ ν ⟩ ∣

\leq r λ_{m a x} ν_{m a x} = r k_{S c hmi d t} (∣ ψ ⟩) λ_{m a x} ν_{m a x}

d_{M K} (μ, ν) = in f {\int_{R \times R} c (x, y) d γ (x, y) ∣ γ \in Γ (μ, ν)} .

d_{M K} (μ, ν) = in f {\int_{R \times R} c (x, y) d γ (x, y) ∣ γ \in Γ (μ, ν)} .

d_{M K} (μ, ν) \equiv γ \in Γ (μ, ν) in f {\int_{R \times R} c (x, y) d γ (x, y)} = \int_{0}^{1} c (F_{μ}^{- 1} (s), F_{ν}^{- 1} (s)) d s

d_{M K} (μ, ν) \equiv γ \in Γ (μ, ν) in f {\int_{R \times R} c (x, y) d γ (x, y)} = \int_{0}^{1} c (F_{μ}^{- 1} (s), F_{ν}^{- 1} (s)) d s

d_{\infty} (∣ ψ ⟩, ∣ ϕ ⟩) \equiv q \in [0, 1] max ∣ F_{p_{ψ}}^{- 1} (q) - F_{p_{ϕ}}^{- 1} (q) ∣

d_{\infty} (∣ ψ ⟩, ∣ ϕ ⟩) \equiv q \in [0, 1] max ∣ F_{p_{ψ}}^{- 1} (q) - F_{p_{ϕ}}^{- 1} (q) ∣

d_{\infty}^{ϵ} (∣ ψ ⟩, ∣ ϕ ⟩) \equiv q \in [0, 1] max r \in [q - ϵ, q + ϵ] min ∣ F_{p_{ψ}}^{- 1} (q) - F_{p_{ϕ}}^{- 1} (r) ∣

d_{\infty}^{ϵ} (∣ ψ ⟩, ∣ ϕ ⟩) \equiv q \in [0, 1] max r \in [q - ϵ, q + ϵ] min ∣ F_{p_{ψ}}^{- 1} (q) - F_{p_{ϕ}}^{- 1} (r) ∣

⟨ ϕ ∣^{A B} U_{P} ∣ ψ ⟩^{A B} \leq 1 - \frac{1}{4} ϵ^{2} + 24 \cdot 2^{- \frac{1}{2} (d_{\infty}^{ϵ} (∣ ψ ⟩, ∣ ϕ ⟩) - 3 Q)}

⟨ ϕ ∣^{A B} U_{P} ∣ ψ ⟩^{A B} \leq 1 - \frac{1}{4} ϵ^{2} + 24 \cdot 2^{- \frac{1}{2} (d_{\infty}^{ϵ} (∣ ψ ⟩, ∣ ϕ ⟩) - 3 Q)}

r \in [p - ϵ, p + ϵ] min ∣ F_{p_{ψ}}^{- 1} (p) - F_{p_{ϕ}}^{- 1} (r) ∣ = d

r \in [p - ϵ, p + ϵ] min ∣ F_{p_{ψ}}^{- 1} (p) - F_{p_{ϕ}}^{- 1} (r) ∣ = d

ψ^{1} ⟩ \equiv U_{P} ∣ ψ ⟩_{\leq x}, ψ^{3} ⟩ \equiv ϕ^{3} ⟩ ⟨ ϕ^{3} U_{P} ∣ ψ ⟩_{> x}, ϕ^{3} ⟩ \equiv ∣ ϕ ⟩_{\geq x + d}, ϕ^{1} ⟩ \equiv ψ^{1} ⟩ ⟨ ψ^{1} ∣ ϕ ⟩_{< x + d}, ψ^{2} ⟩ \equiv (I - ϕ^{3} ⟩ ⟨ ϕ^{3}) U_{P} ∣ ψ ⟩_{> x} ϕ^{2} ⟩ \equiv (I - ψ^{1} ⟩ ⟨ ψ^{1}) ∣ ϕ ⟩_{< x + d}

ψ^{1} ⟩ \equiv U_{P} ∣ ψ ⟩_{\leq x}, ψ^{3} ⟩ \equiv ϕ^{3} ⟩ ⟨ ϕ^{3} U_{P} ∣ ψ ⟩_{> x}, ϕ^{3} ⟩ \equiv ∣ ϕ ⟩_{\geq x + d}, ϕ^{1} ⟩ \equiv ψ^{1} ⟩ ⟨ ψ^{1} ∣ ϕ ⟩_{< x + d}, ψ^{2} ⟩ \equiv (I - ϕ^{3} ⟩ ⟨ ϕ^{3}) U_{P} ∣ ψ ⟩_{> x} ϕ^{2} ⟩ \equiv (I - ψ^{1} ⟩ ⟨ ψ^{1}) ∣ ϕ ⟩_{< x + d}

U_{P} ∣ ψ ⟩ = ψ^{1} ⟩ + ψ^{2} ⟩ + ψ^{3} ⟩

U_{P} ∣ ψ ⟩ = ψ^{1} ⟩ + ψ^{2} ⟩ + ψ^{3} ⟩

∣ ϕ ⟩ = ϕ^{1} ⟩ + ϕ^{2} ⟩ + ϕ^{3} ⟩

1

1

1

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣

\leq ⟨ ϕ^{1} ψ^{1} ⟩ + ⟨ ϕ^{2} ψ^{2} ⟩ + ⟨ ϕ^{3} ψ^{3} ⟩ + 6 \cdot h (Q, d)

\leq ϕ^{1} ⟩ ψ^{1} ⟩ + ϕ^{2} ⟩ ψ^{2} ⟩ + ϕ^{3} ⟩ ψ^{3} ⟩ + 6 \cdot h (Q, d)

ψ^{1} ⟩ = U_{P} ∣ ψ ⟩_{\leq x} = ∣ ψ ⟩_{\leq x} = p

ψ^{1} ⟩ = U_{P} ∣ ψ ⟩_{\leq x} = ∣ ψ ⟩_{\leq x} = p

ϕ^{3} ⟩ = ∣ ϕ ⟩_{\geq x + d} \geq 1 - p + ϵ

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣

x_{1} y_{1} + x_{2} y_{2} + x_{3} y_{3} \leq cos (α - β) .

x_{1} y_{1} + x_{2} y_{2} + x_{3} y_{3} \leq cos (α - β) .

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣ \leq p - ϵ p + 1 - p 1 - p + ϵ + 6 \cdot h (Q, d) .

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣ \leq p - ϵ p + 1 - p 1 - p + ϵ + 6 \cdot h (Q, d) .

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣ \leq 1 - \frac{1}{4} ϵ^{2} + 6 \cdot h (Q, d) .

∣ ⟨ ϕ ∣ P (ψ) ⟩ ∣ \leq 1 - \frac{1}{4} ϵ^{2} + 6 \cdot h (Q, d) .

i = - 1 \sum ⌈ x ⌉ ∥ ∣ ρ ⟩_{i} ∥^{2} = ∥ ∣ ρ ⟩ ∥^{2} \leq 1

i = - 1 \sum ⌈ x ⌉ ∥ ∣ ρ ⟩_{i} ∥^{2} = ∥ ∣ ρ ⟩ ∥^{2} \leq 1

∣ ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ρ ⟩_{i} ∣ \leq 2^{\frac{3}{2} Q} r k_{S c hmi d t} (∣ ρ ⟩_{i}) 2^{- (x + d)} 2^{- i} \leq 2^{\frac{3}{2} Q} \cdot 2^{i + 1} ∥ ∣ ρ ⟩_{i} ∥^{2} \cdot 2^{- (x + d)} 2^{- i}

∣ ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ρ ⟩_{i} ∣ \leq 2^{\frac{3}{2} Q} r k_{S c hmi d t} (∣ ρ ⟩_{i}) 2^{- (x + d)} 2^{- i} \leq 2^{\frac{3}{2} Q} \cdot 2^{i + 1} ∥ ∣ ρ ⟩_{i} ∥^{2} \cdot 2^{- (x + d)} 2^{- i}

= 2 \cdot 2^{\frac{3}{2} Q} ∥ ∣ ρ ⟩_{i} ∥^{2} 2^{i - x - d} \leq 2 \cdot 2^{\frac{3}{2} Q} ∥ ∣ ρ ⟩_{i} ∥^{2} \cdot 2 \cdot 2^{- d /2} = 4 \cdot 2^{\frac{3 Q - d}{2}} ∥ ∣ ρ ⟩_{i} ∥^{2},

⟨ ϕ^{3} ψ^{1} ⟩ = ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ψ ⟩_{\leq x} = i = - 1 \sum ⌈ x ⌉ ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ρ ⟩_{i} \leq i = - 1 \sum ⌈ x ⌉ ∣ ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ρ ⟩_{i} ∣

⟨ ϕ^{3} ψ^{1} ⟩ = ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ψ ⟩_{\leq x} = i = - 1 \sum ⌈ x ⌉ ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ρ ⟩_{i} \leq i = - 1 \sum ⌈ x ⌉ ∣ ⟨ ϕ_{\geq x + d} ∣ U_{P} ∣ ρ ⟩_{i} ∣

\leq 4 \cdot 2^{\frac{3 Q - d}{2}} i = - 1 \sum ⌈ x ⌉ ∥ ∣ ρ ⟩_{i} ∥^{2} = 4 \cdot 2^{\frac{3 Q - d}{2}} ∣ ψ ⟩_{\leq x}^{2} \leq 4 \cdot 2^{\frac{3 Q - d}{2}} = h (Q, d),

⟨ ψ^{3} ψ^{1} ⟩ = ⟨ ψ_{> x} ∣ U_{P}^{†} ϕ^{3} ⟩ ⟨ ϕ^{3} ψ^{1} ⟩ = ⟨ ψ_{> x} ∣ U_{P}^{†} ϕ^{3} ⟩ ⟨ ϕ^{3} ψ^{1} ⟩ \leq ⟨ ϕ^{3} ψ^{1} ⟩ \leq h (Q, d),

⟨ ψ^{3} ψ^{1} ⟩ = ⟨ ψ_{> x} ∣ U_{P}^{†} ϕ^{3} ⟩ ⟨ ϕ^{3} ψ^{1} ⟩ = ⟨ ψ_{> x} ∣ U_{P}^{†} ϕ^{3} ⟩ ⟨ ϕ^{3} ψ^{1} ⟩ \leq ⟨ ϕ^{3} ψ^{1} ⟩ \leq h (Q, d),

⟨ ψ^{2} ψ^{1} ⟩ = ⟨ ψ_{> x} ∣ U_{P}^{†} (I - ϕ^{3} ⟩ ⟨ ϕ^{3}) ψ^{1} ⟩ \leq ⟨ ψ_{> x} ∣ U_{P}^{†} ψ^{1} ⟩ + ⟨ ψ_{> x} ∣ U_{P}^{†} ϕ^{3} ⟩ ⟨ ϕ^{3} ψ^{1} ⟩

⟨ ψ^{2} ψ^{1} ⟩ = ⟨ ψ_{> x} ∣ U_{P}^{†} (I - ϕ^{3} ⟩ ⟨ ϕ^{3}) ψ^{1} ⟩ \leq ⟨ ψ_{> x} ∣ U_{P}^{†} ψ^{1} ⟩ + ⟨ ψ_{> x} ∣ U_{P}^{†} ϕ^{3} ⟩ ⟨ ϕ^{3} ψ^{1} ⟩

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Universality of EPR pairs in Entanglement-Assisted

Communication Complexity, and the Communication Cost of State Conversion

Matthew Coudron Aram W. Harrow Institute for Quantum Computing, University of Waterloo [email protected]. Center for Theoretical Physics, MIT. [email protected]

Abstract

In this work we consider the role of entanglement assistance in quantum communication protocols, focusing, in particular, on whether the type of shared entangled state can affect the quantum communication complexity of a function. This question is interesting because in some other settings in quantum information, such as non-local games, or tasks that involve quantum communication between players and referee, or simulating bipartite unitaries or communication channels, maximally entangled states are known to be less useful as a resource than some partially entangled states. By contrast, we prove that the bounded-error entanglement-assisted quantum communication complexity of a partial or total function cannot be improved by more than a constant factor by replacing maximally entangled states with arbitrary entangled states. In particular, we show that every quantum communication protocol using $Q$ qubits of communication and arbitrary shared entanglement can be $\epsilon$ -approximated by a protocol using $O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ qubits of communication and only EPR pairs as shared entanglement. This conclusion is opposite of the common wisdom in the study of non-local games, where it has been shown, for example, that the I3322 inequality has a non-local strategy using a non-maximally entangled state, which surpasses the winning probability achievable by any strategy using a maximally entangled state of any dimension [15]. We leave open the question of how much the use of a shared maximally entangled state can reduce the quantum communication complexity of a function.

Our second result concerns an old question in quantum information theory: How much quantum communication is required to approximately convert one pure bipartite entangled state into another? We give simple and efficiently computable upper and lower bounds. Given two bipartite states $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ , we define a natural quantity, $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ , which we call the $\ell_{\infty}$ Earth Mover’s distance, and we show that the communication cost of converting between $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ is upper bounded, up to a constant multiplicative factor, by $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ . Here $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ may be informally described as the minimum over all transports between the log of the Schmidt coefficients of $\left|\chi\right\rangle$ and those of $\left|\upsilon\right\rangle$ , of the maximum distance that any amount of mass must be moved in that transport. A precise definition is given in the introduction. Furthermore, we prove a complementary lower bound on the cost of state conversion by the $\epsilon$ -Smoothed $\ell_{\infty}$ -Earth Mover’s Distance, which is a natural smoothing of the $\ell_{\infty}$ -Earth Mover’s Distance that we will define via a connection with optimal transport theory.

1 Introduction

1.1 Entanglement-assisted communication complexity

Imagine that two cooperating players, Alice and Bob, are given the task of evaluating a function $f(x,y)$ ( $x,y\in\{0,1\}^{n}$ ), where $x$ is known only to Alice and $y$ is known only to Bob. The communication complexity of $f$ is the number of bits that Alice and Bob need to exchange in order to compute $f$ . Popular variations of this framework include allowing a small probability of error, allowing qubits to be communicated instead of classical bits, and allowing extra resources such as shared randomness or entanglement.

In classical communication complexity, Newman’s theorem states that arbitrarily large amounts of shared randomness in a protocol can be replaced by a distribution with $O(\log(n/\epsilon))$ bits of entropy while only reducing the success probability of that protocol by $\epsilon$ . (Here $n$ is the input size of each party.) Is there a quantum analogue to this result?

In one sense the answer is “no”. Given a two-party entanglement-assisted protocol for, say, computing the value of some function, we cannot replace the shared entanglement with some different, less entangled, state, without causing large errors [9, 1]. It is an open question whether it is possible to replace a large entangled state with a less entangled one while also changing the communication protocol.

However, while it remains a challenge to characterize the dimension of shared entanglement required for optimal entanglement-assisted quantum communication protocols, in this work we show that the type of shared entanglement required by such protocols can be neatly characterized. In Theorem 1 below, we establish that the bounded-error entanglement-assisted quantum communication complexity of a partial or total function cannot be improved by more than a constant factor by replacing maximally entangled states with arbitrary entangled states. This is accomplished by constructing an explicit protocol which allows two parties, who only share maximally entangled states, to simulate any entanglement-assisted quantum communication task regardless of the shared state that that task originally required.

Theorem 1.

Consider a quantum communication protocol $\mathcal{R}$ whose goal is to compute a joint function $f(x,y)\in\{0,1\}$ . Suppose that $\mathcal{R}$ uses an arbitrary bipartite entangled state $\left|\psi\right\rangle^{AB}$ (of unbounded dimension), as well as $Q$ qubits of communication total, in either direction (for sufficiently large $Q\geq 15$ ). Then, for every $\epsilon>0$ , there exists a quantum communication protocol $\mathcal{R^{\prime}}$ which simulates $\mathcal{R}$ with error $\epsilon$ , while using only a maximally entangled state as an entangled resource (rather than $\left|\psi\right\rangle^{AB}$ or any other state), and using $O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ qubits of communication. Thus, if $\mathcal{R}$ computes $f$ with error $\epsilon^{\prime}$ it follows that $\mathcal{R^{\prime}}$ computes $f$ with error $\epsilon+\epsilon^{\prime}$ .

Theorem 1 shows that, although the role of shared entanglement in quantum communication complexity is still not well understood, the type of shared entanglement does not drastically change communication complexity. This is true regardless of input size or promise, as long as we are in the constant-error regime and some communication is allowed between players (unlike, say, the simultaneous-message-passing model). This result sets quantum communication complexity apart from settings such as channel simulation [3], nonlocal games [10, 14], unitary gate simulation [6], and communication tasks involving quantum communication between referees and players [11]. In each of those cases the ratio between the EPR-assisted costs and the (unrestricted) entanglement-assisted costs can be made arbitrarily large. This suggests that the role of shared entanglement in quantum communication complexity may be fundamentally different than in these other settings. Furthermore, the result achieved in Theorem 1 may be useful in future work attempting to further bound the role of entanglement in quantum communication complexity, as it restricts the problem to the case of shared EPR pairs, without loss of generality.

It may be worth noting that the proof of Theorem 1 is nearly oblivious to the entanglement-assisted protocol being considered in the following sense: Given a protocol $\mathcal{P}$ using $Q$ qubits of communication and a shared entangled state $\left|\psi\right\rangle$ , we can replace $\left|\psi\right\rangle$ with a “consolidated” state $\rho$ at the cost of error $\epsilon$ . Moreover, $\rho$ can be prepared from a maximally entangled state using $O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ communication. Taking $\epsilon$ constant implies that the EPR-assisted communication complexity of a function is at most $O(1)$ times the (unrestricted) entanglement-assisted communication complexity of that function. It was not necessary to modify the protocol $\mathcal{P}$ to achieve this result, except to pre-compose it with a pre-processing protocol which starts with only EPR pairs, and prepares the state $\rho$ using only $O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ communication. $\mathcal{P}$ can then be run on $\rho$ directly. Such a protocol-agnostic preprocessing should not be taken for granted, since it is known that reducing the number of EPR pairs may in some cases require more than just pre-processing [9, 1].

1.2 Communication cost of state transformations

Our second contribution, which is related at the level of techniques to Theorem 1, is to provide upper and lower bounds for an old quantity studied in quantum information theory, the communication cost of state transformation.

Suppose that $\left|\chi\right\rangle^{AB}$ and $\left|\nu\right\rangle^{AB}$ are bipartite pure quantum states, with vectors of Schmidt coefficients denoted respectively by $\chi$ and $\nu$ . In this setting it is known that $\left|\chi\right\rangle$ can be exactly converted into $\left|\nu\right\rangle$ using LOCC if and only if $\chi$ is majorized by $\nu$ [13]. But the communication cost of this transformation is known only in a few special cases. If $\left|\chi\right\rangle=\left|\chi_{0}\right\rangle^{\otimes n}$ and $\left|\nu\right\rangle=\left|\nu_{0}\right\rangle^{\otimes n}$ for some states $\left|\chi_{0}\right\rangle,\left|\nu_{0}\right\rangle$ , then this cost is $O(\sqrt{n})$ or less in some special cases (e.g. $\left|\nu_{0}\right\rangle$ is maximally entangled). More generally there is, in principle, an exact characterization of the communication cost (either LOCC, or quantum communication) of state transformation using the Schubert calculus due to Daftuar and Hayden [4], but in practice it is difficult to extract concrete bounds from their main theorem.

In this work we identify a simple and efficiently computable quantity, which we call the $\ell_{\infty}$ Earth Mover’s (or Wasserstein) Distance, which tells us approximately how much quantum communication is required to transform $\left|\chi\right\rangle$ to $\left|\nu\right\rangle$ . Given its simple form, we believe that this quantity may be a useful tool in quantum information theory.

Definition 2 ( $\ell_{\infty}$ Earth Mover’s Distance ).

Let $\left|\chi\right\rangle^{AB}=\sum_{i\in X}\sqrt{\chi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ and $\left|\upsilon\right\rangle^{AB}=\sum_{j\in Y}\sqrt{\upsilon_{j}}\left|j\right\rangle^{A}\otimes\left|j\right\rangle^{B}$ be two states. We define $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ to be the $\ell_{\infty}$ Earth Mover’s distance between $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ , which is equal to the minimum $\mu\geq 0$ for which there exists a joint distribution $\omega(x,y):X\times Y\to\mathbb{R}_{\geq 0}$ such that:

•

$\sum_{j\in Y}\omega(i,j)=\chi_{i}$ * $\forall i\in X$ *

•

$\sum_{i\in X}\omega(i,j)=\upsilon_{j}$ * $\forall j\in Y$ *

•

$\omega(i,j)=0$ * ** whenever ** $|\log(\chi_{i})-\log(\upsilon_{j})|>\mu$ *

We can think of $\chi$ as corresponding to placing $\chi_{i}$ mass at position $\log(\chi_{i})$ for each $i$ , and similarly for $\upsilon$ . Then $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ is the $\ell_{\infty}$ EMD (Earth Mover’s distance) between these distributions.

In Section 4 we will show that this quantity gives an intuitive upper bound on the amount of quantum communication required to transform one bipartite shared state into another. In particular we prove the following theorem.

Theorem 3.

Let $\left|\chi\right\rangle^{AB}$ and $\left|\upsilon\right\rangle^{AB}$ be two bipartite shared states. There is a protocol $\mathcal{M}_{\chi\rightarrow\upsilon}$ which can prepare $\left|\upsilon\right\rangle$ from $\left|\chi\right\rangle$ , using only $4\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+8$ qubits of communication.

In Section 3 we establish a complementary lower bound, showing that a “ $\epsilon$ -smoothed” version of the $\ell_{\infty}$ Earth Mover’s Distance, denoted by $d^{\epsilon}_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ , gives a lower bound on the cost of state transformation. That is:

Theorem 4.

Given any two bipartite shared states $\left|\psi\right\rangle^{AB}=\sum_{i}\sqrt{\psi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ and $\left|\phi\right\rangle^{AB}=\sum_{i}\sqrt{\phi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ , shared between two parties $A$ and $B$ , together with a unitary $U_{\mathcal{P}}$ which can be performed on the state $\left|\psi\right\rangle^{AB}$ via a quantum communication protocol $\mathcal{P}$ , that uses $Q$ qubits of communication between $A$ and $B$ , we have that, for every $\epsilon$ :

[TABLE]

In words: If two shared states cannot be brought within small $\ell_{\infty}$ Earth Mover’s Distance of each other by moving an $\epsilon$ quantity of mass of their Schmidt coefficients, then they also cannot be brought closer than $1-O(\epsilon^{2})$ fidelity with each other without using $\Omega(d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle))$ qubits of communication (for sufficiently large values of $d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)$ ). Thus, the $\epsilon$ -smoothed $\ell_{\infty}$ Earth Mover’s Distance provides a lower bound on the communication cost of state conversion. On the other hand, from the definition of $d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)$ , stated in Definition 10, we note here that one can use Theorem 3 to move $\left|\psi\right\rangle$ to within $1-\epsilon$ fidelity of $\left|\phi\right\rangle$ using only $O(d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle))$ qubits of communication. To do this, omit the $\epsilon$ mass of Schmidt coefficients on which the two states have large $\epsilon$ -smoothed $\ell_{\infty}$ distance, and apply Theorem 3 as one would do with the regular $\ell_{\infty}$ Earth Mover’s Distance. In this sense $d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)$ gives both an upper and lower bound on the communication cost of state conversion.

To put these bounds in context: One could consider entanglement concentration and dilution to be the starting point for the study of state conversion. The original paper on entanglement concentration and dilution [2] concerned the many-copy limit and did not attempt to bound the amount of classical communication used. The first time the classical communication cost of state conversion was considered explicitly seems to have been in [12], which could be said to establish a version of our upper bound in the case where the starting state is maximally entangled. (Their result is not quite that general but contains many of the key ideas.) A version of our lower bound was established, again for the case of starting with maximally entangled states, in [7, 8]. These lower bounds could be applied to general state conversion but relied on Rènyi entropy inequalities that are clearly not tight in many cases. Finally, as noted earlier, a full characterization of the communication cost of general state conversion was given in [4] but the resulting formula is complicated and there is not an efficient algorithm known to evaluate it.

We conclude the section with two remarks about notation.

Remark 1.

In theorem statements above, and where appropriate, we have made use of superscripts $A$ and $B$ , as in $\left|\psi\right\rangle^{AB}=\sum_{i}\sqrt{\psi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ to explicitly denote the two halves of the bipartite division of a state. However, since all of the shared entangled states considered in this paper are bipartite, and since the two components of the bipartite division are generally clear from context, we will usually omit this notation.

Remark 2.

When considering a bipartite state $\left|\psi\right\rangle$ , we will assume that the state has a Schmidt decomposition of the form $\left|\psi\right\rangle=\sum_{i}\sqrt{\psi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ across the implicit bipartite division. This is done in the theorem statements above and everywhere in the paper. We can assume this WLOG because any state that has the same Schmidt coefficients as $\left|\psi\right\rangle$ can be moved to this canonical form (and vice versa) using only local unitary transformations, which can be implemented with no quantum communication between the two components of the bipartite division. Thus our analysis of communication costs is unaffected by assuming WLOG that, in any quantum communication protocol, shared entangled states start and end in this form.

2 Entanglement-Assisted Communication Complexity

In this section we will discuss the proof of our main result, Theorem 1, which shows that arbitrary entanglement-assisted quantum communication protocols can be simulated by quantum communication protocols that use only the maximally entangled state as an entangled resource. A basic fact we will need is that two bipartite pure states which are sufficiently different in the distribution of mass across their Schmidt coefficients must be nearly orthogonal. This fact is stated for our specific purposes in Lemma 6 below. Crucially, such states remain nearly orthogonal even after one of them is acted on by any unitary which can be implemented with a small amount of quantum communication, as we detail in Lemma 5.

Lemma 5.

Given two quantum states $\left|\psi\right\rangle$ and $\left|\nu\right\rangle$ on $\mathcal{H}_{A}\otimes\mathcal{H}_{B}$ , such that the Schmidt coefficients of $\psi$ are upper bounded by $\lambda_{\max}$ , and those of $\nu$ are upper bounded by $\nu_{\max}$ , and further given a unitary transformation $\mathcal{U}$ on $\mathcal{H}_{A}\otimes\mathcal{H}_{B}$ which can be implemented using at most $Q$ qubits of communication between the $\mathcal{H}_{A}$ and $\mathcal{H}_{B}$ components of the Hilbert space, it follows that:

[TABLE]

Proof.

If $\mathcal{U}$ is a unitary transform using $Q$ qubits of communication, then $rk_{Schmidt}(\mathcal{U}\left|\nu\right\rangle)\leq 2^{Q}rk_{Schmidt}(\left|\nu\right\rangle)$ [8]. We also know that the Schmidt coefficients of $\mathcal{U}\left|\nu\right\rangle$ are bounded above by $2^{Q}\nu_{max}$ [8]. The desired result now follows by Lemma 6. ∎

Lemma 6.

Given two quantum states $\left|\psi\right\rangle$ and $\left|\nu\right\rangle$ on $\mathcal{H}_{A}\otimes\mathcal{H}_{B}$ , such that the Schmidt coefficients of $\psi$ are upper bounded by $\lambda_{\max}$ , and those of $\nu$ are upper bounded by $\nu_{\max}$ , we have:

[TABLE]

Proof.

For brevity let $r=rk_{Schmidt}(\left|\psi\right\rangle)$ . Schmidt decompose $\left|\psi\right\rangle$ and $\left|\nu\right\rangle$ as $\left|\psi\right\rangle=\sum_{i=0}^{r-1}\sqrt{\lambda_{i}}\left|i\right\rangle_{A}\otimes\left|i\right\rangle_{B}$ , as $\left|\nu\right\rangle=\sum_{j}\sqrt{\nu_{j}}\left|j\right\rangle_{A}\otimes\left|j\right\rangle_{B}$ . Define the matrix $M_{\nu}=\sum_{j}\sqrt{\nu_{j}}\left|j\right\rangle_{A}\otimes\left\langle j\right|_{B}^{*}$ , and note that

[TABLE]

Now, by definition of a Schmidt Decomposition, we know that the maximum singular value of $M_{\nu}$ is $\sqrt{\nu_{max}}$ . Thus, for all $i$ we have that $|\left\langle i_{A}\right|M_{\nu}\left|i_{B}^{*}\right\rangle|\leq\sqrt{\nu_{max}}$ (since $\left|i_{A}\right\rangle$ and $\left|i_{B}\right\rangle$ are normalized vectors by definition). It then follows that:

[TABLE]

∎

Theorem 1 is the main result of this work. The proof is long enough that a high-level outline may be valuable. Therefore will now give a brief, intuitive outline of the proof of Theorem 1, restated below for the reader’s convenience, and include the complete proof in Section E of the Appendix.

Theorem (Restatement of Theorem 1).

Consider a quantum communication protocol $\mathcal{R}$ whose goal is to compute a joint function $g(x,y)\in\{0,1\}$ . Suppose that $\mathcal{R}$ uses an arbitrary bipartite entangled state $\left|\psi\right\rangle^{AB}$ (of unbounded dimension), as well as $Q$ qubits of communication total, in either direction (for sufficiently large $Q\geq 15$ ). Then, for every $\epsilon>0$ , there exists a quantum communication protocol $\mathcal{R^{\prime}}$ which simulates $\mathcal{R}$ with error $\epsilon$ , while using only a maximally entangled state as an entangled resource (rather than $\left|\psi\right\rangle^{AB}$ or any other state), and using $O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ qubits of communication. Thus, if $\mathcal{R}$ computes $f$ with error $\epsilon^{\prime}$ it follows that $\mathcal{R^{\prime}}$ computes $f$ with error $\epsilon+\epsilon^{\prime}$ .

Outline of the Proof of Theorem 1:

The proof of Theorem 1 has three main parts. First, the initial entangled state used by the protocol can be converted using a small amount of communication to a state $\varphi$ in which the Schmidt coefficients are grouped into evenly spaced groups. This is achieved using Theorem 3.

Second, we show that our new “grouped” entangled state can be divided into three “pieces” (more precisely termed subset-matrices in Definition 20), one piece which has small trace norm and can therefore be omitted, one piece called $\varphi_{\text{far}}$ which only has non-zero terms which are far from the diagonal in the appropriate basis, and one piece called $\varphi_{\text{block}}$ which is a block-diagonal mixed state that can be produced with small error and low communication cost from a maximally entangled state.

to show that, if one starts with a quantum communication protocol with an arbitrary shared entangled state, then that protocol can be modified, using a small amount of additional communication, to instead use an entangled state, $\varphi$ , (a property which will be useful later in the proof). Once we have reduced, without loss of generality, to appropriately “grouped” entangled state $\varphi$ in this way, the proof proceeds in two halves. In the first half, which is summed up in Lemma 22, we show that

In the second half of the proof, which is summed up in Lemma 23, we show that the $\varphi_{\text{far}}$ piece of $\varphi$ has very little effect on the outcome of the quantum communication protocol in question. This means that $\varphi$ can be replaced by $\varphi_{\text{block}}$ alone while incurring very little error in the outcome of the quantum communication protocol. Since $\varphi_{\text{block}}$ can be produced with low cost from a maximally entangled state, this then achieves the desired result. The full proof of Theorem 1 is included in Section E of the Appendix. The role of Lemma 5 in the proof is within this step for controlling the terms far from the diagonal, in Lemma 23.

3 The Cost of State Transformation: A Lower Bound

It is natural at this point to discuss the background and proof for Theorem 4, which establishes a lower-bound on the cost of State Transformation by the $\epsilon$ -Smoothed $\ell_{\infty}$ Earth Mover’s Distance, and to postpone the discussion of Theorem 3 until Section 4, for two reasons. First, the proof of Theorem 4 in this section shares key techniques in common with the proof of Theorem 1 in Section 2 above, and so this progression may provide the reader with some continuity of thought while also reiterating the usefulness of the techniques. Second, Theorem 4 in this section motivates the notion of the $\ell_{\infty}$ Earth Mover’s Distance by highlighting its, perhaps surprising, relevance to lower bounding the cost of state transformation. This prepares the reader with some motivation for why the upper bound proven in Theorem 3, in Section 4 below, is interesting and potentially useful. Thus, covering Theorem 4 at this point may provide the reader with a reason to accept the $\epsilon$ -smoothed $\ell_{\infty}$ Earth Mover’s Distance as a useful proxy for the cost of State Transformation.

Whereas the proof of Theorem 3 in the next section will make direct use of Definition 2, the proof of Theorem 4 in this section is elucidated by first establishing an equivalent formulation of the $\ell_{\infty}$ Earth Mover’s Distance which is derived by establishing the relationship between the $\ell_{\infty}$ Earth Mover’s Distance as defined in Definition 2, and the Monge-Kantorovich Transportation distance on the real line, as shown below. After translating to this equivalent definition, stated in Definition 9, the generalization to the $\epsilon$ -smoothed $\ell_{\infty}$ Earth Mover’s Distance in Definition 10 is straightforward and natural.

Definition 7.

Given two probability distributions $\mu$ and $\nu$ on the real line, and a function $c:\mathbb{R}\times\mathbb{R}\to[0,\infty]$ the corresponding Monge-Kantorovich distance, $d_{MK}(\mu,\nu)$ between $\mu$ and $\nu$ is defined as:

[TABLE]

Where $\Gamma(\mu,\nu)$ is defined to be the collection of all probability distributions on $X\times Y\equiv\mathbb{R}\times\mathbb{R}$ which have marginal on $X$ equal to $\mu$ and marginal on $Y$ equal to $\nu$ .

In order to translate into a statement about quantum states, we make the following definition in a similar style to Definition 2:

Definition 8.

Given a bipartite shared state $\left|\psi\right\rangle=\sum_{i\in X}\sqrt{\psi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ let us define a random variable $V_{\psi}$ which takes value $\log(\psi_{i})$ with probability $\psi_{i}$ (note that, since the $\psi_{i}$ sum to one, this is a well defined random variable). We now define $p_{\psi}$ to be the probability distribution of this random variable.

It is clear that, for every $\psi$ , $p_{\psi}$ is a probability distribution on the real line. One may note the following simple relationship between Monge-Kantorovich distance and $\ell_{\infty}$ Earth Mover’s Distance:

For any $d>0$ , consider the Monge-Kantorovich distance, $d_{MK}$ where the function $c:\mathbb{R}\times\mathbb{R}\to[0,\infty]$ is defined by $c(x,y)=1$ if $|x-y|\geq d$ and $c(x,y)=0$ if $|x-y|<d$ . Then, for any two quantum states $\left|\psi\right\rangle$ and $\left|\phi\right\rangle$ , we have that $d_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)<d$ if and only if $d_{MK}(p_{\psi},p_{\phi})=0$ .

Given this concrete connection between $\ell_{\infty}$ Earth Mover’s Distance and the Monge-Kantorovich distance, we can now make use of the following characterization of Monge-Kantorovich distance for distributions on the real line, which is well known in optimal transport theory:

Fact.

Let $\mu$ and $\nu$ be probability distributions supported on the real line, and let $F_{\mu}$ and $F_{\nu}$ be their cumulative distribution functions, respectively. Then, for any $c:\mathbb{R}\times\mathbb{R}\to[0,\infty]$ :

[TABLE]

It follows from this Fact, combined with the discussion above, that an equivalent definition of the $\ell_{\infty}$ Earth Mover’s Distance is given by:

Definition 9.

[TABLE]

In the context of this equivalent formulation of $\ell_{\infty}$ Earth Mover’s Distance, we can succinctly introduce a “smoothed” version of the same distance. The reader may note that, since the above definition of $d_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)$ is evidently not robust against tiny changes of either distribution in the total variation distance it would be impossible to prove a lower bound of the form of Theorem 4 if stated using that definition. Hence the motivation for introducing a “smoothed” version of the distance measure, which has built-in robustness by definition.

Definition 10.

$\epsilon$ -Smoothed $\ell_{\infty}$ -Earth Mover’s Distance

[TABLE]

With this definition in place we can now state the lower bound.

Theorem (Restatement of Theorem 4).

Given any two bipartite shared states $\left|\psi\right\rangle^{AB}=\sum_{i}\sqrt{\psi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ and $\left|\phi\right\rangle^{AB}=\sum_{i}\sqrt{\phi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ , shared between two parties $A$ and $B$ , together with a unitary $U_{\mathcal{P}}$ which can be performed on the state $\left|\psi\right\rangle^{AB}$ via a quantum communication protocol $\mathcal{P}$ , that uses $Q$ qubits of communication between $A$ and $B$ , we have that, for every $\epsilon$ :

[TABLE]

Intuitively, Theorem 4 states that two bipartite shared states which are far apart in the $\epsilon$ -Smoothed $\ell_{\infty}$ -Earth Mover’s Distance, cannot be made equal via a quantum communication protocol unless it uses at least $c\cdot d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)$ qubits of communication (for a particular constant $c$ which can be computed from the statement of Theorem 4).

Proof.

Suppose that two bipartite shared states $\left|\psi\right\rangle$ and $\left|\phi\right\rangle$ have $d^{\epsilon}_{\infty}(\left|\psi\right\rangle,\left|\phi\right\rangle)=d$ . By definition $\exists p\in[0,1]$ such that

[TABLE]

Suppose that $F_{p_{\psi}}^{-1}(p)<F_{p_{\phi}}^{-1}(r)$ (if the opposite is true then we simply switch the roles of $\psi$ and $\phi$ and continue with the same proof). Define $x\equiv F_{p_{\psi}}^{-1}(p)$ . Further define $\left|\psi\right\rangle_{\leq x}\equiv\sum_{\{i:|\log{1/\psi_{i}}|\leq x\}}\sqrt{\psi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ , and $\left|\psi\right\rangle_{>x}\equiv\left|\psi\right\rangle-\left|\psi\right\rangle_{\leq x}$ . Similarly define $\left|\phi\right\rangle_{\geq x+d}\equiv\sum_{\{i:|\log{1/\phi_{i}}|\geq x+d\}}\sqrt{\phi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ , and $\left|\phi\right\rangle_{<x+d}\equiv\left|\phi\right\rangle-\left|\phi\right\rangle_{\geq x+d}$ . Note that $\left|\psi\right\rangle_{\leq x}$ , and $\left|\psi\right\rangle_{>x}$ are orthogonal, as are $\left|\phi\right\rangle_{<x+d}$ and $\left|\phi\right\rangle_{\geq x+d}$ .

Since we have $x\equiv F_{p_{\psi}}^{-1}(p)$ it follows from the definitions that $||\left|\psi\right\rangle_{\leq x}||^{2}=p$ . Since $F_{p_{\psi}}(x)=p$ , and $F_{p_{\psi}}^{-1}(p)<F_{p_{\phi}}^{-1}(r)$ , it follows from Equation 1 that $F_{p_{\phi}}(x+d)\leq p-\epsilon$ . Therefore, $||\left|\phi\right\rangle_{<x+d}||^{2}\leq p-\epsilon$ and thus $||\left|\phi\right\rangle_{\geq x+d}||^{2}=1-||\left|\phi\right\rangle_{<x+d}||^{2}\geq 1-p+\epsilon$ .

The main idea in the proof of this theorem is that we can now partition $\left|\psi\right\rangle,\left|\phi\right\rangle$ into three nearly orthogonal parts, depending on $U_{\mathcal{P}}$ , as follows:

Definition 11.

[TABLE]

Lemma 12.

For $i,j\in\{1,2,3\}$ with $i\neq j$ , we have that $|\langle\phi^{i}|\psi^{j}\rangle|\leq h(Q,d)$ , $|\langle\psi^{i}|\psi^{j}\rangle|\leq h(Q,d)$ , and $|\langle\phi^{i}|\phi^{j}\rangle|\leq h(Q,d)$ , where $h(Q,d)\equiv 4\cdot 2^{\frac{3Q-d}{2}}$ .

The proof of Lemma 12 is given separately in the appendix. Within that proof is the key use of Lemma 5 which is the primary conceptual step in proving Theorem 4. Understanding the proof of Lemma 12 is also the best way of understanding the motivation behind Definition 11 above.

It follows from the definitions that:

[TABLE]

While the individual $\left|\psi^{i}\right\rangle$ and $\left|\phi^{i}\right\rangle$ are not necessarily all orthogonal we do have $\left|\psi^{2}\right\rangle\perp\left|\psi^{3}\right\rangle$ and $\left|\psi^{1}\right\rangle\perp\left|\psi^{2}\right\rangle+\left|\psi^{3}\right\rangle$ . Likewise $\left|\phi^{1}\right\rangle\perp\left|\phi^{2}\right\rangle$ and $\left|\phi^{3}\right\rangle\perp\left|\psi^{1}\right\rangle+\left|\psi^{2}\right\rangle$ . Together these imply

[TABLE]

From Lemma 12 it follows that:

[TABLE]

Now recall that

[TABLE]

We now return to Equation 5. Setting $x_{i}=\|\,|\psi^{i}\rangle\|$ and $y_{i}=\|\,|\phi^{i}\rangle\|$ for $i=1,2,3$ we have

[TABLE]

where $x_{1}=\sqrt{p}$ , $y_{3}\geq\sqrt{1-p+\epsilon}$ and $(x_{1},x_{2},x_{3}),(y_{1},y_{2},y_{3})$ are unit vectors. We claim that this quantity is maximized by setting $x_{2}=y_{2}=0$ and $y_{3}=\sqrt{1-p+\epsilon}$ . Indeed we can upper bound $\sqrt{p}y_{1}+x_{2}y_{2}\leq x_{12}y_{12}$ where $x_{12}\equiv\sqrt{x_{1}^{2}+x_{2}^{2}}$ and $y_{12}\equiv\sqrt{y_{1}^{2}+y_{2}^{2}}$ . Now define $x_{12}=\cos(\alpha),x_{3}=\sin(\alpha),y_{12}=\cos(\beta),y_{3}=\sin(\beta)$ and we have

[TABLE]

This is maximized by taking $(x_{1},x_{2},x_{3})=(\sqrt{p},0,\sqrt{1-p})$ and $(y_{1},y_{2},y_{3})=(\sqrt{p-\epsilon},0,\sqrt{1-p+\epsilon})$ . Thus

[TABLE]

Finally we would like an upper bound independent of $p$ . This maximization is performed in the proof of Fact 18 from Section B of the Appendix and yields the following.

[TABLE]

∎

4 The Cost of State Transformation: An Upper Bound

In this section we will give a proof of Theorem 3, which states that the quantum communication cost of converting between two bipartite entangled states is upper bounded by the $\ell_{\infty}$ Earth Mover’s Distance between those states. This upper bound represents the second half of our two sided argument (employing both Theorem 3 and Theorem 4) that the $\ell_{\infty}$ Earth Mover’s Distance is a simple and efficiently computable proxy for the cost of state conversion. The proof is divided into two parts which are proved separately in Lemma 15, and Lemma 16 together with Corollary 17. At a high level Lemma 15 tells us that, given bipartite states $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ , one can map the Schmidt coefficients of $\left|\chi\right\rangle$ directly onto the Schmidt coefficients of $\left|\upsilon\right\rangle$ using a series of bipartite “flows” that have small degree (where degree is a quantity defined below). Lemma 16 and Corollary 17 then tell us that any such “flow” which has small degree, can be implemented as an actual bipartite state transformation, with correspondingly small communication required.

Here we establish Lemmas 15 and 16 which, together, prove the desired theorem. We begin with a couple definitions establishing the concept of flows, as we use it here.

Definition 13 (Right (Left) Index-1 Flow ).

Fix two states $\left|\chi\right\rangle=\sum_{i\in X}\sqrt{\chi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ and $\left|\upsilon\right\rangle=\sum_{j\in Y}\sqrt{\upsilon_{j}}\left|j\right\rangle\otimes\left|j\right\rangle$ . A Right Index-1 Flow from $\left|\chi\right\rangle$ to $\left|\upsilon\right\rangle$ is a bipartite graph $G_{X,Y}$ with vertices given by $X\cup Y$ , and edge set $E_{X,Y}$ , such that:

•

Each vertex in $j\in Y$ has index 1 in $G_{X,Y}$ .

•

For all $i\in X$ , $\chi_{i}=\sum_{j\in Y:(i,j)\in E_{X,Y}}\upsilon_{j}$

If the roles of $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ are reversed in the above, then we say that there is a Left Index-1 Flow from $\left|\upsilon\right\rangle$ to $\left|\chi\right\rangle$ . Equivalently, there is a Left Index-1 Flow from $\left|\upsilon\right\rangle$ to $\left|\chi\right\rangle$ exactly when there is a a Right Index-1 Flow from $\left|\chi\right\rangle$ to $\left|\upsilon\right\rangle$ .

Definition 14 (Degree of a Right (Left) Index-1 Flow ).

We define the degree of a Right (Left) Index-1 Flow from $\left|\chi\right\rangle=\sum_{i\in X}\sqrt{\chi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ to $\left|\upsilon\right\rangle=\sum_{j\in Y}\sqrt{\upsilon_{j}}\left|j\right\rangle\otimes\left|j\right\rangle$ to be the maximum index of any vertex in the bipartite graph $G_{X,Y}$ .

The following lemma, which a key step in proving Theorem 3, establishes that bipartite states which are close to each other in the $\ell_{\infty}$ Earth Mover’s Distance of Definition 2, can be mapped to each other through a series of flows of bounded degree. This series of flows intuitively establishes a map for converting one bipartite state to the other using bounded quantum communication, in a manner that will be made rigorous in Lemma 16. The main step in the proof of Lemma 15 involves constructing a flow through a type of greedy algorithm whose analysis has a number of subtle cases. In order to concretely exhibit these cases the entire greedy algorithm, including every case, is written out in pseudocode in Algorithm 1.

Lemma 15.

Given two states $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ , there exist two “intermediate” states $\left|\gamma\right\rangle$ and $\left|\rho\right\rangle$ , such that there is a Right Index-1 Flow from $\left|\chi\right\rangle$ to $\left|\gamma\right\rangle$ of degree at most $2^{2\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+4}$ , a Left Index-1 Flow from $\left|\gamma\right\rangle$ to $\left|\rho\right\rangle$ of degree at most $2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}$ , and a Left Index-1 Flow from $\left|\rho\right\rangle$ to $\left|\upsilon\right\rangle$ of degree at most $2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}$ .

The Proof of Lemma 15 is included in the Appendix, section G.

Lemma 15, above, shows that two bipartite entangled states can be connected to each other by a series of flows which have a degree which is bounded in terms of the $\ell_{\infty}$ Earth Mover’s Distance between them. The next step is to establish that every flow can be implemented via a quantum communication protocol. Lemma 16 and Corollary 17, below, accomplish this by showing that, if two bipartite states can be connected by flows of small degree, then one state can be converted to the other (and vice versa) using a quantum communication protocol which only requires small amounts of communication.

Lemma 16.

Given two states $\left|\tau\right\rangle$ and $\left|\kappa\right\rangle$ such that there is a Right Index-1 Flow from $\left|\tau\right\rangle$ to $\left|\kappa\right\rangle$ with degree at most $2^{Q}$ , there exists a quantum communication protocol $\mathcal{P}$ , which uses $Q$ qubits of communication, and converts the shared state $\left|\tau\right\rangle$ to the shared state $\left|\kappa\right\rangle$ .

The idea of the proof is that if $\left|\tau\right\rangle=\sum_{i}\sqrt{\tau_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ then it suffices to define separately protocols for each $\left|i\right\rangle\otimes\left|i\right\rangle$ term. These protocols simply use quantum communication to create a shared entangled state, resulting in the state $\sum_{i}\tau_{i}\left|i\right\rangle_{A}\otimes\left|i\right\rangle_{B}\otimes\left|\psi_{i}\right\rangle_{A^{\prime}B^{\prime}}$ . Choosing the Schmidt coefficients according to the given Right Index-1 Flow yields the result. The details of this argument are in the Appendix xC.

Corollary 17 establishes the same result as Lemma 16, but in the reverse direction.

Corollary 17.

Given two states $\left|\tau\right\rangle$ and $\left|\kappa\right\rangle$ such that there is a Left Index-1 Flow from $\left|\kappa\right\rangle$ to $\left|\tau\right\rangle$ with degree at most $2^{Q}$ , then, for two parties sharing entangled state $\left|\kappa\right\rangle$ , there exists a quantum communication protocol $\mathcal{P}$ , which uses $Q$ qubits of communication, and converts the shared state $\left|\kappa\right\rangle$ to the shared state $\left|\tau\right\rangle$ .

The proof of Corollary 17 is straightforward and appears in Appendix D.

Theorem (Restatement of Theorem 3).

Let $\left|\chi\right\rangle^{AB}$ and $\left|\upsilon\right\rangle^{AB}$ be two bipartite shared states. There is a protocol $\mathcal{M}_{\chi\rightarrow\upsilon}$ which can prepare $\left|\upsilon\right\rangle$ from $\left|\chi\right\rangle$ , using only $4\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+8$ qubits of communication.

Proof.

The proof follows by applying Lemma 15, followed by Lemma 16 and Corollary 17. ∎

Appendix A Proof of Lemma 12

Proof.

First note that it is immediate from the definitions that $\left\langle\phi^{2}\middle|\psi^{1}\right\rangle=\left\langle\phi^{3}\middle|\psi^{2}\right\rangle=0$ , so the conditions of the lemma are automatically satisfied in those cases.

To bound the remaining inner products we will first prove a bound on the inner product $|\left\langle\phi^{3}\middle|\psi^{1}\right\rangle|$ and note that the remaining inner products are bounded as a consequence of this first bound. For notational convenience, while establishing the bound on $|\left\langle\phi^{3}\middle|\psi^{1}\right\rangle|$ , we set $\left|\rho\right\rangle\equiv\left|\psi\right\rangle_{\leq x}$ , and let $\rho_{j}$ be the non-zero Schmidt coefficients of $\left|\rho\right\rangle$ (which are just a renamed version of the non-zero Schmidt coefficients of $\left|\psi\right\rangle_{\leq x}$ ). Therefore, we know that, for all $j$ , $1\geq\rho_{j}\geq 2^{-x}$ , and $\left|\psi\right\rangle_{\leq x}=\left|\rho\right\rangle=\sum_{j}\sqrt{\rho_{j}}\left|j\right\rangle\otimes\left|j\right\rangle$ . The purpose of this renaming convention is that we can now cleanly make the following definition. For integers $i$ define $\left|\rho\right\rangle_{i}\equiv\sum_{\{j:i<|\log{1/\rho_{j}}|\leq i+1\}}\sqrt{\rho_{j}}\left|j\right\rangle\otimes\left|j\right\rangle$ , so that we have $\left|\psi\right\rangle_{\leq x}=\left|\rho\right\rangle=\sum_{i=-1}^{\lceil x\rceil}\left|\rho\right\rangle_{i}$ , and $\left\langle\rho_{k}\middle|\rho_{i}\right\rangle=0$ whenever $k\neq i$ . So,

[TABLE]

By definition, for any $1\leq i\leq\lceil x\rceil$ , the Schmidt coefficients of $\left|\rho\right\rangle_{i}$ are upper bounded by $2^{-i}$ , and lower bounded by $2^{-(i+1)}$ , and from the latter we have $rk_{Schmidt}(\left|\rho\right\rangle_{i})\leq 2^{i+1}\left\|\left|\rho\right\rangle\right\|^{2}$ . Furthermore, the Schmidt coefficients of $\left|\phi_{\geq x+d}\right\rangle$ are upper bounded by $2^{-(x+d)}$ , and thus, we have by Lemma 5 that:

[TABLE]

where the final inequality follows because $i\leq\lceil x\rceil$ by assumption. Thus,

[TABLE]

where the second inequality follows by Equation A and the subsequent equality follows by Equation 9. Having established this upper bound on $\left|\left\langle\phi^{3}\middle|\psi^{1}\right\rangle\right|$ we now proceed with bounding the other inner products in the Lemma statement:

[TABLE]

where both of the inequality steps follow by Equation 12 (the first of which also uses the triangle inequality).

[TABLE]

Now, as noted earlier, $\left\langle\phi^{2}\middle|\psi^{1}\right\rangle=\left\langle\phi^{3}\middle|\psi^{2}\right\rangle=0$ . Continuing with the cross terms we have:

[TABLE]

where the last inequality follows from Equation 12. And, since we already have $\left|\left\langle\phi^{3}\middle|\psi^{1}\right\rangle\right|\leq h(Q,d)$ from Equation A, the final inner product to bound is:

[TABLE]

where the last inequality follows by Equation 12.

∎

Appendix B Fact 18

Fact 18.

For $p\in[0,1]$ and $0\leq\epsilon\leq p$ , $\sqrt{p-\epsilon}\sqrt{p}+\sqrt{1-p}\sqrt{1-p+\epsilon}\leq 1-\frac{1}{8}\epsilon^{2}$

Proof.

Define $f(x)\equiv\sqrt{p-x}\sqrt{p}+\sqrt{1-p}\sqrt{1-p+x}$ . Note that $f^{\prime}(x)=-\frac{\sqrt{p}}{2\sqrt{p-x}}+\frac{\sqrt{1-p}}{2\sqrt{1-p+x}}$ , and $f^{\prime\prime}(x)=-1/4\left(\frac{\sqrt{p}}{(p-x)^{3/2}}+\frac{\sqrt{1-p}}{(1-p+x)^{3/2}}\right)$ . So, $f(0)=1$ , $f^{\prime}(0)=0$ , and

[TABLE]

for all $p\in[0,1]$ and $0\leq x\leq p$ . It follows by integration that:

[TABLE]

So,

[TABLE]

∎

Appendix C Proof of Lemma 16

Proof.

By assumption there is a Right Index-1 Flow from $\left|\tau\right\rangle$ to $\left|\kappa\right\rangle$ with degree at most $2^{Q}$ , so there exists a bipartite graph $G_{X,Y}$ with vertices given by $X\cup Y$ , and edge set $E_{X,Y}$ , such that:

•

Each vertex in $j\in Y$ has index 1 in $G_{X,Y}$ .

•

For all $i\in X$ , $\tau_{i}=\sum_{j\in Y:(i,j)\in E_{X,Y}}\kappa_{j}$ .

•

The maximum degree of any vertex $i\in X$ in $G_{X,Y}$ is $2^{Q}$ .

The protocol for Alice and Bob to start with shared state $\left|\tau\right\rangle$ and end up with shared state $\left|\kappa\right\rangle$ will proceed as follows: Beginning with the state $\left|\tau\right\rangle$ shared between Alice and Bob, we will refer to the register containing the Alice half of $\left|\tau\right\rangle$ as $A$ , and the register containing the Bob half as $B$ . Alice will append two additional registers, of $Q$ qubits each, and initialize each of them to the all zeros state. We will call these two new registers $C_{1}$ and $C_{2}$ respectively. Alice will then perform a controlled unitary operation between $A$ and the registers $C_{1}$ and $C_{2}$ . She will then pass the register $C_{2}$ to Bob using $Q$ qubits of quantum communication to do so. Bob will then perform a controlled unitary between $B$ and $C_{2}$ , Alice will perform a controlled unitary between $A$ and $C_{1}$ , and after that Alice and Bob will share the state $\left|\kappa\right\rangle$ .

To describe the protocol more precisely we will define the specific controlled unitaries performed by Alice and Bob at each step. Beginning with a shared state $\left|\tau\right\rangle$ , after Alice appends the two additional $Q$ -qubit registers to her side of $\left|\tau\right\rangle$ , the shared state looks as follows:

[TABLE]

Where, initially, Alice holds the registers $A$ , $C_{1}$ , and $C_{2}$ . Alice now performs a controlled unitary operation, acting on registers $C_{1}$ and $C_{2}$ and controlled on register $A$ . To describe this controlled unitary concisely we will need to imagine that there is some total order on the elements $j\in Y$ (any total order will do, one can simply imagine that the $j$ ’s are indexed by bit strings which encode integers), and we will define $s_{ij}\equiv|\{j^{\prime}\in Y:j^{\prime}<j,\text{ and }(i,j^{\prime})\in E_{X,Y}\}|$ . Note that, since every $i\in X$ has degree at most $2^{Q}$ , $s_{ij}$ is always an integer between [math] and $2^{Q}$ , so it can always be expressed in binary as a $Q$ -bit binary number. We will take this convention in the following argument.

Now to define Alice’s controlled unitary: When controlled on $\left|i\right\rangle_{A}$ Alice’s unitary moves the state $\left|0^{\otimes Q}\right\rangle_{C_{1}}\otimes\left|0^{\otimes Q}\right\rangle_{C_{2}}$ to the state $\left|i\text{-controlled}\right\rangle_{C_{1}C_{2}}\equiv\sum_{j\in Y:(i,j)\in E_{X,Y}}\sqrt{\kappa_{j}/\tau_{i}}\left|s_{ij}\right\rangle_{C_{1}}\otimes\left|s_{ij}\right\rangle_{C_{2}}$ . Note that since $s_{ij}$ is always a $Q$ -bit binary string, it can always be contained in the $Q$ -qubit registers $C_{1}$ and $C_{2}$ . Further note that, since $\tau_{i}=\sum_{j\in Y:(i,j)\in E_{X,Y}}\kappa_{j}$ by assumption, $\left|i\text{-controlled}\right\rangle_{C_{1}C_{2}}$ is a normalized pure state. Thus there exists a unitary operation that moves $\left|0^{\otimes Q}\right\rangle_{C_{1}}\otimes\left|0^{\otimes Q}\right\rangle_{C_{2}}$ to $\left|i\text{-controlled}\right\rangle_{C_{1}C_{2}}$ and Alice need only perform this specific unitary when the control register is in state $\left|i\right\rangle_{A}$ . So, when Alice applies this controlled unitary to her registers $C_{1}$ , $C_{2}$ and $A$ (where $A$ is the controlling register), the resulting new shared state between Alice and Bob is:

[TABLE]

At this point Alice uses $Q$ qubits of communication to pass the $Q$ -qubit register $C_{2}$ to Bob. The resulting shared state is:

[TABLE]

Where Alice owns registers $C_{1}$ and $A$ , and Bob owns registers $C_{2}$ and $B$ . Now it is not hard to see from the definition of $s_{ij}$ and the fact that every $j\in Y$ has degree exactly 1 in the graph $G_{X,Y}$ , that there is a bijection mapping each $j\in Y$ to the tuple $(i,s_{ij})$ . Alice and Bob both know this bijection since they know the description of $G_{X,Y}$ , and since bijections are invertible, Alice and Bob can now both apply a local unitary which relabels the basis element $\left|i\right\rangle\otimes\left|s_{ij}\right\rangle$ to the basis element $j$ . The resulting shared state is:

[TABLE]

Where the first equality follows because each $j\in Y$ appears in the initial sum exactly once (because $j$ has degree exactly one in $G_{X,Y}$ ).

This completes the protocol.

∎

Appendix D Proof of Corollary 17

Proof.

By definition, if there is a Left Index-1 Flow from $\left|\kappa\right\rangle$ to $\left|\tau\right\rangle$ , then there is a Right Index-1 Flow from $\left|\tau\right\rangle$ to $\left|\kappa\right\rangle$ (which is the starting assumption of Lemma 16). One can check that, in the proof Lemma 16, every operation performed by Alice and Bob was reversible. Therefore, the proof of this corollary is simply to start at the end of the proof of Lemma 16, and “reverse” every step of the proof in order from end to beginning (including the communication step…now communication goes from Bob to Alice rather than Alice to Bob). The result is the desired quantum communication protocol, which converts the shared state $\left|\kappa\right\rangle$ to the shared state $\left|\tau\right\rangle$ using $Q$ qubits of communication. ∎

Appendix E Proof of Theorem 1

A concept which will be useful in the proof of Theorem 1 is the notion of the spread of a state:

Definition 19 (Spread).

For a finite dimensional bipartite entangled state $\left|\psi\right\rangle^{AB}=\sum_{i}\sqrt{\psi_{i}}\left|i\right\rangle^{A}\otimes\left|i\right\rangle^{B}$ let $\lambda_{max}$ be the maximum of the Schmidt coefficients of $\psi$ , and let $\lambda_{min}$ be the minimum Schmidt coefficient. We define the spread of $\left|\psi\right\rangle$ to be the quantity $\log(\lambda_{max}/\lambda_{min})$ .

We note that the above definition of spread is given in the case of finite dimensional $\left|\psi\right\rangle$ , which is the only case we will need. There is also an $\epsilon$ -smoothed variant of the spread of a state [8, 5], but it will not be needed for this proof. Within the proof of Theorem 1 the spread of a bipartite state will be used as a proxy for the amount of communication required to create that state from a maximally entangled state. This intuition is formalized, for example, by Theorem 3, but in this case of converting from a maximally entangled state, is also an implication of earlier works, such as [7, 8].

Theorem (Restatement of Theorem 1).

Consider a quantum communication protocol $\mathcal{R}$ whose goal is to compute a joint function $g(x,y)\in\{0,1\}$ . Suppose that $\mathcal{R}$ uses an arbitrary bipartite entangled state $\left|\psi\right\rangle^{AB}$ (of unbounded dimension), as well as $Q$ qubits of communication total, in either direction (for sufficiently large $Q\geq 15$ ). Then, for every $\epsilon>0$ , there exists a quantum communication protocol $\mathcal{R^{\prime}}$ which simulates $\mathcal{R}$ with error $\epsilon$ , while using only a maximally entangled state as an entangled resource (rather than $\left|\psi\right\rangle^{AB}$ or any other state), and using $O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ qubits of communication. Thus, if $\mathcal{R}$ computes $f$ with error $\epsilon^{\prime}$ it follows that $\mathcal{R^{\prime}}$ computes $f$ with error $\epsilon+\epsilon^{\prime}$ .

Proof.

Given $\mathcal{R}$ , $g$ , and $\left|\psi\right\rangle$ as in the theorem statement, Schmidt decompose $\left|\psi\right\rangle$ as $\sum_{i}\sqrt{\lambda_{i}}\left|i,i\right\rangle$ (see Remark 2 for why we may assume WLOG that $\left|\psi\right\rangle$ has this form).

Let $N\geq 2$ be an integer, which will be specified later. Define a function $f:[0,1]\to\{0,1,\ldots,N\}$ given by

[TABLE]

and define a new state $\left|\varphi\right\rangle\equiv\sum_{i}\sum_{j\in\{1,...,f(\lambda_{i})\}}\sqrt{\nu_{i,j}}\left|(i,j),(i,j)\right\rangle$ , where $\nu_{i,j}\equiv\frac{\lambda_{i}}{f(\lambda_{i})}$ . Note that $\sum_{i,j}\nu_{i,j}=1$ , so that $\left|\varphi\right\rangle$ is a normalized pure state. Furthermore, every Schmidt coefficient $\nu_{i,j}$ of $\left|\varphi\right\rangle$ is within a multiple of $2$ of the integer power $2^{-\left\lceil\frac{\log(1/\lambda_{i})}{N}\right\rceil N}$ . This follows because

[TABLE]

Next, we can upper bound $d_{\infty}(\left|\psi\right\rangle,\left|\varphi\right\rangle)\leq N$ by considering the coupling in which each $\nu_{i,j}$ is moved to $\lambda_{i}$ . The largest distance obtained here is the maximum $\log f(\lambda_{i})$ for which $\lambda_{i}>0$ , and this in turn is $\leq N$ . Therefore, by Theorem 3, there is a protocol $\mathcal{M}$ by which Alice and Bob can prepare $\left|\varphi\right\rangle$ from $\left|\psi\right\rangle$ , using $4\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+8\leq 4N+8$ qubits of communication. (For this special case, of course a simpler protocol could also be used.)

Define $\mathcal{C}\equiv\mathcal{R}\circ\mathcal{M}$ to be the composed protocol in which Alice and Bob start with shared state $\left|\varphi\right\rangle$ , first use protocol $\mathcal{M}$ to convert $\left|\varphi\right\rangle$ to $\left|\psi\right\rangle$ , and then perform protocol $\mathcal{R}$ using shared state $\left|\psi\right\rangle$ and inputs $x$ and $y$ , to compute the joint function $g(x,y)$ . It is evident that $\mathcal{C}$ has exactly the same success probability as $\mathcal{R}$ . Since $\mathcal{M}$ uses at most $4N+8$ qubits of communication and $\mathcal{R}$ uses $Q$ qubits of communication, $\mathcal{C}$ can be performed with $Q+4N+8$ qubits of communication.

For $j$ a nonnegative integer, define $I_{j}:=\{i:2^{-jN+1}\geq\lambda_{i}>2^{-jN-1}\}$ and define the subnormalized state

[TABLE]

From Equation (15) and the surrounding discussion, we have that $\left|\varphi\right\rangle=\sum_{j}\left|\varphi_{j}\right\rangle$ . Furthermore, by the definition of $I_{j}$ , it follows that $\left|\varphi_{j}\right\rangle$ has spread at most $2$ ; note that the spread of $\left|\varphi_{j}\right\rangle$ does not depend on whether the state is normalized or not.

The idea of the proof is that different $\left|\varphi_{j}\right\rangle$ are not only orthogonal, but must remain approximately orthogonal even after a small amount of quantum communication. In particular, note that for any $j$ , $rk_{Schmidt}(\left|\varphi_{j}\right\rangle)\leq 2^{jN+1}\|\left|\varphi_{j}\right\rangle\|^{2}$ . Furthermore, for all $l$ we have, by definition, that the Schmidt coefficients of $\left|\varphi_{l}\right\rangle$ are bounded above by $2^{-lN+1}$ . Therefore, if $U$ is a unitary transform using $M$ qubits of communication, then, it follows by Lemma 5, that $\forall j,k$ ,

[TABLE]

To apply this to our problem, we first note that the protocol $\mathcal{C}$ depends, a priori, on the inputs $x,y$ to the function $g(x,y)$ that we wish to compute (just like the the protocol $\mathcal{R}$ ). We now fix any input pair $x,y$ and for the remainder of the proof of this theorem we will perform only transformations of the shared state which do not depend on the value of $x,y$ . We will therefore establish that our transformation to a maximally entangled shared state does not significantly impact the success probability of the quantum communication protocol regardless of the value of $x,y$ . The desired Theorem then follows.

With the input $x,y$ now fixed, we observe that the success probability of protocol $\mathcal{C}$ (which we have already established is equal to the success probability of the original protocol $\mathcal{R}$ ) can be expressed WLOG by performing $\mathcal{C}$ and then computing the probability of outcomes when measuring the first qubit in the computational basis. The probability that such a measurement on protocol $\mathcal{C}$ outputs $b\in\{0,1\}$ is

[TABLE]

where $I$ acts on all qubits except for the first, which is being measured. Define $\mathcal{P}\equiv\mathcal{C}^{\dagger}(\sigma_{z}\otimes I)\mathcal{C}=\mathcal{C}^{\dagger}(\left|0\right\rangle\left\langle 0\right|\otimes I)\mathcal{C}-\mathcal{C}^{\dagger}(\left|1\right\rangle\left\langle 1\right|\otimes I)\mathcal{C}$ . Then

[TABLE]

Observe, for later, that $\mathcal{P}$ is a unitary operator that can be implemented using $2Q+8N+16$ qubits of communication.

The proof will proceed as follows: In Lemma 22 we show that the density matrix $\varphi=\left|\varphi\right\rangle\left\langle\varphi\right|$ can be divided into three “pieces” (in a manner that does not depend on the inputs $x,y$ ), one piece which has small trace norm and can therefore be omitted, one piece called $\varphi_{\text{far}}$ which only has non-zero terms which are far from the diagonal in the appropriate basis, and one piece called $\varphi_{\text{block}}$ which is a block-diagonal mixed state that can be produced with small error and low communication cost from a maximally entangled state. Then, in Lemma 23, we show that the $\varphi_{\text{far}}$ piece of $\varphi$ has very little effect on the protocol $\mathcal{C}$ . This means that $\varphi$ can be replaced by $\varphi_{\text{block}}$ alone while incurring very little error in the success probability of $\mathcal{C}$ . Stated equivalently, via the equality in Equation 18 above, Lemma 23 shows that the quantity $\left|\mbox{\rm Tr}(\mathcal{P}(\varphi-\varphi_{\text{block}}))\right|$ is small. Since we know from Lemma 22 that $\varphi_{\text{block}}$ can be produced with low cost from a maximally entangled state, this leads us to the desired result. Since $\varphi_{\text{block}}$ does not depend on the inputs $x,y$ this same statement holds for every pair of inputs $x,y$ . From this point forward we will no longer specify the fixed inputs $x,y$ , as it will be clear that the state substitutions do not depend on these inputs, and thus that the argument holds for every input as discussed in this paragraph.

We now establish some notation which will be useful throughout the rest of the proof:

Definition 20 (subset-matrix).

Consider operators on the Hilbert space which is the span of the $\left|\varphi_{j}\right\rangle$ . We say that an operator $M^{\prime}$ is a subset-matrix of an operator $M$ , if it is the case that for all $l,k$ either $\left\langle\varphi_{l}\right|M^{\prime}\left|\varphi_{k}\right\rangle=\left\langle\varphi_{l}\right|M\left|\varphi_{k}\right\rangle$ , or $\left\langle\varphi_{l}\right|M^{\prime}\left|\varphi_{k}\right\rangle=0$ .

Definition 21 (Non-Zero Set).

For an operator $\theta$ on the Hilbert space which is the span of the $\left|\varphi_{j}\right\rangle$ , define the non-zero set of $\theta$ to be $T_{\theta}=\{(l,k):\left\langle\varphi_{k}\right|\theta\left|\varphi_{l}\right\rangle\neq 0\}$ .

Lemma 22.

Consider the density matrix $\varphi\equiv\sum_{k,l}\left|\varphi_{k}\right\rangle\left\langle\varphi_{l}\right|$ . For any $\epsilon>0$ , there exist subset-matrices, $\varphi_{\text{block}},\varphi_{\text{far}}$ , of $\varphi$ , such that

$\|\varphi-(\varphi_{\text{block}}+\varphi_{\text{far}})\|_{1}\leq 2\epsilon$ ** 2. 2.

$T_{\varphi_{\text{far}}}\subseteq\{(l,k):|k-l|>B\}$ , where $B\equiv 30+2\left\lceil\frac{\log(1/\epsilon)}{N}\right\rceil$ . 3. 3.

The bipartite shared state $\varphi_{\text{block}}$ can be prepared starting from EPR pairs with $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ bits of communication.

The proof of Lemma 22 is included in Section F of the Appendix.

We can now bound the difference between the protocol $\mathcal{C}$ acting on $\varphi$ versus $\mathcal{C}$ acting on $\varphi_{\text{block}}$ , following equation 18 as follows:

[TABLE]

Setting $N=2Q$ and recalling from the Theorem statement that $Q\geq 15$ by assumption, it follows by Lemma 23, stated below, that:

[TABLE]

This completes the proof of the Theorem as we now describe.

We know from Lemma 22 that there is a quantum communication protocol, call it $\mathcal{K}$ , which prepares the shared state $\varphi_{\text{block}}$ starting from just a maximally entangled state using at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ bits of communication. Now define the protocol $\mathcal{R^{\prime}}\equiv\mathcal{C}\circ\mathcal{K}$ . Since $\mathcal{C}$ uses at most $Q+4N+8$ qubits of communication, and since we have chosen to set $N=2Q$ (in the line above Equation 19), it follows that $\mathcal{R^{\prime}}$ uses at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)=O(Q/\epsilon+\log(1/\epsilon)/\epsilon)$ qubits of communication. Furthermore, the success probability of $\mathcal{R^{\prime}}$ with only the maximally entangled state as an entangled resource is the same, by construction, as the success probability of $\mathcal{C}$ with $\varphi_{\text{block}}$ as an entangled resource, which, by Equation 19 above and the original definition $\mathcal{C}\equiv\mathcal{R}\circ\mathcal{M}$ , is within $3\epsilon$ of the success probability of the original protocol $\mathcal{R}$ from the theorem statement when using the original shared state $\left|\psi\right\rangle$ as an entangled resource. This is the desired result. ∎

Lemma 23.

For $\varphi_{\text{block}}$ as constructed in Lemma 22, and for $N,Q$ as defined in the proof of Theorem 1 we have, $\left|\mbox{\rm Tr}(\mathcal{P}(\varphi-\varphi_{\text{block}}))\right|\leq 3\epsilon$ whenever $N\geq 2Q\geq 30$ .

Proof.

Following Lemma 22, we define $B\equiv 30+2\left\lceil\frac{\log(1/\epsilon)}{N}\right\rceil$ . Now, letting $\varphi_{\text{block}}$ and $\varphi_{\text{far}}$ be as in Lemma 22, and recalling that $\|\varphi-(\varphi_{\text{block}}+\varphi_{\text{far}})\|_{1}\leq 2\epsilon$ , we have:

[TABLE]

where the final inequality follows because $T_{\varphi_{\text{far}}}\subseteq\{(l,k):|k-l|>B\}$ by Lemma 22. Recalling that the unitary $\mathcal{P}$ can be implemented using 2Q+8N+16 qubits of communication, and applying equation 17 then gives that:

[TABLE]

So, recalling from the Lemma statement that $N\geq 2Q\geq 30$ by assumption:

[TABLE]

So,

[TABLE]

∎

Note, in the pre-processing step in the proof of Theorem 1, and again at a point within the proof of Lemma 22 we use our Theorem 3 in a setting where either the starting or ending state is very close to a maximally entangled state. It is helpful to observe, to avoid confusion, that in such cases Theorem 3 is not strictly necessary and could be replaced with previously known results from, for example, [7, 8]. In this manuscript we will use Theorem 3 in these cases in order to remain self-contained, and for the convenience of the reader, but we emphasize that the lines of the proof of Theorem 1 in which we use Theorem 3 could be replaced with known results.

Appendix F Proof of Lemma 22

Lemma (Restatement of Lemma 22).

Consider the density matrix $\varphi\equiv\sum_{k,l}\left|\varphi_{k}\right\rangle\left\langle\varphi_{l}\right|$ . For any $\epsilon>0$ , there exist subset-matrices, $\varphi_{\text{block}},\varphi_{\text{far}}$ , of $\varphi$ , such that

$\|\varphi-(\varphi_{\text{block}}+\varphi_{\text{far}})\|_{1}\leq 2\epsilon$ ** 2. 2.

$T_{\varphi_{\text{far}}}\subseteq\{(l,k):|k-l|>B\}$ , where $B\equiv 30+2\left\lceil\frac{\log(1/\epsilon)}{N}\right\rceil$ . 3. 3.

The bipartite shared state $\varphi_{\text{block}}$ can be prepared starting from EPR pairs with $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ bits of communication.

Proof.

Note: The terminology used in this proof is defined in the proof of Theorem 1 preceding the use of Lemma 22 there (Appendix E).

Fixing an $\epsilon>0$ we will now show how to “cut” $\varphi\equiv\sum_{k,l}\left|\varphi_{k}\right\rangle\left\langle\varphi_{l}\right|$ down into a mixture of states of small spread such that the cut only removes subset-matrices of the operator which are either far from the diagonal or small in the trace norm (less than $2\epsilon$ ).

Define a sequence of mutually orthogonal projectors $\{P_{i}\}$ , where each $P_{i}$ is the projection onto the span of $\{\left|\varphi_{l}\right\rangle\}_{2(i-1)B<l\leq 2i\cdot B}$ . Let

[TABLE]

Now, for $k\in[1,....,\lceil 1/\epsilon\rceil]$ define

[TABLE]

The $S_{k}$ are block-diagonal subset-matrices of $\varphi$ , which are disjoint in the sense that $T_{S_{k}}\cap T_{S_{k^{\prime}}}=\emptyset$ when $k\neq k^{\prime}$ . Additionally, $\sum_{k=1}^{\lceil 1/\epsilon\rceil}S_{k}=\sum_{i}M_{i}$ is a subset-matrix of $\varphi$ which contains the entire diagonal of $\varphi$ . Indeed $\sum_{k=1}^{\lceil 1/\epsilon\rceil}S_{k}$ can be obtained from $\varphi$ via the “pinching” TPCP which has Kraus operators given by the $\{P_{2i-1}+P_{2i}\}$ . Thus

[TABLE]

Choose $k^{\prime}$ such that $\operatorname{tr}[S_{k^{\prime}}]\leq 1/\lceil 1/\epsilon\rceil\leq\epsilon$ . Since the $S_{k}$ are all PSD we also have $\|S_{k^{\prime}}\|_{1}\leq\epsilon$ .

Our strategy now is to use something like $\varphi-S_{k^{\prime}}$ as a candidate for $\varphi_{\text{block}}+\varphi_{\text{far}}$ in the Lemma statement. However, subtracting all of $S_{k^{\prime}}$ removes some terms close to the diagonal, which, even though it is not a large fraction of all entries in $\varphi$ , would make the proof and statement of Lemma 22 somewhat awkward. So, in order to make the Lemma statement as clean as possible we will only subtract the “anti-diagonal” parts of $S_{k^{\prime}}$ , and leave the “diagonal” parts of $S_{k^{\prime}}$ in a manner made precise below.

Define the block matrices

[TABLE]

$D_{i}$ and $A_{i}$ are, respectively, the diagonal and off-diagonal blocks of $M_{i}$ .

Further define $K_{k^{\prime}}\equiv\sum_{i=0}^{\infty}A_{i\cdot\lceil 1/\epsilon\rceil+k^{\prime}}$ . We have that $K_{k^{\prime}}=S_{k^{\prime}}-\sum_{i=0}^{\infty}D_{i\cdot\lceil 1/\epsilon\rceil+k^{\prime}}$ , and that $\|\sum_{i=0}^{\infty}D_{i\cdot\lceil 1/\epsilon\rceil+k^{\prime}}\|_{1}=\|S_{k^{\prime}}\|_{1}$ since $\sum_{i=0}^{\infty}D_{i\cdot\lceil 1/\epsilon\rceil+k^{\prime}}$ is a block-diagonal subset-matrix of $S_{k^{\prime}}$ containing the entire diagonal of $S_{k^{\prime}}$ . Thus,

[TABLE]

We now define a “cut down” version of $\varphi$ by $\tilde{\varphi}\equiv\varphi-K_{k^{\prime}}$ . From this definition we have:

[TABLE]

Further, we define the projectors

[TABLE]

and define the block diagonal matrix $\varphi_{\text{block}}$ as:

[TABLE]

where the last equality follows because $\sum_{j}Q_{j}K_{k^{\prime}}Q_{j}=0$ because $K_{k^{\prime}}$ consists only of the “anti-diagonal” components $A_{i\cdot\lceil 1/\epsilon\rceil+k^{\prime}}$ which lie outside of the $Q_{j}$ . Note that $\varphi_{\text{block}}$ is a subset-matrix of $\tilde{\varphi}$ according to Definition 20. Now define $\varphi_{\text{far}}$ by:

[TABLE]

Therefore, $\varphi_{\text{far}}$ is also a subset-matrix of $\tilde{\varphi}$ according to Definition 20. Furthermore, it follows immediately using Equation 21 that:

[TABLE]

Second Claim: To establish the second claim in Lemma 22 we now show that $T_{\varphi_{\text{far}}}\subseteq\{(l,k):|k-l|>B\}$ (recall that $B\equiv 30+2\left\lceil\frac{\log(1/\epsilon)}{N}\right\rceil$ ). To see this, we consider the case that $|k-l|\leq B$ and show that in this case $(l,k)\notin T_{\varphi_{\text{far}}}$ . Assume WLOG that $k\geq l$ . When $|k-l|\leq B$ we know that either $\exists j$ such that:

[TABLE]

or $\exists j$ such that:

[TABLE]

In the first case, denoted by Equation 26, we have that the coordinates $(l,k)$ lie within the subset-matrix $\varphi_{\text{block}}$ of $\varphi$ , and thus that either $(l,k)\in T_{\varphi_{\text{block}}}$ or $(l,k)\notin T_{\varphi}$ by definition. In particular, either $(l,k)\in T_{Q_{j}\varphi Q_{j}}\subseteq T_{\varphi_{\text{block}}}$ as follows by Equation 23 and the definition of $Q_{j}$ in Equation 22, or $(l,k)\notin T_{\varphi}$ . If $(l,k)\in T_{\varphi_{\text{block}}}$ then we note that $T_{\varphi_{\text{block}}}\cap T_{\varphi_{\text{far}}}=\emptyset$ by definition (Equation 24), and this implies that $(l,k)\notin T_{\varphi_{\text{far}}}$ . If $(l,k)\notin T_{\varphi}$ , then $(l,k)\notin T_{\varphi_{\text{far}}}$ because $T_{\varphi_{\text{far}}}\subseteq T_{\varphi}$ .

On the other hand, in the case denoted by Equation 27, we have the coordinates $(l,k)$ lie within the subset-matrix $K_{k^{\prime}}$ of $\varphi$ , and thus that either $(l,k)\in T_{K_{k^{\prime}}}$ , or $(l,k)\notin T_{\varphi}$ . The reason for this is that we know that, in this case, the coordinates $(l,k)$ are within the subset-matrix $M_{j\lceil 1/\epsilon\rceil+k^{\prime}}$ of $\varphi$ . Furthermore, since we have already ruled out the case of Equation 26, we know that $(l,k)$ is not in $D_{j\lceil 1/\epsilon\rceil+k^{\prime}}$ , the block diagonal portion of $M_{j\lceil 1/\epsilon\rceil+k^{\prime}}$ . Therefore, the coordinates $(l,k)$ must lie in the block-anti-diagonal portion $A_{j\lceil 1/\epsilon\rceil+k^{\prime}}=M_{j\lceil 1/\epsilon\rceil+k^{\prime}}-D_{j\lceil 1/\epsilon\rceil+k^{\prime}}$ (this can also be determined directly from Equation 27 itself, and the definition of $A_{j\lceil 1/\epsilon\rceil+k^{\prime}}$ ). Since $K_{k^{\prime}}\equiv\sum_{i=0}^{\infty}A_{i\lceil 1/\epsilon\rceil+k^{\prime}}$ we know that the coordinates $(l,k)$ lie within the $K_{k^{\prime}}$ , or more precisely, either $(l,k)\in T_{K_{k^{\prime}}}$ , or $(l,k)\notin T_{\varphi}$ . Just as before, if $(l,k)\notin T_{\varphi}$ , then $(l,k)\notin T_{\varphi_{\text{far}}}\subseteq T_{\varphi}$ . On the other hand, in the case that $(l,k)\in T_{K_{k^{\prime}}}$ we know that $T_{K_{k^{\prime}}}\cap T_{\varphi_{\text{far}}}=\emptyset$ because $T_{\varphi_{\text{far}}}\subseteq T_{\tilde{\varphi}}$ by Equation 24, and $T_{\tilde{\varphi}}\cap T_{K_{k^{\prime}}}=\emptyset$ as follows from the definition $\tilde{\varphi}\equiv\varphi-K_{k^{\prime}}$ .

This establishes that $T_{\varphi_{\text{far}}}\subseteq\{(l,k):|k-l|>B\}$ .

Third Claim: To establish the third claim in Lemma 22, and complete the proof, we will show that $\varphi_{\text{block}}$ is a mixture of states of spread at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ , which means that $\varphi_{\text{block}}$ can be produced from a shared maximally entangled state with at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ bits of communication.

Recalling the definition of $\varphi_{\text{block}}$ in Equation 23, let us define $\rho^{\prime}_{j}\equiv Q_{j}\varphi Q_{j}$ , so that it is clear that $\varphi_{\text{block}}=\sum_{j}\rho^{\prime}_{j}$ . It is also clear that $\rho^{\prime}_{j}$ is not only PSD, but also an un-normalized pure state, because

[TABLE]

From the definition of $Q_{j}$ in Equation 22 we have that:

[TABLE]

Where the index limits are

[TABLE]

We know from the definition in Equation 16 that the $\left|\varphi_{l}\right\rangle$ are orthogonal to each other, and that each $\left|\varphi_{l}\right\rangle$ has Schmidt coefficients bounded by $2^{-lN+1}\geq\lambda_{i}>2^{-lN-1}$ . Thus, it is immediate that $\rho_{j}^{\prime}$ has spread at most $(B_{b}-B_{s})N+4=2\lceil 1/\epsilon\rceil BN+4=O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ , where the last equality follows because $B=30+2\left\lceil\frac{\log(1/\epsilon)}{N}\right\rceil$ . Therefore $\varphi_{\text{block}}$ is a normalized mixture of states with spread at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ .

Consider the normalized version of $\rho_{j}^{\prime}$ , which is still a pure state of spread at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ it is clear that this state has Earthmover distance at most $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ from the nearest maximally entangled state (simply move all of the weight onto Schmidt coefficients of the size of the smallest Schmidt coefficient, which can be done by moving all the weight a distance less than or equal to the spread). It follows, by using Theorem 3 that there is a protocol which prepares the normalized version of $\rho_{i}^{\prime}$ from EPR pairs, with only $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ bits of communication (we note that this line of the proof could also have been established using result from [7, 8], for example). Now the state $\varphi_{\text{block}}\equiv\sum_{i}\rho_{i}^{\prime}$ can be prepared by applying this same protocol in superposition over $i$ (with the probability $\operatorname{tr}(\rho_{i}^{\prime})$ assigned to each $i$ ), and then tracing out over the $i$ register. Thus $\varphi_{\text{block}}$ can be prepared starting from EPR pairs with $O(N/\epsilon+\log(1/\epsilon)/\epsilon)$ bits of communication.

∎

Appendix G Proof of Lemma 15

Proof.

Given two states $\left|\chi\right\rangle=\sum_{i\in X}\sqrt{\chi_{i}}\left|i\right\rangle\otimes\left|i\right\rangle$ and $\left|\upsilon\right\rangle=\sum_{j\in Y}\sqrt{\upsilon_{j}}\left|j\right\rangle\otimes\left|j\right\rangle$ , and an arbitrary $\epsilon>0$ , let $\omega(i,j):X\times Y\to\mathbb{R}_{\geq 0}$ be the joint distribution on $X\times Y$ which satisfies the $\ell_{\infty}$ Earth Mover conditions for $\left|\chi\right\rangle$ and $\left|\upsilon\right\rangle$ , and acheives the optimal earth mover bound $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ . That is, for all $i\in X$ , $\sum_{j\in Y}\omega(i,j)=\chi_{i}$ , for all $j\in Y$ , $\sum_{i\in X}\omega(i,j)=\upsilon_{j}$ , and $\omega(i,j)=0$ whenever $|\log(\chi_{i})-\log(\upsilon_{j})|>d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ .

Define $\left|\rho\right\rangle\equiv\sum_{j\in Y}\sum_{k\in[2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil}+2]}\sqrt{\rho_{j,k}}\left|j\right\rangle\otimes\left|k\right\rangle\otimes\left|j\right\rangle\otimes\left|k\right\rangle$ , where

[TABLE]

We now define the intermediate state

[TABLE]

where the Schmidt coefficients $\gamma_{j,k,r}$ are left unspecified for now.

In order to specify the Schmidt coefficients of the intermediate state $\left|\gamma\right\rangle$ as well as the Right Index-1 Flow from $\left|\chi\right\rangle$ to $\left|\gamma\right\rangle$ , and the Left Index-1 Flow from $\left|\gamma\right\rangle$ to $\left|\rho\right\rangle$ we will first define “bins” for the Schmidt coefficients of $\left|\upsilon\right\rangle$ as follows:

For $l\in\mathbb{N}\cup\{0\}$ let $\Upsilon_{l}\equiv\{j\in Y:2^{-l}\geq\upsilon_{j}\geq 2^{-(l+1)}\}$ , and $X_{l}\equiv\{i\in X:2^{-l}\geq\chi_{i}\geq 2^{-(l+1)}\}$ . Define $\omega(X_{m},\Upsilon_{l})\equiv\sum_{(i,j)\in X_{m}\times\Upsilon_{l}}\omega(i,j)$ .

Fact 24.

If $|m-l|>d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)+1$ , then $\omega(X_{m},\Upsilon_{l})=0$

Proof.

Given $i\in X_{m}$ , and $j\in\Upsilon_{l}$ we have by definition that $2^{-l}\geq\upsilon_{j}\geq 2^{-(l+1)}$ , and $2^{-m}\geq\chi_{i}\geq 2^{-(m+1)}$ , and therefore that $|\log(\chi_{i})-\log(\upsilon_{j})|\geq|m-l|-1>d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ , where the last equality follows by assumption. It follows by definition of $d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)$ and of $\omega$ , that $\omega(i,j)=0$ . Since this is true for all $(i,j)\in X_{m}\times\Upsilon_{l}$ , the claim follows. ∎

We will now specify an iterative, “greedy” procedure to define the Schmidt coefficients $\gamma_{j,k,c}$ as a function of the $\left|\chi\right\rangle$ and $\left|\rho\right\rangle$ .

For each $(m,l)\in\mathbb{N}\cup\{0\}\times\mathbb{N}\cup\{0\}$ such that $\omega(X_{m},\Upsilon_{l})>0$ we first note that by Fact 24 that $|m-l|<d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)+1$ . Thus, for each $(i,j)\in X_{m}\times\Upsilon_{l}$ ,

[TABLE]

for all $k\in[2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}]$ .

One may check that Algorithm 1 defines Schmidt coefficients $\gamma_{j,k,r}$ , satisfying

[TABLE]

as well as a Right Index-1 Flow from $\left|\chi\right\rangle$ to $\left|\gamma\right\rangle$ , with degree at most $2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}\cdot 2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}=2^{2\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+4}$ . In particular the Right Index-1 Flow from $\left|\chi\right\rangle$ to $\left|\gamma\right\rangle$ is constructed in Algorithm 1 by iteratively adding edges to form the bipartite flow-graph $G_{X,Z}$ where $Z\equiv(Y,[2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}],[2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}])$ . Each line in the pseudocode which reads “Add an edge in the flow graph from $i_{m}$ to $(j,k,\text{overflow}+1)$ ”, or similar, adds a single edge to the graph $G_{X,Z}$ and the union of all these edges forms the bipartite flow $G_{X,Z}$ between $X$ and $Z$ . Furthermore, for the $\gamma_{j,k,r}$ defined by Algorithm 1,

[TABLE]

so that there is a Left Index-1 flow from $\left|\gamma\right\rangle$ to $\left|\rho\right\rangle$ defined by a bipartite graph between the Schmidt coefficients of $\left|\gamma\right\rangle$ and $\left|\rho\right\rangle$ respectively, in which, for every $(j,k,r)\in Y\times[2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}]\times[2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}]$ , there is an edge from $\gamma_{j,k,r}$ to $\rho_{j,k}$ of weight $\gamma_{j,k,r}$ . This Left Index-1 flow then clearly has degree $2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}$ .

Finally, recall that,

[TABLE]

So, by very similar reasoning, there is a Left Index-1 flow from $\left|\rho\right\rangle$ to $\left|\nu\right\rangle$ with degree exactly $2^{\lceil d_{\infty}(\left|\chi\right\rangle,\left|\upsilon\right\rangle)\rceil+2}$ .

∎

Acknowledgments

AWH was funded by NSF grants CCF-1452616, CCF-1729369, PHY-1818914 and ARO contract W911NF-17-1-0433. MC was supported at MIT by an Akamai Fellowship, and at the IQC by Canada’s NSERC and the Canadian Institute for Advanced Research (CIFAR), and through funding provided to IQC by the Government of Canada and the Province of Ontario.

Bibliography15

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D. Aharonov, A. W. Harrow, Z. Landau, D. Nagaj, M. Szegedy, and U. Vazirani. Local tests of global entanglement and a counterexample to the generalized area law. In Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on , pages 246–255, Oct 2014, ar Xiv:1410.0951 .
2[2] C. H. Bennett, H. J. Bernstein, S. Popescu, and B. Schumacher. Concentrating partial entanglement by local operations. Phys. Rev. A , 53:2046–2052, 1996, ar Xiv:quant-ph/9511030 .
3[3] C. H. Bennett, I. Devetak, A. W. Harrow, P. W. Shor, and A. Winter. The quantum reverse Shannon theorem and resource tradeoffs for simulating quantum channels. IEEE Trans. Inf. Theory , 60(5):2926–2959, May 2014, ar Xiv:0912.5537 .
4[4] S. Daftuar and P. Hayden. Quantum state transformations and the schubert calculus. Annals of Physics , 315:80–122, 2005, ar Xiv:quant-ph/0410052 .
5[5] A. W. Harrow. Entanglement spread and clean resource inequalities. In P. Exner, editor, XV Ith Int. Cong. on Math. Phys. , pages 536–540. World Scientific, 2009, ar Xiv:0909.1557 .
6[6] A. W. Harrow and D. W. Leung. A communication-efficient nonlocal measurement with application to communication complexity and bipartite gate capacities. IEEE Trans. Inf. Theory , 57(8):5504–5508, 2011, ar Xiv:0803.3066 .
7[7] A. W. Harrow and H.-K. Lo. A tight lower bound on the classical communication cost of entanglement dilution. IEEE Trans. Inf. Theory , 50(2):319–327, 2004, ar Xiv:quant-ph/0204096 .
8[8] P. Hayden and A. Winter. On the communication cost of entanglement transformations. Phys. Rev. A , 67:012306, 2003, ar Xiv:quant-ph/0204092 .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Universality of EPR pairs in Entanglement-Assisted

Abstract

1 Introduction

1.1 Entanglement-assisted communication complexity

Theorem 1**.**

1.2 Communication cost of state transformations

Definition 2** (ℓ∞\ell_{\infty}ℓ∞​ Earth Mover’s Distance ).**

Theorem 3**.**

Theorem 4**.**

Remark 1**.**

Remark 2**.**

2 Entanglement-Assisted Communication Complexity

Lemma 5**.**

Proof.

Lemma 6**.**

Proof.

Theorem** (Restatement of Theorem 1).**

Outline of the Proof of Theorem 1:

3 The Cost of State Transformation: A Lower Bound

Definition 7**.**

Definition 8**.**

Fact**.**

Definition 9**.**

Definition 10**.**

Theorem** (Restatement of Theorem 4).**

Proof.

Definition 11**.**

Lemma 12**.**

4 The Cost of State Transformation: An Upper Bound

Definition 13** (Right (Left) Index-1 Flow ).**

Definition 14** (Degree of a Right (Left) Index-1 Flow ).**

Lemma 15**.**

Lemma 16**.**

Corollary 17**.**

Theorem** (Restatement of Theorem 3).**

Proof.

Appendix A Proof of Lemma 12

Proof.

Appendix B Fact 18

Fact 18**.**

Proof.

Appendix C Proof of Lemma 16

Proof.

Appendix D Proof of Corollary 17

Proof.

Appendix E Proof of Theorem 1

Definition 19** (Spread).**

Theorem** (Restatement of Theorem 1).**

Proof.

Definition 20** (subset-matrix).**

Definition 21** (Non-Zero Set).**

Lemma 22**.**

Lemma 23**.**

Proof.

Appendix F Proof of Lemma 22

Lemma** (Restatement of Lemma 22).**

Proof.

Appendix G Proof of Lemma 15

Proof.

Fact 24**.**

Proof.

Acknowledgments

Theorem 1.

Definition 2 ( $\ell_{\infty}$ Earth Mover’s Distance ).

Theorem 3.

Theorem 4.

Remark 1.

Remark 2.

Lemma 5.

Lemma 6.

Theorem (Restatement of Theorem 1).

Definition 7.

Definition 8.

Fact.

Definition 9.

Definition 10.

Theorem (Restatement of Theorem 4).

Definition 11.

Lemma 12.

Definition 13 (Right (Left) Index-1 Flow ).

Definition 14 (Degree of a Right (Left) Index-1 Flow ).

Lemma 15.

Lemma 16.

Corollary 17.

Theorem (Restatement of Theorem 3).

Fact 18.

Definition 19 (Spread).

Theorem (Restatement of Theorem 1).

Definition 20 (subset-matrix).

Definition 21 (Non-Zero Set).

Lemma 22.

Lemma 23.

Lemma (Restatement of Lemma 22).

Fact 24.