Universality of EPR pairs in Entanglement-Assisted Communication Complexity, and the Communication Cost of State Conversion
Matthew Coudron, Aram W. Harrow

TL;DR
This paper demonstrates that EPR pairs are universally sufficient for entanglement-assisted quantum communication, and provides bounds on the cost of converting one bipartite state into another using a new metric based on optimal transport.
Contribution
It proves the universality of EPR pairs in entanglement-assisted communication complexity and introduces the $ ext{EMD}$ metric to bound state conversion costs.
Findings
Any entanglement-assisted protocol can be approximated using only EPR pairs.
The quantum communication cost for state conversion is bounded by the $ ext{EMD}$ metric.
Lower bounds on state conversion costs are established via smoothed $ ext{EMD}$.
Abstract
Entanglement assistance is known to reduce the quantum communication complexity of evaluating functions with distributed inputs. But does the type of entanglement matter, or are EPR pairs always sufficient? This is a natural question because in several other settings maximally entangled states are known to be less useful as a resource than some partially entangled state. These include non-local games, tasks with quantum communication between players and referee, and simulating bipartite unitaries or communication channels. By contrast, we prove that the bounded-error entanglement-assisted quantum communication complexity of a function cannot be improved by more than a constant factor by replacing maximally entangled states with arbitrary entangled states. In particular, we show that every quantum communication protocol using qubits of communication and arbitrary shared entanglement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Universality of EPR pairs in Entanglement-Assisted
Communication Complexity, and the Communication Cost of State Conversion
Matthew Coudron Aram W. Harrow Institute for Quantum Computing, University of Waterloo [email protected]. Center for Theoretical Physics, MIT. [email protected]
Abstract
In this work we consider the role of entanglement assistance in quantum communication protocols, focusing, in particular, on whether the type of shared entangled state can affect the quantum communication complexity of a function. This question is interesting because in some other settings in quantum information, such as non-local games, or tasks that involve quantum communication between players and referee, or simulating bipartite unitaries or communication channels, maximally entangled states are known to be less useful as a resource than some partially entangled states. By contrast, we prove that the bounded-error entanglement-assisted quantum communication complexity of a partial or total function cannot be improved by more than a constant factor by replacing maximally entangled states with arbitrary entangled states. In particular, we show that every quantum communication protocol using qubits of communication and arbitrary shared entanglement can be -approximated by a protocol using qubits of communication and only EPR pairs as shared entanglement. This conclusion is opposite of the common wisdom in the study of non-local games, where it has been shown, for example, that the I3322 inequality has a non-local strategy using a non-maximally entangled state, which surpasses the winning probability achievable by any strategy using a maximally entangled state of any dimension [15]. We leave open the question of how much the use of a shared maximally entangled state can reduce the quantum communication complexity of a function.
Our second result concerns an old question in quantum information theory: How much quantum communication is required to approximately convert one pure bipartite entangled state into another? We give simple and efficiently computable upper and lower bounds. Given two bipartite states and , we define a natural quantity, , which we call the Earth Mover’s distance, and we show that the communication cost of converting between and is upper bounded, up to a constant multiplicative factor, by . Here may be informally described as the minimum over all transports between the log of the Schmidt coefficients of and those of , of the maximum distance that any amount of mass must be moved in that transport. A precise definition is given in the introduction. Furthermore, we prove a complementary lower bound on the cost of state conversion by the -Smoothed -Earth Mover’s Distance, which is a natural smoothing of the -Earth Mover’s Distance that we will define via a connection with optimal transport theory.
1 Introduction
1.1 Entanglement-assisted communication complexity
Imagine that two cooperating players, Alice and Bob, are given the task of evaluating a function (), where is known only to Alice and is known only to Bob. The communication complexity of is the number of bits that Alice and Bob need to exchange in order to compute . Popular variations of this framework include allowing a small probability of error, allowing qubits to be communicated instead of classical bits, and allowing extra resources such as shared randomness or entanglement.
In classical communication complexity, Newman’s theorem states that arbitrarily large amounts of shared randomness in a protocol can be replaced by a distribution with bits of entropy while only reducing the success probability of that protocol by . (Here is the input size of each party.) Is there a quantum analogue to this result?
In one sense the answer is “no”. Given a two-party entanglement-assisted protocol for, say, computing the value of some function, we cannot replace the shared entanglement with some different, less entangled, state, without causing large errors [9, 1]. It is an open question whether it is possible to replace a large entangled state with a less entangled one while also changing the communication protocol.
However, while it remains a challenge to characterize the dimension of shared entanglement required for optimal entanglement-assisted quantum communication protocols, in this work we show that the type of shared entanglement required by such protocols can be neatly characterized. In Theorem 1 below, we establish that the bounded-error entanglement-assisted quantum communication complexity of a partial or total function cannot be improved by more than a constant factor by replacing maximally entangled states with arbitrary entangled states. This is accomplished by constructing an explicit protocol which allows two parties, who only share maximally entangled states, to simulate any entanglement-assisted quantum communication task regardless of the shared state that that task originally required.
Theorem 1**.**
Consider a quantum communication protocol whose goal is to compute a joint function . Suppose that uses an arbitrary bipartite entangled state (of unbounded dimension), as well as qubits of communication total, in either direction (for sufficiently large ). Then, for every , there exists a quantum communication protocol which simulates with error , while using only a maximally entangled state as an entangled resource (rather than or any other state), and using qubits of communication. Thus, if computes with error it follows that computes with error .
Theorem 1 shows that, although the role of shared entanglement in quantum communication complexity is still not well understood, the type of shared entanglement does not drastically change communication complexity. This is true regardless of input size or promise, as long as we are in the constant-error regime and some communication is allowed between players (unlike, say, the simultaneous-message-passing model). This result sets quantum communication complexity apart from settings such as channel simulation [3], nonlocal games [10, 14], unitary gate simulation [6], and communication tasks involving quantum communication between referees and players [11]. In each of those cases the ratio between the EPR-assisted costs and the (unrestricted) entanglement-assisted costs can be made arbitrarily large. This suggests that the role of shared entanglement in quantum communication complexity may be fundamentally different than in these other settings. Furthermore, the result achieved in Theorem 1 may be useful in future work attempting to further bound the role of entanglement in quantum communication complexity, as it restricts the problem to the case of shared EPR pairs, without loss of generality.
It may be worth noting that the proof of Theorem 1 is nearly oblivious to the entanglement-assisted protocol being considered in the following sense: Given a protocol using qubits of communication and a shared entangled state , we can replace with a “consolidated” state at the cost of error . Moreover, can be prepared from a maximally entangled state using communication. Taking constant implies that the EPR-assisted communication complexity of a function is at most times the (unrestricted) entanglement-assisted communication complexity of that function. It was not necessary to modify the protocol to achieve this result, except to pre-compose it with a pre-processing protocol which starts with only EPR pairs, and prepares the state using only communication. can then be run on directly. Such a protocol-agnostic preprocessing should not be taken for granted, since it is known that reducing the number of EPR pairs may in some cases require more than just pre-processing [9, 1].
1.2 Communication cost of state transformations
Our second contribution, which is related at the level of techniques to Theorem 1, is to provide upper and lower bounds for an old quantity studied in quantum information theory, the communication cost of state transformation.
Suppose that and are bipartite pure quantum states, with vectors of Schmidt coefficients denoted respectively by and . In this setting it is known that can be exactly converted into using LOCC if and only if is majorized by [13]. But the communication cost of this transformation is known only in a few special cases. If and for some states , then this cost is or less in some special cases (e.g. is maximally entangled). More generally there is, in principle, an exact characterization of the communication cost (either LOCC, or quantum communication) of state transformation using the Schubert calculus due to Daftuar and Hayden [4], but in practice it is difficult to extract concrete bounds from their main theorem.
In this work we identify a simple and efficiently computable quantity, which we call the Earth Mover’s (or Wasserstein) Distance, which tells us approximately how much quantum communication is required to transform to . Given its simple form, we believe that this quantity may be a useful tool in quantum information theory.
Definition 2** ( Earth Mover’s Distance ).**
Let and be two states. We define to be the Earth Mover’s distance between and , which is equal to the minimum for which there exists a joint distribution such that:
- •
* *
- •
* *
- •
* ** whenever ** *
We can think of as corresponding to placing mass at position for each , and similarly for . Then is the EMD (Earth Mover’s distance) between these distributions.
In Section 4 we will show that this quantity gives an intuitive upper bound on the amount of quantum communication required to transform one bipartite shared state into another. In particular we prove the following theorem.
Theorem 3**.**
Let and be two bipartite shared states. There is a protocol which can prepare from , using only qubits of communication.
In Section 3 we establish a complementary lower bound, showing that a “-smoothed” version of the Earth Mover’s Distance, denoted by , gives a lower bound on the cost of state transformation. That is:
Theorem 4**.**
Given any two bipartite shared states and , shared between two parties and , together with a unitary which can be performed on the state via a quantum communication protocol , that uses qubits of communication between and , we have that, for every :
[TABLE]
In words: If two shared states cannot be brought within small Earth Mover’s Distance of each other by moving an quantity of mass of their Schmidt coefficients, then they also cannot be brought closer than fidelity with each other without using qubits of communication (for sufficiently large values of ). Thus, the -smoothed Earth Mover’s Distance provides a lower bound on the communication cost of state conversion. On the other hand, from the definition of , stated in Definition 10, we note here that one can use Theorem 3 to move to within fidelity of using only qubits of communication. To do this, omit the mass of Schmidt coefficients on which the two states have large -smoothed distance, and apply Theorem 3 as one would do with the regular Earth Mover’s Distance. In this sense gives both an upper and lower bound on the communication cost of state conversion.
To put these bounds in context: One could consider entanglement concentration and dilution to be the starting point for the study of state conversion. The original paper on entanglement concentration and dilution [2] concerned the many-copy limit and did not attempt to bound the amount of classical communication used. The first time the classical communication cost of state conversion was considered explicitly seems to have been in [12], which could be said to establish a version of our upper bound in the case where the starting state is maximally entangled. (Their result is not quite that general but contains many of the key ideas.) A version of our lower bound was established, again for the case of starting with maximally entangled states, in [7, 8]. These lower bounds could be applied to general state conversion but relied on Rènyi entropy inequalities that are clearly not tight in many cases. Finally, as noted earlier, a full characterization of the communication cost of general state conversion was given in [4] but the resulting formula is complicated and there is not an efficient algorithm known to evaluate it.
We conclude the section with two remarks about notation.
Remark 1**.**
In theorem statements above, and where appropriate, we have made use of superscripts and , as in to explicitly denote the two halves of the bipartite division of a state. However, since all of the shared entangled states considered in this paper are bipartite, and since the two components of the bipartite division are generally clear from context, we will usually omit this notation.
Remark 2**.**
When considering a bipartite state , we will assume that the state has a Schmidt decomposition of the form across the implicit bipartite division. This is done in the theorem statements above and everywhere in the paper. We can assume this WLOG because any state that has the same Schmidt coefficients as can be moved to this canonical form (and vice versa) using only local unitary transformations, which can be implemented with no quantum communication between the two components of the bipartite division. Thus our analysis of communication costs is unaffected by assuming WLOG that, in any quantum communication protocol, shared entangled states start and end in this form.
2 Entanglement-Assisted Communication Complexity
In this section we will discuss the proof of our main result, Theorem 1, which shows that arbitrary entanglement-assisted quantum communication protocols can be simulated by quantum communication protocols that use only the maximally entangled state as an entangled resource. A basic fact we will need is that two bipartite pure states which are sufficiently different in the distribution of mass across their Schmidt coefficients must be nearly orthogonal. This fact is stated for our specific purposes in Lemma 6 below. Crucially, such states remain nearly orthogonal even after one of them is acted on by any unitary which can be implemented with a small amount of quantum communication, as we detail in Lemma 5.
Lemma 5**.**
Given two quantum states and on , such that the Schmidt coefficients of are upper bounded by , and those of are upper bounded by , and further given a unitary transformation on which can be implemented using at most qubits of communication between the and components of the Hilbert space, it follows that:
[TABLE]
Proof.
If is a unitary transform using qubits of communication, then [8]. We also know that the Schmidt coefficients of are bounded above by [8]. The desired result now follows by Lemma 6. ∎
Lemma 6**.**
Given two quantum states and on , such that the Schmidt coefficients of are upper bounded by , and those of are upper bounded by , we have:
[TABLE]
Proof.
For brevity let . Schmidt decompose and as , as . Define the matrix , and note that
[TABLE]
Now, by definition of a Schmidt Decomposition, we know that the maximum singular value of is . Thus, for all we have that (since and are normalized vectors by definition). It then follows that:
[TABLE]
∎
Theorem 1 is the main result of this work. The proof is long enough that a high-level outline may be valuable. Therefore will now give a brief, intuitive outline of the proof of Theorem 1, restated below for the reader’s convenience, and include the complete proof in Section E of the Appendix.
Theorem** (Restatement of Theorem 1).**
Consider a quantum communication protocol whose goal is to compute a joint function . Suppose that uses an arbitrary bipartite entangled state (of unbounded dimension), as well as qubits of communication total, in either direction (for sufficiently large ). Then, for every , there exists a quantum communication protocol which simulates with error , while using only a maximally entangled state as an entangled resource (rather than or any other state), and using qubits of communication. Thus, if computes with error it follows that computes with error .
Outline of the Proof of Theorem 1:
The proof of Theorem 1 has three main parts. First, the initial entangled state used by the protocol can be converted using a small amount of communication to a state in which the Schmidt coefficients are grouped into evenly spaced groups. This is achieved using Theorem 3.
Second, we show that our new “grouped” entangled state can be divided into three “pieces” (more precisely termed subset-matrices in Definition 20), one piece which has small trace norm and can therefore be omitted, one piece called which only has non-zero terms which are far from the diagonal in the appropriate basis, and one piece called which is a block-diagonal mixed state that can be produced with small error and low communication cost from a maximally entangled state.
to show that, if one starts with a quantum communication protocol with an arbitrary shared entangled state, then that protocol can be modified, using a small amount of additional communication, to instead use an entangled state, , (a property which will be useful later in the proof). Once we have reduced, without loss of generality, to appropriately “grouped” entangled state in this way, the proof proceeds in two halves. In the first half, which is summed up in Lemma 22, we show that
In the second half of the proof, which is summed up in Lemma 23, we show that the piece of has very little effect on the outcome of the quantum communication protocol in question. This means that can be replaced by alone while incurring very little error in the outcome of the quantum communication protocol. Since can be produced with low cost from a maximally entangled state, this then achieves the desired result. The full proof of Theorem 1 is included in Section E of the Appendix. The role of Lemma 5 in the proof is within this step for controlling the terms far from the diagonal, in Lemma 23.
3 The Cost of State Transformation: A Lower Bound
It is natural at this point to discuss the background and proof for Theorem 4, which establishes a lower-bound on the cost of State Transformation by the -Smoothed Earth Mover’s Distance, and to postpone the discussion of Theorem 3 until Section 4, for two reasons. First, the proof of Theorem 4 in this section shares key techniques in common with the proof of Theorem 1 in Section 2 above, and so this progression may provide the reader with some continuity of thought while also reiterating the usefulness of the techniques. Second, Theorem 4 in this section motivates the notion of the Earth Mover’s Distance by highlighting its, perhaps surprising, relevance to lower bounding the cost of state transformation. This prepares the reader with some motivation for why the upper bound proven in Theorem 3, in Section 4 below, is interesting and potentially useful. Thus, covering Theorem 4 at this point may provide the reader with a reason to accept the -smoothed Earth Mover’s Distance as a useful proxy for the cost of State Transformation.
Whereas the proof of Theorem 3 in the next section will make direct use of Definition 2, the proof of Theorem 4 in this section is elucidated by first establishing an equivalent formulation of the Earth Mover’s Distance which is derived by establishing the relationship between the Earth Mover’s Distance as defined in Definition 2, and the Monge-Kantorovich Transportation distance on the real line, as shown below. After translating to this equivalent definition, stated in Definition 9, the generalization to the -smoothed Earth Mover’s Distance in Definition 10 is straightforward and natural.
Definition 7**.**
Given two probability distributions and on the real line, and a function the corresponding Monge-Kantorovich distance, between and is defined as:
[TABLE]
Where is defined to be the collection of all probability distributions on which have marginal on equal to and marginal on equal to .
In order to translate into a statement about quantum states, we make the following definition in a similar style to Definition 2:
Definition 8**.**
Given a bipartite shared state let us define a random variable which takes value with probability (note that, since the sum to one, this is a well defined random variable). We now define to be the probability distribution of this random variable.
It is clear that, for every , is a probability distribution on the real line. One may note the following simple relationship between Monge-Kantorovich distance and Earth Mover’s Distance:
For any , consider the Monge-Kantorovich distance, where the function is defined by if and if . Then, for any two quantum states and , we have that if and only if .
Given this concrete connection between Earth Mover’s Distance and the Monge-Kantorovich distance, we can now make use of the following characterization of Monge-Kantorovich distance for distributions on the real line, which is well known in optimal transport theory:
Fact**.**
Let and be probability distributions supported on the real line, and let and be their cumulative distribution functions, respectively. Then, for any :
[TABLE]
It follows from this Fact, combined with the discussion above, that an equivalent definition of the Earth Mover’s Distance is given by:
Definition 9**.**
[TABLE]
In the context of this equivalent formulation of Earth Mover’s Distance, we can succinctly introduce a “smoothed” version of the same distance. The reader may note that, since the above definition of is evidently not robust against tiny changes of either distribution in the total variation distance it would be impossible to prove a lower bound of the form of Theorem 4 if stated using that definition. Hence the motivation for introducing a “smoothed” version of the distance measure, which has built-in robustness by definition.
Definition 10**.**
-Smoothed -Earth Mover’s Distance
[TABLE]
With this definition in place we can now state the lower bound.
Theorem** (Restatement of Theorem 4).**
Given any two bipartite shared states and , shared between two parties and , together with a unitary which can be performed on the state via a quantum communication protocol , that uses qubits of communication between and , we have that, for every :
[TABLE]
Intuitively, Theorem 4 states that two bipartite shared states which are far apart in the -Smoothed -Earth Mover’s Distance, cannot be made equal via a quantum communication protocol unless it uses at least qubits of communication (for a particular constant which can be computed from the statement of Theorem 4).
Proof.
Suppose that two bipartite shared states and have . By definition such that
[TABLE]
Suppose that (if the opposite is true then we simply switch the roles of and and continue with the same proof). Define . Further define , and . Similarly define , and . Note that , and are orthogonal, as are and .
Since we have it follows from the definitions that . Since , and , it follows from Equation 1 that . Therefore, and thus .
The main idea in the proof of this theorem is that we can now partition into three nearly orthogonal parts, depending on , as follows:
Definition 11**.**
[TABLE]
Lemma 12**.**
For with , we have that , , and , where .
The proof of Lemma 12 is given separately in the appendix. Within that proof is the key use of Lemma 5 which is the primary conceptual step in proving Theorem 4. Understanding the proof of Lemma 12 is also the best way of understanding the motivation behind Definition 11 above.
It follows from the definitions that:
[TABLE]
While the individual and are not necessarily all orthogonal we do have and . Likewise and . Together these imply
[TABLE]
From Lemma 12 it follows that:
[TABLE]
Now recall that
[TABLE]
We now return to Equation 5. Setting and for we have
[TABLE]
where , and are unit vectors. We claim that this quantity is maximized by setting and . Indeed we can upper bound where and . Now define and we have
[TABLE]
This is maximized by taking and . Thus
[TABLE]
Finally we would like an upper bound independent of . This maximization is performed in the proof of Fact 18 from Section B of the Appendix and yields the following.
[TABLE]
∎
4 The Cost of State Transformation: An Upper Bound
In this section we will give a proof of Theorem 3, which states that the quantum communication cost of converting between two bipartite entangled states is upper bounded by the Earth Mover’s Distance between those states. This upper bound represents the second half of our two sided argument (employing both Theorem 3 and Theorem 4) that the Earth Mover’s Distance is a simple and efficiently computable proxy for the cost of state conversion. The proof is divided into two parts which are proved separately in Lemma 15, and Lemma 16 together with Corollary 17. At a high level Lemma 15 tells us that, given bipartite states and , one can map the Schmidt coefficients of directly onto the Schmidt coefficients of using a series of bipartite “flows” that have small degree (where degree is a quantity defined below). Lemma 16 and Corollary 17 then tell us that any such “flow” which has small degree, can be implemented as an actual bipartite state transformation, with correspondingly small communication required.
Here we establish Lemmas 15 and 16 which, together, prove the desired theorem. We begin with a couple definitions establishing the concept of flows, as we use it here.
Definition 13** (Right (Left) Index-1 Flow ).**
Fix two states and . A Right Index-1 Flow from to is a bipartite graph with vertices given by , and edge set , such that:
- •
Each vertex in has index 1 in .
- •
For all ,
If the roles of and are reversed in the above, then we say that there is a Left Index-1 Flow from to . Equivalently, there is a Left Index-1 Flow from to exactly when there is a a Right Index-1 Flow from to .
Definition 14** (Degree of a Right (Left) Index-1 Flow ).**
We define the degree of a Right (Left) Index-1 Flow from to to be the maximum index of any vertex in the bipartite graph .
The following lemma, which a key step in proving Theorem 3, establishes that bipartite states which are close to each other in the Earth Mover’s Distance of Definition 2, can be mapped to each other through a series of flows of bounded degree. This series of flows intuitively establishes a map for converting one bipartite state to the other using bounded quantum communication, in a manner that will be made rigorous in Lemma 16. The main step in the proof of Lemma 15 involves constructing a flow through a type of greedy algorithm whose analysis has a number of subtle cases. In order to concretely exhibit these cases the entire greedy algorithm, including every case, is written out in pseudocode in Algorithm 1.
Lemma 15**.**
Given two states and , there exist two “intermediate” states and , such that there is a Right Index-1 Flow from to of degree at most , a Left Index-1 Flow from to of degree at most , and a Left Index-1 Flow from to of degree at most .
The Proof of Lemma 15 is included in the Appendix, section G.
Lemma 15, above, shows that two bipartite entangled states can be connected to each other by a series of flows which have a degree which is bounded in terms of the Earth Mover’s Distance between them. The next step is to establish that every flow can be implemented via a quantum communication protocol. Lemma 16 and Corollary 17, below, accomplish this by showing that, if two bipartite states can be connected by flows of small degree, then one state can be converted to the other (and vice versa) using a quantum communication protocol which only requires small amounts of communication.
Lemma 16**.**
Given two states and such that there is a Right Index-1 Flow from to with degree at most , there exists a quantum communication protocol , which uses qubits of communication, and converts the shared state to the shared state .
The idea of the proof is that if then it suffices to define separately protocols for each term. These protocols simply use quantum communication to create a shared entangled state, resulting in the state . Choosing the Schmidt coefficients according to the given Right Index-1 Flow yields the result. The details of this argument are in the Appendix xC.
Corollary 17 establishes the same result as Lemma 16, but in the reverse direction.
Corollary 17**.**
Given two states and such that there is a Left Index-1 Flow from to with degree at most , then, for two parties sharing entangled state , there exists a quantum communication protocol , which uses qubits of communication, and converts the shared state to the shared state .
The proof of Corollary 17 is straightforward and appears in Appendix D.
Theorem** (Restatement of Theorem 3).**
Let and be two bipartite shared states. There is a protocol which can prepare from , using only qubits of communication.
Proof.
The proof follows by applying Lemma 15, followed by Lemma 16 and Corollary 17. ∎
Appendix A Proof of Lemma 12
Proof.
First note that it is immediate from the definitions that , so the conditions of the lemma are automatically satisfied in those cases.
To bound the remaining inner products we will first prove a bound on the inner product and note that the remaining inner products are bounded as a consequence of this first bound. For notational convenience, while establishing the bound on , we set , and let be the non-zero Schmidt coefficients of (which are just a renamed version of the non-zero Schmidt coefficients of ). Therefore, we know that, for all , , and . The purpose of this renaming convention is that we can now cleanly make the following definition. For integers define , so that we have , and whenever . So,
[TABLE]
By definition, for any , the Schmidt coefficients of are upper bounded by , and lower bounded by , and from the latter we have . Furthermore, the Schmidt coefficients of are upper bounded by , and thus, we have by Lemma 5 that:
[TABLE]
where the final inequality follows because by assumption. Thus,
[TABLE]
where the second inequality follows by Equation A and the subsequent equality follows by Equation 9. Having established this upper bound on we now proceed with bounding the other inner products in the Lemma statement:
[TABLE]
[TABLE]
where both of the inequality steps follow by Equation 12 (the first of which also uses the triangle inequality).
[TABLE]
[TABLE]
Now, as noted earlier, . Continuing with the cross terms we have:
[TABLE]
[TABLE]
where the last inequality follows from Equation 12. And, since we already have from Equation A, the final inner product to bound is:
[TABLE]
where the last inequality follows by Equation 12.
∎
Appendix B Fact 18
Fact 18**.**
For and ,
Proof.
Define . Note that , and . So, , , and
[TABLE]
for all and . It follows by integration that:
[TABLE]
So,
[TABLE]
∎
Appendix C Proof of Lemma 16
Proof.
By assumption there is a Right Index-1 Flow from to with degree at most , so there exists a bipartite graph with vertices given by , and edge set , such that:
- •
Each vertex in has index 1 in .
- •
For all , .
- •
The maximum degree of any vertex in is .
The protocol for Alice and Bob to start with shared state and end up with shared state will proceed as follows: Beginning with the state shared between Alice and Bob, we will refer to the register containing the Alice half of as , and the register containing the Bob half as . Alice will append two additional registers, of qubits each, and initialize each of them to the all zeros state. We will call these two new registers and respectively. Alice will then perform a controlled unitary operation between and the registers and . She will then pass the register to Bob using qubits of quantum communication to do so. Bob will then perform a controlled unitary between and , Alice will perform a controlled unitary between and , and after that Alice and Bob will share the state .
To describe the protocol more precisely we will define the specific controlled unitaries performed by Alice and Bob at each step. Beginning with a shared state , after Alice appends the two additional -qubit registers to her side of , the shared state looks as follows:
[TABLE]
Where, initially, Alice holds the registers , , and . Alice now performs a controlled unitary operation, acting on registers and and controlled on register . To describe this controlled unitary concisely we will need to imagine that there is some total order on the elements (any total order will do, one can simply imagine that the ’s are indexed by bit strings which encode integers), and we will define . Note that, since every has degree at most , is always an integer between [math] and , so it can always be expressed in binary as a -bit binary number. We will take this convention in the following argument.
Now to define Alice’s controlled unitary: When controlled on Alice’s unitary moves the state to the state . Note that since is always a -bit binary string, it can always be contained in the -qubit registers and . Further note that, since by assumption, is a normalized pure state. Thus there exists a unitary operation that moves to and Alice need only perform this specific unitary when the control register is in state . So, when Alice applies this controlled unitary to her registers , and (where is the controlling register), the resulting new shared state between Alice and Bob is:
[TABLE]
At this point Alice uses qubits of communication to pass the -qubit register to Bob. The resulting shared state is:
[TABLE]
Where Alice owns registers and , and Bob owns registers and . Now it is not hard to see from the definition of and the fact that every has degree exactly 1 in the graph , that there is a bijection mapping each to the tuple . Alice and Bob both know this bijection since they know the description of , and since bijections are invertible, Alice and Bob can now both apply a local unitary which relabels the basis element to the basis element . The resulting shared state is:
[TABLE]
Where the first equality follows because each appears in the initial sum exactly once (because has degree exactly one in ).
This completes the protocol.
∎
Appendix D Proof of Corollary 17
Proof.
By definition, if there is a Left Index-1 Flow from to , then there is a Right Index-1 Flow from to (which is the starting assumption of Lemma 16). One can check that, in the proof Lemma 16, every operation performed by Alice and Bob was reversible. Therefore, the proof of this corollary is simply to start at the end of the proof of Lemma 16, and “reverse” every step of the proof in order from end to beginning (including the communication step…now communication goes from Bob to Alice rather than Alice to Bob). The result is the desired quantum communication protocol, which converts the shared state to the shared state using qubits of communication. ∎
Appendix E Proof of Theorem 1
A concept which will be useful in the proof of Theorem 1 is the notion of the spread of a state:
Definition 19** (Spread).**
For a finite dimensional bipartite entangled state let be the maximum of the Schmidt coefficients of , and let be the minimum Schmidt coefficient. We define the spread of to be the quantity .
We note that the above definition of spread is given in the case of finite dimensional , which is the only case we will need. There is also an -smoothed variant of the spread of a state [8, 5], but it will not be needed for this proof. Within the proof of Theorem 1 the spread of a bipartite state will be used as a proxy for the amount of communication required to create that state from a maximally entangled state. This intuition is formalized, for example, by Theorem 3, but in this case of converting from a maximally entangled state, is also an implication of earlier works, such as [7, 8].
Theorem** (Restatement of Theorem 1).**
Consider a quantum communication protocol whose goal is to compute a joint function . Suppose that uses an arbitrary bipartite entangled state (of unbounded dimension), as well as qubits of communication total, in either direction (for sufficiently large ). Then, for every , there exists a quantum communication protocol which simulates with error , while using only a maximally entangled state as an entangled resource (rather than or any other state), and using qubits of communication. Thus, if computes with error it follows that computes with error .
Proof.
Given , , and as in the theorem statement, Schmidt decompose as (see Remark 2 for why we may assume WLOG that has this form).
Let be an integer, which will be specified later. Define a function given by
[TABLE]
and define a new state , where . Note that , so that is a normalized pure state. Furthermore, every Schmidt coefficient of is within a multiple of of the integer power . This follows because
[TABLE]
Next, we can upper bound by considering the coupling in which each is moved to . The largest distance obtained here is the maximum for which , and this in turn is . Therefore, by Theorem 3, there is a protocol by which Alice and Bob can prepare from , using qubits of communication. (For this special case, of course a simpler protocol could also be used.)
Define to be the composed protocol in which Alice and Bob start with shared state , first use protocol to convert to , and then perform protocol using shared state and inputs and , to compute the joint function . It is evident that has exactly the same success probability as . Since uses at most qubits of communication and uses qubits of communication, can be performed with qubits of communication.
For a nonnegative integer, define and define the subnormalized state
[TABLE]
From Equation (15) and the surrounding discussion, we have that . Furthermore, by the definition of , it follows that has spread at most ; note that the spread of does not depend on whether the state is normalized or not.
The idea of the proof is that different are not only orthogonal, but must remain approximately orthogonal even after a small amount of quantum communication. In particular, note that for any , . Furthermore, for all we have, by definition, that the Schmidt coefficients of are bounded above by . Therefore, if is a unitary transform using qubits of communication, then, it follows by Lemma 5, that ,
[TABLE]
To apply this to our problem, we first note that the protocol depends, a priori, on the inputs to the function that we wish to compute (just like the the protocol ). We now fix any input pair and for the remainder of the proof of this theorem we will perform only transformations of the shared state which do not depend on the value of . We will therefore establish that our transformation to a maximally entangled shared state does not significantly impact the success probability of the quantum communication protocol regardless of the value of . The desired Theorem then follows.
With the input now fixed, we observe that the success probability of protocol (which we have already established is equal to the success probability of the original protocol ) can be expressed WLOG by performing and then computing the probability of outcomes when measuring the first qubit in the computational basis. The probability that such a measurement on protocol outputs is
[TABLE]
where acts on all qubits except for the first, which is being measured. Define . Then
[TABLE]
Observe, for later, that is a unitary operator that can be implemented using qubits of communication.
The proof will proceed as follows: In Lemma 22 we show that the density matrix can be divided into three “pieces” (in a manner that does not depend on the inputs ), one piece which has small trace norm and can therefore be omitted, one piece called which only has non-zero terms which are far from the diagonal in the appropriate basis, and one piece called which is a block-diagonal mixed state that can be produced with small error and low communication cost from a maximally entangled state. Then, in Lemma 23, we show that the piece of has very little effect on the protocol . This means that can be replaced by alone while incurring very little error in the success probability of . Stated equivalently, via the equality in Equation 18 above, Lemma 23 shows that the quantity is small. Since we know from Lemma 22 that can be produced with low cost from a maximally entangled state, this leads us to the desired result. Since does not depend on the inputs this same statement holds for every pair of inputs . From this point forward we will no longer specify the fixed inputs , as it will be clear that the state substitutions do not depend on these inputs, and thus that the argument holds for every input as discussed in this paragraph.
We now establish some notation which will be useful throughout the rest of the proof:
Definition 20** (subset-matrix).**
Consider operators on the Hilbert space which is the span of the . We say that an operator is a subset-matrix of an operator , if it is the case that for all either , or .
Definition 21** (Non-Zero Set).**
For an operator on the Hilbert space which is the span of the , define the non-zero set of to be .
Lemma 22**.**
Consider the density matrix . For any , there exist subset-matrices, , of , such that
** 2. 2.
, where . 3. 3.
The bipartite shared state can be prepared starting from EPR pairs with bits of communication.
The proof of Lemma 22 is included in Section F of the Appendix.
We can now bound the difference between the protocol acting on versus acting on , following equation 18 as follows:
[TABLE]
Setting and recalling from the Theorem statement that by assumption, it follows by Lemma 23, stated below, that:
[TABLE]
This completes the proof of the Theorem as we now describe.
We know from Lemma 22 that there is a quantum communication protocol, call it , which prepares the shared state starting from just a maximally entangled state using at most bits of communication. Now define the protocol . Since uses at most qubits of communication, and since we have chosen to set (in the line above Equation 19), it follows that uses at most qubits of communication. Furthermore, the success probability of with only the maximally entangled state as an entangled resource is the same, by construction, as the success probability of with as an entangled resource, which, by Equation 19 above and the original definition , is within of the success probability of the original protocol from the theorem statement when using the original shared state as an entangled resource. This is the desired result. ∎
Lemma 23**.**
For as constructed in Lemma 22, and for as defined in the proof of Theorem 1 we have, whenever .
Proof.
Following Lemma 22, we define . Now, letting and be as in Lemma 22, and recalling that , we have:
[TABLE]
where the final inequality follows because by Lemma 22. Recalling that the unitary can be implemented using 2Q+8N+16 qubits of communication, and applying equation 17 then gives that:
[TABLE]
So, recalling from the Lemma statement that by assumption:
[TABLE]
So,
[TABLE]
∎
Note, in the pre-processing step in the proof of Theorem 1, and again at a point within the proof of Lemma 22 we use our Theorem 3 in a setting where either the starting or ending state is very close to a maximally entangled state. It is helpful to observe, to avoid confusion, that in such cases Theorem 3 is not strictly necessary and could be replaced with previously known results from, for example, [7, 8]. In this manuscript we will use Theorem 3 in these cases in order to remain self-contained, and for the convenience of the reader, but we emphasize that the lines of the proof of Theorem 1 in which we use Theorem 3 could be replaced with known results.
Appendix F Proof of Lemma 22
Lemma** (Restatement of Lemma 22).**
Consider the density matrix . For any , there exist subset-matrices, , of , such that
** 2. 2.
, where . 3. 3.
The bipartite shared state can be prepared starting from EPR pairs with bits of communication.
Proof.
Note: The terminology used in this proof is defined in the proof of Theorem 1 preceding the use of Lemma 22 there (Appendix E).
Fixing an we will now show how to “cut” down into a mixture of states of small spread such that the cut only removes subset-matrices of the operator which are either far from the diagonal or small in the trace norm (less than ).
Define a sequence of mutually orthogonal projectors , where each is the projection onto the span of . Let
[TABLE]
Now, for define
[TABLE]
The are block-diagonal subset-matrices of , which are disjoint in the sense that when . Additionally, is a subset-matrix of which contains the entire diagonal of . Indeed can be obtained from via the “pinching” TPCP which has Kraus operators given by the . Thus
[TABLE]
Choose such that . Since the are all PSD we also have .
Our strategy now is to use something like as a candidate for in the Lemma statement. However, subtracting all of removes some terms close to the diagonal, which, even though it is not a large fraction of all entries in , would make the proof and statement of Lemma 22 somewhat awkward. So, in order to make the Lemma statement as clean as possible we will only subtract the “anti-diagonal” parts of , and leave the “diagonal” parts of in a manner made precise below.
Define the block matrices
[TABLE]
and are, respectively, the diagonal and off-diagonal blocks of .
Further define . We have that , and that since is a block-diagonal subset-matrix of containing the entire diagonal of . Thus,
[TABLE]
We now define a “cut down” version of by . From this definition we have:
[TABLE]
Further, we define the projectors
[TABLE]
and define the block diagonal matrix as:
[TABLE]
where the last equality follows because because consists only of the “anti-diagonal” components which lie outside of the . Note that is a subset-matrix of according to Definition 20. Now define by:
[TABLE]
Therefore, is also a subset-matrix of according to Definition 20. Furthermore, it follows immediately using Equation 21 that:
[TABLE]
Second Claim: To establish the second claim in Lemma 22 we now show that (recall that ). To see this, we consider the case that and show that in this case . Assume WLOG that . When we know that either such that:
[TABLE]
or such that:
[TABLE]
In the first case, denoted by Equation 26, we have that the coordinates lie within the subset-matrix of , and thus that either or by definition. In particular, either as follows by Equation 23 and the definition of in Equation 22, or . If then we note that by definition (Equation 24), and this implies that . If , then because .
On the other hand, in the case denoted by Equation 27, we have the coordinates lie within the subset-matrix of , and thus that either , or . The reason for this is that we know that, in this case, the coordinates are within the subset-matrix of . Furthermore, since we have already ruled out the case of Equation 26, we know that is not in , the block diagonal portion of . Therefore, the coordinates must lie in the block-anti-diagonal portion (this can also be determined directly from Equation 27 itself, and the definition of ). Since we know that the coordinates lie within the , or more precisely, either , or . Just as before, if , then . On the other hand, in the case that we know that because by Equation 24, and as follows from the definition .
This establishes that .
Third Claim: To establish the third claim in Lemma 22, and complete the proof, we will show that is a mixture of states of spread at most , which means that can be produced from a shared maximally entangled state with at most bits of communication.
Recalling the definition of in Equation 23, let us define , so that it is clear that . It is also clear that is not only PSD, but also an un-normalized pure state, because
[TABLE]
From the definition of in Equation 22 we have that:
[TABLE]
Where the index limits are
[TABLE]
We know from the definition in Equation 16 that the are orthogonal to each other, and that each has Schmidt coefficients bounded by . Thus, it is immediate that has spread at most , where the last equality follows because . Therefore is a normalized mixture of states with spread at most .
Consider the normalized version of , which is still a pure state of spread at most it is clear that this state has Earthmover distance at most from the nearest maximally entangled state (simply move all of the weight onto Schmidt coefficients of the size of the smallest Schmidt coefficient, which can be done by moving all the weight a distance less than or equal to the spread). It follows, by using Theorem 3 that there is a protocol which prepares the normalized version of from EPR pairs, with only bits of communication (we note that this line of the proof could also have been established using result from [7, 8], for example). Now the state can be prepared by applying this same protocol in superposition over (with the probability assigned to each ), and then tracing out over the register. Thus can be prepared starting from EPR pairs with bits of communication.
∎
Appendix G Proof of Lemma 15
Proof.
Given two states and , and an arbitrary , let be the joint distribution on which satisfies the Earth Mover conditions for and , and acheives the optimal earth mover bound . That is, for all , , for all , , and whenever .
Define , where
[TABLE]
We now define the intermediate state
[TABLE]
where the Schmidt coefficients are left unspecified for now.
In order to specify the Schmidt coefficients of the intermediate state as well as the Right Index-1 Flow from to , and the Left Index-1 Flow from to we will first define “bins” for the Schmidt coefficients of as follows:
For let , and . Define .
Fact 24**.**
If , then
Proof.
Given , and we have by definition that , and , and therefore that , where the last equality follows by assumption. It follows by definition of and of , that . Since this is true for all , the claim follows. ∎
We will now specify an iterative, “greedy” procedure to define the Schmidt coefficients as a function of the and .
For each such that we first note that by Fact 24 that . Thus, for each ,
[TABLE]
for all .
One may check that Algorithm 1 defines Schmidt coefficients , satisfying
[TABLE]
as well as a Right Index-1 Flow from to , with degree at most . In particular the Right Index-1 Flow from to is constructed in Algorithm 1 by iteratively adding edges to form the bipartite flow-graph where . Each line in the pseudocode which reads “Add an edge in the flow graph from to ”, or similar, adds a single edge to the graph and the union of all these edges forms the bipartite flow between and . Furthermore, for the defined by Algorithm 1,
[TABLE]
so that there is a Left Index-1 flow from to defined by a bipartite graph between the Schmidt coefficients of and respectively, in which, for every , there is an edge from to of weight . This Left Index-1 flow then clearly has degree .
Finally, recall that,
[TABLE]
So, by very similar reasoning, there is a Left Index-1 flow from to with degree exactly .
∎
Acknowledgments
AWH was funded by NSF grants CCF-1452616, CCF-1729369, PHY-1818914 and ARO contract W911NF-17-1-0433. MC was supported at MIT by an Akamai Fellowship, and at the IQC by Canada’s NSERC and the Canadian Institute for Advanced Research (CIFAR), and through funding provided to IQC by the Government of Canada and the Province of Ontario.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Aharonov, A. W. Harrow, Z. Landau, D. Nagaj, M. Szegedy, and U. Vazirani. Local tests of global entanglement and a counterexample to the generalized area law. In Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on , pages 246–255, Oct 2014, ar Xiv:1410.0951 .
- 2[2] C. H. Bennett, H. J. Bernstein, S. Popescu, and B. Schumacher. Concentrating partial entanglement by local operations. Phys. Rev. A , 53:2046–2052, 1996, ar Xiv:quant-ph/9511030 .
- 3[3] C. H. Bennett, I. Devetak, A. W. Harrow, P. W. Shor, and A. Winter. The quantum reverse Shannon theorem and resource tradeoffs for simulating quantum channels. IEEE Trans. Inf. Theory , 60(5):2926–2959, May 2014, ar Xiv:0912.5537 .
- 4[4] S. Daftuar and P. Hayden. Quantum state transformations and the schubert calculus. Annals of Physics , 315:80–122, 2005, ar Xiv:quant-ph/0410052 .
- 5[5] A. W. Harrow. Entanglement spread and clean resource inequalities. In P. Exner, editor, XV Ith Int. Cong. on Math. Phys. , pages 536–540. World Scientific, 2009, ar Xiv:0909.1557 .
- 6[6] A. W. Harrow and D. W. Leung. A communication-efficient nonlocal measurement with application to communication complexity and bipartite gate capacities. IEEE Trans. Inf. Theory , 57(8):5504–5508, 2011, ar Xiv:0803.3066 .
- 7[7] A. W. Harrow and H.-K. Lo. A tight lower bound on the classical communication cost of entanglement dilution. IEEE Trans. Inf. Theory , 50(2):319–327, 2004, ar Xiv:quant-ph/0204096 .
- 8[8] P. Hayden and A. Winter. On the communication cost of entanglement transformations. Phys. Rev. A , 67:012306, 2003, ar Xiv:quant-ph/0204092 .
