When is the Chernoff Exponent for Quantum Operations finite?

Nengkun Yu; Li Zhou

arXiv:1705.01642·quant-ph·January 26, 2022·IEEE Trans. Inf. Theory

When is the Chernoff Exponent for Quantum Operations finite?

Nengkun Yu, Li Zhou

PDF

TL;DR

This paper investigates the conditions under which the Chernoff exponent for quantum operations is finite, establishing a clear criterion related to perfect distinguishability and providing bounds on the exponent.

Contribution

It provides a necessary and sufficient condition for the finiteness of the Chernoff exponent for quantum operations and offers upper bounds, clarifying the asymptotic error decay behavior.

Findings

01

Chernoff exponent is finite iff quantum operations cannot be perfectly distinguished with finite uses.

02

Error probability decays exponentially when operations are not perfectly distinguishable.

03

Super-exponential decay of error probability is ruled out.

Abstract

We consider the problem of testing two hypotheses of quantum operations in a setting of many uses where an arbitrary prior probability distribution is given. The Chernoff exponent for quantum operations is investigated to track the minimal average error probability of discriminating two quantum operations asymptotically. We answer the question, "When is the Chernoff exponent for quantum operations finite?" We show that either two quantum operations can be perfectly distinguished with finite uses, or the minimal discrimination error decays exponentially with respect to the number of uses asymptotically. That is, the Chernoff exponent is finite if and only if the quantum operations can not be perfectly distinguished with finite uses. This rules out the possibility of super-exponential decay of error probability. Upper bounds of the Chernoff exponent for quantum operations are provided.

Equations139

ξ_{E, F} = - n \to \infty \overline{lim} \frac{lo g P _{er r, min, n}}{n}

ξ_{E, F} = - n \to \infty \overline{lim} \frac{lo g P _{er r, min, n}}{n}

ρ = k = 1 \sum n p_{k} ∣ ψ_{k} ⟩ ⟨ ψ_{k} ∣,

ρ = k = 1 \sum n p_{k} ∣ ψ_{k} ⟩ ⟨ ψ_{k} ∣,

D (ρ, σ) \equiv \frac{1}{2} Tr ∣ ρ - σ ∣

D (ρ, σ) \equiv \frac{1}{2} Tr ∣ ρ - σ ∣

F (ρ, σ) \equiv Tr ρ σ ρ .

F (ρ, σ) \equiv Tr ρ σ ρ .

F (i \sum p_{i} ρ_{i}, i \sum q_{i} σ_{i}) \geq i = 0 \sum n p_{i} q_{i} F (ρ_{i}, σ_{i}) .

F (i \sum p_{i} ρ_{i}, i \sum q_{i} σ_{i}) \geq i = 0 \sum n p_{i} q_{i} F (ρ_{i}, σ_{i}) .

F (i \sum p_{i} ψ_{i}, i \sum q_{i} ϕ_{i}) \geq i = 0 \sum n p_{i} q_{i} F (ψ_{i}, ϕ_{i})

F (i \sum p_{i} ψ_{i}, i \sum q_{i} ϕ_{i}) \geq i = 0 \sum n p_{i} q_{i} F (ψ_{i}, ϕ_{i})

= i = 0 \sum n ∣ ⟨ p_{i} ψ_{i} ∣ q_{i} ϕ_{i} ⟩ ∣.

1 - F (ρ, σ) \leq D (ρ, σ) \leq 1 - F (ρ, σ)^{2} .

1 - F (ρ, σ) \leq D (ρ, σ) \leq 1 - F (ρ, σ)^{2} .

D (ϕ, ψ) = 1 - F (ϕ, ψ)^{2} = 1 - ∣ ⟨ ϕ ∣ ψ ⟩ ∣^{2} .

D (ϕ, ψ) = 1 - F (ϕ, ψ)^{2} = 1 - ∣ ⟨ ϕ ∣ ψ ⟩ ∣^{2} .

P_{er r} = Π_{0} Tr [E_{1} ρ_{0}] + Π_{1} Tr [E_{0} ρ_{1}] .

P_{er r} = Π_{0} Tr [E_{1} ρ_{0}] + Π_{1} Tr [E_{0} ρ_{1}] .

P_{er r, min} = \frac{1}{2} (1 - Tr ∣ Π_{1} ρ_{1} - Π_{0} ρ_{0} ∣) .

P_{er r, min} = \frac{1}{2} (1 - Tr ∣ Π_{1} ρ_{1} - Π_{0} ρ_{0} ∣) .

E (ρ) = i = 1 \sum k E_{i} ρ E_{i}^{†},

E (ρ) = i = 1 \sum k E_{i} ρ E_{i}^{†},

F (E (ρ), E (σ))

F (E (ρ), E (σ))

Tr [(I^{R} \otimes E^{Q}) (ψ^{R Q}) (I^{R} \otimes F^{Q}) (ϕ^{R Q})] = 0

Tr [(I^{R} \otimes E^{Q}) (ψ^{R Q}) (I^{R} \otimes F^{Q}) (ϕ^{R Q})] = 0

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

(I^{R} \otimes E_{i}^{Q}) ∣ ψ^{R Q} ⟩ ⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{Q})^{†}

(I^{R} \otimes E_{i}^{Q}) ∣ ψ^{R Q} ⟩ ⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{Q})^{†}

(I^{R} \otimes F_{j}^{Q}) ∣ ϕ^{R Q} ⟩ ⟨ ϕ^{R Q} ∣ (I^{R} \otimes F_{j}^{Q})^{†}],

(I^{R} \otimes F_{j}^{Q}) ∣ ϕ^{R Q} ⟩ ⟨ ϕ^{R Q} ∣ (I^{R} \otimes F_{j}^{Q})^{†}],

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{Q})^{†} (I^{R} \otimes F_{j}^{Q}) ϕ^{R Q} ⟩ = 0

⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{Q})^{†} (I^{R} \otimes F_{j}^{Q}) ϕ^{R Q} ⟩ = 0

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{†} F_{j}) ϕ^{R Q} ⟩ = 0,

⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{†} F_{j}) ϕ^{R Q} ⟩ = 0,

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

⟨ ψ^{R Q} ∣ ϕ^{R Q} ⟩ = 0.

Tr (M E_{i}^{†} F_{j}) = ⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{†} F_{j}) ϕ^{R Q} ⟩ = 0,

Tr (M E_{i}^{†} F_{j}) = ⟨ ψ^{R Q} ∣ (I^{R} \otimes E_{i}^{†} F_{j}) ϕ^{R Q} ⟩ = 0,

Tr M = 0.

Tr M = 0.

η > 0

η > 0

η := ρ \in D (H_{R Q}) in f 0 \leq X \leq (I^{R} \otimes E) (ρ^{R Q}), 0 \leq X \leq (I^{R} \otimes F) (ρ^{R Q}) sup Tr X

η := ρ \in D (H_{R Q}) in f 0 \leq X \leq (I^{R} \otimes E) (ρ^{R Q}), 0 \leq X \leq (I^{R} \otimes F) (ρ^{R Q}) sup Tr X

= ρ \in D (H_{R Q}) in f 0 \leq X \leq (I^{R} \otimes E) (ρ^{R Q}), 0 \leq X \leq (I^{R} \otimes F) (ρ^{R Q}) max Tr X .

0 \leq X \leq (I^{R} \otimes E) (ρ^{R Q}), 0 \leq X \leq (I^{R} \otimes F) (ρ^{R Q}) max Tr X

0 \leq X \leq (I^{R} \otimes E) (ρ^{R Q}), 0 \leq X \leq (I^{R} \otimes F) (ρ^{R Q}) max Tr X

⟨ I, X ⟩

⟨ I, X ⟩

Φ (X) \leq B,

X \in Pos (H_{R Q}) .

⟨ B, Y ⟩

⟨ B, Y ⟩

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

When is the Chernoff Exponent for Quantum Operations finite?

Nengkun Yu

Li Zhou

Centre for Quantum Software and Information,

Faculty of Engineering and Information Technology,

University of Technology Sydney, NSW 2007, Australia

Email: [email protected]

Max Planck Institute for Security and Privacy, Bochum, Germany

[email protected]

Abstract

We consider the problem of testing two hypotheses of quantum operations in a setting of many uses where an arbitrary prior probability distribution is given. The Chernoff exponent for quantum operations is investigated to track the minimal average error probability of discriminating two quantum operations asymptotically. We answer the question, “When is the Chernoff exponent for quantum operations finite?” We show that either two quantum operations can be perfectly distinguished with finite uses, or the minimal discrimination error decays exponentially with respect to the number of uses asymptotically. That is, the Chernoff exponent is finite if and only if the quantum operations can not be perfectly distinguished with finite uses. This rules out the possibility of super-exponential decay of error probability. Upper bounds of the Chernoff exponent for quantum operations are provided.

I Introduction

A fundamental problem in quantum information theory is to test a device that may be prepared for implementing one of many quantum operations. The testing treated in the framework of quantum mechanics is performed by inputting a quantum state and performing a quantum measurement. The general noncommutative feature and the complex structure of quantum operations make quantum statistics a much richer field than its classical counterpart.

In the degenerate case where the outputs of the quantum operations are fixed, the freedom of choosing input states becomes useless. The asymptotic behavior of the average error, in discriminating a set of quantum states $\{\rho_{1}^{\otimes n},\ldots,\rho_{r}^{\otimes n}\}$ with prior probability distribution $\{\Pi_{1},\ldots,\Pi_{r}\}$ is of great interest. In [26], Parthasarathy showed that the average error decays exponentially, asymptotically. Significant efforts have been made to identify the Chernoff exponent as the optimal error exponent. In two breakthrough papers, [3] and [25], the closed-form of the optimal error exponent was obtained, which can be regarded as the quantum generalization of the Chernoff bound in classical hypothesis testing [6]. Li proved that multiple Chernoff exponent equals the minimal mutual Chernoff exponent [22].

It is highly desirable to generalize the results of quantum states to quantum operations. Given that considerable experimental effort has been devoted to the field of quantum mechanics to prepare quantum systems and measure quantum states, it is of fundamental importance to develop a theory that can discriminate the different quantum operations. We note that for classical channel discrimination, the optimal exponential error rate problem has been well understood, where it is proven that adaptive choice does not improve the exponential error rate in these settings [13].

Where quantum operations are only allowed to be used once, the problem has been extensively studied, with fruitful results. By employing Holevo-Helstrom’s celebrated theorem on the one-copy quantum state discrimination [14, 17], a completely bounded trace norm, known as the diamond norm, was introduced to characterize the difference between quantum channels by Kitaev [19]. This norm becomes a fundamental tool in almost all aspects of quantum information science [2, 31, 32, 27] since it is the most physically meaningful notion of distance between quantum operations.

The problem becomes much more complicated when quantum operations are used multiple times [15, 7, 28]. Much effort has been devoted to characterizing the conditions of perfect distinguishability, in the sense that two quantum operations can be distinguished without error by a finite number of uses [1, 9, 37, 21, 20]. Unlike classical channel discrimination, unitary operations exist that cannot be distinguished without error for single-use, while multiple uses can help achieve perfect discrimination. A complete solution to this problem is obtained in [10] with a feasible, necessary, and sufficient condition.

In this paper, we investigate the concept of Chernoff exponent for quantum operations to characterize the asymptotic behavior of the average error probability of distinguishing given quantum operations under any prior probability distribution. Suppose we have a quantum device that is secretly chosen from $\{\mathcal{E},\mathcal{F}\}$ , and a known set of two quantum operations according to a prior probability distribution $\{\Pi_{0},\Pi_{1}\}$ . Our goal is to identify whether the device is $\mathcal{E}$ or $\mathcal{F}$ by using this device many times. We explore the Chernoff exponent for two quantum operations $\mathcal{E}$ and $\mathcal{F}$ to track the optimal error probability by using the following definition:

[TABLE]

where $P_{err,min,n}$ denotes the infimum discrimination error over all possible output states $\rho_{n}$ and $\sigma_{n}$ as illustrated in Figure 1 where the quantum device is used $n$ time.

Notice that all possible strategies can be described by, or translate to, the model showed in Figure 1 with a sufficiently large ancilla system. This model is the most general scheme, and quantum operations $\mathcal{G}_{i}$ are freely chosen. For instance, parallel uses of devices can always be simulated by sequential uses and employing swap operators.

We show that the Chernoff exponent for quantum operations is finite if and only if they cannot be distinguished perfectly with finite uses. More precisely, we show that the average error probability decays at most, according to exponential function for quantum operations, if the quantum operations can not be perfectly distinguishable. This indicates that the error probability can never decay super-exponentially, such as $\exp(-\alpha n^{2})$ . Computable upper bounds on the Chernoff exponent for quantum operations are provided. Finally, we generalize our results to deal with multiple quantum operations.

II Notations and Preliminaries

We use the symbols $\mathcal{H},\mathcal{X},\mathcal{Y},\mathcal{Z}$ to denote finite-dimensional Hilbert spaces over complex numbers and $\mathrm{L}\left(\mathcal{H}\right)$ to denote the set of linear operators mapping from $\mathcal{H}$ into itself. For Hermitian matrices $A,B$ , we use $\langle A,B\rangle=\operatorname{Tr}(A^{{\dagger}}B)=\operatorname{Tr}(AB)$ to denote their inner product. Let $\mathrm{Pos}(\mathcal{H})\subset\mathrm{L}\left(\mathcal{H}\right)$ be the set of positive (semidefinite) matrices, and $\mathcal{D}(\mathcal{H})\subset\mathrm{Pos}(\mathcal{H})$ is the set of positive matrices with trace one. A pure quantum state of $\mathcal{H}$ is just a normalized vector $\left|\psi\right\rangle\in\mathcal{H}$ , while a general quantum state is characterized by a density operator $\rho\in\mathcal{D}(\mathcal{H})$ . For simplicity, we use $\psi$ to represent the density operator of a pure state $\left|\psi\right\rangle$ which is just the projector $\psi=|\psi\rangle\langle\psi|$ . A density operator $\rho$ can always be decomposed into a convex combination of pure states:

[TABLE]

where the coefficients $p_{k}$ are strictly positive numbers and add up to one. The support of $\rho$ is defined as $\mathrm{supp}(\rho)=\mathrm{span}\{\left|\psi_{k}\right\rangle:1\leq k\leq n\}$ . We say two pure states $\left|\psi\right\rangle$ and $\left|\phi\right\rangle$ are orthogonal if and only if their inner product $\langle\psi,\phi\rangle$ is equal to zero, and the orthogonality of two density operators $\rho$ and $\sigma$ is defined by the orthogonality of their supports, namely, $\rho$ and $\sigma$ are orthogonal if and only if $\mathrm{supp}(\rho)\perp\mathrm{supp}(\sigma)$ . Two density operators $\rho$ and $\sigma$ are said to be disjoint if $\mathrm{supp}(\rho)\cap\mathrm{supp}(\sigma)=\{0\}$ and joint if the intersection of their support contains some non-zero vectors.

There are two commonly used measures to characterize the difference between the quantum states: trace distance and fidelity. The trace distance $D$ between two density operators $\rho$ and $\sigma$ is defined as

[TABLE]

where we define $|A|\equiv\sqrt{A^{\dagger}A}$ to be the positive square root of $A^{\dagger}A$ .

The fidelity of states $\rho$ and $\sigma$ is defined to be

[TABLE]

The strong concavity property for the fidelity is quite useful, which can be formalized as

Fact 1 ([24]).

For quantum states $\rho_{i}$ , $\sigma_{i}$ and probability distributions $(p_{0},p_{1},\cdots,p_{n})$ and $(q_{0},q_{1},\cdots,q_{n})$

[TABLE]

If $\rho_{i}=\psi_{i}$ and $\sigma_{i}=\phi_{i}$ are all pure states, we obtain

[TABLE]

Definition 1.

We say that a pure state $\left|\psi\right\rangle\in\mathcal{H}_{A}\otimes\mathcal{H}_{B}$ is a purification of some state $\rho$ if $\mathop{\mathrm{tr}}\nolimits_{A}({\psi})=\rho$ .

Fact 2 (Uhlmann’s theorem, [29]).

Given quantum states $\rho$ , $\sigma$ , and a purification $\left|\psi\right\rangle$ of $\rho$ , it holds that $F({\rho},{\sigma})=\max_{\left|\phi\right\rangle}|\langle\phi|\psi\rangle|$ , where the maximum is ranging over all purifications of $\sigma$ .

The following fact connects the trace distance and the fidelity between two states.

Fact 3 (Fuchs-van de Graaf inequalities [11]).

For quantum states $\rho$ and $\sigma$ , it holds that

[TABLE]

For pure states $\left|\phi\right\rangle$ and $\left|\psi\right\rangle$ , we have

[TABLE]

The trace distance is a static measure quantifying how close two quantum states are and is closely related to the discrimination of quantum states. Let us consider the two hypotheses, $H_{0}$ and $H_{1}$ . Hypothesis $H_{0}$ assumes that a given unknown quantum state is equal to $\rho_{0}$ , and Hypothesis $H_{1}$ assumes that a given unknown quantum state is equal to $\rho_{1}$ . We assume that the prior probability distribution of $\rho_{0}$ and $\rho_{1}$ are $\Pi_{0}$ and $\Pi_{1}$ , respectively, which add up to one.

A physical strategy to discriminate between these two hypotheses is to perform a positive-operator valued measure (POVM) on the quantum state with two outcomes, 0 and 1. Such a POVM has two elements $\{E_{0},E_{1}\}$ satisfying $E_{0},E_{1}\in\mathrm{Pos}(\mathcal{H})$ and $E_{0}+E_{1}=I$ , where $I$ is the identity matrix of $\mathcal{H}$ . The aim of quantum state discrimination is to find the elements $E_{0}$ and $E_{1}$ that minimize the total error $P_{err}$ , which is

[TABLE]

This optimal error has been identified by Helstrom as expressed in the following equation

[TABLE]

A quantum operation $\mathcal{E}$ from $\mathrm{L}\left(\mathcal{H}\right)$ to $\mathrm{L}\left(\mathcal{Z}\right)$ is a completely positive and trace-preserving map used to describe the evolution of an open quantum system. A quantum operation $\mathcal{E}$ can always be represented using the Kraus representation as

[TABLE]

where $\{E_{i}\}_{i=1,\cdots,k}$ are the Kraus operators of $\mathcal{E}$ satisfying $\sum_{i=1}^{k}E_{i}^{\dagger}E_{i}=I$ , the identity of $\mathcal{H}$ .

The following fact states that the fidelity between two states is non-decreasing under quantum operations.

Fact 4 ([24]).

For states $\rho$ , $\sigma$ , and quantum operation $\mathcal{E}(\cdot)$ , it holds that

[TABLE]

Quantum operations $\mathcal{E}$ and $\mathcal{F}$ are said to be perfectly distinguishable with finite uses if there exists a strategy illustrated as Figure 1 such that $\sigma_{n}$ and $\rho_{n}$ are orthogonal.

Two conditions introduced by [10] characterize the perfect distinguishability between quantum operations.

Definition 2.

Two quantum operations, $\mathcal{E}$ and $\mathcal{F}$ , acting on the same principal system, denoted by $Q$ , are said to be disjoint if there is an auxiliary system $R$ , and a pure state $\left|\psi^{RQ}\right\rangle$ , such that $(\mathcal{I}^{R}\otimes\mathcal{E}^{Q})(\psi^{RQ})$ and $(\mathcal{I}^{R}\otimes\mathcal{F}^{Q})(\psi^{RQ})$ are disjoint, where $\mathcal{I}^{R}$ is the identity operation on $R$ , and the superscripts only identify which systems the operations acted on. Otherwise, they are called joint.

Intuitively, this disjointness guarantees that the outputs do not have a common part with the carefully chosen input. This disjointness is necessary to achieve perfect distinguishability. Otherwise, according to an inductive argument, there is always a non-zero common part between the outputs for an arbitrary strategy with finite uses.

Another relationship is to ensure that non-orthogonal states, $\rho$ and $\sigma$ , exist such that $(\mathcal{I}^{R}\otimes\mathcal{E}^{Q})(\rho^{RQ})$ and $(\mathcal{I}^{R}\otimes\mathcal{F}^{Q})(\sigma^{RQ})$ become orthogonal, then one can distinguish $(\mathcal{I}^{R}\otimes\mathcal{E}^{Q})(\rho^{RQ})$ and $(\mathcal{I}^{R}\otimes\mathcal{F}^{Q})(\sigma^{RQ})$ without error. This is the final step of any strategy to achieve perfect distinguishability between $\mathcal{E}$ and $\mathcal{F}$ .

Interestingly, these two conditions are not only necessary but also sufficient for the perfect distinguishability between quantum operations [10] .

Proposition 1.

Two quantum operations $\mathcal{E}$ and $\mathcal{F}$ are perfectly distinguishable if and only if: 1). They are disjoint; 2). They can map some non-orthogonal states into orthogonal states.

We remark here that ancillary systems are also allowed to achieve perfect discrimination.

In the following, we give an analytical characterization of the negation of the second condition, i.e., that $\mathcal{E}$ and $\mathcal{F}$ cannot map some non-orthogonal states into orthogonal states even with the help of ancillary system.

Remark 1.

For $\mathcal{E}(\cdot)=\sum_{i}E_{i}\cdot E_{i}^{\dagger}$ and $\mathcal{F}(\cdot)=\sum_{j}F_{j}\cdot F_{j}^{\dagger}$ , Condition 2) of Proposition 1 is equivalent to $I\notin\mathrm{span}\{({E_{i}}^{{\dagger}}F_{j});1\leq i,j\leq m\}$ .

We want to emphasize this proof of the characterization is precisely the same as given in [10]. We provide the following argument for the readers’ convenience.

Without loss of generality, we assume that $\mathcal{E}$ and $\mathcal{F}$ have the same number of Kraus operators by adding zero Kraus operators, if necessary. That is, $\mathcal{E}(\cdot)=\sum_{i=1}^{m}E_{i}\cdot E_{i}^{\dagger}$ and $\mathcal{F}(\cdot)=\sum_{j=1}^{m}F_{j}\cdot F_{j}^{\dagger}$ , cannot make non-orthogonal states orthogonal. That is, if $\rho^{RQ}$ and $\sigma^{RQ}$ are not orthogonal, then $(\mathcal{I}^{R}\otimes\mathcal{E}^{Q})(\rho^{RQ})$ and $(\mathcal{I}^{R}\otimes\mathcal{F}^{Q})(\sigma^{RQ})$ are not orthogonal. Equivalently, if pure states $\rho^{RQ}=|\psi^{RQ}\rangle\langle\psi^{RQ}|$ and $\sigma^{RQ}=|\phi^{RQ}\rangle\langle\phi^{RQ}|$ are not orthogonal, then $(\mathcal{I}^{R}\otimes\mathcal{E}^{Q})(\rho^{RQ})$ and $(\mathcal{I}^{R}\otimes\mathcal{F}^{Q})(\sigma^{RQ})$ are not orthogonal. In other words,

[TABLE]

implies

[TABLE]

That is, if $\forall 1\leq i,j\leq m$ ,

[TABLE]

is orthgonal to

[TABLE]

then

[TABLE]

The above condition is equivalent to for $\left|\psi^{RQ}\right\rangle$ and $\left|\phi^{RQ}\right\rangle$ , if

[TABLE]

for all $1\leq i,j\leq m$ , then

[TABLE]

That is, if $\forall 1\leq i,j\leq m$

[TABLE]

then

[TABLE]

For any $M\in\mathrm{L}\left(\mathcal{H}_{Q}\right)$ , one can find $\left|\phi^{RQ}\right\rangle$ and $\left|\psi^{RQ}\right\rangle$ such that $M=\operatorname{Tr}_{R}|\phi^{RQ}\rangle\langle\psi^{RQ}|$ . We know that $\forall 1\leq i,j\leq m$

[TABLE]

implies

[TABLE]

That is satisfied if and only if $I^{RQ}\in\mathrm{span}\{(I^{R}\otimes{E_{i}}^{{\dagger}}F_{j});1\leq i,j\leq m\}$ , which in turn is equivalent to $I\in\mathrm{span}\{E_{i}^{{\dagger}}F_{j};1\leq i,j\leq m\}$ .

Therefore, $\mathcal{E}$ and $\mathcal{F}$ can map some non-orthogonal states into orthogonal states, Condition 2) of Proposition 1, is equivalent to $I\notin\mathrm{span}\{({E_{i}}^{{\dagger}}F_{j});1\leq i,j\leq m\}$ .

III Two useful Lemmas

The following lemma shows that if two quantum operations are joint, then for any input state, the output states always have a common semi-definite positive component whose size is positive and depends only on the operations.

Lemma 1.

If $\mathcal{E}$ and $\mathcal{F}$ are joint, there exists $\eta>0$ , depending only on $\mathcal{E}$ and $\mathcal{F}$ , such that for any quantum state $\rho$ on a potentially larger Hilbert space $RQ$ , there is a matrix $A$ , such that $0\leq A\leq(\mathcal{I}^{R}\otimes\mathcal{E})(\rho^{RQ}),(\mathcal{I}^{R}\otimes\mathcal{F})(\rho^{RQ})$ and $\operatorname{Tr}(A)\geq\eta$ .

Proof.

It is straightforward to verify that we only need to consider $\rho$ to be a pure state. Thus, according to Schmidt decomposition, we can assume that $\rho$ is a quantum state in $\mathcal{H}_{RQ}=\mathcal{H^{\prime}}\otimes\mathcal{H}$ with the dimension of $\mathcal{H}^{\prime}$ being equal to the dimension of $\mathcal{H}$ , where $\mathcal{H}$ is the Hilbert space of $Q$ .

Our goal is to show

[TABLE]

where $\eta$ is defined as

[TABLE]

To prove this, we notice that for a fixed input $\rho^{RQ}$ , the optimization problem

[TABLE]

can be formulated as the following semidefinite program [31]:

Primal problem

[TABLE]

Dual problem

[TABLE]

In the above formula, $\Phi$ is the super-operator

[TABLE]

as

[TABLE]

where the adjoint super-operator

[TABLE]

is given by

[TABLE]

and

[TABLE]

Choose $Y=I>0$ , then $\Phi^{\dagger}(Y)=2I>0$ . This dual program is strictly feasible. Thus, the primal value and dual value are the same [38].

Now we are going back to the original problem by considering the dual problem with $\rho^{RQ}$ ranging over all possible states.

Let $\mathcal{B}$ denote the following set

[TABLE]

$\mathcal{B}$ is a compact set because it is a closed bounded set in a finite-dimensional space.

According to the compactness of $\mathcal{B}$ , we have the following

[TABLE]

for some $B_{0}\in\mathcal{B}$ and $Y_{0}\in\mathrm{Pos}\left(\mathcal{H}_{RQ}\oplus\mathcal{H}_{RQ}\right)$ .

Let $\rho_{0}^{RQ}$ be such that

[TABLE]

For $\rho_{0}^{RQ}$ , the intersection of the supports of $(\mathcal{I}^{R}\otimes\mathcal{E})(\rho_{0}^{RQ})$ and $(\mathcal{I}^{R}\otimes\mathcal{F})(\rho_{0}^{RQ})$ has a non-zero element. It indicates that there exists a non-zero $0\leq G\leq(\mathcal{I}^{R}\otimes\mathcal{E})(\rho_{0}^{RQ}),(\mathcal{I}^{R}\otimes\mathcal{F})(\rho_{0}^{RQ})$ .

According to $\Phi^{\dagger}(Y_{0})\geq I$ , we have $M+N\geq I$ . Let

[TABLE]

We can conclude that this $\eta$ satisfies the wanted property. ∎

The following observation shows that if $I\in\mathrm{span}\{E_{i}^{{\dagger}}F_{j}\}$ , then $\mathcal{E}$ and $\mathcal{F}$ cannot change the fidelity of two quantum states significantly.

Lemma 2.

If $I\in\mathrm{span}\{E_{i}^{{\dagger}}F_{j}\}$ , then, there exists $\zeta>0$ , depending only on $\mathcal{E}$ and $\mathcal{F}$ , such that for all $\rho,\sigma$ on a potentially larger Hilbert space $RQ$ ,

[TABLE]

Proof.

The condition $I\in\mathrm{span}\{E_{i}^{{\dagger}}F_{j}\}$ leads us to the existence of $\chi_{i,j}\in\mathbb{C}$ such that

[TABLE]

Using polar decomposition of the coefficient matrix $\chi_{i,j}$ , we can always assume that

[TABLE]

and $\chi_{i}\geq 0$ . We use $\chi=\max_{i}\chi_{i}$ to denote the largest $\chi_{i}$ .

For any $\rho^{RQ}$ and $\sigma^{RQ}$ , by Uhlmann’s Theorem 2, there exist $\left|\psi^{RQT}_{\rho}\right\rangle$ and $\left|\psi^{RQT}_{\sigma}\right\rangle$ being $\rho$ and $\sigma$ ’s purifications respectively, and

[TABLE]

Now we can have the following

[TABLE]

The first inequality is due to Fact 4, the monotonicity of the fidelity under partial trace. The second inequality is due to Fact 1, the strong concavity of the fidelity, and positive homogeneity.

Therefore, we choose $\zeta=\frac{1}{\chi}$ . ∎

IV main results

Our main result is as follows

Theorem 1.

The Chernoff exponent for quantum operations, Eq. 1, is finite if and only if they cannot be distinguished perfectly.

For two distinct quantum operations, $\mathcal{E}$ and $\mathcal{F}$ , it is straightforward to verify that $P_{err}\leq\exp(-n\xi^{\prime})$ for some $\xi^{\prime}>0$ by observing the following process. First, one can always find an input state $\rho$ such that $\mathcal{E}(\rho)$ and $\mathcal{F}(\rho)$ are distinct. Then we feed $\rho$ as input through the device for $n$ times. After that, the problem becomes to distinguish $\mathcal{E}(\rho)^{\otimes n}$ and $\mathcal{F}(\rho)^{\otimes n}$ . Invoking the celebrated result on the Chernoff exponent for quantum states, we know that the error probability of distinguishing two different quantum states with identical copies decays according to an exponential function. Notice that this protocol only provides an upper bound on the minimal error probability of distinguishing $\mathcal{E}$ and $\mathcal{F}$ , so one can conclude that $P_{err}\leq\exp(-n\xi^{\prime})$ for some $\xi^{\prime}>0$ .

The above arguments show that the error decays at least exponentially. In other words, $\xi_{\mathcal{E},\mathcal{F}}$ is greater than [math]. However, this scheme can be far from optimal. Perfect discrimination between unknown processes chosen from a finite set is shown to be possible. For two quantum operations that can be distinguished perfectly, $\xi_{\mathcal{E},\mathcal{F}}=\infty$ , we prove that this is the only case where $\xi_{\mathcal{E},\mathcal{F}}=\infty$ . Moreover, we provide an easy computable upper bound of $\xi_{\mathcal{E},\mathcal{F}}$ for quantum operations that can not be distinguished perfectly, i.e., $P_{err}\geq\exp(-n\xi)$ , where the parameter $\xi$ is a positive constant that depends on the two operations only.

Proof.

The only if part of Theorem 1 is trivial. The if part follows Proposition 1 in Section II, and we prove it for prior probability distribution $\Pi_{0}=\Pi_{1}=1/2$ . Also, we prove the Chernoff exponent is independent of a prior distribution in Proposition 2.

To prove the only if part of Theorem 1 under distribution $\Pi_{0}=\Pi_{1}=1/2$ , we only need to show that when either condition in Proposition 1 is violated, the error probability is at least an exponential function of the number of channel uses.

First, we suppose $\mathcal{E}$ and $\mathcal{F}$ are joint, in the sense that the produced quantum states have non-zero overlapping supports for any common input state, we show that there exists $\eta>0$ such that $P_{err,n}\geq{\eta^{n}}/{2}$ in the following:

Refer to Figure 1 for our notations. By employing Lemma 1, we observe that there exists $0\leq A_{1}\leq\rho_{1},\sigma_{1}$ such that $\operatorname{Tr}A_{1}\geq\eta$ . Then, $0\leq A_{1}^{\prime}=\mathcal{G}_{1}(A_{1})\leq\rho_{1}^{\prime},\sigma_{1}^{\prime}$ such that $\operatorname{Tr}A_{1}^{\prime}=\operatorname{Tr}A_{1}\geq\eta$ . Then, there exists $0\leq A_{2}\leq\rho_{2},\sigma_{2}$ such that $\operatorname{Tr}A_{2}\geq\eta^{2}$ . Thus, $0\leq A_{2}^{\prime}=\mathcal{G}_{2}(A_{2})\leq\rho_{1}^{\prime},\sigma_{1}^{\prime}$ such that $\operatorname{Tr}A_{2}^{\prime}=\operatorname{Tr}A_{2}\geq\eta^{2}$ $\cdots$ There exists $0\leq A_{n}\leq\rho_{n},\sigma_{n}$ such that $\operatorname{Tr}A_{n}\geq\eta^{n}$ .

By Helstrom’s celebrated result on state discrimination [14], we know that the discrimination error satisfies the following

[TABLE]

The first inequality is according to the triangle inequality and $0\leq A_{n}\leq\rho_{n},\sigma_{n}$ .

Second, suppose two quantum operations $\mathcal{E}$ and $\mathcal{F}$ can not transform non-orthogonal states into orthogonal states. This is equivalent to $I\in\mathrm{span}\{E_{i}^{*}F_{j}\}$ , as illustrated in Remark 1 at the end of Section II. We show in the following that there exists $\mu>0$ such that $P_{err,n}\geq\mu^{n}/4$ . The proof of this part is according to the observation that if two quantum operations cannot make nonorthogonal states orthogonal, they cannot change their fidelity significantly.

Refer to Figure 1 for our notations. By employing Lemma 2, we observe that there exists $\zeta>0$ such that after $n$ uses of the unknown quantum operation, the possible outcome states $\rho_{n}$ and $\sigma_{n}$ satisfy the following:

[TABLE]

where $F(\cdot,\cdot)$ denotes the fidelity of quantum states.

The first inequality is due to Lemma 2. The second inequality is due to the monotonicity of fidelity under any quantum operation.

According to the relation between fidelity and trace distance, we have

[TABLE]

where the minimization ranges across all possible output $\rho_{n}$ and $\sigma_{n}$ .

Therefore, we can choose $\mu=\zeta^{2}$ .

Putting these two conditions together, we obtain that for indistinguishable quantum operations $\mathcal{E},\mathcal{F}$ under uniform distribution,

[TABLE]

∎

If we have more than two quantum operations, suppose we have a quantum device that is secretly chosen from $\{\mathcal{E}_{1},\cdots\mathcal{E}_{r}\}$ , a known set of quantum operations according to prior probability distribution $\{\Pi_{1},\cdots,\Pi_{r}\}$ . Our goal is to see which quantum operation the quantum device implements by using the device many times. The definition of the Chernoff exponent for two quantum operations Eq.(1) can be easily generalized into multiple quantum operations, where $P_{err,min,n}$ now is defined as the infimum error probability for distinguishing multiple quantum operations with $n$ uses. One can prove that the Chernoff exponent for multiple quantum operations shares the same properties as the Chernoff exponent for two quantum operations. This exponent does not depend on the prior probability distribution, and it is infinite if these quantum operations are mutually perfectly distinguishable. Moreover, it is, at most, the minimal mutual Chernoff exponent for quantum operations.

Proposition 2.

The Chernoff exponent for multiple quantum operations, Eq. 1, does not depend on the prior distribution. Moreover, the multi-channel Chernoff exponent is upper bounded by the smallest pairwise Chernoff exponent.

Proof.

For prior $(\Pi_{1},\Pi_{2},\cdots,\Pi_{r})$ , $\Pi_{1}\geq\Pi_{2}\geq\cdots\Pi_{r}>0$ and fixed $n$ , we use $P_{err,min,n,\Pi}$ to denote the infimum error probability. $P_{err,min,n}$ denotes the infimum error probability for uniform prior $(1/r,1/r,\cdots,1/r)$ . For any discrimination scheme, we let $1-p_{n,i}$ be the probability of correctly identifying the $i$ -th channel. Then we have

[TABLE]

for any $1\leq k,l\leq r$ .

Since this inequality holds for the error probabilities in any strategy, one can just take the infimum over all strategies in each term of the inequality and have

[TABLE]

Therefore,

[TABLE]

That is, the Chernoff exponent for multiple quantum operations does not depend on prior. ∎

According to the proof, we can also conclude that

Corollary 1.

The Chernoff exponent for multiple quantum operations, Eq. 1, is infinite if and only if the quantum operations are mutually perfectly distinguishable.

Proof.

Suppose we are given a quantum operation $\mathcal{E}$ being one of the quantum operations $\mathcal{E}_{1},\mathcal{E}_{2},\cdots,\mathcal{E}_{r}$ , and any two quantum operations can be distinguished perfectly. Let a protocol produce orthogonal quantum states $\rho_{1}$ and $\rho_{2}$ for quantum operations $\mathcal{E}_{1}$ and $\mathcal{E}_{2}$ , respectively.

Now we run the protocol on $\mathcal{E}$ and measure the output. We employ the measurement which can distinguish $\rho_{1}$ and $\rho_{2}$ perfectly. If the measurement outcome corresponds to $\rho_{1}$ , then we know $\mathcal{E}$ can not be $\mathcal{E}_{2}$ ; otherwise, it can not be $\mathcal{E}_{1}$ .

Therefore, via finite uses of $\mathcal{E}$ , we can eliminate one candidate. By repeating this procedure, we can conclude that the Chernoff exponent is $\infty$ .

Otherwise, if two quantum operations, say $\mathcal{E}_{1}$ and $\mathcal{E}_{2}$ , can not be distinguished perfectly. According to the proof of Proposition 2, the multi-channel Chernoff exponent is upper bounded by the smallest pairwise exponent which is no more than the Chernoff exponent of $\mathcal{E}_{1}$ and $\mathcal{E}_{2}$ , a finite number by Theorem 1. ∎

V Conclusion and Open Problems

In this paper, we introduce the Chernoff exponent for quantum operations. We show the Chernoff exponent is finite if and only if the operations are not perfectly distinguishable. More precisely, we provide computable upper bounds of the Chernoff exponent by proving lower bounds on the error probability of distinguishing quantum operations with $n$ uses. Our result is an asymptotic generalization of the diamond norm.

There are several open questions. One relates to the local operations and classical communication (LOCC)-Chernoff distance. Motivated by the quantum Chernoff theorem [3, 25], the LOCC-Chernoff exponent studies the distinguishability of two bipartite mixed states under the constraint of LOCC, in the limit of many copies [8, 23]. There is a significant difference between the LOCC Chernoff exponent and the standard Chernoff exponent. Orthogonality does not indicate perfect LOCC distinguishability. More precisely, there exist quantum states which cannot be locally distinguished but multicopy makes them perfectly distinguishable [35, 36]. This behavior is similar to the discrimination of quantum operations. A fundamental question regarding the LOCC Chernoff exponent is still not answered: For two quantum states that are not LOCC perfectly distinguishable, even in the limit of many copies, does the LOCC discrimination error always decay exponentially? The first difficulty is we do not have a characterization of LOCC distinguishability of quantum states, even though this problem has been studied for more than 20 years [34, 33, 12, 5, 30, 16, 4].

We thank the editor and the anonymous reviewers whose comments have greatly improved this manuscript. This work is supported by ARC Discovery Early Career Researcher Award DE180100156 and ARC Discovery Program DP210102449.

Bibliography38

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. Acin, ”Statistical distinguishability between unitary operations.” Physical Review Letters , 87 (17): 177901, 2001.
2[2] D. Aharonov, A. Kitaev, and N. Nisan, ”Quantum circuits with mixed states.” Proceeding of the Thirtieth Annual ACM Symposium on Theory of Computation , pp. 20-30, 1997.
3[3] K. M. R. Audenaert, J. Casamiglia, R. Munoz-Tapia, E. Bagan, Ll. Masanes, A. Acin, and F. Verstraete, ”Discriminating states: the quantum Chernoff bound.” Physical Review Letters , 98 (16): 160501, 2007.
4[4] S. Bandyopadhyay, A. Cosentino, N. Johnston, V. Russo, J. Watrous, and N. Yu, ”Limitations on separable measurements by convex optimization.” IEEE Transactions on Information Theory , 61 (6): 3593, 2015.
5[5] C. H. Bennett, D. P. Di Vincenzo, T. Mor, P. W. Shor, J. A. Smolin, and B. M. Terhal, ”Unextendible product bases and bound entanglement.” Physical Review Letters , 82 (26): 5385, 1999.
6[6] H. Chernoff, ”A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations.” The Annals of Mathematical Statistics , 23 (4): 493, 1952.
7[7] Tom Cooney, Milan Mosonyi, and Mark M. Wilde, ”Strong converse exponents for a quantum channel discrimination problem and quantum-feedback-assisted communication.” Communications in Mathematical Physics , 344 (3): 797-829, 2016.
8[8] J. Calsamiglia, J. I. de Vicente, R. Muñoz-Tapia, and E. Bagan, ”Local discrimination of mixed states.” Physical Review Letters , 105 (8): 080504, 2010.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

When is the Chernoff Exponent for Quantum Operations finite?

Abstract

I Introduction

II Notations and Preliminaries

Fact 1** ([24]).**

Definition 1**.**

Fact 2** (Uhlmann’s theorem, [29]).**

Fact 3** (Fuchs-van de Graaf inequalities [11]).**

Fact 4** ([24]).**

Definition 2**.**

Proposition 1**.**

Remark 1**.**

III Two useful Lemmas

Lemma 1**.**

Proof.

Lemma 2**.**

Proof.

IV main results

Theorem 1**.**

Proof.

Proposition 2**.**

Proof.

Corollary 1**.**

Proof.

V Conclusion and Open Problems

Fact 1 ([24]).

Definition 1.

Fact 2 (Uhlmann’s theorem, [29]).

Fact 3 (Fuchs-van de Graaf inequalities [11]).

Fact 4 ([24]).

Definition 2.

Proposition 1.

Remark 1.

Lemma 1.

Lemma 2.

Theorem 1.

Proposition 2.

Corollary 1.