When is the Chernoff Exponent for Quantum Operations finite?
Nengkun Yu, Li Zhou

TL;DR
This paper investigates the conditions under which the Chernoff exponent for quantum operations is finite, establishing a clear criterion related to perfect distinguishability and providing bounds on the exponent.
Contribution
It provides a necessary and sufficient condition for the finiteness of the Chernoff exponent for quantum operations and offers upper bounds, clarifying the asymptotic error decay behavior.
Findings
Chernoff exponent is finite iff quantum operations cannot be perfectly distinguished with finite uses.
Error probability decays exponentially when operations are not perfectly distinguishable.
Super-exponential decay of error probability is ruled out.
Abstract
We consider the problem of testing two hypotheses of quantum operations in a setting of many uses where an arbitrary prior probability distribution is given. The Chernoff exponent for quantum operations is investigated to track the minimal average error probability of discriminating two quantum operations asymptotically. We answer the question, "When is the Chernoff exponent for quantum operations finite?" We show that either two quantum operations can be perfectly distinguished with finite uses, or the minimal discrimination error decays exponentially with respect to the number of uses asymptotically. That is, the Chernoff exponent is finite if and only if the quantum operations can not be perfectly distinguished with finite uses. This rules out the possibility of super-exponential decay of error probability. Upper bounds of the Chernoff exponent for quantum operations are provided.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
When is the Chernoff Exponent for Quantum Operations finite?
Nengkun Yu
Li Zhou
Centre for Quantum Software and Information,
Faculty of Engineering and Information Technology,
University of Technology Sydney, NSW 2007, Australia
Email: [email protected]
Max Planck Institute for Security and Privacy, Bochum, Germany
Abstract
We consider the problem of testing two hypotheses of quantum operations in a setting of many uses where an arbitrary prior probability distribution is given. The Chernoff exponent for quantum operations is investigated to track the minimal average error probability of discriminating two quantum operations asymptotically. We answer the question, “When is the Chernoff exponent for quantum operations finite?” We show that either two quantum operations can be perfectly distinguished with finite uses, or the minimal discrimination error decays exponentially with respect to the number of uses asymptotically. That is, the Chernoff exponent is finite if and only if the quantum operations can not be perfectly distinguished with finite uses. This rules out the possibility of super-exponential decay of error probability. Upper bounds of the Chernoff exponent for quantum operations are provided.
I Introduction
A fundamental problem in quantum information theory is to test a device that may be prepared for implementing one of many quantum operations. The testing treated in the framework of quantum mechanics is performed by inputting a quantum state and performing a quantum measurement. The general noncommutative feature and the complex structure of quantum operations make quantum statistics a much richer field than its classical counterpart.
In the degenerate case where the outputs of the quantum operations are fixed, the freedom of choosing input states becomes useless. The asymptotic behavior of the average error, in discriminating a set of quantum states with prior probability distribution is of great interest. In [26], Parthasarathy showed that the average error decays exponentially, asymptotically. Significant efforts have been made to identify the Chernoff exponent as the optimal error exponent. In two breakthrough papers, [3] and [25], the closed-form of the optimal error exponent was obtained, which can be regarded as the quantum generalization of the Chernoff bound in classical hypothesis testing [6]. Li proved that multiple Chernoff exponent equals the minimal mutual Chernoff exponent [22].
It is highly desirable to generalize the results of quantum states to quantum operations. Given that considerable experimental effort has been devoted to the field of quantum mechanics to prepare quantum systems and measure quantum states, it is of fundamental importance to develop a theory that can discriminate the different quantum operations. We note that for classical channel discrimination, the optimal exponential error rate problem has been well understood, where it is proven that adaptive choice does not improve the exponential error rate in these settings [13].
Where quantum operations are only allowed to be used once, the problem has been extensively studied, with fruitful results. By employing Holevo-Helstrom’s celebrated theorem on the one-copy quantum state discrimination [14, 17], a completely bounded trace norm, known as the diamond norm, was introduced to characterize the difference between quantum channels by Kitaev [19]. This norm becomes a fundamental tool in almost all aspects of quantum information science [2, 31, 32, 27] since it is the most physically meaningful notion of distance between quantum operations.
The problem becomes much more complicated when quantum operations are used multiple times [15, 7, 28]. Much effort has been devoted to characterizing the conditions of perfect distinguishability, in the sense that two quantum operations can be distinguished without error by a finite number of uses [1, 9, 37, 21, 20]. Unlike classical channel discrimination, unitary operations exist that cannot be distinguished without error for single-use, while multiple uses can help achieve perfect discrimination. A complete solution to this problem is obtained in [10] with a feasible, necessary, and sufficient condition.
In this paper, we investigate the concept of Chernoff exponent for quantum operations to characterize the asymptotic behavior of the average error probability of distinguishing given quantum operations under any prior probability distribution. Suppose we have a quantum device that is secretly chosen from , and a known set of two quantum operations according to a prior probability distribution . Our goal is to identify whether the device is or by using this device many times. We explore the Chernoff exponent for two quantum operations and to track the optimal error probability by using the following definition:
[TABLE]
where denotes the infimum discrimination error over all possible output states and as illustrated in Figure 1 where the quantum device is used time.
Notice that all possible strategies can be described by, or translate to, the model showed in Figure 1 with a sufficiently large ancilla system. This model is the most general scheme, and quantum operations are freely chosen. For instance, parallel uses of devices can always be simulated by sequential uses and employing swap operators.
We show that the Chernoff exponent for quantum operations is finite if and only if they cannot be distinguished perfectly with finite uses. More precisely, we show that the average error probability decays at most, according to exponential function for quantum operations, if the quantum operations can not be perfectly distinguishable. This indicates that the error probability can never decay super-exponentially, such as . Computable upper bounds on the Chernoff exponent for quantum operations are provided. Finally, we generalize our results to deal with multiple quantum operations.
II Notations and Preliminaries
We use the symbols to denote finite-dimensional Hilbert spaces over complex numbers and to denote the set of linear operators mapping from into itself. For Hermitian matrices , we use to denote their inner product. Let be the set of positive (semidefinite) matrices, and is the set of positive matrices with trace one. A pure quantum state of is just a normalized vector , while a general quantum state is characterized by a density operator . For simplicity, we use to represent the density operator of a pure state which is just the projector . A density operator can always be decomposed into a convex combination of pure states:
[TABLE]
where the coefficients are strictly positive numbers and add up to one. The support of is defined as . We say two pure states and are orthogonal if and only if their inner product is equal to zero, and the orthogonality of two density operators and is defined by the orthogonality of their supports, namely, and are orthogonal if and only if . Two density operators and are said to be disjoint if and joint if the intersection of their support contains some non-zero vectors.
There are two commonly used measures to characterize the difference between the quantum states: trace distance and fidelity. The trace distance between two density operators and is defined as
[TABLE]
where we define to be the positive square root of .
The fidelity of states and is defined to be
[TABLE]
For pure states and , .
The strong concavity property for the fidelity is quite useful, which can be formalized as
Fact 1** ([24]).**
For quantum states , and probability distributions and
[TABLE]
If and are all pure states, we obtain
[TABLE]
Definition 1**.**
We say that a pure state is a purification of some state if .
Fact 2** (Uhlmann’s theorem, [29]).**
Given quantum states , , and a purification of , it holds that , where the maximum is ranging over all purifications of .
The following fact connects the trace distance and the fidelity between two states.
Fact 3** (Fuchs-van de Graaf inequalities [11]).**
For quantum states and , it holds that
[TABLE]
For pure states and , we have
[TABLE]
The trace distance is a static measure quantifying how close two quantum states are and is closely related to the discrimination of quantum states. Let us consider the two hypotheses, and . Hypothesis assumes that a given unknown quantum state is equal to , and Hypothesis assumes that a given unknown quantum state is equal to . We assume that the prior probability distribution of and are and , respectively, which add up to one.
A physical strategy to discriminate between these two hypotheses is to perform a positive-operator valued measure (POVM) on the quantum state with two outcomes, 0 and 1. Such a POVM has two elements satisfying and , where is the identity matrix of . The aim of quantum state discrimination is to find the elements and that minimize the total error , which is
[TABLE]
This optimal error has been identified by Helstrom as expressed in the following equation
[TABLE]
A quantum operation from to is a completely positive and trace-preserving map used to describe the evolution of an open quantum system. A quantum operation can always be represented using the Kraus representation as
[TABLE]
where are the Kraus operators of satisfying , the identity of .
The following fact states that the fidelity between two states is non-decreasing under quantum operations.
Fact 4** ([24]).**
For states , , and quantum operation , it holds that
[TABLE]
Quantum operations and are said to be perfectly distinguishable with finite uses if there exists a strategy illustrated as Figure 1 such that and are orthogonal.
Two conditions introduced by [10] characterize the perfect distinguishability between quantum operations.
Definition 2**.**
Two quantum operations, and , acting on the same principal system, denoted by , are said to be disjoint if there is an auxiliary system , and a pure state , such that and are disjoint, where is the identity operation on , and the superscripts only identify which systems the operations acted on. Otherwise, they are called joint.
Intuitively, this disjointness guarantees that the outputs do not have a common part with the carefully chosen input. This disjointness is necessary to achieve perfect distinguishability. Otherwise, according to an inductive argument, there is always a non-zero common part between the outputs for an arbitrary strategy with finite uses.
Another relationship is to ensure that non-orthogonal states, and , exist such that and become orthogonal, then one can distinguish and without error. This is the final step of any strategy to achieve perfect distinguishability between and .
Interestingly, these two conditions are not only necessary but also sufficient for the perfect distinguishability between quantum operations [10] .
Proposition 1**.**
Two quantum operations and are perfectly distinguishable if and only if: 1). They are disjoint; 2). They can map some non-orthogonal states into orthogonal states.
We remark here that ancillary systems are also allowed to achieve perfect discrimination.
In the following, we give an analytical characterization of the negation of the second condition, i.e., that and cannot map some non-orthogonal states into orthogonal states even with the help of ancillary system.
Remark 1**.**
For and , Condition 2) of Proposition 1 is equivalent to .
We want to emphasize this proof of the characterization is precisely the same as given in [10]. We provide the following argument for the readers’ convenience.
Without loss of generality, we assume that and have the same number of Kraus operators by adding zero Kraus operators, if necessary. That is, and , cannot make non-orthogonal states orthogonal. That is, if and are not orthogonal, then and are not orthogonal. Equivalently, if pure states and are not orthogonal, then and are not orthogonal. In other words,
[TABLE]
implies
[TABLE]
That is, if ,
[TABLE]
is orthgonal to
[TABLE]
then
[TABLE]
The above condition is equivalent to for and , if
[TABLE]
for all , then
[TABLE]
That is, if
[TABLE]
then
[TABLE]
For any , one can find and such that . We know that
[TABLE]
implies
[TABLE]
That is satisfied if and only if , which in turn is equivalent to .
Therefore, and can map some non-orthogonal states into orthogonal states, Condition 2) of Proposition 1, is equivalent to .
III Two useful Lemmas
The following lemma shows that if two quantum operations are joint, then for any input state, the output states always have a common semi-definite positive component whose size is positive and depends only on the operations.
Lemma 1**.**
If and are joint, there exists , depending only on and , such that for any quantum state on a potentially larger Hilbert space , there is a matrix , such that and .
Proof.
It is straightforward to verify that we only need to consider to be a pure state. Thus, according to Schmidt decomposition, we can assume that is a quantum state in with the dimension of being equal to the dimension of , where is the Hilbert space of .
Our goal is to show
[TABLE]
where is defined as
[TABLE]
To prove this, we notice that for a fixed input , the optimization problem
[TABLE]
can be formulated as the following semidefinite program [31]:
Primal problem
[TABLE]
Dual problem
[TABLE]
In the above formula, is the super-operator
[TABLE]
as
[TABLE]
where the adjoint super-operator
[TABLE]
is given by
[TABLE]
and
[TABLE]
Choose , then . This dual program is strictly feasible. Thus, the primal value and dual value are the same [38].
Now we are going back to the original problem by considering the dual problem with ranging over all possible states.
Let denote the following set
[TABLE]
is a compact set because it is a closed bounded set in a finite-dimensional space.
According to the compactness of , we have the following
[TABLE]
for some and .
Let be such that
[TABLE]
For , the intersection of the supports of and has a non-zero element. It indicates that there exists a non-zero .
According to , we have . Let
[TABLE]
We can conclude that this satisfies the wanted property. ∎
The following observation shows that if , then and cannot change the fidelity of two quantum states significantly.
Lemma 2**.**
If , then, there exists , depending only on and , such that for all on a potentially larger Hilbert space ,
[TABLE]
Proof.
The condition leads us to the existence of such that
[TABLE]
Using polar decomposition of the coefficient matrix , we can always assume that
[TABLE]
and . We use to denote the largest .
For any and , by Uhlmann’s Theorem 2, there exist and being and ’s purifications respectively, and
[TABLE]
Now we can have the following
[TABLE]
The first inequality is due to Fact 4, the monotonicity of the fidelity under partial trace. The second inequality is due to Fact 1, the strong concavity of the fidelity, and positive homogeneity.
Therefore, we choose . ∎
IV main results
Our main result is as follows
Theorem 1**.**
The Chernoff exponent for quantum operations, Eq. 1, is finite if and only if they cannot be distinguished perfectly.
For two distinct quantum operations, and , it is straightforward to verify that for some by observing the following process. First, one can always find an input state such that and are distinct. Then we feed as input through the device for times. After that, the problem becomes to distinguish and . Invoking the celebrated result on the Chernoff exponent for quantum states, we know that the error probability of distinguishing two different quantum states with identical copies decays according to an exponential function. Notice that this protocol only provides an upper bound on the minimal error probability of distinguishing and , so one can conclude that for some .
The above arguments show that the error decays at least exponentially. In other words, is greater than [math]. However, this scheme can be far from optimal. Perfect discrimination between unknown processes chosen from a finite set is shown to be possible. For two quantum operations that can be distinguished perfectly, , we prove that this is the only case where . Moreover, we provide an easy computable upper bound of for quantum operations that can not be distinguished perfectly, i.e., , where the parameter is a positive constant that depends on the two operations only.
Proof.
The only if part of Theorem 1 is trivial. The if part follows Proposition 1 in Section II, and we prove it for prior probability distribution . Also, we prove the Chernoff exponent is independent of a prior distribution in Proposition 2.
To prove the only if part of Theorem 1 under distribution , we only need to show that when either condition in Proposition 1 is violated, the error probability is at least an exponential function of the number of channel uses.
First, we suppose and are joint, in the sense that the produced quantum states have non-zero overlapping supports for any common input state, we show that there exists such that in the following:
Refer to Figure 1 for our notations. By employing Lemma 1, we observe that there exists such that . Then, such that . Then, there exists such that . Thus, such that There exists such that .
By Helstrom’s celebrated result on state discrimination [14], we know that the discrimination error satisfies the following
[TABLE]
The first inequality is according to the triangle inequality and .
Second, suppose two quantum operations and can not transform non-orthogonal states into orthogonal states. This is equivalent to , as illustrated in Remark 1 at the end of Section II. We show in the following that there exists such that . The proof of this part is according to the observation that if two quantum operations cannot make nonorthogonal states orthogonal, they cannot change their fidelity significantly.
Refer to Figure 1 for our notations. By employing Lemma 2, we observe that there exists such that after uses of the unknown quantum operation, the possible outcome states and satisfy the following:
[TABLE]
where denotes the fidelity of quantum states.
The first inequality is due to Lemma 2. The second inequality is due to the monotonicity of fidelity under any quantum operation.
According to the relation between fidelity and trace distance, we have
[TABLE]
where the minimization ranges across all possible output and .
Therefore, we can choose .
Putting these two conditions together, we obtain that for indistinguishable quantum operations under uniform distribution,
[TABLE]
∎
If we have more than two quantum operations, suppose we have a quantum device that is secretly chosen from , a known set of quantum operations according to prior probability distribution . Our goal is to see which quantum operation the quantum device implements by using the device many times. The definition of the Chernoff exponent for two quantum operations Eq.(1) can be easily generalized into multiple quantum operations, where now is defined as the infimum error probability for distinguishing multiple quantum operations with uses. One can prove that the Chernoff exponent for multiple quantum operations shares the same properties as the Chernoff exponent for two quantum operations. This exponent does not depend on the prior probability distribution, and it is infinite if these quantum operations are mutually perfectly distinguishable. Moreover, it is, at most, the minimal mutual Chernoff exponent for quantum operations.
Proposition 2**.**
The Chernoff exponent for multiple quantum operations, Eq. 1, does not depend on the prior distribution. Moreover, the multi-channel Chernoff exponent is upper bounded by the smallest pairwise Chernoff exponent.
Proof.
For prior , and fixed , we use to denote the infimum error probability. denotes the infimum error probability for uniform prior . For any discrimination scheme, we let be the probability of correctly identifying the -th channel. Then we have
[TABLE]
for any .
Since this inequality holds for the error probabilities in any strategy, one can just take the infimum over all strategies in each term of the inequality and have
[TABLE]
Therefore,
[TABLE]
That is, the Chernoff exponent for multiple quantum operations does not depend on prior. ∎
According to the proof, we can also conclude that
Corollary 1**.**
The Chernoff exponent for multiple quantum operations, Eq. 1, is infinite if and only if the quantum operations are mutually perfectly distinguishable.
Proof.
Suppose we are given a quantum operation being one of the quantum operations , and any two quantum operations can be distinguished perfectly. Let a protocol produce orthogonal quantum states and for quantum operations and , respectively.
Now we run the protocol on and measure the output. We employ the measurement which can distinguish and perfectly. If the measurement outcome corresponds to , then we know can not be ; otherwise, it can not be .
Therefore, via finite uses of , we can eliminate one candidate. By repeating this procedure, we can conclude that the Chernoff exponent is .
Otherwise, if two quantum operations, say and , can not be distinguished perfectly. According to the proof of Proposition 2, the multi-channel Chernoff exponent is upper bounded by the smallest pairwise exponent which is no more than the Chernoff exponent of and , a finite number by Theorem 1. ∎
V Conclusion and Open Problems
In this paper, we introduce the Chernoff exponent for quantum operations. We show the Chernoff exponent is finite if and only if the operations are not perfectly distinguishable. More precisely, we provide computable upper bounds of the Chernoff exponent by proving lower bounds on the error probability of distinguishing quantum operations with uses. Our result is an asymptotic generalization of the diamond norm.
There are several open questions. One relates to the local operations and classical communication (LOCC)-Chernoff distance. Motivated by the quantum Chernoff theorem [3, 25], the LOCC-Chernoff exponent studies the distinguishability of two bipartite mixed states under the constraint of LOCC, in the limit of many copies [8, 23]. There is a significant difference between the LOCC Chernoff exponent and the standard Chernoff exponent. Orthogonality does not indicate perfect LOCC distinguishability. More precisely, there exist quantum states which cannot be locally distinguished but multicopy makes them perfectly distinguishable [35, 36]. This behavior is similar to the discrimination of quantum operations. A fundamental question regarding the LOCC Chernoff exponent is still not answered: For two quantum states that are not LOCC perfectly distinguishable, even in the limit of many copies, does the LOCC discrimination error always decay exponentially? The first difficulty is we do not have a characterization of LOCC distinguishability of quantum states, even though this problem has been studied for more than 20 years [34, 33, 12, 5, 30, 16, 4].
We thank the editor and the anonymous reviewers whose comments have greatly improved this manuscript. This work is supported by ARC Discovery Early Career Researcher Award DE180100156 and ARC Discovery Program DP210102449.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Acin, ”Statistical distinguishability between unitary operations.” Physical Review Letters , 87 (17): 177901, 2001.
- 2[2] D. Aharonov, A. Kitaev, and N. Nisan, ”Quantum circuits with mixed states.” Proceeding of the Thirtieth Annual ACM Symposium on Theory of Computation , pp. 20-30, 1997.
- 3[3] K. M. R. Audenaert, J. Casamiglia, R. Munoz-Tapia, E. Bagan, Ll. Masanes, A. Acin, and F. Verstraete, ”Discriminating states: the quantum Chernoff bound.” Physical Review Letters , 98 (16): 160501, 2007.
- 4[4] S. Bandyopadhyay, A. Cosentino, N. Johnston, V. Russo, J. Watrous, and N. Yu, ”Limitations on separable measurements by convex optimization.” IEEE Transactions on Information Theory , 61 (6): 3593, 2015.
- 5[5] C. H. Bennett, D. P. Di Vincenzo, T. Mor, P. W. Shor, J. A. Smolin, and B. M. Terhal, ”Unextendible product bases and bound entanglement.” Physical Review Letters , 82 (26): 5385, 1999.
- 6[6] H. Chernoff, ”A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations.” The Annals of Mathematical Statistics , 23 (4): 493, 1952.
- 7[7] Tom Cooney, Milan Mosonyi, and Mark M. Wilde, ”Strong converse exponents for a quantum channel discrimination problem and quantum-feedback-assisted communication.” Communications in Mathematical Physics , 344 (3): 797-829, 2016.
- 8[8] J. Calsamiglia, J. I. de Vicente, R. Muñoz-Tapia, and E. Bagan, ”Local discrimination of mixed states.” Physical Review Letters , 105 (8): 080504, 2010.
