Process estimation in qubit systems: a quantum decision theory approach
Ivan Maffeis, Seid Koudia, Abdelhakim Gharbi, Matteo G. A. Paris

TL;DR
This paper applies quantum decision theory to analyze process discrimination and estimation in qubit systems, exploring optimal measurement strategies, the role of entanglement, and noise effects to improve quantum process detection.
Contribution
It introduces a quantum decision theory framework for process discrimination and estimation in qubits, including optimal strategies and the impact of entanglement and noise.
Findings
Optimal measurement strategies for process discrimination are identified.
Entanglement enhances stability but not precision in estimation.
Background noise affects discrimination performance and strategies.
Abstract
We address quantum decision theory as a convenient framework to analyze process discrimination and estimation in qubit systems. In particular we discuss the following problems: i) how to discriminate whether or not a given unitary perturbation has been applied to a qubit system; ii) how to determine the amplitude of the minimum detectable perturbation. In order to solve the first problem, we exploit the so-called Bayes strategy, and look for the optimal measurement to discriminate, with minimum error probability, whether or not the unitary transformation has been applied to a given signal. Concerning the second problem, the strategy of Neyman and Pearson is used to determine the ultimate bound posed by quantum mechanics to the minimum detectable amplitude of the qubit transformation. We consider both pure and mixed initial preparations of the qubit, and solve the corresponding binary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: I. Maffeis and M. G. A. Paris 22institutetext: Quantum Technology Lab, Dipartimento di Fisica Aldo Pontremoli, Università degli Studi di Milano, I-20133 Milano, Italy 33institutetext: S. Koudia and A. Gharbi 44institutetext: Laboratoire de Physique Théorique, Faculté des Sciences Exacte, Université de Bejaia, 06000 Bejaia, Algeria
Process estimation in qubit systems: a quantum decision theory approach
Ivan Maffeis
Seid Koudia
Abdelhakim Gharbi
Matteo G. A. Paris
Abstract
We address quantum decision theory as a convenient framework to analyze process discrimination and estimation in qubit systems. In particular we discuss the following problems: i) how to discriminate whether or not a given unitary perturbation has been applied to a qubit system; ii) how to determine the amplitude of the minimum detectable perturbation. In order to solve the first problem, we exploit the so-called Bayes strategy, and look for the optimal measurement to discriminate, with minimum error probability, whether or not the unitary transformation has been applied to a given signal. Concerning the second problem, the strategy of Neyman and Pearson is used to determine the ultimate bound posed by quantum mechanics to the minimum detectable amplitude of the qubit transformation. We consider both pure and mixed initial preparations of the qubit, and solve the corresponding binary decision problems. We also analyze the use of entangled qubits in the estimation protocol and found that entanglement, in general, improves stability rather than precision. Finally, we take into account the possible occurrence of different kinds of background noise and evaluate the corresponding effects on the discrimination strategies.
††journal: Quantum Information Processing
1 Introduction
The existence of non-orthogonal quantum states is one of the fundamental traits of quantum mechanics, and has profound implications on its applications. On the one hand, non-orthogonality poses limitations to the fundamental challenge of quantum state discrimination yue75 ; hel67 ; hel68 ; hel76 ; iva87 ; die88 ; per88 ; bgu04 ; che04 ; tom08 ; bgu10 ; jex03 ; jex04 ; jex06 and, on the other hand, it may be exploited as a resource in quantum technologies, e.g. in quantum cryptography ben92 ; eke91 ; gol95 ; guo95 ; par00 ; luc05 ; ben14 ; sca14 ; ave10 ; bri12 ; mes0 ; mes1 ; mes2 . Non-orthogonality also influences other fundamental tasks in quantum information processing, and in particular process discrimination dar02 ; lai09 ; inv10 ; dev12 ; wit14 ; bae15 ; tra17 ; reh18 ; che18 , which itself represents a crucial ingredient for quantum simulations and quantum interferometry tak10 ; int97 ; ral00 .
In this paper, we address process discrimination in qubit systems and exploit results from quantum decision theory in order to optimize the discrimination strategy dua08 ; cao16 , i.e. to minimize the impact of non-orthogonality, and to derive the corresponding ultimate bounds to the probability of error in the detection of the perturbation, and the discrimination precision of the perturbation amplitude. The possible advantages resulting from the use of entanglement are also explored in details.
The scheme we are going to consider is the following: a single- or two-qubit system is prepared in a given initial state, then, with a certain unknown probability, it is subjected to the action of a given unitary operator , being the generator of the perturbation and its amplitude. Finally, the system is measured, in order to detect whether or not the unitary transformation has perturbed the system. The problem is equivalent to that of discriminating the unperturbed state of the system from the perturbed one, while accepting a probability of error. A second, complementary, goal is to determine the minimum detectable value of the amplitude which leads to discriminable outputs. The precision obtained by using one- or two-qubit will be compared in order to reveal whether the use of entanglement leads to some advantages, either in ideal conditions or in the presence of noise.
The paper is structured as follows. In Section 2 we establish notation and briefly review few fundamental results in quantum discrimination theory, also illustrating the different figures of merit employed in the so-called Bayes and Neyman-Pearson discrimination strategies. In Section 3 we employ Bayes strategy to distinguish between different processes with minimum error using one- and two-qubit probes, also when different kind of noise occurs. In Section 4, we exploit Neyman-Pearson strategy to find the ultimate bound to the minimum detectable perturbation, discrimination, also addressing the use of entanglement and the occurrence of noise. Finally, Section 5 is closing the paper with some concluding remarks.
2 Quantum decision theory
Let us consider a qubit system and assume it may be prepared in one of the two states with prior probabilities
such that , and .
Our goal is that of inferring the state of the qubit, basing our decision on the outcome of a measurement performed on the system. To this aim we should implement a detection scheme, i.e. a POVM (Positive Operator-Valued Measure) , where the ’s are positive semi-definite, , , and sum to the identity .
As we already mentioned, we cannot, even in principle, perfectly distinguish the two states, unless they have orthogonal supports. For non-orthogonal states, we cannot achieve perfect discrimination. However, we may seek for an optimal discrimination strategy, which minimise/maximise, in average, a given loss/gain function. In the following two paragraphs, we briefly review two relevant strategies used in quantum hypothesis testing, which will be also at the basis of our approach to process estimation in qubit systems.
The first paragraph is devoted to Bayes strategy, which aims to the POVM minimising the average probability of error in the decision process. Bayes strategy is suitable for those situations where the two possible outcomes of the measurement are equally important for the experimenter and thus the two error probabilities should be jointly minimised. The paradigmatic situation where the Bayes strategy is employed is binary quantum communication, in which two classical symbols are encoded onto quantum states of a physical system and quantum decision theory is exploited to determine the optimal receiver at the output of the communication channel.
On the other hand, there are several situations of interest where one of the two events, usually referred to the alternative hypothesis, is expected to occur more rarely with respect to the other one, called null hypothesis. In these cases, the detection of alternative hypothesis is the main task of the measurement. The so-called Neyman-Pearson (NP) strategy is relevant for this context, aiming at maximising the detection probability of the alternative hypothesis while accepting a possible false-alarm probability, which is the probability of inferring the alternative hypothesis when the null hypothesis is instead true. A possible paradigmatic situation where NP strategy may be successfully implemented is the interferometric detection of gravitational waves.
2.1 Bayes discrimination strategy
Bayes strategy aims at minimising the average probability of error of the decision process bgu10 , i.e. the quantity
[TABLE]
where is the probability of inferring when is the actual state of the system. In turn, Bayes strategy is also referred to as minimum-error discrimination strategy mes0 ; mes1 ; mes2 . Since we are here considering binary decisions, the probability of error simplifies to
[TABLE]
where
[TABLE]
This operator is hermitian, though not positive definite. It is sometimes referred to as the Bayesian characteristic operator.
As we can see from (1), the probability of error is minimized when the measurement operator is the projector on the positive part of . It is thus a projective valued measure (PVM) given by where and are the eigenvalues of . The resulting minimum probability of error may be written as:
[TABLE]
where the trace norm of an operator is defined as . The quantity in Eq. (3) is usually referred to as the Helstrom bound to the error probability in binary state discrimination. For pure states the Helstrom bound may be rewritten as
[TABLE]
where is the overlap between the two states. We can easily see that for vanishing overlap the probability of error is also vanishing as , whereas for large (), the discrimination process approaches pure guessing ().
2.2 Neyman-Pearson discrimination strategy
Neyman-Pearson strategy aims at maximising the detection probability of the alternative hypothesis, at a fixed value of the false-alarm probability. Let us refer to the two hypotheses as the null and the alternative hypothesis respectively, where corresponds to the qubit being in the state . The basis of NP strategy is in fact a trade-off between the probability of detection of alternative hypothesis, , and the false-alarm probability, . A maximum threshold of false-alarm probability is fixed as acceptable, given the nature of the physical problem where the decision strategy is implemented. The searching of the POVM maximising the detection probability corresponds to a Lagrange maximisation problem where the value of is taken as a constraint. The optimal POVM is then the one maximising the Lagrange functional hel76 :
[TABLE]
where is a Lagrange multiplier and
[TABLE]
is the Lagrange operator. In order to maximise , the POVM needs to be a projector on the positive eigenvalues of the operator . The optimal measurement scheme according to Neyman-Pearson strategy is thus a projective one where , being the eigenvalues of and the corresponding eigenvectors. We notice that, in solving the eigenvalue problem, different values of correspond to different values of the accepted false alarm probability and thus to different Neyman-Pearson strategies.
Once the eigenvalues of are found, the strategy becomes clear: when the measurement of gets a positive outcome, the alternative hypothesis is inferred as true, otherwise the null hypothesis is inferred. For pure states the detection probability can be written in terms of after eliminating the Lagrange multiplier , obtaining:
[TABLE]
Eq. (6) shows that we may have unit detection probability as far as we accept a false-alarm probability larger than the overlap between the two pure states involved in the problem. As a consequence, as the overlap is large, the detection of the alternative hypothesis becomes hard, as either the detection probability cannot achieve the unit value without accepting extremely large margins of error (false alarm).
3 Bayes approach to process detection in qubit systems
3.1 Single-qubit states
In this section we apply Bayes strategy to quantum binary discrimination to observe whether or not a given unitary perturbation has been applied to a qubit system. The detection scheme is schematically illustrated in Fig. 1.
A single-qubit system is initially prepared in an input quantum state and it is then let to evolve through a channel where it may undergo a transformation , being the perturbation amplitude. At the output, i.e .after the (possible) transformation we want to know whether we have or
[TABLE]
as output state. The problem reduces to discrimination between the two states, and since they do not have, in general, orthogonal supports, quantum decision theory is the natural framework to adopt. In the following, we will employ Bayes strategy for single-qubit signals, as well as two-qubit ones, in order to detect perturbations induced by a Pauli matrix and, without loss of generality, we will assume
[TABLE]
as the transformation which may take place into the channel. Upon writing the initial single-qubit state in a Bloch form , with , , the transformed state is given by where the transformed components are given by
[TABLE]
The characteristic operator in (2) becomes , with:
[TABLE]
In this context, and are respectively the prior probabilities for the unitary transformation to have effect on the initial quantum state or not (). Upon evaluating explicitly trace and determinant of , the probability of error has the following form:
[TABLE]
in which . If the state is pure, , we indeed recover Eq. (4) since we have . In the limiting cases where both eigenvalues are positive (negative) discrimination reduces to pure guessing, i.e. the inference is made without looking at data. We may summarize the situation as follows:
[TABLE]
Let us now assume (no a priori information about the occurrence of the perturbation) and rewrite (11) in terms of the initial state purity
[TABLE]
The probability of error achieves its minimum when the input signal is pure (), and its projection on the direction of the generator vanishes, i.e. for pure states lying in the orthogonal plane of the generator . In this case we have
[TABLE]
which only depends on the parameter of the transformation. This is the least possible error we can get. Any discrimination procedure in the Bayesian sense is thus preparation dependent, at least for the present single-qubit case. We also note that all the above results generalise, mutatis mutandis, to any generator ().
3.2 Two-qubit states
Let us now consider the scheme in Fig. 2, where we assume that the qubit which may possibly subject to the transformation is initially prepared in an entangled state with another qubit.
Our goal is to compare the performance of this scheme against those if the single-qubit one at fixed use of the resource, here intended as the device that may impose the perturbation. Starting with a singlet state , the density matrix of the corresponding perturbed state may be written as
[TABLE]
where the perturbation is acting on the first qubit in the singlet. The Bayesian characteristic operator is thus given by
[TABLE]
and the corresponding eigenvalues by \xi_{\pm}=\frac{1}{2}\Big{[}\left(z_{1}-z_{0}\right)\pm\sqrt{1-4z_{0}z_{1}\cos^{2}\lambda}\Big{]}. The error probability reads as follows:
[TABLE]
Using entanglement we may thus achieve the ultimate bound in Eq. (14) and, remarkably, the same results, i.e. the same value of in Eq. (17) is obtained for any initial two-qubit preparation of the form
[TABLE]
provided that . In addition, under the same condition, Eq. (17) also holds when the perturbation is generated by any of the , . The class of states in Eq. (18) includes maximally entangled states. Upon comparing Eq. (18) with Eq. (13), we see that entanglement improves the discrimination process in terms of stability, since the probability of error does not depend on the projection of the Bloch vector on the axis corresponding to the generator of the perturbation.
Let us now consider a generic mixture of Bell states:
[TABLE]
After lengthy but straightforward calculation the characteristic operator can be obtained (not shown here) and, in turn, the following probability of error:
[TABLE]
3.3 Bayes strategy in presence of noise
A possible generalization of the problem is considering a situation where some source of noise acts during propagation of the signal. In particular, we focus on the two-qubit case, assuming that noise occurs to both parties and takes place before the possible perturbation . A schematic diagram of the experimental protocol is shown in Fig. 3. We present here the analysis of the effects on the same initial entangled state of four different kind of noise: the three formalized by Pauli matrices and the depolarizing noise.
The analysis here presented considers as first case the bit-flip noise, which is described by the completely positive map
[TABLE]
where are the Kraus operators:
[TABLE]
In these equations, and are the probabilities for respectively the first qubit (which has probability to undergo the unitary transformation) and the second qubit to be subject to noise. The probability of occurring noise is assumed as independent on the two channels. If the initial state is , we get
[TABLE]
Using Eq. (19) we obtain:
[TABLE]
Similar considerations can be conducted for the phase-flip noise by replacing in Eq. (21) to . For the same initial state considered above we get:
[TABLE]
and
[TABLE]
No dependence on the noise is present, i.e. the discrimination power for the studied protocol is not affected by the presence of phase-flip noise. Analogue results are obtained when discussing the phase-bit-flip noise, which is described by the same previous set of Kraus operators by formally replacing with .
Considering as a last case the depolarizing noise
[TABLE]
where is the probability that the state is transformed to a maximally mixed state , results of Eq. (19) may be used, leading to
[TABLE]
As before, this is independent of the mean value of the generator of the perturbation. On the other hand, the probability of error depends on the noise parameter , and it increases with . To summarize: when the noise acts on an eigenspace of the perturbation generator, the decision process is affected, and accordingly, the probability of error is increased. On the other hand, when the noise acts on an orthogonal space of the generator of the perturbation , the probability of error of making a decision results unchanged and it achieves its minimal value given in Eq. (25).
4 Neyman-Pearson strategy and the minimum detectable perturbation
4.1 Pure states
In this Section we address the problem of evaluating the minimum detectable value of the perturbation amplitude , i.e. the minimum value of in making and discriminable. To this aim, we have to maximise the detection probability of the alternative hypothesis, which is the basis of the so-called Neyman-Pearson strategy, to optimise binary decision.
We start from a single qubit initially prepared in a pure state. In this case, as it was observed in Section 2, the optimal NP strategy may be analytically determined and the characteristic equation takes the form of the expression shown in Eq. (6), where is the overlap between the initial and the perturbed states. Given the expression of , it is clear that the eigenstates of are not modified by the action of the perturbation (up to an irrelevant phase), and thus they are not suitable to detect any value of .
In order to define the minimum detectable perturbation, it is necessary to define a criterion to discern the regimes for which the detection probability can be considered large. The first criterion to be employed is an “absolute” one: a perturbation amplitude is detectable if it leads to a detection probability . A lower detection rate would indeed make the dataset useless, as no reliable information may be extracted in that case. We also address a “relative” criterion, for which a perturbation is considered detectable when it leads to . In order to perform our analysis, we rewrite the expression of in terms of . Concerning the expression of the detection probability, it is clear from Eq. (6) that unit detection probability is reached whenever , i.e. . In principle, in this regime is obtained and thus we may detect arbitrarily small perturbation. However, the condition is -dependent and this constraint poses a lower bound on the detectable amplitude. This is illustrated in the left panel of Fig. 4, where a contour plot of as function of and is shown. The good area is the white one above the red straight line .
The other branch of the function is less trivial to analyze. Nevertheless, it is easy to observe that increases with and , and then the condition is satisfied in the area between the above mentioned line and the curve of equation , which is the red arc of Fig. 4 (both panels). The entire set of couples satisfying the absolute criterion can therefore be graphically represented as the gray-shadowed area in the right panel of Fig. 4. The above Equation may be inverted to make explicit, leading to . The minimum detectable perturbation can then be calculated upon recalling the expression of , leading to
[TABLE]
In order to make sense of the right-hand side of Eq. (28), the possible values of must be restricted to . This is a limitation on the state construction: if we fix a false-alarm probability , only states satisfying this condition may lead to a detection probability larger than . In the rest of our discussion we focus on those states, since for we have , and any state preparations corresponds to an arbitrarily small detectable perturbation.
As it was noted above, the optimal preparation of the input signal corresponds to . In this case, varying , we have . Accepting as instance no false alarm (), the minimum detectable perturbation parameter is calculated as , whereas all the unitary transformations with parameter smaller than cannot be detected. When instead the accepted false-alarm probability is fixed to (the largest possible value when considering this branch), all the possible transformation parameters are available: . This is yet a relative advantage, because of the trade-off between sensitivity and false-alarm probability. In particular, for values of we have . If takes its maximum allowed value, that is , then independently from the value of the false-alarm probability.
Let us now address the analysis of the second above-mentioned criterion, for which the minimum detectable perturbation leading to is targeted. As in previous cases, it is easy to address the situation when , for which . In this case, the criterion may be easily inverted and immediately leads to . The corresponding points in the plane are shown in Fig. 5. In the opposite case (), the inequality has to be solved. After some algebra, we may rewrite this condition as
[TABLE]
The entire area for which the detection probability can be considered large with respect to is gray coloured in Fig. 5. Upon increasing , the area is decreasing in size, the vertical line moving left.
The minimum detectable perturbation is thus given by
[TABLE]
depending on the fixed value of and on the state preparation with respect to the -axis, . The restriction on the possible state preparation reads as follows: .
4.2 Mixed states
Following the same steps for mixed states, we are led to diagonalise the Lagrange characteristic operator , which is expressed in terms of the Bloch representation of the initial and the perturbed states
[TABLE]
The detection and the false alarm probabilities are respectively given by:
[TABLE]
where and . After careful inspection of the domains of validity, we may invert the above Equations, obtaining the characteristic expression as follows:
[TABLE]
where , and the two critical values are given by
[TABLE]
The characteristic function is graphically represented in Fig. 6 for fixed values of and , which correspond to fix and , i.e. to tuning the initial preparation and the setup (remind that , where is the purity of the initial preparation). The corresponding minimum detectable perturbation is then given by
[TABLE]
with the condition
[TABLE]
Once again, this is a restriction on the initial preparation of the system. When the inequality is saturated, the only parameter that can be detected is , otherwise the best preparation is . Of course, vanishes as far as approaches the limiting value .
4.3 Pure two-qubit states
Here we address possible enhancement coming from coupling the qubit that might undergo a perturbation to another qubit which is left unperturbed. We will consider the same setup of the Bayesian analysis, reported in in Fig. 2. We obtained before that for a generic pure state described in the computational basis of two qubits by (18) and satisfying the overlap has a universal value independent of the preparation; , and thus . If we compare this result with what we found in the case of generic single qubit state, we see that they match up whenever the state is pure and the projection on the generator axis vanishes. The corresponding minimum detectable perturbation is given by
[TABLE]
which does not depend on the initial preparation of the system and ranges from [math] to for ranging from to [math]. As we already concluded for the Bayesian case, the effect of entanglement is to enhance the overall stability of the discrimination/estimation scheme.
4.4 Mixed two-qubit states
Let us consider here a generic Bell-diagonal mixed state , . The Lagrange operator for this preparation may be decomposed into a direct sum of two operators, each one acting on the orthogonal subspaces generated by and respectively. In particular, we have
[TABLE]
where
[TABLE]
and
[TABLE]
The two operators may be brought to a matrix form as
[TABLE]
with
[TABLE]
For the sake of clarity, let us discuss explicitly how to derive the characteristic function. The relevant quantities for discussing the eigenvalues of are trace and determinant of . We have
[TABLE]
where
[TABLE]
The two critical values of , corresponding to vanishing determinant, are
[TABLE]
Accordingly, three regimes are identified, summarized as follows:
[TABLE]
where are the eigenvalues of the operator . Upon carrying out the same analysis for , we identify the critical values for the Lagrange multiplier. Depending on the initial parameters , the intervals and are contained one in the other. Focusing on the case , it is observed that . The characteristic function may then be explicitly parameterized by as follows:
[TABLE]
An example of this characteristic function with a particular choice of the preparation parameters and the perturbation parameter is illustrated in Fig. 7. Similar calculations can be conducted in the other case for which: .
Using arguments similar to those developed in this Section we may deal with discrimination in the presence of noise and find the minimum detectable perturbation also in those cases. Finally, we notice that the use of entanglement can positively affect the discrimination process of quantum states by confirming the overall precision obtained in the single-qubit protocol, while enhancing the stability of the detection scheme by removing the dependence on the preparation of the initial state.
5 Conclusions
In this paper, we have applied concepts and methods from quantum decision theory to address process discrimination and estimation in qubit systems. In particular, we have discussed the problem of discriminating whether or not a given unitary perturbation was applied to a qubit system, as well as the complementary problem of evaluating the minimum detectable perturbation amplitude leading to discriminable outputs. Our approach has allowed us to obtain several results which are summarised in the following.
We have seen that entanglement may represent a resource, in particular for improving the stability of the discrimination strategies. Using single-qubit pure probes, the characteristic function for both Bayes and NP strategy does explicitly depend on the initial preparation, whereas using two-qubit probes this dependence disappears. Bell-state probes are shown to be optimal for both strategies. Nevertheless, they are not the only choice, since there exists an entire class of states, depending on the generator of the unitary perturbation, for which both strategies are optimised. In this sense, it has been shown that not only Bell states belong to this class, but there exists also a set of non-balanced states simultanously optimal for each generator.
A similar analysis has been performed for single- and two-qubit mixed states, for which we found novel analytic solutions for Bayes error probability and the NP characteristic function. In this way, we found that optimal preparations correspond to pure states with a vanishing value of the generator expectation value. We have then used our results to explicitly discuss the effects of possible background noises on the discrimination performance of the various strategies and schemes. Overall, besides the exact quantification of noise effects, we have the rather intuitive conclusion that when we bring noise into play, it has detrimental effects only when it acts on the eigenspaces of the generator, whereas it has no effects when acting on orthogonal subspaces.
Acknowledgements
MGAP is member of GNFM-INdAM.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) H. P. Yuen, R. S. Kennedy, M. Lax, Optimum testing of multiple hypotheses in quantum detection theory , IEEE Trans. Inf. Theory, IT 21 , 125 (1975).
- 2(2) C. W. Helstrom, Detection Theory and Quantum Mechanics I , Inform. Control 10 , 254 (1967).
- 3(3) C. W. Helstrom, Detection Theory and Quantum Mechanics II , Inform. Control 13 , 156 (1968).
- 4(4) C. W. Helstrom, Quantum detection and estimation theory , (Academic Press, 1976).
- 5(5) I. D. Ivanovic, How to differentiate between non-orthogonal states , Phys. Lett. A 123 , 257 (1987).
- 6(6) D. Dieks, How to differentiate between non-orthogonal states , Phys. Lett. A 126 , 303 (1988).
- 7(7) A. Peres, How to differentiate between non-orthogonal states , Phys. Lett. A 128 , 19 (1988).
- 8(8) J. A. Bergou, U. Herzog and M. Hillery, Discrimination of Quantum States , Lect. Not. Phys. 649 , 415 (2004).
