Optimal discrimination of single-qubit mixed states
Graeme Weir, Stephen M. Barnett, Sarah Croke

TL;DR
This paper introduces an analytical algebraic method for optimally discriminating single-qubit mixed quantum states with minimum error, applicable to any number of states and prior probabilities, complementing geometric approaches.
Contribution
It provides a constructive, algebraic solution to the minimum-error discrimination problem for single-qubit mixed states, expanding beyond existing geometric methods.
Findings
Analytical Helstrom condition-based method developed
Applicable to any number of states and prior probabilities
Complements geometric approaches
Abstract
We consider the problem of minimum-error quantum state discrimination for single-qubit mixed states. We present a method which uses the Helstrom conditions constructively and analytically; this algebraic approach is complementary to existing geometric methods, and solves the problem for any number of arbitrary signal states with arbitrary prior probabilities.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimal discrimination of single-qubit mixed states
Graeme Weir
School of Physics and Astronomy, University of Glasgow, Glasgow G12 8QQ, United Kingdom
Stephen M. Barnett
School of Physics and Astronomy, University of Glasgow, Glasgow G12 8QQ, United Kingdom
Sarah Croke
School of Physics and Astronomy, University of Glasgow, Glasgow G12 8QQ, United Kingdom
Abstract
We consider the problem of minimum-error quantum state discrimination for single-qubit mixed states. We present a method which uses the Helstrom conditions constructively and analytically; this algebraic approach is complementary to existing geometric methods, and solves the problem for any number of arbitrary signal states with arbitrary prior probabilities. It has long been known that the minimum error probability is given by the trace of the Lagrange operator, . The remarkable feature of our approach is the central role played not by , but by its inverse.
I Introduction
A fundamental task in quantum theory, and quantum information in particular, is that of distinguishing between different possible preparation procedures: quantum state discrimination. In quantum theory measurement is an active, rather than a passive process: there are unavoidable restrictions on which observables may be jointly measured, and generically, measurement causes disturbance. The limits to state discrimination thus are inextricably linked with the ultimate limits of measurement in quantum theory. The problem is thus ubiquitous in the field of quantum information and indeed more broadly, with applications including, for example, discrimination of operations in quantum metrology bae2015quantum , bounding the dimension of a system Hilbert space given incomplete information brunner2013dimension , sharing information through imperfect cloning buvzek1996quantum ; bruss1998optimal ; bruss2000phase ; keyl1999optimal , and others bae2015quantum .
The problem of quantum state discrimination is perhaps most naturally expressed as a problem in quantum communications, in which it is stated as follows: the quantum states make up an “alphabet” which can be used to craft a message. A sender, Alice, sends a message to a receiver, Bob, in which each has some a priori probability of being sent. Unless the signal states are mutually orthogonal, there will be some error in any attempt to determine which state was sent. If Bob knows the set of signal states and their probabilities, what measurement strategy should he use in order to optimally decipher Alice’s message?
The study of quantum state discrimination has a long history, beginning with the pioneering work of Helstrom, Holevo, and others in the 1960s and 1970s helstrom1976quantum ; holevo1978asymptotically , who sought to understand the fundamental limits imposed by quantum theory on optical communications. Since then there has been much further development, with the construction of strategies based on various figures of merit ivanovic1987differentiate ; dieks1988overlap ; peres1988differentiate ; jaeger1995optimal ; chefles1998unambiguous ; chefles2001unambiguous ; sasaki1999accessible ; chitambar2013revisiting ; croke2006maximum ; barnett1997experimental ; andersson2002minimum ; chou2004minimum ; ha2013complete ; bae2013minimum ; rosati ; bruss1998optimal ; bruss2000phase ; keyl1999optimal ; yuen1975optimum ; helstrom1976quantum ; holevo2011probabilistic , as well as ones that interpolate between these herzog2015optimal ; fiuravsek2003optimal . We are interested here in the minimum error strategy, based on perhaps the most natural figure of merit: what is the highest possible probability of correctly identifying the state?
Necessary and sufficient conditions which an optimal measurement must satisfy are known helstrom1976quantum , and further, the problem may be cast as a semi-definite program, which may be solved efficiently numerically jevzek2002finding . If an analytical solution is desired, these conditions are most naturally used to check optimality of a candidate measurement procedure, but are not constructive. For some time therefore almost all known analytic solutions were in special cases in which a symmetry property of the states could be used to guide the design of optimal measurements andersson2002minimum ; mochon2006family ; ban1997optimum ; chou2004minimum . Some progress has since been made, however, in employing these conditions to construct solutions, and in the qubit case in particular, the Bloch sphere representation proves useful in providing a geometric picture hunter2004results ; deconinck2010qubit ; bae2013minimum ; ha2013complete ; samsonov2009minimum . Methods of constructing optimal measurements were given first for equi-probable pure qubit states hunter2004results , and later for mixed states and for arbitrary prior probabilities bae2013minimum ; ha2013complete ; ha2014discriminating ; deconinck2010qubit , the latter using a geometric approach.
In this paper, we give an alternative method of constructing optimal measurements from the minimum error conditions. Previous work has demonstrated that finding a single operator , sometimes referred to as the Lagrange operator, is equivalent to solving the minimum error discrimination problem: the trace of this operator gives the optimum probability of success, and optimal measurements may be readily constructed once it is known. We construct linear constraints on this operator and its inverse, which in the qubit case may be readily solved for and thereby the optimal measurement. Our algebraic approach is complementary to the geometric approach presented in bae2013minimum , the results of which were applied to explicitly calculate the optimal probability of correct discrimination for the case of three mixed qubit states in ha2013complete . In contrast to the existing approaches, our method reduces the problem of state discrimination to the simpler one of solving a series of linear equations.
II The minimum-error conditions
We begin by introducing the minimum-error conditions helstrom1976quantum . We wish to distinguish between states , with a priori probabilities . Any physically allowed measurement may be described mathematically as a probability operator measure (POM) barnett2009quantum , also known as a positive operator-valued measure (POVM) peres2006quantum , that is a set of operators satisfying:
[TABLE]
The Born rule, expressing the probability of obtaining outcome in a measurement on a system prepared in state is given by:
[TABLE]
and the above conditions ensure that these probabilities are positive and sum to 1. A “click” at the detector corresponding to element is taken to indicate that the state was sent. Bob’s probability of correctly guessing the state Alice sent is then given by , and it is this that we wish to maximise in the minimum error problem (Bob’s probability of error, of course, is given by ). Clearly to distinguish between states we therefore need a measurement with at most outcomes. The number of outcomes may be less than (or equivalently, some of the operators may be zero) if Bob’s measurement procedure is such that some states are never identified. Indeed in some cases the optimal measurement is simply to always guess the most likely state hunter2003measurement and never identify any other state.
The solution to the problem of minimum-error quantum state discrimination is equivalent to finding a POVM satisfying the conditions helstrom1976quantum ; holevo2011probabilistic ; yuen1975optimum :
[TABLE]
where . The first condition is both necessary and sufficient for to describe an optimal measurement procedure, and we note that the conditions are not independent; the second, which is necessary but not sufficient, follows from the first. It is useful, however, to give both conditions, because often the second is more convenient to use in practice. Note that is a Hermitian operator , which follows from condition (2), and may be seen explicitly by summing over both and in condition (3). An alternative condition is obtained by summing over in eqn. (3), giving:
[TABLE]
This is a necessary (but not sufficient) condition on any optimal measurement , and is central to our and other methods bae2013minimum ; ha2013complete , allowing us to construct operators satisfying once a candidate is given. Indeed, both and [according to inequality (2)] are positive operators, and thus eqn. (4) can hold only if they are orthogonal, that is, is entirely within the kernel (or the eigensubspace corresponding to zero eigenvalue) of . It follows that can be non-zero only if has at least one zero eigenvalue. We further note that
[TABLE]
Therefore if we can find , we find both the optimal probability of success, and a way of constructing the optimal measurement operators. The problem of finding the optimal measurement , a set of operators, is thus equivalent to finding a single positive operator satisfying the condition (2) and from which operators satisfying (4) and forming a POVM may be constructed. Indeed the so-called dual problem in the semi-definite programming approach consists of finding the operator with minimum trace that satisfies condition (2) for all . Further, as is stressed by Bae bae2013structure , the operator is unique for a given set of states, while the optimal measurement may not be - for example, in the case of symmetric states hunter2004results ; mochon2006family ; sasaki1999accessible .
III Qubit state discrimination
There has recently been much progress in using the Helstrom conditions constructively, to find optimal measurements, with particular success in the qubit case. Hunter was perhaps the first to attempt this in a systematic way, showing how to construct optimal measurements for all sets of equi-probable pure states hunter2004results . Samsonov later presented an algorithmic solution to the Helstrom conditions for pure qubits with arbitrary prior probabilities samsonov2009minimum . Deconinck and Terhal gave an elegant geometric interpretation of the Lagrange operator in the Bloch sphere picture as the minimum enclosing ball of a suitably defined set of balls. A similar interpretation can be given in higher dimensions tysonremark . This gives rise to a linear time algorithm for a set of arbitrary qubit states with arbitrary probabilities. Recently Bae used the so-called Karush-Kuhn-Tucker (KKT) conditions from semi-definite programming to define complementary states with weights such that
[TABLE]
The geometric structure of the complementary states may be deduced from the conditions and the geometric structure of the signal states , and in turn used to construct . Bae discusses the qubit case, in which the Bloch sphere provides a convenient geometric picture, and the full details for three mixed qubit states were later calculated by Ha and Kwon ha2013complete .
We begin with some general considerations concerning the qubit state discrimination problem, and then discuss our method, which constructs directly, without reference to complementary states. Firstly, we note that for each the operator can have two, one, or no zero eigenvalues, corresponding to the zero operator, a rank-one operator, and a positive-definite operator respectively:
If for some , then , which can only hold if for all . The no measurement strategy is then an optimal measurement, hunter2003measurement . 2. 2.
has a single zero eigenvalue. If is non-zero, it is a weighted projector onto the corresponding eigenstate. 3. 3.
If is positive definite (all eigenvalues strictly greater than zero), then according to condition (4) it follows that for every optimal measurement, and the corresponding state is never identified.
Given a set of qubit states with a priori probabilities , it is easily checked whether for some
[TABLE]
If this does hold for some , the optimal strategy is not to measure at all and simply guess . For all other ensembles, it follows that the optimal measurement is made up of rank-one weighted projectors,
[TABLE]
for some satisfying , and where is the eigenstate of corresponding to the zero eigenvalue. Note that this is completely general for qubits, and holds whether are pure or mixed states. Thus, for an optimal measurement each operator is uniquely defined, up to a multiplying factor. There may however be more than one way of choosing the coefficients such that the thus found sum to the identity.
Secondly, we note that for minimum error discrimination of an arbitrary set of qubit states there always exists an optimal measurement with at most four outcomes. Intuitively, the constraint contains only independent linear constraints, where is the dimension of our space: if a set of elements satisfies this, there is always a subset of these which, when appropriately weighted, also forms a resolution of the identity. A measurement with outcomes can always be decomposed as a probabilistic mixture of measurements with at most outcomes. If the mixture results in an optimal procedure, then any of the component measurements must also be optimal hunter2004results ; deconinck2010qubit . A more complete proof of this may be found in the literature, e.g., chiribella2007continuous .
Finally, note that the number of outcomes in our optimal measurement corresponds to the number of states that are identified with non-zero probability by the measurement: additional states are never identified. Denoting the number of outcomes , the cases and are well-known, as these correspond to the no-measurement strategy and the Helstrom two-state discrimination measurement respectively helstrom1976quantum ; barnettcroke2009quantum . In the cases and it is more difficult to find optimal measurements, although as discussed above, strategies for these cases have been recently suggested.
For qubits, the Pauli operators together with the identity form a convenient basis in which to express any operator on the space. Thus, for example, we can write
[TABLE]
where , is a real three-dimensional vector, and is the vector of Pauli operators: . It will be convenient in what follows to also use such a representation for the inverse , and it is easily verified that:
[TABLE]
Note that is a strictly positive operator in the space spanned by the states to be discriminated, and so the inverse is always well-defined, as is the square-root, which we will use later. We thus need four parameters to completely specify , and we discuss now how to construct four constraints, which are readily inverted to construct .
Suppose there is an optimal measurement which identifies states: we will show that for each of these we may obtain one constraint on the parameters of . It is of course not obvious a priori which states will be identified by an optimal measurement; however, we can construct a candidate , under the assumption that a particular subset of our states are identified in an optimal measurement, and then verify that this results in a physically allowed measurement procedure. We will return to this later. According to the discussion above therefore, for each of these states the operator has a single zero eigenvalue. Let us consider first the pure state case: . If we pre- and post-multiply condition (2) by we find:
[TABLE]
We further require that this operator has exactly one zero eigenvalue, which in turn requires that the second term has modulus 1:
[TABLE]
A similar relation was pointed out by Mochon mochon2006family , who discussed the inverse problem of characterising the sets of states and corresponding probabilities for which a given measurement procedure was optimal, although it does not seem to have been used constructively in the literature. Thus we find
[TABLE]
Alternatively, if has Bloch vector (a unit vector as is a pure state), , we may write:
[TABLE]
Each state gives rise to one such constraint, resulting in independent constraints on .
This procedure is readily adapted to apply also to mixed states for the qubit case. Note that for qubit states, every mixed state can be written as a mixture of a pure state and the identity: , where . The requirement that then becomes:
[TABLE]
and using the same reasoning as previously, if we require this operator have exactly one zero eigenvalue we obtain:
[TABLE]
Explicitly, this gives:
[TABLE]
and after a litte rearranging, again writing , we find
[TABLE]
If there are states identified by the optimal measurement this procedure, in both the pure state and mixed state case, gives equations for the parameters of . Clearly if this is enough to construct . We further note that in equations (11) and (13) the non-linear right hand side is independent of , thus we can easily take linear combinations to obtain linear equations. For these are readily solved to write all parameters in terms of a single one, e.g. , which is finally determined by solving one quadratic equation.
Thus we can construct optimal measurements with or outcomes. For we do not yet have enough constraints to determine ; a further constraint, however, is readily constructed, as we now discuss. We first note that for the special case in which three signal states lie in an equatorial plane of the Bloch sphere (as in andersson2002minimum ; myfirstpaper ), we know from symmetry that the POVM elements, and therefore also , must lie in the same plane as the signal states, thus giving us our final constraint. More generally, for the case of three equiprobable pure qubit states it is always possible to choose a representation in which the states sit at the same latitude of the Bloch sphere. The optimal measurement operators then lie in the equator of the sphere, and has the same latitude as the signal states hunter2004results .
We can generalize this idea to both pure and mixed states, and to non-equal prior probabilities. We first note that for a three outcome measurement, all three elements of the POVM must lie in an equatorial plane of the Bloch sphere in order to form a resolution of the identity. Without loss of generality we choose our axes so that this is the plane. That is, we can always choose our axes so that , with , . Referring now to condition 4, it follows that
[TABLE]
and thus . Finally we therefore require
[TABLE]
where as before . Thus if we define our -axis to be such that the -component of is the same for each of the three signal states identified, then also has the same -component, and the optimal measurement operators lie in the equatorial plane. Note that a similar discussion may be found in herzog2015optimal .
Thus for a given set of qubit states with arbitrary priors , if there exists an optimal minimum error measurement which identifies a subset of of these states, we have shown how to construct , which in turn allows us to construct the optimal measurement. We illustrate below in an example how this may be employed in practice to find optimal measurements, and discuss later the problem of how we can know in general which states are identified by an optimal measurement.
IV Example
To illustrate our method, we consider the problem of discriminating between three pure states which are mirror-symmetrically arranged on the equator of the Bloch sphere, previously investigated by Andersson, et. al. andersson2002minimum . The states are:
[TABLE]
and these occur with a priori probabilities , , with . The so-called trine ensemble occurs at myfirstpaper .
We begin by noting that as the states are all pure it is not possible to satisfy conditions (6) and the no-measurement solution is never optimal. We next check to see when a two outcome measurement is optimal. Note that due to the symmetry the only sensible two-outcome measurement is one distinguishing and : the optimal such measurement is a projective measurement in the eigenbasis of ; , where and . It is straight-forward to calculate , and we note that condition 2 is satisfied for by construction. In order to check this condition for , it is enough to verify that . We find as in andersson2002minimum , that this holds when
[TABLE]
The corresponding optimal probability of correctly identifying the state is given by
[TABLE]
When condition 14 does not hold, we know that a three outcome measurement is optimal, and can use the method outlined above to find this. We first note that for each of our signal states. Thus, as discussed above, must also have , and lies in the equatorial plane. Further, using equation 11, we obtain the following three constraints on , and :
[TABLE]
It is clear from the latter two that . The remaining equations are readily solved for and , giving:
[TABLE]
The corresponding probability of correctly identifying the state is then given by:
[TABLE]
which agrees with the solution provided in andersson2002minimum .
We finally note that we found the region in which a three outcome measurement was necessary by first finding the region in which a two-outcome measurement was optimal. If we use our method to find a candidate in the region where in fact the optimal measurement has only two outcomes, we find that even though it is possible to construct , it is not possible to construct a physically allowed measurement from the conditions 4, and the method fails. Further, it can sometimes return probabilities that are greater than 1, clearly indicating that something has gone wrong. This is illustrated in Fig 1.
V Discussion
We have presented a method to construct optimal minimum error measurements from the known necessary and sufficient conditions. If we know which of a set of states are identified by an optimal measurement, the method presented here allows us to construct four linear conditions on either or , from which we have enough information to reconstruct . The remaining problem we have not addressed, and which is common to other methods in the literature herzog2015optimal ; ha2013complete , is how to find which states our measurement should identify. We finish with some comments on this problem.
In the worst case, we can find the optimal measurement by exhaustive search: we first check if the no measurement solution is optimal. If yes then we are done, and if not then we know that . We then check whether any measurement identifying just two of the states is optimal. This consists of constructing optimal measurements for each pair of states, and checking the condition (2) for the remaining states in each case. There are such measurements. If none of these are optimal, then we know , and so on. This requires constructing [i.e. ] candidate operators, and for each one checking conditions, thus we require operations, in the worst case. Our detailed results for the case of three symmetric states with arbitrary priors, which we discuss elsewhere myfirstpaper , indicate that for almost all prior probabilities the optimal measurement has only two outcomes. Thus we expect that in many cases an optimal measurement will be found faster than operations.
For a given set of states, the method we present here allows us to characterise the entire parameter space of prior probabilities, beginning with the no-measurement solution, through those regions in which a two-outcome measurement is optimal, and constructing three- and then four-outcome solutions for the remaining regions, as shown in the example above. We note also that for specific cases numerical methods can also be used to determine which states are identified by an optimal measurement, and once this is known our method may be used to find an exact analytical solution for the optimal probability of correctly identifying the state and to find optimal measurements.
We have introduced an analytical method, complementary to the geometric approach in the literature, for constructing optimal measurements for minimum error state discrimination problems. Our method constructs linear constraints on the so-called Lagrange operator , and its inverse , from which the optimal may readily be found for any qubit state discrimination problem. Although the constraints we present appear elsewhere in the literature in a different context, it seems not to have been recognised that these together give enough information to construct optimal measurements. We have further shown that these are applicable to both pure and mixed states in the qubit case.
In this paper we have discussed the qubit case in detail. We expect that the linear constraints given on may also be applied in higher dimensions. The constraints on may be applied to pure states in higher dimensions, although the mixed state case appears less straight-forward, because it is no longer the case that any mixed state is a mixture of a pure state and the maximally mixed states. We leave a full discussion of the generalization to higher dimensions for future work.
Acknowledgements
This work was supported by the University of Glasgow College of Science and Engineering (S.C. and G.W.) and by the Royal Society Research Professorships (S.M.B., Grant No RP150122).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Joonwoo Bae and Leong-Chuan Kwek. Quantum state discrimination and its applications. Journal of Physics A: Mathematical and Theoretical , 48(8):083001, 2015.
- 2(2) Nicolas Brunner, Miguel Navascués, and Tamás Vértesi. Dimension witnesses and quantum state discrimination. Physical review letters , 110(15):150501, 2013.
- 3(3) Vladimir Bužek and Mark Hillery. Quantum copying: Beyond the no-cloning theorem. Physical Review A , 54(3):1844, 1996.
- 4(4) Dagmar Bruss, Artur Ekert, and Chiara Macchiavello. Optimal universal quantum cloning and state estimation. Physical review letters , 81(12):2598, 1998.
- 5(5) Dagmar Bruß, Mirko Cinchetti, G Mauro D’Ariano, and Chiara Macchiavello. Phase-covariant quantum cloning. Physical Review A , 62(1):012302, 2000.
- 6(6) Michael Keyl and Reinhard F Werner. Optimal cloning of pure states, testing single clones. Journal of Mathematical Physics , 40(7):3283–3299, 1999.
- 7(7) Carl W Helstrom. Quantum detection and estimation theory . Academic press, 1976.
- 8(8) Alexander Semenovich Holevo. On asymptotically optimal hypotheses testing in quantum statistics. Teoriya Veroyatnostei i ee Primeneniya , 23(2):429–432, 1978.
