Semi-device-independent self-testing of unsharp measurements
Nikolai Miklin, Jakub J. Borka{\l}a, and Marcin Paw{\l}owski

TL;DR
This paper introduces a semi-device-independent method to self-test unsharp quantum measurements, demonstrating their advantages over sharp measurements and providing robust characterization techniques using semidefinite programming.
Contribution
It presents a novel scheme for self-testing unsharp measurements and introduces a semidefinite programming method for bounding quantum correlations in sequential measurement scenarios.
Findings
Unsharp measurements outperform sharp ones in certain scenarios.
All two-outcome qubit measurements can be robustly characterized.
The method improves security analysis in quantum key distribution.
Abstract
Unsharp quantum measurements provide a resource in scenarios where one faces the trade-off between information gain and disturbance. In this work we introduce a prepare-transform-measure scenario in which two-outcome unsharp measurements outperform their sharp counterparts, as well as any stochastic strategy involving dichotomic projective measurements. Based on that, we propose a scheme for semi-device-independent self-testing of unsharp measurements and show that all two-outcome qubit measurements can be characterized in a robust way. Along with the main result, in this work we introduce a method, based on semidefinite programming, for bounding quantum correlations in scenarios with sequential measurements of length two. This method can also be applied to refine security analysis of the semi-device-independent one-way quantum key distribution. We also present new information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Semi-device-independent self-testing of unsharp measurements
Nikolai Miklin
Institute of Theoretical Physics and Astrophysics, National Quantum Information Center, Faculty of Mathematics, Physics and Informatics, University of Gdansk, 80-952 Gdańsk, Poland
Jakub J. Borkała
Institute of Theoretical Physics and Astrophysics, National Quantum Information Center, Faculty of Mathematics, Physics and Informatics, University of Gdansk, 80-952 Gdańsk, Poland
Marcin Pawłowski
Institute of Theoretical Physics and Astrophysics, National Quantum Information Center, Faculty of Mathematics, Physics and Informatics, University of Gdansk, 80-952 Gdańsk, Poland
Abstract
Unsharp quantum measurements provide a resource in scenarios where one faces the trade-off between information gain and disturbance. In this work we introduce a prepare-transform-measure scenario in which two-outcome unsharp measurements outperform their sharp counterparts, as well as any stochastic strategy involving dichotomic projective measurements. Based on that, we propose a scheme for semi-device-independent self-testing of unsharp measurements and show that all two-outcome qubit measurements can be characterized in a robust way. Along with the main result, in this work we introduce a method, based on semidefinite programming, for bounding quantum correlations in scenarios with sequential measurements of length two. This method can also be applied to refine security analysis of the semi-device-independent one-way quantum key distribution. We also present new information gain–disturbance relation for pairs of dichotomic measurements.
I Introduction
Nonprojective, or generalized, measurements are known to provide an advantage in various quantum information processing tasks. Examples include quantum tomography renes2004symmetric ; bisio2009optimal , state discrimination chefles2000quantum ; barnett2009quantum , and randomness certification acin2016optimal , to name a few. However, there is one important property of generalized measurements that is discussed less often than the others. It is the trade-off between information gain and disturbance that this type of measurement may allow for.
In quantum measurement, if some information is obtained about the state of the measured system, that state will necessarily be perturbed. This principle, which was formalized in Ref. busch2009no , is central to quantum mechanics and it is also one of the pillars of quantum cryptography bennett2014quantum . Quantitatively the interplay between information gain and disturbance has been formulated in numerous trade-off inequalities fuchs1996quantum ; fuchs1998information ; kretschmann2008information ; buscemi2008global .
Sharp or “strong” measurement does not provide such a trade-off. The state of the system after sharp quantum measurement corresponds to one of the eigenstates of the measured observable. It means that the disturbance caused by sharp measurements is maximal and any subsequent measurement on the same system will not provide any additional information about its original state. On the other hand, maximally unsharp measurement, corresponding to an identity channel applied on the state, does not cause any disturbance, but at the same time it does not give any information about the state of the system. In the case of two-outcome qubit measurements, which we discuss in this paper, the above two types of measurements are precisely the two variants of projective measurements. Hence, the aforementioned trade-off is possible only if one has access to a device performing nonprojective, or generalized, measurements.
At this point we should clarify some terminology that we use in this paper. Throughout the text we use the terms “two-outcome”, “binary”, and “dichotomic” interchangeably. For two-outcome qubit measurements the terms “unsharp” and “nonprojective” mean the same thing except for the case of maximally unsharp measurements, which are trivial measurements and are not interesting to study. Keeping this in mind, we note that the term “nonprojective” is very often associated exclusively with three- or more-outcome measurements in the case of qubit systems. Hence, we chose to use the term “unsharp” to denote two-outcome nonprojective measurement throughout this work. Unsharp measurements that aim at bringing minimal disturbance to the system are often referred to as “weak measurements” aharonov1988result .
Unsharp and, in particular, weak measurements have been the subject of numerous studies busch1986unsharp ; busch1988surprising ; ritchie1991realizaiton ; fuchs1996quantum ; fuchs1998information ; kretschmann2008information ; buscemi2008global ; pryde2005measurement ; aharonov2014foundations . We would like to mention some of the recent results of Refs. silva2015multiple ; curchod2017unbounded ; curchod2018entangled showing that nondestructive sequential unsharp measurements can be utilized to establish nonlocal correlations among multiple parties from one entangled pair and generate an unbounded amount of certified randomness. Naturally, given their importance, unsharp quantum measurements were reportedly implemented by various experimental groups ritchie1991realizaiton ; pryde2005measurement . However, all of these results require one’s trust in the measuring or preparation devices.
In this paper we ask the following questions: Is it possible to certify implementation of an unsharp measurement without making any assumptions, or with minimal assumptions made about the inner workings of the devices? If so, can one characterize the performed measurement based solely on the observed statistics? The answer comes from the rapidly growing field of self-testing mayers2003self ; vsupic2019self . It turns out that in quantum experiments it is possible to certify implementation of the target measurement or preparation of the desired state from the statistics alone and with minimal assumptions made about the inner workings of the devices. The most prominent example is self-testing of a two-qubit Bell state in the case of maximal violation of Clauser-Horne-Shimony-Holt (CHSH) inequality clauser1969proposed ; braunstein1992maximal ; popescu1992states .
Self-testing was originally proposed in the framework of the Bell test mayers2003self , also known as device-independent (DI) characterization. However, it was soon generalized to a more experimentally appealing prepare-and-measure scenario tavakoli2018self , which followed the ideas of semi-device-independent (SDI) characterization of quantum systems pawlowski2011semi . In the SDI framework one assumes that the dimension of the degree of freedom of a physical system used for information processing is bounded from above. We would like to point out that the assumption on the dimension is not only natural for studying generalized measurements (due to the possibility of a dilation), but it is also necessary for obtaining nontrivial results in scenarios with sequential measurements, as widely discussed in the literature fritz2010quantum ; hoffmann2018structure ; budroni2019memory . Additionally, more and more works on self-testing are now considering the SDI framework mironowicz2018experimentally ; tavakoli2018self ; tavakoli2018self2 ; farkas2019self .
The majority of self-testing results, in both DI and SDI settings, consider target objects, e.g., an entangled state, that satisfy the property of extremality. In this case, one can hope to find a witness, e.g., a Bell inequality, that singles it out. In our case, the situation is somewhat more complicated. The reason is that statistics of an unsharp binary qubit measurement can be reproduced exactly by a statistical mixture of projective measurements on the same state space without any postprocessing oszmaniec2017simulating . As a consequence, if one considers a simple prepare-and-measure scenario, it will not be possible to distinguish whether the measuring device performs an actual unsharp measurement or its simulation.
In this work we overcome the above limitations by considering a tripartite prepare-transform-measure scenario. We introduce a game in which unsharp measurements outperform their sharp counterparts as well as any statistical mixture of projective measurements. This claim is formalized by two propositions in Section III. In Section IV we show that in the proposed scenario one can device-independently characterize all two-outcome unsharp qubit measurements, subject to an assumption that the physical system that parties use for information processing is a qubit. In the same section we show that this characterization is robust to noise. In our proofs regarding self-testing we use semidefinite-programming techniques that we describe in Section V. These techniques can be used to tackle many other problems involving sequential measurements outside of the current work. We demonstrate it by refining the security analysis of the SDI one-way quantum key distribution pawlowski2011semi in Section VI. Some proofs as well as minor technical details are left for the Appendix.
II Prepare-transform-measure scenario
Let us now introduce our scenario which is a generalization of quantum random access code (QRAC) ambainis2008quantum . Consider three parties, Alice, Bob, and Charlie, communicating in a sequential way as shown in Fig. 1. Alice receives two random bits . Bob and Charlie each receive a random bit, and respectively, which indicates the bit of Alice this party needs to guess. All the random bits , and are independent and uniformly distributed. We assume that the parties do not share any entangled states, however, they may have access to any amount of shared classical randomness. Depending on , Alice prepares a qubit state which she sends to Bob. Bob performs some quantum operation on , depending on his input , and obtains a classical outcome . Afterwards, Bob sends the postoperation state to Charlie, who performs a measurement depending on .
A figure of merit that we consider is the following average success probability:
[TABLE]
with and where summation is taken over all values of and , respectively. The parameter is announced prior to the game and remains unchanged throughout all the rounds. It dictates the parties whose guess contributes more to the overall success probability. As we show later in the text, the optimal operations that Bob performs in the case of correspond to unsharp measurements.
II.1 Preliminary notions
Measurements in quantum mechanics are formalized by the notion of positive-operator valued measure (POVM). For two-outcome measurements, which we consider in this paper, a POVM is an ordered pair of positive semidefinite linear operators , , satisfying , where is the identity operator. Projection-valued measure (PVM) is a special case that corresponds to . From here it is clear that the only two possibilities of two-outcome qubit PVMs are sharp measurements, with both being normalized rank- operators, and the maximally unsharp, or trivial, measurement with one equal to and the other being zero. We call a POVM unbiased if and biased otherwise. In this work we concentrate on the former case as it encapsulates the resources that are interesting to us for the trade-off between information gain and disturbance. The biased case allows for an additional flexibility with respect to probability distribution of the measurement outcomes.
In order to describe the operations of Bob we need to introduce the notion of a quantum instrument, which captures both measurement statistics and the state evolution (see, e.g., Ref. heinosaari2011mathematical ). By quantum instrument for the case of two measurement outcomes we mean an ordered pair of completely positive trace nonincreasing maps that sum to a channel, completely positive trace-preserving map. Let us denote the instruments of Bob by for each , where each and , in agreement with our SDI assumption. If we apply these maps to the state (for a given ), which Alice sends to Bob, we obtain a pair of positive operators , . Their traces form the probability distribution of Bob’s measurement outcomes , and, if normalized, these operators correspond to the postmeasurement states that Bob sends to Charlie, . We say that instrument is compatible with a POVM if for all states , .
Using the formalism of quantum instruments we can write the joint probability distribution of both parties’ outcomes as follows:
[TABLE]
where we denoted Charlie’s POVMs by , . Naturally, using this formula we can calculate the probabilities in our figure of merit in Eq. (1).
The action of an instrument on a state can be specified by its Kraus decomposition (see, e.g., Ref. heinosaari2011mathematical ). However, it will be more convenient for us to work with instruments in their Choi-Jamiołkowski (CJ) representation (see, e.g., Ref. heinosaari2011mathematical ). We obtain the CJ operators of Bob’s instruments by the following map:
[TABLE]
where is an identity map on elements of and is a non-normalized Bell state . The inverse map is given by the relation , where the partial trace is taken with respect to the first subsystem, and stands for matrix transpose with respect to the basis in which the CJ isomorphism is defined. From this relation it is clear that POVM , associated with the CJ operator , is simply a pair of reduced operators with the trace taken over the second (output) subsystem, i.e., , for each .
Using CJ representation of the instruments we can rewrite the conditional probabilities observed in our scenario as follows:
[TABLE]
One can immediately notice that the above formula is very similar to the one of tensor product measurements on an entangled state. This motivated us to try to apply the semidefinite programming (SDP) techniques of Ref. navascues2015bounding to bound correlations in Eq. (4). As discussed in Section V, these SDP techniques had to be modified for this purpose.
III Unsharpness as a resource
We have already mentioned in the Introduction that unsharp measurements are needed to access the trade-off between information gain and disturbance. In this section we show that the figure of merit in Eq. (1) allows for witnessing unsharpness of the performed measurement in an SDI way, i.e., with no assumptions about the measuring or preparation devices apart from the upper bound on the dimension of the underlined Hilbert space. We are ready to present our first part of the results, which is given by the following two propositions.
Proposition 1**.**
The average success probability for strategies involving only projective measurements and their probabilistic mixtures is bounded by the following expression:
[TABLE]
The reason that the function has three distinct segments is that Bob can choose to apply one of the three different optimal strategies depending on . The first one, which we call “unitary”, corresponds to the maximally unsharp measurement for both , which is compatible with Bob outputting and applying a unitary channel on Alice’s states. The third one is the “measure-and-prepare” strategy, in which Bob performs a sharp measurement for both and prepares some state that he sends to Charlie. The middle segment corresponds to Bob applying the unitary strategy for , and the prepare and measure strategy for (or the other way around), which we refer to as a “mixed” strategy. The proof of Proposition 1 is too lengthy to be included in the main text and is given in Appendix B.
Proposition 2**.**
The bound on the average success probability for the general strategy involving unsharp measurements is the following
[TABLE]
The results of both Propositions 1, and 2 are shown in Figure 2. In this figure we also show the classical bound for this game, which evidently equals irrespective of . We should point out that all the presented bounds are tight in the sense that there exist states and measurements reaching these average success probabilities.
The optimal quantum strategy that attains the bound in Proposition 2 is the following. The states of Alice have the same form as in the QRAC, i.e., independent of . Their explicit form is given below, where we denote :
[TABLE]
The optimal POVMs of Charlie can be set to be the following:
[TABLE]
and the optimal instruments of Bob (their CJ operators) can be taken to be of the form ,
[TABLE]
where we left the real parameter unspecified. It is clear that determines the sharpness of the corresponding measurement of Bob. We find that the optimal is the following function of :
[TABLE]
which leads to the expression of the optimal average success probability from Proposition 2.
The above analysis gives only a lower bound on the average success probability. In order to prove that this bound is also an upper bound we used the aforementioned SDP techniques, described in Section V. We compared the numerical results from the SDP for each value of with a step of with the lower bound values from Proposition 2 and the discrepancy was always less than or of the order of , which leads us to conclude that the obtained bound is in fact the correct one.
Having a gap between success probabilities achieved by projective measurements and unsharp measurements is necessary for the self-testing claims. However, already from the results of Propositions 1 and 2 it follows that if one experimentally obtains a value of average success probability that is greater than for some , one can conclude that Bob’s measuring device performed an unsharp measurement. In order to answer the question of exactly which measurements was performed by Bob we will need a stronger result, given in the next section.
IV SDI characterization of unsharp measurements
The core idea that we use in our paper is to certify implementation of a quantum instrument that realizes the target unsharp measurement. The set of quantum instruments realizing a given unsharp POVM is strictly larger than the set of probabilistic mixtures of instruments corresponding to PVMs that reproduce the same measurement statistics for Bob. It is surprising that a simple and intuitive figure of merit in Eq. (1) is sufficient to witness this fact. Although the target measurements that we would like to self-test are not extremal in the space of all POVMs, some instruments that realize these measurements are extremal and hence can potentially be witnessed by saturating the bound in Proposition 2.
In this paper we focus on certification of the unsharpness parameter of the performed measurement from the observed data. In this section we only consider two-outcome unbiased qubit POVMs. Hence, the unsharpness parameter, which can be taken to be the operator norm of one of the effects completely characterizes these measurements up to an isometry. Hence, this SDI characterization is equivalent to the SDI self-testing of individual measurements.
Since the figure of merit in Eq. (1), when written in terms of probabilities in Eq. (4), is linear with respect to CJ operators of Bob’s instruments, it is sufficient to consider only extremal instruments. In Appendix A we show that in our case the extremal instruments have the form as in Eq. (III) (with being dependent on , in general). Hence, the sharpness of the corresponding POVM is exactly the parameter in Eq. (12).
Now, we come to our main question of what can be said about the unsharpness parameter for each given the observed value of the average success probability in Eq. (1). We answer this question by setting the opposite problem: What are the maximal values of the average success probability attainable by POVMs with a given norm? It turns out that the optimal is always unbiased for the figure of merit in Eq. (1).
Our results on the robust SDI characterization of unsharp measurements are shown in Figure 3. The black line represents the case of the ideal statistics, in which case the norm of the target measurements is the same for both and is given by the expression in Eq. (12). As we can see, the above norm is a monotonically increasing function of and its image is the interval . This fact suggests that we can self-test any unsharp unbiased two-outcome qubit measurement by picking the corresponding value of .
In Fig. 3 each pair of lines of the same color represents upper and lower bounds on the norm of the POVM for each , given that the average success probability is above some fixed value. We would like to stress that we do not assume that both norms for and are equal. In Figure 3 we show four values for , where is the deviation of the observed value of from the optimal value . From the plot one can see that for going to [math], the region of the compatible norms shrinks monotonically to a point with given by Eq. (12) for both . This fact suggests that our SDI characterization is indeed robust.
We would like to give an example of how to read the plots in Fig. 3. Let us say we wish to self-test a measurement with the norm equal to . We would pick , for which POVMs with this norm are the optimal ones. If now our observed average success probability is equal to , which is smaller than the optimal value , we can conclude that the norm of the implemented POVM for both must lay within the interval . A more detailed description of the robustness analysis is given in Appendix C. In Appendix D we also give a sufficient numerical evidence that our scheme can be adapted to SDI characterization of biased unsharp measurements.
V Methods
The proof of Proposition 1 is almost entirely analytical (see Appendix B). For the proof of Proposition 2 as well as for the results of the previous section we used SDP techniques that we describe in this section. These SDP techniques are based on the hierarchy of Navascués and Vértesi navascues2015bounding . Below, we also provide an argument why the hierarchy of Ref. navascues2015bounding cannot be directly applied to our problem. Moreover, our SDP techniques can be applied to a variety of problems involving two sequential measurements on a single system. For example, in Section VI we refine the analysis of the SDI one-way quantum key distribution proposed in Ref. pawlowski2011semi . There we show that a secure key can be distilled whenever the success probability of a QRAC exceeds the classical limit of , which is a significant improvement compared to the original result pawlowski2011semi .
The SDP hierarchy of Navascués and Vértesi navascues2015bounding is very useful for bounding correlations in the Bell scenario with an additional restriction on the local dimension. One can also use this hierarchy to approximate the sets of quantum correlations in the prepare-and-measure scenario, where the dimension constraint is crucial. In particular, using Ref. navascues2015bounding one can directly obtain upper bounds on success probabilities of QRACs. We do not summarize the method of Ref. navascues2015bounding here, so the interested reader should first make themselves familiar with it before reading the following description of the proposed modification.
In our modification of the hierarchy of Ref. navascues2015bounding , which we formulate here for what is know as the “1+AB” level, we consider the following set of operators
[TABLE]
We then consider four moment matrices for each . For each we assume the measure ; i.e., the entries of the matrix correspond to , . It is clear that each must be positive semidefinite, i.e., , . Also, we can use the main idea of the hierarchy of Ref. navascues2015bounding and construct a basis for from randomly sampled operators of fixed dimension. However, we need to apply more constraints on in order to obtain nontrivial bounds on our figure of merit in Eq. (1).
In particular, we exploit the fact that , for each . From this we can derive the following set of constrains that has to be satisfied by all quantum probabilities
[TABLE]
The above constraints should be interpreted as follows. The first type fixes the normalization of the instruments, and the second fixes the normalization of states . Finally, the third type of constraint ensures that all the inner products between the states are the same for both choices of Bob’s operations. Although the above modification is formulated for our scenario with binary , and , it is straightforward to generalize it to other scenarios with an arbitrary number of settings and outputs.
Now, we would like to argue why the original hierarchy of Ref. navascues2015bounding , as it stands, could not be applied to our problem. One could, in principle, try to explicitly include the CJ operators to the set of operators and consider a single moment matrix. However, this would require to be projectors in some potentially larger Hilbert space, which is not in general the case, since we do not know the normalization of the CJ operator of each individual element of the instrument.
In this paper we have successfully used the proposed SDP method to confirm the bounds obtained with the inner approximation methods. It would be interesting to investigate the convergence of this modified hierarchy navascues2015characterizing .
VI Applications
VI.1 Trade-off relation
As an application of the results of Proposition 2, we can derive new trade-off relations in terms of QRAC success probabilities attainable in the described sequential scenario. Indeed, our figure of merit can be rewritten as . From the condition we can derive the following trade-off relation
[TABLE]
This relation can be used to calculate the disturbance caused by the measurement of Bob by subtracting from the maximally attainable value of .
VI.2 Security of the SDI one-way quantum key distribution
Here we demonstrate that the SDP techniques introduced in the previous section have applications outside of the current work. To do this we refine the analysis of the security in semi-device-independent one-way quantum key distribution (QKD), proposed in Ref. pawlowski2011semi .
Let us briefly introduce this QKD scheme. In this scheme two communicating parties, Alice and Bob are trying to establish a secret key without trusting their devices. As a measure of privacy of the generated key they use the figure of merit of the QRAC, which reads
[TABLE]
The eavesdropping party, Eve, is trying to guess the bit of Alice corresponding to Bob’s input bit , i.e. to maximize the probability , where is Eve’s output. We can then introduce a real parameter just like we did in the main text and consider the following objective function
[TABLE]
which resembles the figure of merit in Eq. (1). We then optimize the expression in Eq. (15) subject to the constraints of the SDP method described in Section V. Afterwards, by optimizing over we obtain the monogamy relations between and that we plot in Fig. 4.
As shown in Ref. pawlowski2011semi , Alice and Bob are able to distill a secret key from their QRAC correlations whenever . From this condition one can calculate the critical success probability of QRAC required for the task. In Ref. pawlowski2011semi the authors estimated this critical probability to be , which is rather close to the maximal attainable success probability of the QRAC. From our plot in Fig. 4 one can see that it is enough for the average success probability to be just above the classical limit of . The latter value was also reported in Ref.chaturvedi2018security where the security of the SDI QKD scheme was analyzed from the point of view of the critical detection efficiency of the receiver’s detectors. However, to the best of our knowledge, the monogamy relation in Fig. 4 has not appeared in the literature prior to this work.
It is clear that our SDP techniques can be used for the analysis of the security of one-way quantum key distribution in scenarios beyond the one in Ref. pawlowski2011semi . Furthermore, there is a way to extend the SDP techniques to analyze correlations in scenarios with sequential measurements of length three.
VII Conclusions
In this paper we have discussed the advantages that unsharp measurements can provide if one faces the trade-off between information gain and state disturbance. We have proposed a scheme in which this advantage can be manifested within the semi-device-independent framework. Using this scheme we have shown that all unsharp binary measurements on a qubit can be semi-device-independently characterized in a robust manner, which also implies SDI self-testing of individual measurements for the case of unbiased POVMs. This was possible due to certification of the instruments implementing the respective unsharp measurement.
In this work we considered only SDI characterization of individual two-outcome qubit measurements since the latter can be achieved by certifying a single real parameter, the unsharpness. However, it is clear that our analysis can be extended to more complicated cases. One generalization, which is definitely worth considering, is certification of trinary qubit measurements. We believe that our scheme can be easily adapted to these purposes.
Semidefinite-programming techniques proposed in this paper can be readily applied to analyze the security of more complex SDI quantum key distribution schemes. These techniques can also be applied to explore the set of sequential quantum correlations for the case of time-dependent operations and sequences of length three.
Note added
After completing this work, we have become aware of the related result in Ref. wagner2018device . In Ref. wagner2018device the authors discuss self-testing of binary unsharp qubit measurements in the framework of the Bell test. In Appendix E we give a quick comparison of the obtained results and the used frameworks. We would also like to draw the reader’s attention to work in Ref. mohan2019sequential .
Acknowledgements.
N.M. would like to thank Costantino Budroni, Nicolas Brunner, Mariami Gachechiladze, and Anubhav Chaturvedi for useful discussions. The work on this project was supported by First TEAM Grant No. 2016-1/5. The SDP optimization was performed with SDPT3 solver toh1999sdpt3 and YALMIP toolbox lofberg2004yalmip for MATLAB.
Appendix
In this Appendix we provide technical details omitted in the main text. In Appendix A we discuss the optimality of Lüders instruments, a fact regarding extremal instruments that we use in the main text. In Appendix B we give a proof of Proposition 1. Appendix C is dedicated to a more detailed description of our results on the robustness of SDI characterization of unsharp measurements. Apart from that, in Appendix C we derive an upper bound on the average success probability of QRAC for the case of a dephasing channel being applied on the states of Alice. These results are used for the robustness proofs. In Appendix D we present our observations regarding the possibility of SDI characterization of biased two-outcome qubit measurements. Finally, in Appendix E we provide a comparison of our results with Ref. wagner2018device .
We start by fixing the notations that we use in this Appendix. We denote the preparation states of Alice by , where . The POVMs of Charlie we denote by . We use the notation to denote the qubit state orthogonal to the given one, i.e. , which is not ambiguous in the qubit case. We also use the common shorthand notation , as well as , .
Appendix A Optimality of Lüders instruments
In this Appendix we show that the optimal instruments of Bob are of Lüders type, which is essentially equivalent to saying that the optimal instruments have rank- Choi-Jamiołkowski (CJ) operators. Using the formula for conditional probability from Eq. (4) we can write explicitly the expression for the average success probability (1) in terms of operators as follows
[TABLE]
Let us introduce new operators , , with . We can think of as a block-diagonal matrix for each and , with two blocks being and . In a similar fashion we can construct operators , . The expression in Eq. (16) can then be written simply as
[TABLE]
It is easy to see that each is a state, i.e., , . It is also clear that the set of is convex and closed. Moreover, for the given POVMs of Bob we can always construct a “measure-and-prepare” instrument of the form , where are states. The states can then be expressed as . Since can be chosen arbitrarily, we can conclude that the extreme elements of the set of are direct sums of pairs of pure states, multiplied by and . The proof can be obtained trivially by assuming the opposite and writing the spectral decomposition for each of , which is then nothing but a convex combination of rank-1 block-diagonal operators (which means that the corresponding states are pure). Of course, measure-and-prepare instruments will not be optimal, but it is sufficient to consider this type of instrument to prove the optimality of rank-1 operators .
Now we can show that from the optimality of pure states of Alice and rank-1 operators it follows that CJ operators of optimal Bob instruments are necessarily rank . Let us write the spectral decomposition of for each ,
[TABLE]
where are not normalized. We did not write the subscripts for to avoid unnecessary complication, but, of course, will be different for different . Each can be written in its Schmidt decomposition schmidt1989theorie as , where we can require to be normalized, and leave not normalized. If we denote , we can write the following:
[TABLE]
However, we have already concluded that optimal must be rank-1 operators, which means that all must coincide up to a phase, i.e., for some phases . Now we can use the argument that the optimal must necessarily be linearly independent, i.e., for at least two , . It is pretty clear that in the opposite case Alice is not sending any information to Bob. Since span the entire Hilbert space, we come to the conclusion that , , i.e., , . However, since Eq. (18) is a spectral decomposition, it must be that , , from which it follows that there is only one vector in the sum in Eq. (18). This completes the proof. We should mention that the above proof can be directly extended to the higher-dimensional case.
Appendix B Proof of Proposition 1
In this Appendix we prove the bounds given in Proposition 1 on the average success probability in Eq. (1) for the case of projective strategies of Bob. There are two types of projective measurements that we need to consider, and . The case is clearly equivalent to . Since we need to consider only instruments of Bob corresponding to rank-1 CJ operators , the latter can be taken of the form
[TABLE]
for POVM . In the main text of the paper we refer to this strategy as “unitary” – because the Kraus operator for this instrument is a unitary. Another possibility for is the following:
[TABLE]
which corresponds to POVM . We refer to this strategy as “measure and prepare”. In both cases all the vectors , are normalized. It is important to note that in Eq. (20) the relative phase between and should be taken into account.
Due to the same requirement on the rank of we can directly see that the probabilistic strategies in which Bob applies, let us say, the operation from Eq. (20) with some probability and the strategy from Eq. (21) in all other cases will necessarily be less optimal than one of these strategies chosen with probability . It means that in order to analyze the bounds on the average success probability for strategies of Bob that correspond to projective measurements and their probabilistic mixtures it is sufficient to compute the bounds for the extreme projective strategies. Thus, we need to analyze the following three cases: (a) Bob applies a unitary strategy for , (b) Bob applies measure-and-prepare strategy for all , and (c) Bob applies a unitary strategy for and a measure-and-prepare strategy for . In the main part of the text we refer to the last case as a “mixed” strategy. It is also clear that strategies, equivalent to the ones listed, e.g., unitary strategy with the outcome flipped, or the mixed strategy with flipped, will give the same bounds. In what follows we analyze in detail all three cases (a), (b), and (c). When combined, it gives the proof of Proposition 1.
(a) “Unitary” strategy
As mentioned above, the instrument from Eq. (20) has a single Kraus operator that is a unitary. Let us denote it as , . Let us now write the expression for average success probability in terms of :
[TABLE]
It is clear that the first sum is equal to . The second sum can be upper-bounded by and for the maximal we can introduce new . Maximization over and is a standard optimization problem for QRAC, and thus the upper bound on in this case is .
b) “Measure and prepare” strategy
In the proof we use two main facts regarding the optimality of certain relations between states and measurements. The first one concerns maximization of expectation values over states , for some , and being normalized vectors in , . The maximum is obtained when with being a normalized eigenvector of corresponding to its maximal eigenvalue. When there are no further constraints on , the maximum can always be attained and the following formula gives its explicit form:
[TABLE]
where we used the notation for the operator norm (operator’s maximal eigenvalue).
The second fact concerns optimization of expressions of type over and , with both being normalized vectors in , with and where is some linear, not necessarily positive, operator. It can be seen that for optimal and necessarily . To see that, let us write the spectral decomposition of the operator , with , and . Then, . Clearly, the maximum over is attained at the boundary, i.e., for . It is also clear that for , a simple relabeling gives the same expression as for . Without loss of generality we can take and , which proves our claim.
Let us now proceed to the actual derivation of the bound. We remind ourselves that the CJ operators of the instruments of Bob have the following form (Eq. (21))
[TABLE]
After removing dependency due to POVM normalization, we can rewrite the average success probability as follows
[TABLE]
where and . Now we can use the fact that the optimal states are pure. Here we take and, from what we have discussed in the beginning of this section, we can assume . This creates some redundancy in Eq. (25), which can be removed. As a result one obtains the following expression for the optimal :
[TABLE]
where we have grouped the terms with respect to . We have also used the fact that extremal effects in this case are necessarily rank and introduced the notation . Applying Eq. (23) to Eq. (26) for the expression enclosed in we obtain the following:
[TABLE]
Each of the square roots are monotonic functions of and their domain is , if all possible values of are considered. Depending on the signs of the bounds on each square root will be obtained for or . One can expect that this is not the case for the sum of the square roots, i.e., the total . However, one can upper-bound the maximum of sums of the square roots by the sum of the maxima. Thus, we need to consider two cases of for each of the square roots, i.e., four cases in total. By doing so we obtain the following:
[TABLE]
For each of the four cases we can group the terms with respect to and upper-bound each of these inner products by the norms of the respective operators that are expressions of and with coefficients depending on . It is interesting that in each of these four cases the final upper bound is , which completes the proof.
(c) “Mixed” strategy
In this proof we use similar techniques as in the previous case of the measure-and-prepare strategy. In particular, we use the formula from Eq. (23) and, whenever our success probability will be written as an affine function of operators of the form , we assume .
For the mixed strategy of Bob the formula for the average success probability takes the following form
[TABLE]
where we have introduced the notation . Since we optimize the expression in Eq. (30) over and and we have no restrictions on the basis in which we consider these operators, we can apply to the basis of and to the basis of . Clearly, this will not change the optimal value for , but it removes this unitary from the expression in Eq. (30). We again use normalization of all POVMs in Eq. (30) to simplify it, which gives the following expression:
[TABLE]
where we used the fact that the optimal is rank and used a slightly different notation for , which are in this case: , , with as before. Now we can make use of the orthogonality relations to obtain the following:
[TABLE]
From this, it was not obvious how to proceed with the proof, so we tried to maximize the expression in Eq. (32) numerically. We used a simple parametrization of each vector with two real parameters and performed an unconstrained heuristic optimization over real parameters with gradient method for each value of . From this numerical optimization we could infer that the maximum of is obtained when and . We prove the validity of these observations later, but for now we assume them true and finish the calculations.
Taking and reduces Eq. (32) to the following form:
[TABLE]
Now we can put an upper bound on the expression in Eq. (33) by substituting all expectation values of operators by their norms. Using Eq. (23) we write as a function of , which is . Finally, optimization over gives the result stated in the proposition, i.e., .
Now, let us verify the assumptions and . In order to do so, let us write all the states in Eq. (32) in Bloch vector form. Since no basis is fixed, we can always assume . We can also assume, without loss of generality, that , i.e., that lies in the - plane of the Bloch ball. The normalization condition for reads . For the rest of the states, in general, we will have all components. Let us take , , , and . The normalization condition states that , for all . Inserting this form of states to Eq. (32) gives the following expression:
[TABLE]
We can show now that for solution maximizing the expression above we have necessarily . In order to do so, let us consider the optimization problem
[TABLE]
where maximization is carried over vectors , and . For the fixed vectors and the inner products , this optimization problem is equivalent to the original maximization of the expression in Eq. (34). Let us now construct a Lagrangian of the problem in Eq. (35),
[TABLE]
where are the Lagrange multipliers. The system of equations for stationary points of Eq. (36) contains, among others, the following three systems of linear equations
[TABLE]
We can immediately notice that the first system in Eq. (37) has nontrivial solutions if and only if the determinant of the system’s matrix is [math]. However, if this is the case, and , the second two systems have no solutions. Since we know that the optimal solution to the problem in Eq. (35) has to be among the stationary points, we can conclude that necessarily . Now we just need to check the exceptions, which are the cases when . It is easy to see that in these cases the objective function becomes either or , i.e., the maximum attained in the problem in Eq. (35) is at most . However, we know from our numerical solution that the optimal value of the expression is greater than for all .
Proving the optimality of is even easier. We can simply remove it from consideration by substituting the constraint with . Since the expression in Eq. (34), which we need to maximize, is linear in and , it means that the maximum is attained at the boundary of the subspace of and , i.e., when . This means in practice that .
We can apply use the SDP relaxation of Lasserre to the polynomial optimization problems lasserre2006convergent . We considered the second level of the Lasserre hierarchy in which case the size of the moment matrix is and the number of variables in the dual SDP is exactly . An important note, which might be of interest to those who would like to reproduce the result or solve a similar problem, is that in the optimization it was crucial to add the normalization constraints of the type and , to the obvious constraints , . If the former are not included in the SDP, the objective function in Eq. (34) turns out to be unbounded.
By performing the above SDP optimization we were able to confirm the optimality of the conditions and up to numerical precision of as well as reproduce the value of of the bound up to the same precision. This completes our proof.
Appendix C Robustness of self-testing
In this Appendix we give more details concerning the robustness of our SDI characterization of unsharp measurements. The question that we would need to answer is the following: Given the experimentally obtained value of the average success probability for some , , what can we say about the unsharpness parameter of the implemented measurements? Here we again discuss the case of trace-1 POVMs of Bob. In order to answer the posed question we first try to answer the opposite question: Given the eigenvalues of the POVMs of Bob, what is the maximal average success probability that can be obtained?
We start by fixing the CJ operator of the first element of Bob’s instrument to be
[TABLE]
where now is the operator norm of the effects of the first POVM (its largest eigenvalue). We can then apply the seesaw method of Ref. werner2001bell in order to establish the optimal form of other instruments of Bob and measurements of Charlie. From these numerical results we infer that we can make the following assumptions:
[TABLE]
where now can be different from . We confirm later that these choices of Charlie’s measurements and Bob’s instruments are indeed optimal, but for now we assume that it is true and continue with the derivations of the bound.
Using Eq. (C) we can obtain the bound on the average success probability for given and . The states can be chosen to be the eigenstates of the respective operators corresponding to the operators’ maximal eigenvalues, as it is often done in such proofs. We get the following
[TABLE]
If we fix to be some value between and , and optimize the expression in Eq. (C) w.r.t. , we find that the optimal is in fact the one given by Eq. (12). The same goes for the optimal , if is fixed. It means that, if we want to find a rectangular area in the space of for which from Eq. (C) is greater than or equal to some observed value , we should fix one of the norms, let us say , to be one given by Eq. (12), and solve the inequality with respect to . Unfortunately, solution to this inequality is too unwieldy to be written explicitly, even for this Appendix. However, since it is a single-variable problem it can be solved with numerical methods up to an arbitrary precision. It is instructive to give the solution in graphical form, which we do in the main text (see Fig. 3).
Let us now discuss the optimality of the solution in Eq. (C). We further modified the SDP techniques, described in Section V in the main text, in order to account for the fixed values of and . We consider the vectors to be of the following form:
[TABLE]
which is the general form of the vectors of CJ operators of instruments corresponding to two unbiased POVMs with effects’ norms and .
As a first modification, we add two operators to the set of operators , where are meant to be the vectors in the Schmidt decompositions from Eq. (C). Considering these operators allows us to add the following constraints to Eq. (13):
[TABLE]
On top of that, we set the upper bound on the success probability of Charlie, if dephasing channels with parameters are applied to the states of Alice. In particular, we enforce the following:
[TABLE]
After adding the constraints from Eqs. (C) and (45) to the SDP in Eq. (13) we can compare the bounds with the predictions given in Eq. (C). Since we know that for deriving the bounds on the norms we need to take one of , let us say , to be of the optimal form in Eq. (12), we just need to make a comparison for each pair . What we found, is that the modified SDP gives the exact bound from Eq. (12) up to the numerical precision (), whenever , i.e., for all less than or equal to the optimal norm. For we obtained a slight discrepancy in the bounds.
We believe that the optimal solution is of the form in Eq. (C) and the proposed bound is tight. However, for the results to be rigorous, we need to estimate the “error” in the estimation of the norms due to the fact that we could not reproduce exactly the bounds with the SDP method. In order to do so, we can use the simple error estimation formula . We have included the estimated “errors” in Fig. 3 as crosses of the respective colors corresponding to different relative discrepancies in the observed average success probability.
As a final remark we give a proof of the bound used in Eq. (45). First of all, we can notice that for the channel corresponding to one of the settings of Bob we can take . This can be easily seen if we write down the CJ operator of the channel:
[TABLE]
We can now rewrite this expression taking into account the normalization conditions, which yields the following:
[TABLE]
We are interested in finding the maximum of the following average success probability of Charlie:
[TABLE]
We can easily see that \sum_{{\vec{x}},z}\text{tr}\Big{(}\varrho^{T}_{\vec{x}}\otimes C^{z}_{x_{z}}\mathbb{1}\otimes\Big{(}(1-\lambda_{y})|q^{y}_{0}\rangle\langle q^{y}_{0}|+\lambda_{y}|q^{y}_{1}\rangle\langle q^{y}_{1}|\Big{)}\Big{)}=4. From here it follows that the resulting expression is a linear function in and . Suppose now the optimal and one of the vectors gives a larger contribution to the value of the expression in Eq. (48). In this case taking both vectors equal to the optimal one will give a larger value of , which contradicts our assumptions. It can also happen that and the values of the expressions corresponding to both are equal. In that case, however, there is no difference in taking them equal or not.
Let us now consider the channel
[TABLE]
which is the same channel as in Eq. (C), but with . We now derive the bound for each value of independently. Since we have a freedom of fixing bases in this case, we can set . We can then see that the channel we are considering is nothing but a dephasing channel of the form with parameter . We derive the bound in the following Appendix.
Success probability of QRAC under a dephasing channel
Let us consider a QRAC with two parties Alice and Charlie, Alice’s preparation states being and Charlie’s measurements . Let us assume that the states of Alice undergo a transformation, corresponding to a dephasing channel with some parameter . The average success probability which we would like to maximize is the following expression:
[TABLE]
We now prove that . First, let us expand the expression in Eq. (50) and use the normalization condition for POVMs of Charlie. We obtain the following:
[TABLE]
Now we use the argument that we made already several times in this paper, that for optimal states of Alice we have and . This leaves us with the following function to maximize,
[TABLE]
where we used the fact that the dephasing channel is unital. Now let us write the states and POVM effects in Bloch form introducing the following notation:
[TABLE]
It is easy to see that the action of the dephasing channel is in multiplying the variables , and by the factor . The expression for in terms of the Bloch coefficients reads
[TABLE]
Without loss of generality we can take , . We are then left with the following optimization problem
[TABLE]
This problem can be easily solved using the Lagrange multipliers method, giving the bound of .
Appendix D “Biased” case
Here we discuss the “biased” case, i.e., the case when , . We use to denote the spectrum of the first effects of Bob’s POVMs. The second effects then have the spectrum . These, in principle, can depend on , however, in our case they do not.
The figure of merit can be adjusted to account for the “bias” and here we consider the following expression for the average success probability
[TABLE]
with being the probability distribution , with . Notice that we do not change the distribution of for Charlie. The case corresponds to the original figure of merit in Eq. (1), with which we provided the SDI characterization of the “unbiased” case, i.e., . For , however, we noticed that the optimal eigenvalues of the first effect of Bob’s POVMs do not sum to . We summarized our findings in Fig. 5.
Let us first discuss the values of that can occur. First of all, we know that , , since the second effects have to be positive. We can also consider the case when , since we have a freedom of denoting which effect is the first or the second. The corresponding region in space of is a triangle, that is depicted with black lines in Fig. 5.
The edge of this triangle that connects the vertices and corresponds to the “unbiased” case of . If we wish to self-test POVMs corresponding to this line we take and vary between [math] and , as discussed in the main text. If we would like to “move” towards the vertex of this triangle, we need to increase the parameter from to . In Fig. 5 we have depicted with color lines what happens with optimal values of , for the given value of , if we increase the parameter . These are the results of the optimization with the seesaw method of Ref. werner2001bell . As we can see from these plots, we can cover all the area of the triangle by choosing appropriate . This supports our claims of the fact that all binary measurements on a qubit can be characterized in the proposed scenario. The robustness analysis of this characterization should be reported elsewhere.
Appendix E Comparison between SDI framework and the one used in Ref. wagner2018device
In Ref. wagner2018device the authors discuss self-testing of unbiased binary qubit measurements in the Bell scenario. Normally, self-testing in the Bell scenario is a stronger result than the similar one in the SDI framework, because one does not assume the dimension of the underlined system. We would like to argue that, first of all, our scheme can also be adapted to the device-independent framework considered in Ref. wagner2018device . Second, we consider the SDI framework to be much more natural for self-testing of unsharp measurements.
Below we propose a general, quite intuitive way to map SDI scenarios to the one considered in Ref. wagner2018device . Let us say we are interested in optimizing a linear function of probabilities , where are preparation states and are POVM effects of the measuring party. In the SDI framework we assume that , where . At this point we do not restrict the number of outcomes or the number of settings. Since we can always represent two sequential measurements as a single POVM, the following reasoning also applies to the scenario considered in the main text of this paper.
Let us now consider the same linear function as before, but now we substitute the probabilities with , where are the effects of POVMs of Alice corresponding to outcome [math] and is the state shared between the parties. If we can certify in a separate setup, as it is done in Ref. wagner2018device , that is a maximally entangled state with local dimension then . We can then relabel and return to the original optimization problem (up to a global factor ). This simple reasoning suggests that the case of maximal violation of the CHSH inequality in the scheme of Ref. wagner2018device is equivalent to the assumption on the dimension in the SDI framework.
We would also like to argue that our scheme is the simplest and also more natural for self-testing of unsharp measurements. Indeed, as clearly stated in the main text, unsharp measurements cannot be differentiated from their probabilistic realizations with PVMs in a simple two-party scenario, where the information about the post-measurement states is discarded. This concerns both the prepare-and-measure and the Bell-type scenario considered in Ref. wagner2018device . We would like to argue that the SDI framework, corresponding to the prepare-and-measure scenario is much more natural, because there is simply no way to avoid the assumption on the dimension of the Hilbert space of states that Bob sends to Charlie. If we lift this assumption, one can think about Bob and Charlie as a single party. In this case it is not possible to distinguish between two sequential binary measurements and the four-outcome POVM that one obtains by applying the Heisenberg dual of Bob’s instruments to POVMs of Charlie. This assumption, although not explicitly stated, is also present in Ref. wagner2018device . From our perspective, if one has to make the assumption on the dimension of the Hilbert space in which are defined, there is no benefit in trying to relax this assumption for the states of Alice.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) J. M. Renes, R. Blume-Kohout, A. J. Scott, and C. M. Caves, “Symmetric informationally complete quantum measurements,” Journal of Mathematical Physics , vol. 45, no. 6, pp. 2171–2180, 2004.
- 2(2) A. Bisio, G. Chiribella, G. D’Ariano, S. Facchini, and P. Perinotti, “Optimal quantum tomography of states, measurements, and transformations,” Physical review letters , vol. 102, no. 1, p. 010404, 2009.
- 3(3) A. Chefles, “Quantum state discrimination,” Contemporary Physics , vol. 41, no. 6, pp. 401–424, 2000.
- 4(4) S. M. Barnett and S. Croke, “Quantum state discrimination,” Advances in Optics and Photonics , vol. 1, no. 2, pp. 238–278, 2009.
- 5(5) A. Acín, S. Pironio, T. Vértesi, and P. Wittek, “Optimal randomness certification from one entangled bit,” Physical Review A , vol. 93, no. 4, p. 040102, 2016.
- 6(6) P. Busch, “No information without disturbance: Quantum limitations of measurement,” in Quantum Reality, Relativistic Causality, and Closing the Epistemic Circle , pp. 229–256, Springer, 2009.
- 7(7) C. H. Bennett and G. Brassard, “Quantum cryptography: public key distribution and coin tossing.,” Theor. Comput. Sci. , vol. 560, no. 12, pp. 7–11, 2014.
- 8(8) C. A. Fuchs and A. Peres, “Quantum-state disturbance versus information gain: Uncertainty relations for quantum information,” Physical Review A , vol. 53, no. 4, p. 2038, 1996.
