Quantum measurement optimization by decomposition of measurements into extremals
Esteban Mart\'inez-Vargas, Carlos Pineda, Pablo Barberis-Blostein

TL;DR
This paper introduces an efficient numerical method for optimizing quantum measurements by decomposing them into extremals, significantly improving parameter estimation strategies in quantum metrology.
Contribution
The paper presents a novel numerical approach that leverages the convex structure of quantum measurements to optimize parameter estimation, outperforming previous methods.
Findings
Strong numerical advantage for small target errors
Effective in qubit and harmonic oscillator systems
Improves quantum measurement strategies
Abstract
Using the convex structure of positive operator value measurements and of several quantities used in quantum metrology, such as quantum Fisher information or the quantum Van Trees information, we present an efficient numerical method to find the best strategy allowed by quantum mechanics to estimate a parameter. This method explores extremal measurements thus providing a significant advantage over previously used methods. We exemplify the method for different cost functions in a qubit and in a harmonic oscillator and find a strong numerical advantage when the desired target error is sufficiently small.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Quantum measurement optimization by decomposition of measurements into extremals
Esteban Martínez-Vargas1, Carlos Pineda2, Pablo Barberis-Blostein3
1 Física Teòrica: Informació i Fenòmens Quàntics, Departament de Física, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain
2Instituto de Física, Universidad Nacional Autónoma de México, Universidad Nacional Autónoma de México, México D. F. 01000, Mexico
3Instituto de Investigaciones en Matemáticas Aplicadas y Sistemas, Universidad Nacional Autónoma de México, México D. F. 01000, Mexico
Abstract
Using the convex structure of positive operator value measurements and of several quantities used in quantum metrology, such as quantum Fisher information or the quantum Van Trees information, we present an efficient numerical method to find the best strategy allowed by quantum mechanics to estimate a parameter. This method explores extremal measurements thus providing a significant advantage over previously used methods. We exemplify the method for different cost functions in a qubit and in a harmonic oscillator and find a strong numerical advantage when the desired target error is sufficiently small.
Contents
1 Introduction
The goal of quantum metrology is to find limits in the precision of parameter estimation of quantum systems [1, 2, 3, 4, 5]. Recent research includes theoretical and experimental advances [6, 7]. Knowing these limits allows to know if a given measurement strategy minimizes the estimation errors. The strategy that minimizes the error will be called the optimal measurement strategy. Given a quantum state that depends on a set of parameters, the optimal measurement strategy for estimating these parameters can be used to estimate the quantum state. These strategies has applications in quantum technologies, for example they can can be used together with quantum control for quantum state manipulation [8].
To estimate a parameter of a physical setup one acquires data through measurements and the estimation of the parameter is obtained by applying a function, known as the estimator, to the data. Data has a random component. The probability distribution of measurement outcomes can be modelled using a statistical model of the experiment: the probability distribution of outcomes conditioned to the value of the parameter. Given a mathematical model of a system using quantum mechanics, the statistical model is obtained once it is decided which operator is going to be measured. Notice that we can include classical noise in the quantum mechanical description. Going from the data to the estimation of the parameters that minimizes the error is not trivial.
Given a cost function that quantifies the error, the optimal measurement strategy consist of the quantum measurement and estimator that extremizes it. But, finding the extreme of a cost function over all possible quantum measurements and estimators is not simple. When a Cramér-Rao type inequality exists, the problem can be reduced to find the extreme of another cost function over all the quantum measurements. This simplifies the problem because it is not longer necessary to maximize over the space of estimators.
It is possible, though very costly, to numerically find the maximum over all quantum measurements of cost functions. The straightforward way is to randomly sample the space of all positive operator value measures (POVMs), evaluate the cost function on this sample, and keep the maximum value obtained. This method, that we will call the random sampling method (RSM), is very inefficient because the POVM space is large.
In this paper we show how using the algorithm proposed by Sentís et al. [9] we can numerically find the maximum over all quantum measurements of some cost functions, that are useful in quantum metrology, in a way that is orders of magnitude faster than using the RSM. This paper presents a direct application of this algorithm. We used it as a method to extract extremal POVMs from a general POVM. It is easy to produce efficiently a general POVM from a random unitary matrix, however, it is not trivial to produce randomly extremal POVMs, this is where the algorithm plays an essential role.
We will call our method the random extreme sampling method (RESM). The techniques here presented can be used, for example, to find numerically the quantum van Trees information [10] or the quantum Fisher information in the case that the initial state is not pure. Together with the value of the cost function maximum, the quantum measurement that maximizes it is obtained.
We start presenting the mathematical tools, including the Crámer-Rao inequality and its quantum extensions in section 2. In the following section we present the RESM in detail (section 3). We finish the bulk of this article comparing the performance of RESM against RSM in a two-level scenario, and benchmarking the accuracy of the method against other ansatz for a harmonic oscillator in two different physical situations. We close the article with some concluding remarks in section 5.
2 Mathematical tools
We discuss how to get the best parameter estimation given a statistical model. Then we discuss how to apply this ideas for a quantum system.
2.1 Crámer-Rao inequality
Now we introduce some basic quantities needed to develop further discussion. Let
[TABLE]
be the distribution probability of the outcomes , of the random variable , conditioned to a fixed value of the real parameter . We assume that each is a set of real numbers of fixed finite size. This is the statistical model. The function is called the estimator and gives and estimation of . The estimator is unbiased when is in average correct,
[TABLE]
The uncertainty of the estimator is given by the mean squared error
[TABLE]
We say that the measurement strategy is optimal if the estimator minimizes the uncertainty. Finally, let us define the Fisher information
[TABLE]
If the estimator is unbiased (i.e. if (2) holds), using the Cauchy-Schwartz inequality, one arrives to the Cramér-Rao inequality [11, 12]:
[TABLE]
Note that depends on the choice of the specific estimator (), whereas the Fisher information depends only on the probability density function of the random variable. From the Cramér-Rao inequality, we see that the inverse of the Fisher information bounds from below the mean squared error independently of the estimator we use; the larger the Fisher information is the smaller the error bound. The best measurement strategy saturates the Cramér-Rao inequality. Fisher showed that in the limit where the number of measurements goes to infinity, the maximum likelihood estimator saturates this inequality [13].
2.2 Bayesian Cramér-Rao inequality
In general each time a measurement is done, the parameter is different. Let us give two examples. The first is a length that changes due to thermal fluctuations. The second, a parameter (such as the phase of a phase shifter) characterizing an element of an ensemble of non-identical objects. Both situations can be modelled assuming that the parameter to be measured is a random variable (with its characteristic distribution). A third case, in which a random variable approach is suitable, is when the parameter is fixed, but we have some partial knowledge contained in an a priori distribution. The Bayesian Cramér-Rao inequality can be used to decide what is the best estimator in this situation.
Lets assume now that each time we make an experiment the parameter we want to estimate is different. We model the parameter as the random variable , with outcomes , and probability distribution . The outcomes of the experiment are modelled as the random variable , with outcomes , and probability distribution . The experiment is modelled in the following way: first we take a value, , from the random variable ; the outcome of the experiment is which is taken from the random variable with distribution probability ; using we can estimate by , After repeating the experiment times, we have estimations with . The error of experiment is . The mean squared error is, after performing the experiment times, . This is the cost function we want to minimize over all the estimators . In the limit it can be written as
[TABLE]
where . It can be shown that the error is bound from below by the Cramér-Rao type inequality[14],
[TABLE]
where the generalized Fisher information, , can be written as
[TABLE]
The first term of the sum is the expectation value of the Fisher information, the second term is the Fisher information of the probability distribution of the possible values of the parameter. Note that we already know something about the parameter; this a priory knowledge is given by . As can be seen from the previous equation, the generalized Fisher information is larger than the Fisher information of the knowledge we already have of the parameter. This has a simple interpretation: we can use to estimate the parameter and measuring the system necessarily diminishes the error in the estimation of the parameter. The best strategy for measuring the outcomes of a random variable is given by the estimator, , that saturates this inequality.
In this context, we found useful [15], a review of bayesian inference in physics.
2.3 Quantum Cramér-Rao inequality
We want to find the best measurement strategy to estimate a parameter, , that appears in the Hamiltonian of a quantum system. In order to estimate the parameter we proceed as follows: we start with an initial state and let the system evolve some time, after which the state of the system is . One then measures some observable of the system; we use the result to estimate . Fixing the Hamiltonian, time and initial state, we want to know if the strategy we are using minimizes the error in the parameter estimation.
Measurements in quantum mechanics are described by the positive operator valued measure (POVM), which we briefly recall in order to fix the notation. If is a POVM parametrized by the real parameter , for each value of , is a self-adjoint operator on the system Hilbert space, they satisfy
[TABLE]
and the probability of measuring the result is
[TABLE]
Notice that can also belong to a finite set (or a combination of several discrete and continuous indices), if the number of possible outcomes is finite. The expressions throughout this article generalize replacing by .
Fixing the POVM, and thinking of (10) as the distribution probability of the outcomes [as in (1)], one can use the tools introduced in section 2.1. In particular, we can calculate the Fisher information and use the Cramér-Rao inequality to know if a given estimator is optimal. Note that there exist a dependence of the Fisher information on the POVM we choose. In order to have the lowest bound for the error we maximize the Fisher information over all the possible measurements [16]
[TABLE]
The quantity is known as the quantum Fisher information and through the Cramér-Rao inequality,
[TABLE]
tells us the minimal possible error for the best measurement strategy for estimating a parameter appearing in the Hamiltonian of a quantum system. Equation (12) holds since Cramér-Rao inequality is valid for every POVM, therefore it is valid for the one in which the maximum Fisher information is attained. The POVM that maximizes is the one that should be used to get the smallest error in the parameter estimation [1]; we call this POVM the optimal POVM. If the quantum state is pure there are analytical formulas for finding , otherwise no general formulae are know and one must rely in numerical methods.
2.4 Bayesian quantum Crámer-Rao inequality
Assume now that each time we prepare the quantum system we are going to meaure, the parameter we want to estimate is taken from a random variable with probability distribution .
The POVM, , that maximizes the generalized Fisher information, , together with the appropiate estimator, saturates the Cramér-Rao type inequality
[TABLE]
where
[TABLE]
We call the quantum Van Trees information.
If we want to minimize the error in the parameter estimation, and we codify what we know about the parameter in the probability distribution , we have to implement the quantum measurement given by [10].
3 Numerical calculations
The calculation of cost functions as or is not easy, as it implies an optimization over all POVMs. In this section we present an efficient numerical procedure to calculate the maxima, over all POVMS, of convex cost functions.
3.1 Convexity
The quantum Van Trees information is convex; this follows directly noticing that set of POVMs [17, 18] and the Fisher information are convex. Fisher information can be rewritten as , where the prime indicates derivative with respect to . It then follows that
[TABLE]
provided that [19]. For a combination with other weights, continuity and a recursive procedure imply convexity of . But then, the Van Trees information is also convex, as the integral of convex functions is also convex.
Since the maximum of the convex cost functions lies on the extremal points of all POVMs, we only need to search in this subset simplifying greatly the optimization task. A way to sample randomly such a set is presented in the following paragraphs.
3.2 The algorithm
The outline of the algorithm is as follows: We produce a random POVM, and decompose it in extremals. We then evaluate the cost function using all its extremal POVMs and choose the one which reaches the highest value. We repeat the procedure several times and keep the optimal POVM. We provide an implementation in [20].
Random POVMs
To produce a random POVM, we use backwards the purification algorithm [21] which transforms a general POVM into a usual projective measurement in an enlarged space. We start from a unitary matrix acting on a Hilbert space resulting of the product of the original Hilbert space, and an ancilla space of dimension equal to the number of outcomes of our POVM. This matrix is chosen according to the Haar measure in the enlarged Hilbert space [22]. To generate a member from such ensemble we build a matrix with Gaussian complex numbers (with equal standard deviation and zero mean). The desired matrix, , is one that diagonalizes . Each of the operators of the POVM we want to generate are defined via its matrix elements as
[TABLE]
where we are using tensor index notation for the space in which acts, the first corresponding to the original space, and the second to the additional ancilla space. Since for all POVMs one can build a unitary transformation in the extended space such that (16) holds [21], sampling all unitaries in the extended space guaranties sampling all POVMs with the corresponding number of outcomes.
Conversion to a rank-1 POVM
To proceed further, we need a rank-1 POVM, so we must transform the aforementioned POVM accordingly. Recall that a rank-1 POVM is one whose elements are all rank-1 operators. For all operators of the POVM that are not rank-1, we apply standard procedures to decompose them, for example the spectral decomposition for normal operators. Notice that the number of elements can change after this step. Let us call the number of outcomes of the rank-1 POVM obtain.
Obtaining an element of the decomposition
Let us define , and with an orthonormal traceless base for hermitian matrices of the appropriate dimension. In our case, we used the Gell-Mann matrices. We also define , so that the completeness condition over POVMS reads
[TABLE]
if we define the -dimensional vector . We now propose the linear program
[TABLE]
Even though is a solution to the problem, the usual numerical algorithms provide an extremal point, which defines an extremal POVM [18, 9]. Notice that if an element of the solution is 0, it means that we do not include the operator in the POVM. Let this extremal solution be .
To obtain the extremal POVM we start by defining via
[TABLE]
with a scalar. Requiring that can be enforced letting
[TABLE]
which in turn implies that is a probability and that for some , . If we define and , we can write
[TABLE]
Indeed, Q is an extremal POVM [9], and since one of the elements of is null, is a output POVM for which we can iterate the algorithm until a single output POVM is obtained.
Notice that with this algorithm, all POVMs with a given number of outputs can in principle be sampled.
4 Examples
In this section we apply the method described in section 3 to estimate the quantum Fisher information and the quantum Van Trees information, . We observe a big advantage in terms of numerical effort using this method compared with finding the maximum using randomly chosen POVMs.
4.1 Qubit
We consider a spin particle in a superposition pure state (see [23]),
[TABLE]
where is the phase between the two basis states and is a known parameter that characterizes the weight of each element of the superposition. The problem is the following: we want to find the best estimate strategy for the phase when is known.
Through the rest of the article we shall consider the following sets of POVMs:
[TABLE]
so each POVM is parametrized by and has two elements that corresponds to the outcomes and . In this subsection we shall consider the particular case
[TABLE]
Using Eq. (10) we obtain that the probability of measuring outcome or for the POVM is given by
[TABLE]
The Fisher information for this probability distribution is
[TABLE]
which is a function of the parameter we want to estimate, i.e. . Notice that there is a dependence on the initial state, via , and on the POVM used, via . We make this dependence on the POVM explicit via a superscript. When the state is pure, the maximum quantum Fisher information can be analytically calculated [16]; in this example the quantum Fisher information is the maximum of with respect to :
[TABLE]
In order to know if the RESM is useful, we apply the RSM and RESM methods and compare their results with the exact result Eq. (25). We define the errors
[TABLE]
where and are the Fisher information numerically calculated using the RSM and RESM respectively. In Fig. 1(left) we plot running time vs error, for the two methods. It is clear from the plot that RESM is better and the longer the program runs the better the results using RESM compared with RSM. For this example, we obtain an error two orders of magnitude smaller running the program the same time.
Now we consider that is a random variable with probability distribution ; limits to the error in the estimation of its outcomes are given by the Cramér-Rao type inequality Eq. (13). First we consider the maximization of the generalized Fisher information over the family of POVMs, , given by (21) and (22)
[TABLE]
Because we are using a subset of all the POVMS , nevertheless this approach allow us to get an analytic approximation for the quantum van Trees information. We assume that has a uniform distribution in , i.e. in (26). For a uniform superposition (), the Fisher information becomes independent of ; in fact , see (24). This implies that any POVM from the family maximizes the Fisher information. In general we obtain
[TABLE]
so we can assert that if only POVMs of the family are allowed, the best estimation is in the case where .
Now we apply RESM to calculate and compare them with , see figure 1(right). The maximum of is obtained when . That means that the lowest error in the phase estimation is obtained when the weights of the superposition are the same. The figure suggest that .
4.2 Phase estimation
We want to estimate the phase difference between two paths that light can follow, see [10] for a similar calculation. We probe the system with a coherent state, such that one path yields the state (with a complex number) and the other
[TABLE]
where is the number operator.
Assume that the object that creates the phase difference is subject to fluctuations such that the phase difference between the two paths is different each time the experiment is done. One can model this assuming that is a random variable. We consider a Gaussian distribution centered at , with standard deviation and trimmed at the edges ([math], ). Using RESM we calculated the quantum van Trees information for different values of . The results are depicted in 2 (left) as red dots. The line is obtained using (21) with as an ansatz. The figure suggest that the family of POVMS proposed is a good ansatz.
4.3 Coherent plus thermal state
As a final example, we consider estimating a parameter, chosen from a given distribution, encoded in a non-pure state. In general, there are no analytical expressions for the quantum Fisher information for this case. We calculate in order to bound the error in estimating the parameter. We build upon the last example, considering a mixture of (28) and a thermal state. Let
[TABLE]
with
[TABLE]
be the state in which the parameter () is encoded. For the right side of fig. (2), we used a Gamma distribution of the form,
[TABLE]
with and .
In figure 2 (right) we show the numerical calculations of using the algorithm RESM. We compare it with the case of an initial coherent pure state and with the ansatz composed of the two outcome POVM (21) with , see (29). As expected, is larger for a pure state: a coherent pure state is better for estimating a phase than the mixture (29).
4.4 A note for reproducing results
The code implementation can be obtained in the repository [20]. To reproduce the results presented in section 4.1, set the flag -o to Qubit and vary the flag --EtaAngle from 0 to . For the results in sections 4.2 and 4.3, set the flag -o to CohPlusTher and to CohPlusTherGamma respectively. We also set the temperature with -T 0.001, the mixing constant --MixConstant 0.5, the number of times to sample the space with -s 150, the dimension of the Hilbert space to describe the system with --HilbertDim 7 and the number of outcomes of the POVM with --Outcomedim 10. For pure state, as in section 4.2, set --MixConstant 1. The squared norm of is set with the option --MeanPhotonNumb, which can be varied to reproduce the plots. The whole data set can be obtained with the command make all.
5 Conclusions
The random extreme sampling method (RESM) can be used to find efficiently the maximum of a cost function over all possible quantum measurements. Particularly it is useful to find limits in the precision of parameter estimation, through the cost function known as the quantum Fisher information, when the state to be measured is a mixed state. It can also be used to find the optimal measurement strategy by a given convex cost function by finding the POVM that maximizes it, at a considerable lower computational cost.
6 Acknowledgements
Support by PASPA-DGAPA, UNAM, projects CONACyT 285754 and UNAM-PAPIIT IG100518, IN-107414.
References
- [1]
B. M. Escher, R. L. Matos Filho, and L. Davidovich.
Quantum Metrology for Noisy Systems.
Brazilian Journal of Physics, 41(4-6):229–247, September 2011.
- [2]
Akio Fujiwara.
Strong consistency and asymptotic efficiency for adaptive quantum estimation problems.
Journal of Physics A: Mathematical and General, 39(40):12489, October 2006.
- [3]
Vittorio Giovannetti, Seth Lloyd, and Lorenzo Maccone.
Advances in quantum metrology.
Nat Photon, 5(4):222–229, Apr 2011.
- [4]
M. Tsang.
Conservative classical and quantum resolution limits for incoherent imaging.
ArXiv e-prints, May 2016.
- [5]
Marcin Jarzyna and Rafał Demkowicz-Dobrzański.
True precision limits in quantum metrology.
New Journal of Physics, 17(1):013010, 2015.
- [6]
Géza Tóth and Iagoba Apellaniz.
Quantum metrology from a quantum information science perspective.
Journal of Physics A: Mathematical and Theoretical, 47(42):424006, 2014.
- [7]
Giorgio Colangelo, Ferran Martin Ciurana, Lorena C. Bianchet, Robert J. Sewell, and Morgan W. Mitchell.
Simultaneous tracking of spin angle and amplitude beyond classical limits.
Nature, 543(7646):525–528, Mar 2017.
Letter.
- [8]
Jeremy L. O’Brien, Akira Furusawa, and Jelena Vuckovic.
Photonic quantum technologies.
Nat Photon, 3(12):687–695, Dec 2009.
- [9]
G Sentís, B Gendra, S D Bartlett, and A C Doherty.
Decomposition of any quantum measurement into extremals.
Journal of Physics A: Mathematical and Theoretical, 46(37):375302, 2013.
- [10]
Esteban Martínez-Vargas, Carlos Pineda, François Leyvraz, and Pablo Barberis-Blostein.
Quantum estimation of unknown parameters.
Phys. Rev. A, 95:012136, Jan 2017.
- [11]
Harald Cramér.
Mathematical Methods of Statistics.
Princeton University Press, 1945.
- [12]
C. Radhakrishna Rao.
Information and the accuracy attainable in the estimation of statistical parameters.
Bull. Calcutta Math. Soc., 37:81–91, 1945.
- [13]
R. A. Fisher.
On the Mathematical Foundations of Theoretical Statistics.
Philosophical Transactions of the Royal Society of London A: Mathematical, Physical and Engineering Sciences, 222(594-604):309–368, January 1922.
- [14]
Harry L. Van Trees.
Detection, Estimation, and Modulation Theory.
John Wiley & Sons, April 2004.
- [15]
Udo von Toussaint.
Bayesian inference in physics.
Rev. Mod. Phys., 83:943–999, Sep 2011.
- [16]
Samuel Braunstein and Carlton Caves.
Statistical distance and the geometry of quantum states.
Physical Review Letters, 72(22):3439–3443, May 1994.
- [17]
Giacomo Mauro D’Ariano, Paoloplacido Lo Presti, and Paolo Perinotti.
Classical randomness in quantum measurements.
Journal of Physics A: Mathematical and General, 38(26):5979, 2005.
- [18]
Erkka Haapasalo, Teiko Heinosaari, and Juha-Pekka Pellonpää.
Quantum measurements on finite dimensional systems: relabeling and mixing.
Quantum Information Processing, 11(6):1751–1763, 2012.
- [19]
F. Leyvraz, private communication, 2017.
- [20]
https://github.com/estebanmv/Random-Sampling-Extremal-POVMs.
- [21]
M.A. Nielsen and I.L. Chuang.
Quantum Computation and Quantum Information.
Cambridge Series on Information and the Natural Sciences. Cambridge University Press, 2000.
- [22]
M. L. Mehta.
Random Matrices.
Academic Press, San Diego, California, second edition, 1991.
- [23]
O. E. Barndorff-Nielsen and R. D. Gill.
Fisher information in quantum statistics.
Journal of Physics A: Mathematical and General, 33(24):4481, June 2000.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. M. Escher, R. L. Matos Filho, and L. Davidovich. Quantum Metrology for Noisy Systems. Brazilian Journal of Physics , 41(4-6):229–247, September 2011.
- 2[2] Akio Fujiwara. Strong consistency and asymptotic efficiency for adaptive quantum estimation problems. Journal of Physics A: Mathematical and General , 39(40):12489, October 2006.
- 3[3] Vittorio Giovannetti, Seth Lloyd, and Lorenzo Maccone. Advances in quantum metrology. Nat Photon , 5(4):222–229, Apr 2011.
- 4[4] M. Tsang. Conservative classical and quantum resolution limits for incoherent imaging. Ar Xiv e-prints , May 2016.
- 5[5] Marcin Jarzyna and Rafał Demkowicz-Dobrzański. True precision limits in quantum metrology. New Journal of Physics , 17(1):013010, 2015.
- 6[6] Géza Tóth and Iagoba Apellaniz. Quantum metrology from a quantum information science perspective. Journal of Physics A: Mathematical and Theoretical , 47(42):424006, 2014.
- 7[7] Giorgio Colangelo, Ferran Martin Ciurana, Lorena C. Bianchet, Robert J. Sewell, and Morgan W. Mitchell. Simultaneous tracking of spin angle and amplitude beyond classical limits. Nature , 543(7646):525–528, Mar 2017. Letter.
- 8[8] Jeremy L. O’Brien, Akira Furusawa, and Jelena Vuckovic. Photonic quantum technologies. Nat Photon , 3(12):687–695, Dec 2009.
