Optimal nonlinear filtering of quantum state
V.I. Man'ko, L.A. Markovich

TL;DR
This paper extends optimal filtering theory to quantum processes, proposing a model for indirect measurements on a qudit system to improve quantum state estimation.
Contribution
It introduces a quantum filtering equation based on an indirect measurement model for qudits, expanding classical filtering to quantum systems.
Findings
Derived a quantum filtering equation for indirect measurements
Proposed an observation model for single qudit systems
Enhanced quantum state estimation techniques
Abstract
We extend the optimal filtering equation known from the Stratonovich filtering theory on the quantum process case. The used observation model is based on an indirect measurement method, where the measurement is performed on an ancilla system that is interacted with an unknown one. Observation model for single qudit system is proposed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Optimal nonlinear filtering of quantum state
V.I. Man’ko1,2 and L.A. Markovich2,3,4∗
1P. N. Lebedev Physical Institute, Russian Academy of Sciences
Leninskii Prospect 53, Moscow 119991, Russia
2Moscow Institute of Physics and Technology
Institutskii Per. 9, Dolgoprudny Moscow Region 141700, Russia
3Institute for information transmission problems, Moscow,
Bolshoy Karetny per. 19, build. 1, Moscow 127051, Russia
4V. A. Trapeznikov Institute of Control Sciences, Moscow,
Profsoyuznaya 65, 117997 Moscow, Russia
∗Corresponding author e-mail: [email protected]
Abstract
The aim of this work is to give an interpretation of the linear or nonlinear filtration problem as a quantization problem. Based on this, we extend the optimal filtering equation known from the Stratonovich filtering theory on the quantum process case. In contrast to the Kalman filter often used in such problems, the Stratonovich equation provides the optimal solution for nonlinear observation models. The observation model is based on an indirect measurement method, where the measurement is performed on an ancilla system that is interacted with an unknown one. The latter model is interpreted in case of a single - qudit system. Since the measured random variables and experiment are fundamentally different from each other, the obtained single - qudit observation model has a substantially different physical nature from the two-qubit one. However, in both problems, the Stratonovich optimal filter equation is equally applicable.
Keywords: Quantum state optimal filtering, quantum state estimation, Markov sequence, nonlinear process, Kalman filter, Stratonovich filtering
1 Introduction
With the growth of experimental base of quantum mechanics there is an increasing awareness of extension of classical probability theory, filtration and control results to the quantum problems. Measuring physical quantities, experimentalists face with such difficulties as the optimal measurement problem (see [6]) or the quantum filtering problem (see [7]). For example, the observable random variable may contain a mixture of the useful signal with noise. Also, the system of interest may not be available for the observation and the information about it is obtained through an observable system which is somehow coupled with the original one. In both cases, the knowledge about the system of interest must be ”filtered” from the measurements done on the observable one.
There has been recent interest in quantum filtering and state estimation problems in quantum information theory and quantum control (cf. [2, 4, 8]). Last experimental and theoretical advances in quantum technology provide a strong motivation for the quantum control including the engineering of quantum states, the stability theory, the quantum error correction, the robust control and quantum networks (cf. [3, 53, 5, 18, 22]). The quantum control plays a fundamental role in the development of new quantum technologies like the quantum computation. For example, the quantum filtering in coherent states is introduced in [19] and the tomographical approach is used for the signal recognition and denoising in a number of recent papers (cf. [1, 46, 43]).
For quantum state estimation, is known, one has to give the measurement strategy that is used to get information, and the estimator mapping the measurement data to the state space. In this paper, we consider the weak measurement model, where unobservable quantum system is coupled with an ancilla system (a probe system) that can be measured. A von Neumann measurement is used, described by a set of projection operators . Each operator describes what happens on one of possible outcomes of the measurement. The method is based on a set of copies of the same system being in the same state. The measurement scheme described above is based on two-qubit systems. This is due to a fast development of quantum technologies such as quantum computers, quantum cryptography and teleportation. These technologies promise in the future to lead to a revolution in computation and communication.
The quantum computing is built by elementary processing elements, namely, quantum bits (qubits). It is known that classical computers elements (bits) take only two values, logic zero and logic one. On the contrary, the qubits as quantum objects can be located also in a coherent superposition of these two states. Thus, they describe the intermediate state, between the logic zero and one. By measuring the qubit, we get zero or one with some probabilities. The quantum computers will be able in a finite time to solve problems, to the solution of which the classical supercomputers will take much more time. An implementation of such computer has several obstacles. Quantum states of ions, electrons and Josephson junctions used as qubits are extremely unstable and cannot be kept in the same state for a long time. For the implementation of computational algorithms one needs a set of interacted qubits in some particular state. In recent years, the stability of the states of qubits has been increased, but its improvement is still extremely complicated problem, especially for the multipartite systems with a large number of qubits.
One way to solve this problem is to use single - qudit systems as the quantum objects instead of qubits (cf. [28, 29]). Such single - qudit systems have more states than the two-qubit ones. Thus, the number of particles (or subsystems) in the system is greatly reduced. A wide number of papers exist devoted to the study of various kinds of characteristics of quantum correlations in systems with subsystems such as the two-qubit systems. Such quantum system can have a correlation responsible for the entanglement phenomenon (cf. [23]) and for the violation of Bell’s inequality (cf. [13]). These correlations may also be responsible for the quantum discord (cf. [40, 44]). However, in literature the systems without subsystems (i.e. one qutrit or qudit) are payed less attention. Recently, in [14, 15, 41, 42] it was shown that the quantum properties of systems without subsystems can be formulated using the method of an invertible mapping. In [42] the entanglement concept and correlations in the single - qudit state are discussed. Using the latter mapping, the notion of the separability and the entanglement is extended in [45] to the case of the single - qudit -state with .
Thus, the state space model for a particular quantum system based on the weak measurements can be extended to the case of a single - qudit system, where the ”artificial” unobservable qubit and ancilla qubit are introduced. The proposed observation models both for the qubits and for the one qudit systems are often nonlinear. Therefore, the widely known Kalman filtering method [25] is not applicable in this case. Hence, a method is required that performs good both in linear and nonlinear cases.
The problems related to the quantum state estimation and the quantum state filtering are fundamental for the quantum information theory and quantum control. In [51] the well-known procedure in the field of classical control theory, namely, the Kalman filtration (see [25]) has been applied to the quantum filtering area. The problem of the filtering of unknown signals from the mixture with noise is well studied in classical probability theory. The Kalman filter provides the optimal solution for the linear recursive model of the observation. However, for nonlinear models that appear often in practice the Kalman filter is not applicable.
As it was shown in [36, 20] the quantum states can be described by the fair probability distribution function called the quantum tomogram. It means that density matrices or vectors in the Hilbert spaces can be mapped onto probability distributions. In view of this filtration procedures known in classical probability theory can be applicable to the quantum problem.
1.1 Contributions of this paper
The paper is aimed as a conceptually simple and intuitive approach to the Stratonovich’s optimal filtration procedure. We deal with both the linear and nonlinear filtering, but final results concern the nonlinear case. First, we state the problem of filtering of a partially observable Markov quantum measurement process, where the optimal estimator of an unobserved component is based on observations. To this end, the general filtering equation introduced in [52, 16, 17] is used. In the author’s paper [37] it is proved that the general filtering equation that gives the optimal Bayesian estimator is the Kalman filter in the case of linear model. Note that the general filtering equation does not contain explicit probabilistic characteristics of the unknown unobservable sequence. We find the optimal state estimate by only observable quantities. Our second aim is to propose a quantum measurement model for the single - qudit state. The optimal filtering method proposed for the two-qubit models can be extended to the latter case. The construction of such type of observation models is useful in the light of possible practical use of the single - qudit systems. For example, one can measure the population on only certain levels of a multi-level atom, while other levels are not available to measure. This may be due to their short lifetime or a diversity in the frequency band reception. Then the measurement scheme described below allows to extract information about inaccessible levels by measuring only available ones.
The paper is organized as follows. In Sec. 2 we give a brief overview of results known for the classical nonlinear filtering problem. The Stratonovich’s optimal approach for nonlinear processes is discussed in details. In Sec. 3 we recall the notion of the weak measurement for the system of two qubits. We state the problem of filtering of the partially observable two-qubit system and the nonlinear measurement scheme is constructed. The Stratonovich’s approach used for the classical filtering problem is interpreted as a quantization problem. It is shown that by means of the optimal filtering equation one can find the optimal solution of the nonlinear quantum filtering problem without linearization procedures, see [51] among others. In the next section the method of the invertible mapping is used to obtain the non-linear measurement model for the single - qudit system. The physical meaning of correlations in such system and the construction of possible experiment are given.
2 Filtration problem
Here, we briefly present the results known for the classical nonlinear filtering problem (cf. [52, 16, 17]). Suppose that a partially observable Markov random process , where the sequence is unobservable and the sequence is observable, is statistically coupled by the conditional density . The conditional density of observations depends on the observation model and the distribution of the observation noises . Hence, depending on the form of the observation model and the noise distribution one has different classes of filtering problems. In this paper, we consider the exponential family of the conditional densities that are commonly used in practice, i.e.
[TABLE]
where is a normalization constant and are known functions.
We proceed to construct the filtering algorithm for an unobservable random sequence , based on observations of the sequence that are connected by the following nonlinear expression
[TABLE]
where is an independent identically distributed random sequence, is a Markov sequence and is some reversible function. Realizations of the random variables and are denoted by and , respectively. The assumption (1) imposes some restrictions on and . There must be a unique solution of (2), i.e.
[TABLE]
and the probability density of noise must be such, that after the substitution, the conditional density were from the exponential class.
In case when (2) has the recursive linear form
[TABLE]
where for all , and are mutually independent random variables with the standard Gaussian distribution,
[TABLE]
coefficients are given by real numbers and , then the Kalman filter is applied as an optimal filtration method [25]. The Kalman estimator is optimal with regard to the minimum of the risk function
[TABLE]
and is equal to the mathematical conditional expectation . However, for all the other cases, the Kalman filter doesn’t provide an optimal solution to the filtration problem and cannot be applied.
The optimal approach for nonlinear processes is proposed in [52]. To estimate the optimal Bayesian estimator in the form of the conditional mean
[TABLE]
has been used. The is the posterior probability density function that satisfies the Stratonovich’s recurrence equation (cf. [56, 31]) given by
[TABLE]
Here, we denote the transition probability density function of the Markov sequence as and , denote conditional densities.
Since the posterior density depends on the unknown prior distribution function and the transition probability of the Markov sequence , we cannot use formula the latter formula to estimate .
To overcome this problem the optimal filtering equation is proposed in [16, 17]. The equation is derived under the assumption that the conditional density belongs to the exponential family of distributions.
Remark 2.1**.**
Note that this assumption is not necessary. One can derive an analogical filtering equation for other classes of densities.
The optimal filtration equation is (cf. [17])
[TABLE]
It is easy to see that equation (7) does not contain explicit probabilistic characteristics and of the unknown sequence . This allows us to find the optimal estimator (6) (that minimizes the mean squared deviation from the true value of an unobservable ) knowing only observable quantities of . However, the latter equation contains the logarithmic derivative of the unknown conditional density which characterizes the signal. The logarithmic probability density function (pdf) derivative is the ratio of the derivative of the pdf to the pdf itself. To estimate them the gamma product kernel estimators by multivariate dependent data are introduced by Markovich in [39] (see example 4.3 on p.14).
One can take the Gaussian density
[TABLE]
as an example of exponential family (1). Then the observation model is defined by the linear equation
[TABLE]
where are independent identically distributed random variables with the Gaussian distribution and coefficients and are real numbers. Hence, equation (7) can be rewritten in the special form
[TABLE]
which is the Kalman filter.
Remark 2.2**.**
The exact coincidences of the triple: the optimal filtering equation (7) of the unobservable Markov sequence defined by a linear equation with a Gaussian noise, the Kalman filter and the conditional expectation defined by Theorem of normal correlation [32] is proved by the author in [37]. Thus, the optimal filtering equation is nothing else but the Kalman filter in case of linear model (4). However, the general filtering equation provides the optimal solution for nonlinear processes in contrast to the Kalman filter that cannot be applied to nonlinear models.
3 The weak measurement
Obviously, the approach introduced above can be applied to arbitrary Markovian pair, regardless of the nature of the measurements. Therefore, we want to apply the latter theory to quantum filtration. In this article, we consider the discrete time case of the indirect measurement (see [51]). We suppose that the unobservable and the observable measurements of quantum systems are quantum bits. Under the weak measurement it means that the projective measurements are done on the extra ancilla system (the probe or the measurement device) that is in state coupled with the system that we are interested in. Two Bloch’s vector representations of the latter states are
[TABLE]
where the and are symbolic vectors constructed from Pauli operators acting on Hilbert spaces and , respectively. The indirect measurement is proceeded by the following way. At the time step we prepare the ancilla qubit in a known state and couple it to an unknown system. The composite system is represented by the 4-dimensional square density matrix . Let us take it as a direct product of two latter states, i.e. . Both qubits evolve at sampling time according to bipartite dynamics. At the end we do the von Neumann measurement on the ancilla qubit. Generally, the von Neumann measurement is the measurement of Pauli operators. The algorithm is repeated at the next time step .
For example, if we are interested in the measurement of the observable , then possible outcomes are its eigenvalues . The probabilities of two different outcomes
[TABLE]
of the von Neumann measurement are the following
[TABLE]
where , . Similarly, if we consider the observable its eigenprojections are
[TABLE]
The evolution of the system is controlled by the unitary operator with the matrix . The state of the composite system after the interaction is the following
[TABLE]
and the reduced density matrix of the system we are interested in is
[TABLE]
The states after the measurement or are
[TABLE]
where are given by (16) or (21), respectively.
Example 3.1**.**
The evolution matrix can be taken as (see [51]), where is the coupling parameter and is the sampling time. For the measurement and , the probabilities of two different outcomes are
[TABLE]
The post measurement states are the following
[TABLE]
Since the probability of the new state depends on both measurements and we can retrieve the information about the useful state using only the observable measurement.
Example 3.2**.**
Let us observe the special case when , coupled with the system . Their density matrices are
[TABLE]
The composite system is represented by
[TABLE]
where , , , . Let us select the unitary evolution matrix of the following view
[TABLE]
Hence, after the evolution step
[TABLE]
for the observable we use (22) and write
[TABLE]
If one can rewrite the latter measurement as
[TABLE]
It is clear that if the initial state is decohered and the evolutionary matrix is chosen such that at each step the matrix is diagonal, then we get the distribution function and its evolution at each step of the measurement. We can describe it in classical manner and apply the results known from the filtration theory discussed in Sec. 2.
3.1 Filtering of unknown quantum signal
In the following, we assume that the Bloch’s vector of unobservable qubit is , where is the time step. In [51] the observable ancilla qubit was characterized by the constant parameter and, repeating the given example, the second row in (26) can be written as
[TABLE]
Let us rewrite the latter process under the assumption that is small enough. Then the process will be the following
[TABLE]
or if we are interested in the system change only after time steps we get
[TABLE]
where , hold. The plus and the minus outcomes are denoted as and , respectively.
Remark 3.1**.**
We can get the whole class of processes like (55) depending on the choice of the evolution matrix .
In [51] the latter equation was rewritten under the assumption that is small and
[TABLE]
holds. This means that we can rewrite the measurement process as
[TABLE]
where the measurement noise is .
It is easy to see that we have the partially observable Markov random pair , where the role of the unobservable sequence is played by the Bloch’s vector of unobservable qubit and the observable sequence is obtained from von Neumann measurements. The connection between these variables is given by the nonlinear equation (55). Thus, a quantum observation model (3.1), that is mathematically absolutely identical to (2), is constructed. Further, it is possible to solve the problem of filtering for a given observation model without thinking about the quantum nature of the observed and unobserved random sequences.
Statement 3.1**.**
If the quantum systems is characterized by the nonlinear state equation (2), where the observable sequence is obtained by means of the measurements performed on the ancilla qubit and is the unobservable sequence of measurements on the unknown system , the optimal Bayesian estimator of the unknown that minimizes the risk function (5) is provided by the optimal filtering equation (7).
Remark 3.2**.**
In [51] the distribution of is approximated by the Gaussian distribution and the Kalman-like filter is applied. However, in general this approach does not provide the optimal solution to the filtration problem. We have to use the optimal filtering equation (7) or its analogs for the non-exponential families of the densities since it gives the optimal solution of the filtration problem in both linear and non-linear cases as for linear as for non-linear problems.
Example 3.3**.**
Let the quantum systems be characterized by the nonlinear state equation of the following form (see [16])
[TABLE]
where is a known differentiable function and the measurement noise is . Then our process looks like
[TABLE]
As a mathematical example, we assign the observable distribution as
[TABLE]
where is a degree of freedom (a known number). Then the conditional density can be represented in the exponent form (1), where
[TABLE]
where is an invertible function and the optimal filtering equation (7) can be used to find the optimal Bayesian estimator .
4 Single - qudit observation model
The above scheme of measurements is constructed on the basis of two qubits, i.e. the system with subsystems. This is due to the fact that the quantum correlations and the entanglement phenomenon in the composite quantum systems are viewed as a promising resource for the quantum technologies. However, the development of a universal toolbox for efficient control for large quantum systems scalable with respect to number of subsystems is still a challenging problem of quantum science. Recently, possibilities of using noncomposite quantum systems as a potential resource for the quantum information theory and in realizations of quantum technologies have been discussed (cf. [41, 26]). The experimental demonstration of non-classical properties of noncomposite systems on the basis of a photonic qutrit is obtained in [47]. For the single - qudit system with the spin the information and entropic characteristics have been analyzed in [45].
Thereby the information properties of quantum states and their characteristics could be associated not only with the composite quantum systems but also with the noncomposite ones. Thus, all the quantum characteristics known for systems with subsystems can be mapped on the case of the systems without subsystems [41, 26, 45, 34, 35]. Then the single - qudit systems can be applicable in quantum technologies and therefore there is a need to construct the measurement model for such kind of systems.
Motivated by this problem we try to extend the weak measurement scheme based on two qubits to the system of one single - qudit. The model can be formulated as follows. A single - qudit is the system of interest. The measurements are still produced on the ancilla prepared qubit. However, such a model is similar to the two-qubit one and it is obvious that the Stratonovich’s optimal nonlinear filtering method is applicable. We will consider another case. Our experiment is built in such a way that there is a multi-level system in which some of the levels are accessible to measurements, and some are not. The question is, can we ”filter” the information somehow at the inaccessible levels by the accessible ones?
Let the quantum state in four dimensional Hilber space be described by the density matrix
[TABLE]
such that , and its eigenvalues are nonnegative. The density matrix can describe the single - qudit system. To this end we use the invertible mapping method introduced in [14, 15, 41]. It was observed that quantum properties of the systems without subsystems can be formulated using the invertible map of integers onto the pairs (triples, etc) of integers , (or semiintegers). Thus, we rewrite using the following invertible mapping of indexes , , , as
[TABLE]
The matrix saves the standard properties of the density matrix, i.e. , hold and its eigenvalues are nonnegative. Using the partial trace two ”artificial subsystems” (the ”artificial qubits”) can be introduced as
[TABLE]
[TABLE]
Then we think that the first ”artificial subsystems” corresponds to the unobserved qubit of interest and the second ”artificial subsystems” corresponds to the ancilla qubit.
Hence, one can think that the density matrix describes the single - qudit state. Applying the latter method, the two ”artificial subsystems” and can be constructed such that . Then one can construct the observation model as for the two-qubit system. However, it does not contain the real observable ancilla qubit and the unobservable one. The described method gives the possibility to extend many known quantum characteristics such as entanglement and steering to systems without subsystems (see [34, 35]). Note that we started our reasoning with the density matrix of a general form. Thus, a similar construction holds for any qudit density matrix. It is possible to construct other partitions into artificial subsystems and thus construct other measurement models for a system of one qudit. The difference is that in this case the role of the ancilla system is played by the accessible energy levels of the multilevel atom and the system of interest are unaccessible levels. The filtering method provides an elegant solution of getting information about unaccessible levels by accessible levels. Thus, applying the filtration method to the single - qudit system, we solve a completely different problem with another physical meaning and the observable random variables.
4.1 Physical meaning of the ”artificial qubits”
Let us start from the short example of two coins which can drop on the first () or on the second () side. Hence, there are two random variables , and four events with probabilities . Let be the joint probability function of these two random variables. Their marginal probability functions can be defined as
[TABLE]
Hence, the correlation between the two observations is given by
[TABLE]
However, we can be interested not in the whole system, but only in cases when the first coin falls on the first side and the second coin is not interesting for us. We have two new events with probabilities and . Analogically, if we are interested only in the second coin side, we have two new events with probabilities and . Outcomes and are correlated. The latter example shows the existence of correlations in the systems without subsystems.
If we have a single - qudit system with the spin , we can write the sample space with four events and values of spin projections with probabilities . Then we have one four-level atom, e.g., corresponds to the case when the highest (fourth) level is filled. If we are interested only in outcomes when the fourth or the second levels of the four-level atom are filled, then we can assume that we have a new set of two outcomes with probabilities , . If we are interested only in outcomes when the fourth and the third levels are filled, then we have another set of outcomes and their probabilities are , . The outcomes and are correlated. Hence, correlations in single - qudit systems are between different combinations of outcomes.
Remark 4.1**.**
With respect to our problem, the experiment could be designed so that we can measure the population only on certain levels, while others are not available to measure. This may be due to their short lifetimes or a diversity in the frequency band reception. For example, we can measure only the first and the second levels of the four-level atom and the third and the fourth levels are unobservable. Then we can think about the observable levels as about ”artificial ancilla qubit” and about other two levels as about ”artificial unobservable qubit”. Thus, we can construct the observation model just like for the real two-qubit system.
5 Conclusion
To conclude let us point out the main results of our work. Using the known state space model of a particular quantum system based on the weak measurements we demonstrated how to apply classical filtration methods in this case. Since the Kalman filter approach does not give the optimal solution for nonlinear models, we propose to use for nonlinear quantum models the general filtering equation. The latter gives the optimal Bayesian solution. In contrast to the known state estimation methods we do not need any simplifications or linearizations of the state space model.
Moreover, using the invertible map of indices we extend the observation model based on the indirect measurement known for the two-qubit system to the single - qudit system. This observation model is useful in case when it is experimentally impossible to do measurements at all levels of the multilevel system. Then we can ”filter” the information at inaccessible levels by conducting measurements on observable ones. Therefore, having the nonlinear observation model one can forget about its physical nature and apply the filtering method described above.
The main advantage of the proposed nonlinear filtration method in both classical and quantum models is its optimality. The general filtering equation gives better result in comparison with any linearization methods and all Kalman-like filters because of the proven optimality in the sense of minimal Bayesian risk. The proposed method has drawbacks, such as the dependence on the unknown conditional density . This can be estimated, for instance, using the nonparametric kernel density estimation methods (see [38, 39]). A possible development of the introduced ideas can be a consideration of qudits and the construction of a nonlinear observation model for such a system with the further application of a nonlinear filter.
Acknowledgements
L.A.M. was partly supported by the Russian Foundation for Basic Research, grant 13-08-00744 A
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Aguirre, C. and Mendes, R. V. (2014). Signal recognition and adapted filtering by non-commutative tomography. IET Signal Processing. 8(1), 67–75.
- 2[2] Armen, M. A., Au, J. K., Stockton, J. K., Doherty, A. C. and Mabuchi, H. (2002). Adaptive homodyne measurement of optical phase. Phys. Rev. Lett. 89, 133602.
- 3[3] Ahn, C., Doherty, A. C.and Landahl, A. J. (2002). Continuous quantum error correction via quantum feedback control. Phys. Rev. A. 65, 042301.
- 4[4] Barchielli, A. (1990). Direct and heterodyne detection and other applications of quantum stochastic calculus to quantum optics. Quantum Opt. 2, 423–441.
- 5[5] V. P. Belavkin (1983). Towards the theory of control in observable quantum systems. Automat. and Remote Control 44, 178–188.
- 6[6] S.C. Edwards, V.P. Belavkin (2005). Optimal Quantum Feedback Control via Quantum Dynamic Programming. quant-ph/0506018 University of Nottingham.
- 7[7] V. P. Belavkin (1980). Quantum filtering of Markov signals with white quantum noise. Radiotechnikai Electronika 25, 1445–1453.
- 8[8] Bouten, L., van Handel, R. and James, M. R. (2007). An introduction to quantum filtering. SIAM J.Control Optim 46, 2199–2241.
