On Parameter Estimation of Hidden Ergodic Ornstein-Uhlenbeck Process
Yury A. Kutoyants

TL;DR
This paper develops a two-step maximum likelihood estimator for unknown parameters in a partially observed Ornstein-Uhlenbeck process, demonstrating its consistency and asymptotic normality using Kalman-Bucy filtering.
Contribution
It introduces a novel two-step MLE process for joint estimation of the unobserved process and parameters in linear stochastic differential equations.
Findings
Estimator is consistent and asymptotically normal.
Recurrent estimators are constructed using Kalman-Bucy filtering.
Theoretical properties are established for large samples.
Abstract
We consider the problem of parameter estimation for the partially observed linear stochastic differential equation. We assume that the unobserved Ornstein-Uhlenbeck process depends on some unknown parameter and estimate the unobserved process and the unknown parameter simultaneously. We construct the two-step MLE-process for the estimator of the parameter and describe its large sample asymptotic properties, including consistency and asymptotic normality. Using the Kalman-Bucy filtering equations we construct recurrent estimators of the state and the parameter.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Target Tracking and Data Fusion in Sensor Networks · Complex Systems and Time Series Analysis
On Parameter Estimation of Hidden Ergodic Ornstein-Uhlenbeck Process
Yury A. Kutoyants
Le Mans University, Le Mans, France
Abstract
We consider the problem of parameter estimation for the partially observed linear stochastic differential equation. We assume that the unobserved Ornstein-Uhlenbeck process depends on some unknown parameter and estimate the unobserved process and the unknown parameter simultaneously. We construct the two-step MLE-process for the estimator of the parameter and describe its large sample asymptotic properties, including consistency and asymptotic normality. Using the Kalman-Bucy filtering equations we construct recurrent estimators of the state and the parameter.
MSC 2000 Classification: 62M02, 62G10, 62G20.
Key words: Partially observed linear system, parameter estimation, hidden process, ergodic process, One-step MLE-process.
1 Introduction
We are given a partially observed linear system, defined by the equations
[TABLE]
where and are constants, and are two independent Wiener processes. The random variable is independent of and .
The system (1)-(2) is defined by the four parameters . Recall that the parameter can be estimated without error by continuous time observations as follows. By the Itô formula we can write
[TABLE]
Hence, for any , we have the estimator
[TABLE]
and this estimator equals the true value. Therefore we consider only the estimation of the three other parameters and . Note that the consistent estimation of the three-dimensional parameter is impossible because the observed process can be written as follows
[TABLE]
This means that the parameters and appear as product . We can have consistent estimation of two-dimensional parameters and . This possibility we discuss in the last section.
The observations are and the Ornstein-Uhlenbeck process is unobservable (hidden), i.e., we have partially observed linear model of observations.
We consider estimation of the one-dimensional parameters , and separately given the continuous time observations . The unknown parameter will be denoted by and we will assume that for some constants . In all the cases the set does not contain 0. Thus we are faced with three different problems: , and . In each problem we propose a two-step construction of asymptotically efficient estimator-process of recurrent nature. First we propose a preliminary consistent estimator based on the observations with . Then this estimator is used for construction of One-step MLE-process, which has recurrent structure. In the last section we discuss the possibilities of the joint estimation of two dimensional parameters and .
Equations (1)-(2) is a prototypical model in the Kalman-Bucy filtering theory, which provides a closed form system of equations for the conditional expectation ([1], [9],[18]). The statistical problems for discretely observed hidden Markov processes were studied by many authors (see [2], [3], [6], [7] and the references therein). However, the literature on continuous time models is limited. For the results in continuous time setup, we refer the interested reader to [13] (linear and non linear partially observed systems with small noise), [6] (continuous-time hidden Markov models estimation), [4] and [11] (hidden telegraph process observed in the white Gaussian noise).
In the present paper we are particularly interested in the asymptotic behavior of the maximum likelihood estimator (MLE) in the large sample asymptotic regime, i.e., when . The statistical problems for such observation models have been widely studied, motivated by the importance of the Kalman-Bucy filtering in engineering applications.
Let us now recall the definitions of the MLE in the case , when the other two parameters and are known. As the parameters of the model take finite values and , the measures induced by the observations (1) on the space of continuous functions on are equivalent. The likelihood ratio function ([18]) is given by the expression
[TABLE]
Then the MLE is defined by the equation
[TABLE]
This means that to calculate we need the values of the family of stochastic processes . The random process is solution of the Kalman-Bucy filtering equations (see [1], [9], [18])
[TABLE]
where . The function is the solution of the Ricatti equation
[TABLE]
Due to importance of this model in many applied problems, much engineering literature is concerned with identification of this model.
The behavior of the MLE was studied at least in three asymptotics:
- •
*Small noise in both equations * ( is fixed) [12], [13]
[TABLE]
- •
Large sample ( and are fixed) [14]
[TABLE]
- •
Small noise in observation only, , ( and are fixed) [16]
[TABLE]
In all three cases the Fisher information is different. It was also shown that the polynomial moments of the scaled estimation error converge and the MLE is asymptotically efficient.
It is evident that the numerical calculation of the MLE according to (3)-(6) is quite a difficult problem. The goal of this work is to suggest the new estimator, called One-step MLE-process , which has two advantages. First, its numerical calculation is much more simple than that of the MLE and, second, this estimator has a recurrent structure and can be used for the joint estimation of the hidden process and the parameter . Similar One-step MLE’s and Multi-step MLE-processes, introduced in [15], have been applied in the problem of parameter estimation of the hidden telegraph process [11], parameter estimation in diffusion processes by the discrete time observations [10], in the problem of frequency estimation [8], intensity parameter estimation for inhomogeneous Poisson processes [5], parameter estimation for the Markov sequences [17].
2 Preliminary estimator.
Following [11] One-step MLE process will be constructed in two steps. First we introduce a consistent and asymptotically normal preliminary estimator and then this estimator is used to define One-step MLE-process. Preliminary estimator is constructed using an asymptotically negligible amount of the observations , where .
Suppose that and introduce the statistic and the function :
[TABLE]
In the cases and the counterparts of the latter function are
[TABLE]
respectively.
In this section we consider the case only. Therefore
[TABLE]
Note that the function is strictly decreasing. Define the preliminary estimator , base the observations :
[TABLE]
Here is the root of equation and are the sets
[TABLE]
The asymptotic behavior of as is described in the following proposition.
Proposition 1
*The estimator is consistent, uniformly on compacts
, and*
[TABLE]
with some constant .
Proof. We have
[TABLE]
For the probabilities we have the estimates
[TABLE]
Therefore we have to study the asymptotics of the statistic as :
[TABLE]
where
[TABLE]
We have
[TABLE]
because and are independent.
The process can be written as
[TABLE]
Hence
[TABLE]
and
[TABLE]
Therefore
[TABLE]
Using similar calculations we obtain the estimate
[TABLE]
which allows us to prove the law of large numbers: for we have convergence in mean square
[TABLE]
and
[TABLE]
Hence
[TABLE]
The function is strictly decreasing. If we denote its inverse function as , then we have
[TABLE]
and
[TABLE]
We can write
[TABLE]
as .
If we put , then
[TABLE]
3 One-Step MLE-process. Case .
Suppose that the unknown parameter is and we have the model (7)-(8), where the process is observable and the Ornstein-Uhlenbeck process is “hidden”. We realize the asymptotically efficient estimation of the parameter in two steps. First we calculate the preliminary estimator and then using this estimator we construct the One-step MLE-process.
Recall that the equation (6) has explicit solution
[TABLE]
Here ,
[TABLE]
Therefore we have exponential convergence of to the stationary solution
[TABLE]
To simplify the exposition we suppose that ; then we have . The case with an arbitrary requires cumbersome calculations, but the main results remain intact.
The equation for in this case is
[TABLE]
Denote and , where is the true value. Then for the process we obtain the equation
[TABLE]
Here we used the innovation theorem (see [18], ??)
[TABLE]
The innovation Wiener process is defined by this equation and is independent on . With probability 1, the random process has continuous derivatives w.r.t. and derivative processes satisfy the equations
[TABLE]
The Fisher information for this model of observations is
[TABLE]
Note that has continuous bounded derivatives and is uniformly in separated from zero.
According to [15] the One-step MLE-process is introduced as follows
[TABLE]
Let us change the variables and denote .
Theorem 1
One-step MLE-process with is consistent: for any and any
[TABLE]
and asymptotically normal
[TABLE]
Proof. Consider the difference
[TABLE]
Note that as it follows from the equations (3)-(3), the Gaussian processes and have bounded variances and therefore for any we have
[TABLE]
where the constants do not depend on . We can write
[TABLE]
because
[TABLE]
Here .
Further, for the Fisher information we have
[TABLE]
This allows us to write
[TABLE]
By the law of large numbers
[TABLE]
and therefore by the central limit theorem
[TABLE]
The similar arguments allow us to write
[TABLE]
Recall that as we have stationary regime . Therefore
[TABLE]
where the integral (see, e.g., Proposition 1.23 in [14])
[TABLE]
Hence we obtained the representation
[TABLE]
Substitution of this relation into the initial representation (3) yields the final expression
[TABLE]
since .
Note that the process can be written in recurrent form
[TABLE]
and we can introduce the adaptive filtering equations as follows
[TABLE]
with the initial value . Here
[TABLE]
It will be interesting to see the behavior of the system (16)-(18) using numerical simulations.
Recall that if we put , then is One-step MLE with
[TABLE]
studied for ergodic diffusion processes in the Section 2.5 [14]. Therefore the estimator is asymptotically equivalent to the asymptotically efficient MLE defined by the equation (4). There is essential computational difference between these two estimators. The calculation of using (3)-(6) requires solving the differential equations (1)-(6) for numerous values of , which is computationally inefficient. To construct One-step MLE-process we have to calculate a simple preliminary estimator and then to solve the system (1)-(6) for just one value . The difference between these two approaches becomes even more significant in the case of multidimensional .
4 One-Step MLE-process. Case .
Suppose that the volatility is the unknown parameter and we have the equations
[TABLE]
As before all parameters do not vanish and . The volatility with and the function
[TABLE]
is strictly increasing.
The statistic , with the new notations, converges to this function
[TABLE]
Therefore we have the explicit expression for the preliminary estimator
[TABLE]
where
[TABLE]
Here the sets are defined by the similar relations
[TABLE]
As before, we have the consistency
[TABLE]
and
[TABLE]
We need the equation for and expression for Fisher information
[TABLE]
in this case. The filtering equations in the stationary regime are
[TABLE]
Therefore
[TABLE]
For
[TABLE]
where
[TABLE]
Since
[TABLE]
we obtain
[TABLE]
Therefore the Fisher information is
[TABLE]
Now we can write the One-step MLE-process as follows
[TABLE]
If we change the variables and denote , then we obtain the same assertions as in the Theorem 1:
Proposition 2
One-step MLE-process with is consistent: for any and any
[TABLE]
and asymptotically normal
[TABLE]
Proof. Similarly to (16), we have exactly the same representation for the estimator as in (14), with the only difference in the forms of and . Thus the previous proof works in this case as well.
It is possible to write the system of recurrent equations as in (16)-(18).
5 One-Step MLE-process. Case .
It is clear that the suggested estimation approach also works for the partially observed system
[TABLE]
where the unknown parameter is the drift .
The function
[TABLE]
is strictly increasing and the corresponding preliminary estimator admits the same asymptotic properties as in the preceding section.
The filtering equations are
[TABLE]
Therefore
[TABLE]
To calculate Fisher information we write the representations
[TABLE]
In the last integral we change the order of integration
[TABLE]
Introduce notations
[TABLE]
Then we can write as follows
[TABLE]
Hence
[TABLE]
Therefore the Fisher information in this problem is the function
[TABLE]
Having the preliminary estimator , expression for Fisher information and the equation for we can construct the One-step MLE-process of the same form as in (14), with replaced by .
This estimator has the same asymptotic properties: it is consistent and asymptotically normal
[TABLE]
The proof follows the same pattern as in the previous cases.
6 Discussion
The results, presented above, can be developed in several directions by means of already known approaches.
It is interesting to find preliminary estimator in the cases of unknown parameters . Of course, with one statistic it is impossible and we need at least three different statistics.
Consider the case of two-dimensional parameter or and two statistics
[TABLE]
The limits are
[TABLE]
Therefore
[TABLE]
The function
[TABLE]
is strictly increasing and , . Therefore, the parameter can be estimated with the help of the statistic :
[TABLE]
Having this estimator the second parameter, say, or can be obtained as solution of one of these equations
[TABLE]
with obvious notation. As soon as we have a consistent preliminary estimator, say, and explicit expression for the information matrix , then
[TABLE]
Recall that such processes were studied in [15]. 2. 2.
The One-step MLE-process has learning interval with . It can be interesting to have such process with shorter learning. This can be done with the help of another construction called Two-step MLE-process introduced in [15]. Let us recall this construction using the model of observation (7)-(8). The first preliminary estimator is constructed using the observations with (shorter learning interval). The second preliminary estimator-process is
[TABLE]
The Two-step MLE-process is
[TABLE]
Following the same arguments as in the proof of Theorem 2 in [15] it can be shown that
[TABLE]
where .
The learning interval can be made even shorter if . In this case we use Three-step MLE-process (see details in [15]). 3. 3.
Consider the model (7)-(8) and the estimator-process , where . Let us denote by the measure induced by the process
[TABLE]
in the measurable space of continuous on functions. It is possible to verify the weak convergence
[TABLE]
where corresponds to the Gaussian process with
[TABLE]
i.e. is a Wiener process on the interval .
The proof in similar situation can be found in [15], Theorem 1. It consists of proving convergence of the finite-dimensional distributions
[TABLE]
and the estimate
[TABLE]
where the constant does not depend on . The approach applied in the present work allows us the direct verification these two conditions.
Acknowledgment. I would like to thank P. Chigansky for useful comments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Arato, M. (1983) Linear Stochastic Systems with Constant Coefficients A Statistical Approach. Lecture Notes in Control and Inform. Sci., 45, New York: Springer-Verlag.
- 2[2] Bickel, P.J., Ritov, Y. and Rydén, T. (1998) Asymptotic normality of the maximum likelihood estimator for general hidden Markov models. Ann. Statist. , 26, 4, 1614-1635.
- 3[3] Cappé, O., Moulines, E. and Rydén, T. (2005) Inference in Hidden Markov Models . Springer, N.Y.
- 4[4] Chigansky, P. (2009) Maximum likelihood estimation for hidden Markov models in continuous time. Statist. Inference Stoch. Processes , 12, 2, 139-163.
- 5[5] Dabye, A.S., Gounoung, A.A. and Kutoyants, Yu.A. (2018) Method of moments estimators and multu-step MLE for Poisson processes. J. Contemp. Math. Analysis , 53, 4, 187-196.
- 6[6] Elliott, R.J., Aggoun, L. and Moor, J.B. (1995) Hidden Markov Models. Springer, N.Y.
- 7[7] Ephraim, Y., Mehrav, N. (2002) Hidden Markov processes. IEEE Trans. Inform. Theory, 48, 6, 1518-1569.
- 8[8] Golubev, G.K. (1984) Fisher’s method of scoring in the problem of frequency estimation. J. of Soviet Math., 25, 3, 1125-1139.
