Consistency of the plug-in functional predictor of the Ornstein-Uhlenbeck process in Hilbert and Banach spaces
J. \'Alvarez-Li\'ebana, D. Bosq, M. D. Ruiz-Medina

TL;DR
This paper establishes the consistency of the maximum likelihood estimator and the plug-in predictor for the Ornstein-Uhlenbeck process in both Hilbert and Banach space frameworks, advancing functional prediction methods.
Contribution
It provides new theoretical results on the consistency of estimators and predictors for the Ornstein-Uhlenbeck process in infinite-dimensional spaces.
Findings
Consistency of the maximum likelihood estimator is proven in Hilbert and Banach spaces.
The plug-in predictor based on the estimator is shown to be consistent.
The results extend functional prediction theory to broader infinite-dimensional settings.
Abstract
New results on functional prediction of the Ornstein-Uhlenbeck process in an autoregressive Hilbert-valued and Banach-valued frameworks are derived. Specifically, consistency of the maximum likelihood estimator of the autocorrelation operator, and of the associated plug-in predictor is obtained in both frameworks.
| Parameter | ||||||
|---|---|---|---|---|---|---|
| Parameter | |||
|---|---|---|---|
| Parameter | ||||||
|---|---|---|---|---|---|---|
| Hilbert-valued case | Banach-valued case | |||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Consistency of the plug-in functional predictor of the Ornstein-Uhlenbeck process in Hilbert and Banach spaces
Javier Álvarez-Liébana1, Denis Bosq2 and M. Dolores Ruiz–Medina1
Summary
New results on functional prediction of the Ornstein-Uhlenbeck process in an autoregressive Hilbert-valued and Banach-valued frameworks are derived. Specifically, consistency of the maximum likelihood estimator of the autocorrelation operator, and of the associated plug-in predictor is obtained in both frameworks.
Published in Statistics & Probability Letters 117:12–22. DOI: doi.org/10.1016/j.spl.2016.04.023
1 Department of Statistics and O. R., University of Granada, Spain. 2 LSTA, Université Pierre et Marie Curie–Paris 6, Paris, France.
E-mail: [email protected]
Key words: Autoregressive Hilbertian processes; Banach-valued autoregressive processes; consistency; maximum likelihood parameter estimator; Ornstein-Uhlenbeck process.
1 Introduction
This paper derives new results in the context of linear processes in function spaces. An extensive literature has been developed in this context in the last few decades (see, for example, Bosq [2000], Ferraty and Vieu [2006], Ramsay and Silverman [2005]; among others). In particular, the problem of functional prediction of linear processes in Hilbert and Banach spaces has been widely addressed. We refer to the reader to the papers by Bensmain and Mourid [2001], Bosq [1996, 2002, 2004, 2007], Guillas [2000, 2001], Mas [2002, 2004, 2007], Mas and Menneteau [2003a], Menneteau [2005], Labbas and Mourid [2002], Mokhtari and Mourid [2003], Mourid [2002, 2004] Rachedi [2004, 2005], Rachedi and Mourid [2003], Dedecker and Merlevède [2003], Dehling and Sharipov [2005], Glendinning and Fleet [2007], Kargin and Onatski [2008], Ruiz-Medina [2012], Pumo [1998], Marion and Pumo [2004] and Turbillon et al. [2007, 2008]; and the references therein. In the above–mentioned papers, different projection methodologies have been adopted in the derivation of the main asymptotic properties of the formulated functional parameter estimators and predictors. Particularly, Bosq [2000], Bosq and Blanke [2007] apply Functional Principal Component Analysis (FPCA); Antoniadis and Sapatinas [2003], Antoniadis et al. [2006], Laukaitis and Vasilecas [2009] propose wavelet–bases–based estimation methods. Applications of these functional estimation results can be found in the papers by Damon and Guillas [2002], Antoniadis and Sapatinas [2003], Laukaitis [2008], Hörmann and Kokoszka [2011], Ruiz-Medina and Salmerón [2009]; among others.
We here pay attention to the problem of functional prediction of the Ornstein–Uhlenbeck (O.U.) process (see, for example, Uhlenbeck and Ornstein [1930], Wang and Uhlenbeck [1945], for its introduction and properties). See also Doob [1942] for the classical definition of O.U. process from the Langevin (linear) stochastic differential equation. We can find in Kutoyants [2004], Liptser and Shiraev [2001] an explicit expression of the maximum likelihood estimator (MLE) of the scale parameter characterizing its covariance function. Its strong consistency is proved, for instance, in Kleptsyna and Breton [2002]. We formulate here the O.U. process as an autoregressive Hilbertian process of order one (so–called ARH(1) process), and as an autoregressive Banach–valued process of order one (so–called ARB(1) process). Consistency of the MLE of is applied to prove the consistency of the corresponding MLE of the autocorrelation operator of the O.U. process. We adopt the methodology applied in Bosq [1991], since our interest relies on forecasting the values of the O.U. process over an entire time interval. Specifically, considering the O.U. process on the basic probability space we can define
[TABLE]
satisfying
[TABLE]
with
[TABLE]
for , where is a standard bilateral Wiener process (see Supplementary Material 5). Thus, satisfies the ARH(1) equation (2) (see also equation (4) below for its general definition). The real separable Hilbert space is given by where is the Borel -algebra generated by the subintervals in is the Lebesgue measure and is the Dirac measure at point The associated norm
[TABLE]
establishes the equivalent classes of functions given by the relationship if and only if
[TABLE]
with
[TABLE]
where, as before, is the Dirac measure at point . We will prove, in Lemma 1 below, that constructed in (1) from the O.U. process, satisfying equations (2)–(LABEL:A2:40), is the unique stationary solution to equation (2), in the space admitting a MAH() representation. Similarly, in Lemma 4 below, we will prove that , constructed in (1) from the O.U. process, satisfying equations (2)–(LABEL:A2:40), is the unique stationary solution to equation (2), admitting a MAB() representation, in the space the real separable Banach space of continuous functions, whose support is the interval with the supremum norm.
The main results of this paper provide the almost surely convergence to of its MLE , in the norm of the space of bounded linear operators in the Hilbert space (respectively, in the norm of the space of bounded linear operators in the Banach space ). The convergence in probability of the associated plug–in ARH(1) and ARB(1) predictors (i.e., the convergence in probability of to in and respectively) is proved as well.
The outline of this paper is as follows. In Appendix 2, the main results of this paper are obtained. Specifically, Appendix 2.1 provides the definition of an O.U. process as an ARH(1) process. Strong consistency in of the estimator of the autocorrelation operator is derived in Appendix 2.2. Consistency in of the associated plug–in ARH(1) predictor is then established in Appendix 2.3. The corresponding results in Banach spaces are given in Appendix 2.4. For illustration purposes, a simulation study is undertaken in Appendix 3. Final comments can be found in Appendix 4. The basic preliminary elements, applied in the proof of the main results of this paper, and the proof of Lemma 1, can be found in the Supplementary Material 5.
2 Prediction of O.U. processes in Hilbert and Banach spaces
In this section, we consider to be a real separable Hilbert space. Recall that a zero–mean ARH(1) process , on the basic probability space satisfies (see Bosq [2000])
[TABLE]
where denotes the autocorrelation operator of process Here, is assumed to be a strong–white noise; i.e., is a Hilbert–valued zero-mean stationary process, with independent and identically distributed components in time, with for all
O.U. processes as ARH(1) processes
As commented in Appendix 1, equations (1)–(LABEL:A2:40) provide the definition of an O.U. process as an ARH(1) process, with The norm in the space of with introduced in (LABEL:A2:40) and is given by
[TABLE]
for each . The following lemma provides, for each the exact value of the norm of in the space of bounded linear operators on As a direct consequence, the existence of an integer such that for is also derived for
Lemma 1
Let us consider and satisfying equations (1)–(LABEL:A2:40). For each the uniform norm of is given by
[TABLE]
Furthermore, for
[TABLE]
where denotes the closest upper integer of for every
The proof of this lemma can be found in the Supplementary Material 5 provided.
Remark 1
From equation (6), applying [Bosq, 2000, Theorem 3.1], Lemma 1 implies that constructed in (1) from an O.U. process, defines the unique stationary solution to equation (2) in the space admitting the MAH() representation
[TABLE]
Remark 2
Note that, for all and
Functional parameter estimation and consistency
We now prove the strong consistency of the estimator of operator in with, as before, and denoting the MLE of based on the observation of an O.U. process on the interval with Note that, from equation (LABEL:A2:40), for all and for a given sample size
[TABLE]
where the MLE of is given, for by
[TABLE]
with being the observed values of the O.U. process over the interval Thus, is introduced in an abstract way, since it can only be explicitly computed, for each particular function considered. However, the norm is explicitly computed in equation (8) below.
The following results will be applied in the proof of Proposition 1.
Lemma 2
If , it holds that
[TABLE]
The proof of this lemma is given in the Supplementary Material 5.
Theorem 1
(See also [Kleptsyna and Breton, 2002, Proposition 2.2] and [Kutoyants, 2004, p. 63 and p. 117]). The MLE of defined in equation (7) is strongly consistent; i.e.,
[TABLE]
The proof follows from the Ibragimov–Khasminskii’s Theorem.
Proposition 1
Let be the space Then, the estimator of operator based on the MLE of , is strongly consistent in the norm of ; i.e.,
[TABLE]
Proof. The following straightforward almost surely identities are obtained:
[TABLE]
[TABLE]
where the last identity is obtained in a similar way to equation (5) in Lemma 1 (see Supplementary Material 5).
From Lemma 2 and equation (8), for sufficiently large, we have
[TABLE]
The strong–consistency of in directly follows from Theorem 1 and equation (9).
Remark 3
From [Kleptsyna and Breton, 2002, Proposition 2.3] (see also Theorem 2 below), the MLE of satisfies
[TABLE]
In addition, from equation (9), considering
[TABLE]
[TABLE]
with
[TABLE]
Therefore, the functional parameter estimator is –consistent.
Consistency of the plug–in ARH(1) predictor
Let us consider the plug–in ARH(1) predictor constructed from the MLE of in Proposition 1, given by
[TABLE]
Corollary 1 below provides the consistency of given in equation (12), from Proposition 1 by applying the following lemma and theorem.
Lemma 3
Let be a sequence of random variables such that
[TABLE]
and let be another sequence of random variables such that
[TABLE]
Then,
[TABLE]
where, as usual, indicates convergence in probability.
The proof of this lemma can be found in the Supplementary Material 5.
Theorem 2
Let be the MLE of defined in equation (7), with Hence,
[TABLE]
In particular,
[TABLE]
The proof of this result is given in [Kleptsyna and Breton, 2002, Proposition 2.3].
Corollary 1
Let be the Hilbert space introduced above. Then, the plug–in ARH(1) predictor (12) of an O.U. process is consistent in ; i.e.,
[TABLE]
Proof. By definition,
[TABLE]
From equations (8)–(9) and (14), we then obtain, for sufficiently large,
[TABLE]
Let us set
[TABLE]
with for every From Theorem 1,
[TABLE]
Hence, to apply Lemma 3, we need to prove that
[TABLE]
From the Chebyshev’s inequality and Theorem 2, we get, for all
[TABLE]
Therefore, from Lemma 3, we obtain the convergence in probability of to zero.
Prediction of O.U. processes in
As before, let be now the Banach space of continuous functions, whose support is the interval , with the supremum norm, denoted as The following lemma states that for and for every with being the space of bounded linear operators on the Banach space and being introduced in equation (LABEL:A2:40). Consequently, from [Bosq, 2000, Theorem 6.1], constructed in (1) from the O.U. process, defines the unique stationary solution to equation (2), in the Banach space admitting a MAB() representation.
Lemma 4
Let introduced in (LABEL:A2:40), defined on Then, for with
Proof.
From
[TABLE]
for each and we have
[TABLE]
We now check the strong consistency of the MLE of in From equation (16),
[TABLE]
From Lemma 2, for sufficiently large, we then have
[TABLE]
Theorem 1 then leads to the desired result on strong consistency of the estimator of in Furthermore, from Theorem 2 , in a similar way to Remark 3, the –consistency of in also follows from equations (13) and (17).
Similarly to Corollary 1, in the following result, the consistency, in the Banach space of the plug–in predictor (12) is obtained.
Corollary 2
The ARB(1) plug–in predictor (12) of a zero–mean O.U. process is consistent in ; i.e., as
[TABLE]
Proof. From Lemma 2, for sufficiently large, and for each
[TABLE]
As derived in the proof of Corollary 1, from Theorem 2, the random sequence is such that
[TABLE]
Moreover, is such that Lemma 3 then leads, as to the desired convergence result from equation (18):
[TABLE]
3 Simulations
In this section, a simulation study is undertaken to illustrate the asymptotic results presented in this paper about the MLE of and the consistency of the ML functional parameter estimators of the autocorrelation operator, and the associated plug–in predictors, in the ARH(1) and ARB(1) frameworks.
Estimation of the scale parameter
On the simulation of the sample–paths of an O.U. process, an extension of the Euler’s method, the so–called Euler–Murayama’s method (see Kloeden and Platen [1992]) is applied, from the Langevin stochastic differential equation satisfied by the O.U. process
[TABLE]
Thus, let be a partition of the real interval Then, (19) can be discretized as
[TABLE]
where are i.i.d. Wiener increments; i.e.,
[TABLE]
In the following, we take as discretization step size, considering simulations of the O.U. process. In particular, Figure 1 shows some realizations of the discrete version of the solution to (19) generated from (20).
Let us first illustrate the asymptotic normal distribution of ; i.e., for sufficiently large, we can consider (see Theorem 3 in the Supplementary Material 5). From equation (7), we take
[TABLE]
(see also Supplementary material 5), to compute the following approximation of the MLE of for each one of the simulations performed, and for each one of the six values of parameter considered:
[TABLE]
where represents the –th discrete generation of the O.U. process, evaluated at time with covariance scale parameter for
[TABLE]
Table 1 displays the empirical probabilities of the error to be within the band from discrete simulations of the O.U. process, considering different sample sizes . Figure 2 displays the cases (at the top) and (at the bottom). It can be observed that, for each one of the sample sizes considered, , approximately a 99% of the realizations of lie within the band which supports the asymptotic Gaussian distribution.
Regarding asymptotic efficiency stated in Theorem 2, from simulations of the O.U. process over the interval for the corresponding empirical mean square errors
[TABLE]
are displayed in Figure 3. Here, , with represent the respective approximated values (21) of the MLE of computed from It can be observed, from the results displayed in Figure 3, that Theorem 2 holds for sufficiently large.
Consistency of in and
The strong–consistency of in is derived in Proposition 1 from the following almost surely upper bound
[TABLE]
Here, from simulations of the O.U. process on the interval with sample sizes the corresponding values of are computed, considering the cases Table 2 shows the empirical probability of to lie within the band for each one of sample sizes and cases regarded. It can be observed that for the sample sizes studied, in the case of the empirical probabilities are equal to one. Thus, the almost surely convergence to zero of the upper bound (22) holds, with an approximated convergence rate of Note that, for the other two cases, and the empirical probabilities are also very close to one (see also Table 1 for smaller sample sizes, where we can also observe the empirical probabilities very close to one for the same band). In particular, Figure 4 displays the cases (at the top) and (at the bottom).
It can be observed from Table 2 that a better performance is obtained for the largest values of which corresponds to the weakest dependent case. Furthermore, from the upper bound in (17), the strong consistency of in with, as before, is also illustrated from the results displayed in Table 2 and Figure 4.
Consistency of the ARH(1) and ARB(1) plug–in predictors for the O.U. process
Let us now consider the derived upper bounds in (15) and (18) in Corollaries 1–2, for the ARH(1) and ARB(1) predictors, respectively. From the generation of discrete realizations of an O.U. process over the interval for the upper bounds (15) and (18) are evaluated, for the cases The following empirical probabilities for are reflected in Table 3
[TABLE]
with , and , for the Hilbert–valued and Banach–valued (see (15) and (18)) frameworks (see also Figure 5). It can be observed that the empirical probabilities are equal to one in both frameworks for the largest sample sizes, in any of the cases considered.
The strong–consistency of the MLE of and of the autocorrelation operator of the O.U. process, in Banach and Hilbert spaces, has been first illustrated. The almost surely rate of convergence to zero is shown as well. The numerical results on the consistency of the associated ARH(1) and ARB(1) plug–in predictors then follow, from the computation of the corresponding empirical probabilities for the derived upper bounds. Note that the numerical results displayed in Appendix 3 are obtained under generation of sample sizes ranging from up to a million of time instants, considering repetitions for each one of such sample sizes. In all these simulations performed, the discretization step size considered has been
4 Final comments
The problem of functional prediction of the O.U. process could be of interest in several applied fields. For example, in finance, in the context of the Vasicek’s model (see Vasicek [1977]) the results derived allow to predict the curve representing the interest rate over a temporal interval, in a consistent way. Note that, in this context, the MLE computed for parameter provides a consistent approximation of the speed reversion, which definitely determines the proposed functional predictor of the interest rate.
Summarizing, this paper addresses the problem of functional prediction of the O.U. process from ARH(1) and ARB(1) perspectives. Specifically, considering the O.U. process as an ARH(1) and an ARB(1) process, new results on strong consistency (almost surely convergence to the true parameter value), in the spaces and of the MLE of its autocorrelation operator are derived. Consistency results (convergence in probability to the true value) of the associated plug–in predictors are obtained as well. The numerical results shown, in addition, the normality and the asymptotic efficiency of the MLE of the scale parameter of the covariance function of the O.U. process.
5 Supplementary Material
The definition and properties of an O.U. process are given here, as well as the proof of Lemma 1.
Ornstein–Uhlenbeck process
Let be a real–valued sample–path continuous stochastic process defined on the basic probability space with index set the real line As demonstrated in Doob [1942], process is an O.U. process if it provides the Gaussian solution to the following stochastic linear Langevin differential equation:
[TABLE]
where is a standard bilateral Wiener process; i.e.,
[TABLE]
with and being independent standard Wiener processes, and and respectively denoting the indicator functions over the positive and negative real line. Applying, in equation (25), the method of separation of variables, considering we obtain
[TABLE]
where the integral is understood in the Itô sense (see Ash and Gardner [1975], Sobczyk [1991] for more details). Particularizing to , the O.U. process is transformed into
[TABLE]
It is well–known that the solution to the stochastic differential equation
[TABLE]
has marginal probability density function satisfying the following Fokker–Planck’s scalar equation (see, for example, Kadanoff [2000]):
[TABLE]
In the case of O.U. process, the stationary solution (), under , adopts the form
[TABLE]
which corresponds to the probability density function of a Gaussian distribution with mean and variance i.e., which corresponds to the probability density function of a random variable such that
[TABLE]
From (26), the mean and covariance functions of O.U. process (see, for instance, Doob [1942], Uhlenbeck and Ornstein [1930]) can be computed as follows:
[TABLE]
where denotes the covariance between random variables and . Additionally, from (27), we obtain the following identities:
[TABLE]
where is a constant. In the subsequent development, we will consider and .
Maximum likelihood estimation of the covariance scale parameter
The MLE of in (28) is given by (see Graczyk and Jakubowski [2006]; [Kutoyants, 2004, p. 63]; [Liptser and Shiraev, 2001, p. 265])
[TABLE]
Thus, equation (29) becomes
[TABLE]
We will assume that is large enough such that almost surely. It is well–known that the MLE of is strongly consistent (see details in [Kleptsyna and Breton, 2002, Proposition 2.2]; [Kutoyants, 2004, p. 63 and p. 117]).
Theorem 3
The following limit in distribution sense holds for the MLE of given in equation (30):
[TABLE]
Results in [Jiang, 2012, Theorem 1.1 and Corollary 1.1] lead to the following almost surely identities (see also [Bosq, 2000, Theorem 2.10];[Ledoux and Talagrand, 2011, pp. 196–203], in relation to the law of the iterated logarithm)
[TABLE]
Preliminary inequalities and results
In this section we recall some inequalities and well–known convergence results on random variables, as well as basic deterministic inequalities, that have been applied in the derivation of the main results displayed above.
Lemma 5
Let be a zero–mean normal distributed random variable, i.e., with . Then,
[TABLE]
Proof. Let be such that Then,
[TABLE]
Let us set
[TABLE]
Function is monotone increasing over and is monotone decreasing over
From equations (31)–(LABEL:A2:Supp:32),
[TABLE]
Now, consider with then,
[TABLE]
5.3.1 Proof of Lemma 1
Proof.
Let us first consider the case from
[TABLE]
and
[TABLE]
we have
[TABLE]
Furthermore,
[TABLE]
Additionally, the function given by
[TABLE]
with denoting the indicator function of set belongs to since
[TABLE]
Thus, by definition of
[TABLE]
[TABLE]
We are now going to compute for Since, for all
[TABLE]
we obtain
[TABLE]
Considering function defined in equation (35), applying similar arguments to those given in the computation of we have
[TABLE]
Now, from equation (37),
[TABLE]
Furthermore, for ,
[TABLE]
since is a monotonically decreasing function on with if and , when . Hence, if ,
[TABLE]
which implies that , when .
5.3.2 Proof of Lemma 2
Proof. Let us first assume that .
From the Mean Value Theorem applied over , there exists such that
[TABLE]
Taking and , we get the following inequalities:
[TABLE]
Similar inequalities are obtained for the case by applying the Mean Value Theorem over the interval , instead of .
5.3.3 Proof of Lemma 3
Proof. Considering the indicator function it holds
[TABLE]
where is a sequence of positive numbers such that the event is equivalent to . From (38) and Lemma 5, if we take for all we get, for each ,
[TABLE]
For , with ,
[TABLE]
which implies that
[TABLE]
in equation (39).
On the other hand, since for every
[TABLE]
Thus, .
Acknowledgments
This work has been supported in part by projects MTM2012-32674 and MTM2015–71839–P (co-funded by Feder funds), of the DGI, MINECO, Spain.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Antoniadis et al. [2006] \NAT@biblabelnum Antoniadis et al. 2006 Antoniadis , A. ; Paparoditis , E. ; Sapatinas , T.: A functional wavelet-kernel approach for time series prediction. J. R. Stat. Soc. Ser. B. Stat. Methodol. 68 (2006), pp. 837–857. – DOI: doi.org/10.1111/j.1467-9868.2006.00569.x
- 2Antoniadis and Sapatinas [2003] \NAT@biblabelnum Antoniadis and Sapatinas 2003 Antoniadis , A. ; Sapatinas , T.: Wavelet methods for continuous-time prediction using Hilbert-valued autoregressive processes. J. Multivariate Anal. 87 (2003), pp. 133–158. – DOI: doi.org/10.1016/S 0047-259X(03)00028-9
- 3Ash and Gardner [1975] \NAT@biblabelnum Ash and Gardner 1975 Ash , R. B. ; Gardner , M. F.: Topics in stochastic processes. Bull. Amer. Math. Soc. 82 (1975), pp. 817–820. – URL https://www.sciencedirect.com/science/book/9780120652709
- 4Bensmain and Mourid [2001] \NAT@biblabelnum Bensmain and Mourid 2001 Bensmain , N. ; Mourid , T.: Estimateur ”sieve” de l’opérateur d’un processus ARH(1). C. R. Acad. Sci. Paris Sér. I Math. 332 (2001), pp. 1015–1018. – DOI: doi.org/10.1016/S 0764-4442(01)01954-1
- 5Bosq [1991] \NAT@biblabelnum Bosq 1991 Bosq , D.: Modelization, non-parametric estimation and prediction for continuous time processes. Nonparametric functional estimation and related topics, NATO, ASI Series 335 (1991), pp. 509–529
- 6Bosq [1996] \NAT@biblabelnum Bosq 1996 Bosq , D.: Limit theorems for Banach-valued autoregressive processes. Applications to real continuous time processes. Bull. Belg. Math. Soc. Simon Stevin 3 (1996), pp. 537–555. – URL https://projecteuclid.org/download/pdf_1/euclid.bbms/1105652783
- 7Bosq [2000] \NAT@biblabelnum Bosq 2000 Bosq , D.: Linear Processes in Function Spaces . Springer, New York, 2000. – ISBN 9781461211549
- 8Bosq [2002] \NAT@biblabelnum Bosq 2002 Bosq , D.: Estimation of mean and covariance operator of autoregressive processes in Banach spaces. Stat. Inference Stoch. Process. 5 (2002), pp. 287–306. – DOI: doi.org/10.1023/A:1021279131053
