Hurst index estimation in stochastic differential equations driven by fractional Brownian motion
Jan Gairing, Peter Imkeller, Radomyra Shevchenko, Ciprian A. Tudor

TL;DR
This paper develops new estimators for the Hurst index in stochastic differential equations driven by fractional Brownian motion, utilizing Malliavin calculus to analyze quadratic variations and their asymptotic behavior.
Contribution
It introduces a novel approach to estimate the Hurst index in SDEs driven by fractional Brownian motion using higher order increments and Malliavin calculus techniques.
Findings
Consistent estimators for the Hurst index are constructed.
Asymptotic properties of the estimators are analyzed.
Simulation results demonstrate estimator effectiveness.
Abstract
We consider the problem of Hurst index estimation for solutions of stochastic differential equations driven by an additive fractional Brownian motion. Using techniques of the Malliavin calculus, we analyze the asymptotic behavior of the quadratic variations of the solution, defined via higher order increments. Then we apply our results to construct and study estimators for the Hurst index.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Financial Risk and Volatility Modeling · Nonlinear Differential Equations Analysis
Hurst index estimation in stochastic differential equations driven by fractional Brownian motion
Jan Gairing
*Mathematisches Institut
Ludwig-Maximilians-Universität München
Peter Imkeller
*Institut für Mathematik
Humboldt-Universität zu Berlin
Radomyra Shevchenko
*Fakultät für Mathematik
Technische Universität Dortmund
Ciprian Tudor
*Département de Mathématiques
Université de Lille 1 *
Abstract
We consider the problem of Hurst index estimation for solutions of stochastic differential equations driven by an additive fractional Brownian motion. Using techniques of the Malliavin calculus, we analyze the asymptotic behavior of the quadratic variations of the solution, defined via higher order increments. Then we apply our results to construct and study estimators for the Hurst index.
2010 AMS Classification Numbers: 60G15, 60H05, 60G18.
Key Words and Phrases: Hurst index estimation; Stochastic differential equation; fractional Brownian motion; Quadratic variation; Malliavin calculus; Central limit theorem.
1 Introduction
Fractional Brownian motion (fBm) is a widely used generalization of the standard Brownian motion that incorporates long-range dependence while preserving the self-similarity and Gaussianity. These features make it a popular stochastic processes in the mathematical modelling of various complex systems from financial applications to surface growth (e.g. [2, 1]). The need to detect and analyze such systems has led to the development of many, now classical, statistical instruments, including wavelet analysis, R/S estimators, generalized variations etc. and we refer to the monographs [3, 9, 18] for further reference.
Fractional Brownian motion is homogeneous in space. Many physical systems, however, are subject to an external force and are confined to certain locations with high probability. It may therefore be appropriate to consider stochastic differential equations (SDE) as a generalization of the fBm model. fBm then takes the role of a random forcing which heavily influences the underlying dynamics. Thus the identification of this random forcing is of particular interest.
In our work, we will consider an SDE with additive fractional Brownian noise, i.e.
[TABLE]
where is an interval in , is the initial state and is a measurable deterministic function satisfying suitable assumptions. Stochastic models as (1) appear in mathematical finance (for example, in [12] the log-volatility is assumed to satisfy an SDE of the form (1)). Other potential applications come from climatology, stochastic processes as in (1) (with Gaussian or non-Gaussian noise) are utilized as models for problems related to the earth’s energy balance, see [10, 8].
In fBm-related models particular interest lies in the estimation of the Hurst index (see Section 2 below) since it constitutes the characteristic parameter of the driving forcing.
The purpose of our endeavour is to estimate the Hurst index in (1) based on the discrete observations of the process . Here we will deploy a well-known method, based on the quadratic variations of . This method is known to work well for self-similar processes (see e.g. [18] and references therein).
We will show that it can be also applied to (1), although is not a self-similar process. The method exploits the fact that the absolutely continuous integral component of (1) does not affect the roughness captured by the quadratic variation of .
We will define the sequence (see equation (7) below) of the so-called quadratic -variations, defined in terms of higher order increments of over a filter . We give the limit behavior in distribution of this sequence making use of the Malliavin calculus to handle correlations and combine it with already known results concerning the variations of the fBm. Interestingly, in this setting the case is easier to handle than the case of . This is due to the fact that, when studying the limit of the sequence (7), one needs to take into account the correlations between the increments of the fBm, but also the correlations between the increments of the fBm and the increments of the Lebesque integral in (1). If these joint correlations are always dominated by those of the fBm, which is not the case for , when a supplementary assumption is needed on the mesh in (7).
By a standard procedure, we then construct a quadratic variation estimator for the Hurst index of the model (1). We prove its consistency and its asymptotic normality, using the limit behavior of the quadratic -variations.
Our work is structured as follows. In Section 2 we introduce the objects of study, fBm, filters and the corresponding quadratic -variations of a process. We also quote the underlying results on the behavior of the -variations of fBm. Section 3 studies the properties and the -variations of the solution to (1), via the techniques of the Malliavin calculus. Section 4 is devoted to the parameter estimation of the Hurst index from discrete observations of . In Section 5 we present simulations and discuss statistical properties of the derived estimators. In the Appendix we give a short review of the definitions and results from Malliavin calculus relevant to this work.
2 Preliminaries: Fractional Brownian motion and its variations
A fractional Brownian motion with Hurst index is a centered Gaussian process on the interval (in this work we consider or ) with covariance function
[TABLE]
From this definition it is apparent that for we have that and the fBm with is the standard Brownian motion.
For a natural number let us consider a -tuple of real numbers with zero sum, i.e.
[TABLE]
Such a -tuple will be called a filter of length . The order of a filter , denoted by , is defined to be the order of the first non-zero moment of the -tuple , i.e.
[TABLE]
We will frequently need the partial sum of the components of and use the notation
[TABLE]
Since by definition a filter has zero sum, clearly for any filter we have . For instance, is a filter of order 1 and of length 2, while is a filter of order 2 and of length 3.
For a process , a filter of length and a fixed mesh size we consider the sum
[TABLE]
with such that . For example, if , then while if then .
We will refer to as the increment of the process at time over the filter . Note that due to the zero sum condition on we can rewrite in the following way:
[TABLE]
We will refer to this form as differences representation. The correlation of increments of the fBm over a filter plays an important role in our calculations. From (2) we can see that for a zero sum vector and such that , are well defined one has
[TABLE]
For a filter and a fractional Brownian motion we define its (normalized) quadratic -variation in the following way:
[TABLE]
where with is the number of discrete observations on a grid of mesh size (that may depend on ) and
[TABLE]
In what follows, we will assume that for some whenever the quadratic -variation is considered. Observe that the choice and would yield the formula for the usual normalized quadratic variation of the fBm (see e.g. [18]).
Let us recall the main result concerning the asymptotic behavior of the quadratic -variation of fBm (see [6] and [11]). It states that the sequence (7) converges to zero almost surely for any filter and if then it satisfies a central limit theorem.
Theorem 1**.**
Let be a fractional Brownian motion with Hurst index and let be a filter of order and of length with . Then
* as * 2. 2.
If , then
[TABLE]
where is an explicit constant.
In the construction of the estimators in Section 4 we need a multidimensional version of this statement which can also be found in [11] and [6]:
Theorem 2**.**
Let be a fractional Brownian motion with Hurst index . For let be a filter of order . Then
[TABLE]
*where is a positive definite matrix. *
Notice that, in the case of a filter of order 1, we have the well-known restriction for the CLT of the sequence to hold. This restriction can be lifted by taking a filter of order greater than one, i.e. by considering higher-order increments of the fBm.
3 Stochastic differential equations driven by the fractional Brownian motion
This section introduces the main object of our study: stochastic differential equations driven by the fractional Brownian motion. We will then study how the central limit theorems of the previous section carry over to the -variation of the solution to the SDE.
3.1 Stochastic differential equations with fBm
For we consider the SDE
[TABLE]
where is a fractional Brownian motion with Hurst index and is a real number. We assume that (i.e., continuous in the time component and continuously differentiable in the space component as well as bounded, together with his partial derivative with respect to the second variable, in both components) with
[TABLE]
for a fixed constant . We denote by the derivative of with respect to its second variable while stands for the infinity norm.
In the course of the paper we will be interested in two cases:
- (S1)
: the SDE (9) with ,
- (S2)
: the SDE (9) with for .
The existence of pathwise solutions to (9) in both cases can be seen by considering the process and thus reducing (S) to an ordinary differential equation. We refer to [15], [4] or [13], among others, for the existence and uniqueness of the solution under assumption (10).
Clearly, solutions to (S2) also solve (S1), if restricted to the unit interval. Therefore statements made for (S2) also apply to (S1). We make this distinction mainly for notational reasons: in (S1) we consider only (where ), such that we do not leave the unit interval as the number of observations () grows. In (S2) is allowed to be less than one, and the observation window grows with .
We further denote by the absolutely continuous component of the equation (9), i.e.
[TABLE]
If is the solution to the SDE (9) (in either the setting (S1) or (S2)) and is a filter of length we define the quadratic -variation of to be
[TABLE]
where the number of observations satisfies and the mesh size is chosen to satisfy for some . Here is again the variance of the increment of along the filter .
3.2 Central limit theorems for the quadratic -variation of SDE
This section comprises the core results of this work, combining the limit results of the fBm from Section 2 with estimates for the solution of SDE obtained via Malliavin calculus. We obtain central limit theorems for the SDE (9) providing the basis for the estimation of to be discussed in Section 4. The results are presented in the Theorems 3 and 4 where the cases and are treated separately. The proof of Theorem 3 consists of direct estimates and does not require any assumptions on the order of the filter .
If these estimates are not sufficient and we rely on the Malliavin Calculus to provide a finer analysis of the correlation. The actual proof of Theorem 4 is postponed to Section 3.4 after some auxiliary results.
First we will consider the case . Notice that in this case we have for any filter .
Theorem 3**.**
Assume , let be a solution of the SDE (S2) and let a be a filter with components. For (with ) we have
[TABLE]
with from Theorem 1.
Proof.
By a binomial expansion we can write:
[TABLE]
with
[TABLE]
with some constants , . Due to Slutsky’s lemma and part (b) of Theorem 1 it is enough to show that for . Using the differences representation (5) we obtain with given by (4)
[TABLE]
since is uniformly bounded by (10). The notation means that there exists a strictly positive constant such that . Thus, we deduce for :
[TABLE]
Moreover, since for , is normally distributed, can be calculated directly:
[TABLE]
since by (8), Hence, we have
[TABLE]
as well as
[TABLE]
both of which converge to zero because of our assumption on . ∎
Remark 1**.**
*Note that for one can choose , thus considering the usual equidistant partition of the interval . But it is also possible to chose a mesh less that .
In the second theorem the case H\in\big{[}\frac{1}{2},\,1\big{)} is being considered. For this case we will assume , which means that the observations do not leave the interval . For the bounds on the correlation terms in the above proof fail to converge for , therefore, we need more elaborate techniques to handle the remainder in Section 3.4. The assumption is purely technical: it permits the use of Lemma 1 in Section 3.3, the core estimate in our Malliavin calculus approach.
Theorem 4**.**
Let be a fractional Brownian motion with Hurst index H\in\big{[}\frac{1}{2},\,1\big{)} and a filter satisfying . If the process solves (S1) and if , (with ), we have
[TABLE]
where is the constant from Theorem 1.
Remark 2**.**
All in all, for we have the convergence conditions and , both of which are satisfied for . When , we need to chose .
Remark 3**.**
The proofs of the Theorems 3 and 4 were based upon the demonstration that the differences converge to zero in . This implies that both theorems can be generalised to their multivariate versions, again by means of Slutsky’s lemma and Theorem 2.
3.3 Auxiliary results from Malliavin calculus
We postpone to the Appendix the technical definitions and results from Malliavin calculus. We refer to [4] for the fact that the solution of (S1) belongs to the Sobolev-space , implying that the process defined above is also Malliavin differentiable (compare also the proofs of Lemma 5.1 and Lemma 5.3 in [13] for a generalisation with respect to the time dependence of the drift). Moreover, the Malliavin derivative of (with respect to the fBm ) satisfies for and
[TABLE]
for . This linear equation has the explicit solution
[TABLE]
and similarly to the equation considered in [4] this provides the bound
[TABLE]
almost everywhere for , where is the bound on defined above in (10).
Moreover, we need the following technical result, which analyzes the correlation between the increments of the processes and . Below, is the canonical Hilbert space of the fBm (see Section 6).
Lemma 1**.**
Let and be defined by (11) for the SDE (S1). Then
for every with , , we have
[TABLE] 2. 2.
for every with , , .
[TABLE]
Proof.
First note that for
[TABLE]
This can be seen by approximating the integrals using the fact that the integrands are bounded. Since the derivative operator is closed, it carries over to the limit. The chain rule yields
[TABLE]
which results in the bound
[TABLE]
Now we will consider the case , in which is the standard Brownian motion. Since the space coincides with the space , we have
[TABLE]
applying in the last step the above bound (13) on . Part (a) is proven for . Part (b) is obtained similarly by the same arguments and bounds.
For the case we first calculate (recall that the norm in is defined in (21))
[TABLE]
consequently, , which enables us to use the scalar product representation given in (22). We get for the case (a), by (13)
[TABLE]
Moreover, it holds that
[TABLE]
since for and
[TABLE]
This proves part (a) of the lemma for . The proof of part (b) follows analogously, since
[TABLE]
∎
3.4 Proof of Theorem 4
The correlation estimates of Lemma 1 allow us to prove the theorem.
Proof.
For both cases we consider the sum defined as in the proof of Theorem 3. For the summand we get similarly to the previous theorem by the boundedness of
[TABLE]
which converges to zero for . We rewrite with the help of the differences representation (5). Then
[TABLE]
ehere denotes the Skorohod integral. This decomposition follows directly from the integration-by-parts formula (23) applied to for
[TABLE]
Notice that (see (5)) . We estimate the first summand in (14). We can write
[TABLE]
for any , where the last inequality follows from Lemma 1, point 1. Hence we obtain
[TABLE]
which goes to zero if . Let us turn to the first component and show that it vanishes in (implying convergence in probability).
[TABLE]
which follows from a direct application of (24) . An expansion of the differences representation (5) for and the definition of gives
[TABLE]
With the use of Lemma 1 applied to each expectation this is easily dominated by
[TABLE]
Analogously we obtain
[TABLE]
where we used the fact that the expectations are again bounded by .
Observe that for fixed , , , we will make the sum in (16) larger if we estimate
[TABLE]
where .
Now the inner product satisfies
[TABLE]
We note that for the indicator functions are orthogonal. On the diagonal we have for
[TABLE]
For also the off-diagonal elements contribute. For large we have since it uniformly approximates the second derivative of the function at . Indeed, a Taylor expansion shows that for
[TABLE]
We can now rearrange the off-diagonal part of the sum in (18) and use this approximation:
[TABLE]
Recall now that for the sum with and the following asymptotic equality stems from the Euler–Maclaurin formula.
[TABLE]
where is the Riemann zeta function. Hence,
[TABLE]
As a final step we verify that correctly scaled we have for the second summand in (14)
[TABLE]
Substituting gives us the condition as above for the vanishing of the first term and the aditional condition for the second. This bound combined with (15) and (14) will imply the conclusion. ∎
4 Estimation of the Hurst parameter
Here we will construct estimators for the Hurst parameter of the Brownian motion driving the SDE (S2) and derive their properties, using the previous results. In particular, we will transfer the almost sure convergence result from Theorem 1 to the quadratic -variation of the solution of the SDE (S2) and apply the central limit theorems from the previous section.
We shall fix in the sequel a filter is a filter of order .
For a stochastic process we define
[TABLE]
with such that and for some . Then
[TABLE]
holds, and we deduce from Theorem 1 that
[TABLE]
The following theorem shows the same result for the solution of (S1).
Theorem 5**.**
Let be a fractional Brownian motion with Hurst parameter and let be the solution of the SDE (S2) driven by the fBm . Then
[TABLE]
Proof.
We have
[TABLE]
so it is enough to show that the last two summands divided by () converge to zero almost surely. Recall that we have from (12) . It follows that
[TABLE]
which goes to [math] for every as tends to infinity.
To estimate we refer to [14] for the fact that almost all sample paths of a fractional Brownian motion with the Hurst index are Hölder continuous of order for any (it follows from the Kolmogorov’s continuity theorem, since is self-similar). We get for almost every trajectory, using the differences representation (5):
[TABLE]
Note that the constant can vary depending on the trajectory. Then we get
[TABLE]
We can ensure that it converges to zero by setting , and the claim follows. ∎
Note that for the special case and we can deduce directly from this result that the ”standard” estimator given by
[TABLE]
converges almost surely to .
Remark 4**.**
Since the relationship between and is the same as for the process , it follows from Theorem 5 that .
Now we are in a position to construct an estimator in the way it was done in [11] (see also [6]) for processes with stationary increments. We know from Theorem 5 that asymptotically equals , and a simple rearrangement argument gives us
[TABLE]
Let us consider for a family of filters . For each member the associated variance is a linear combination of the , , where , the numbers being not necessarily different. So if we consider a matrix defined by
[TABLE]
then tends almost surely to , where denotes the vector .
We choose the such that is of full rank and estimate by linear regression (preserving the almost sure convergence property): . Now we can consider , which tends to , and estimate by another simple linear regression:
[TABLE]
If we know the observations to be equidistant but do not have access to the actual mesh size , we can estimate in the same way (because does not directly depend on ) and then perform a regression with the intercept term . This allows the estimation
[TABLE]
To establish some properties of the estimators and , we first recall that, as a consequence of Theorem 5, we have the convergence result and therefore
[TABLE]
Then, evoking the continuous mapping theorem, we obtain , . In other words, both estimators are strongly consistent.
The question of asymptotic normality depends upon whether Theorems 3 and 4 are applicable. Choosing filters of order 2 one can ensure that the filter condition is always satisfied, but other assumptions require more care. In particular, if there is no prior knowledge about the true value of , one has to assume that the mesh size condition is satisfied. If a certain bound for is known, say, if , then under the assumption that (which includes the usual partition of ) we obtain asymptotic normality. The following theorem summarizes our observations.
Theorem 6**.**
Let be a solution of the SDE (S2) and let the mesh size be chosen such that the assumptions of Theorems 3 and 4 are satisfied. Moreover, let be filters of respective lengths (for ) such that . Then the sequences and converge weakly to normally distributed, centered random variables.
Proof.
This is an application of the delta method (mentioned, for example, in [7]) for the vector . Its almost sure convergence was shown in Remark 4 and the multivariate convergence in law of follows due to Remark 3. Since for the construction of , it underwent only linear and logarithmic transformations, the obtained estimators are indeed asymptotically normal. ∎
5 Simulation study
Similarly to [11] and [6] two filters and are considered, where is obtained by ”thinning” the filter (i.e., for and zero otherwise). In this case the estimator simplifies to
[TABLE]
Note that this estimator is independent of the scaling: its strong consistency has been shown for all , hence, it is enough to know that the observations are equidistant, and the mesh size can then be chosen appropriately. It is, moreover, by construction independent of deterministic multiplicative scaling factors of the fBm involved, which allows us to include the (slightly more general) case
[TABLE]
with some unknown and observed over an unknown interval in our simulation study. From these observations different settings for the simulations arise.
We simulate trajectories of a process defined by
[TABLE]
Some trajectories are displayed in Figure 1 for different values of . We calculate the MSE for , , and observations in the following settings:
- (S1)
on an interval with the ”standard” estimator , , 2. (S2)
on an interval with for , , 3. (S3)
on an interval with for , , 4. (S4)
on an interval with for , , 5. (S5)
on an interval with the ”standard” estimator , , 6. (S6)
on an interval with for , , 7. (S7)
on an interval with for , .
We obtain the following results:
[TABLE]
.
Indeed, one can see that the estimators perform well for a broad range of conditions. For , in case of known and on the unit interval the simplest estimator seems to perform best. However, for large its convergence becomes slower compared to that of for the order filter. It is a known result (see e.g. [5]) that for the filter the asymptotic rates of convergence for the quadratic variation of an fBm are in the case and for . This constraint seems to transfer to the process . Figures 1 and 2 provide a visualisation for the difference in the convergence rates.
6 Appendix: Definitions and standard results from Malliavin calculus
In the following we will talk about the main definitions of Malliavin calculus for the fractional Brownian motion case, following mainly [14]. Throughout this section we will consider .
For a real separable Hilbert space we call a stochastic process in a complete probability space a Gaussian process on if is a centred Gaussian family of random variables such that for all
[TABLE]
For a fractional Brownian motion with Hurst parameter let be the closure of the set of indicator functions with respect to the inner product
[TABLE]
Then is via by definition a Gaussian process on , and this is how the notation is used in this work.
For the space does not only contain functions (see [17]), so we consider the following definition:
[TABLE]
It has been shown in [16] that is a subspace of , which yields a sufficient condition for functions to belong to the space .
We have a useful representation of the inner product of two functions from the space . That is, if , then for (see e.g. [14])
[TABLE]
For (that is, if is the usual Brownian motion) the space is identical with the space endowed with the usual inner product.
For the Gaussian process on let denote the set of smooth random variables of the form
[TABLE]
where denotes the space of infinitely continuously differentiable functions such that and all its partial derivatives grow at most polynomially.
Then we can define the Malliavin derivative as an operator which acts on functions by
[TABLE]
which means that is a random variable with values in . It does not depend on the choice of the function and of .
For the subset of , whose functions have compact support, we will write .
is clearly linear and it holds: .
Moreover, is closable (this has been shown in [14]) and we will denote by the closure of with respect to the norm
[TABLE]
For a fixed we define the operator on the set by
[TABLE]
Since Malliavin derivatives are -valued random variables, one can identify them with stochastic processes if those values in are functions. In this case we will write for .
We have a chain rule for Malliavin derivatives, which is stated in [14]: Let be a function with for some () and let . Then and
[TABLE]
Let denote the set
[TABLE]
Then for we can define and consider the norm
[TABLE]
Now we can, just as for the space , consider the closure of with respect to this norm and call it .
The divergence operator, denoted by , is the adjoint of the derivative operator . Its domain is the set of random variables such that
[TABLE]
for all , where is a constant.
For satisfies the following characterising equation
[TABLE]
for all .
A result from [14] provides a more direct characterisation for a certain elementary subset of : let be an element of . Then belongs to and we have
[TABLE]
We need the following identity for the expectation of , where and belong to the domain of and have a certain specific form. In [14] a statement for more general and is given, however, in the present work we only need this special case: for , the variables , belong to the space and
[TABLE]
Acknowledgement: The financial support of the Collaborative Research Center ’Statistical modeling of non-linear dynamic processes’ (SFB 823) is gratefully acknowledged.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Barabasi and H. Stanley (1995): Fractal Concepts in Surface Growth. Cambridge University Press.
- 2[2] E. Bayraktar, V. Poor and R. Sricar (2004): Estimating the fractional dimension of the SP 500 index using wavelet analysis. Int. J. Theor. Appl. Finance, 7 (5), 615-643.
- 3[3] J. Beran (1995): Statistics of Long-Memory Processes. Chapman & Hall, London.
- 4[4] M. Besalú, A. Kohatsu-Higa and S. Tindel (2016): Gaussian type lower bounds for the density of solutions of SD Es driven by fractional Brownian motion. Ann. Probab., 44 (1), 399-443.
- 5[5] P. Breuer and P. Major (1983): Central limit theorems for nonlinear functionals of Gaussian fields. J. Multivariate Analysis, 13 , 425-441.
- 6[6] J.F. Coeurjolly (2001): Estimating the parameters of a fractional Brownian motion by discrete variations of its sample paths. Statistical Inference for Stochastic Processes, 4 , 199-227.
- 7[7] D. Dacunha-Castelle and M. Duflo (1986): Probability and Statistics. Volume II. Springer, New York (1986).
- 8[8] P. D. Ditlevsen (1999): Observation of α 𝛼 \alpha -stable noise induced millenial climate changes from an ice record. Geophysical Research Letters, 26 (10), 1441-1444.
