Statistical inference for moving-average L\'evy-driven processes: Fourier-based approach
Denis Belomestny, Tatiana Orlova, and Vladimir Panov

TL;DR
This paper introduces a Fourier-based semiparametric estimation method for moving-average Lévy-driven processes, establishing optimal convergence rates and advancing statistical inference in continuous-time stochastic models.
Contribution
It presents a novel Fourier-based estimation approach for Lévy-driven processes with proven optimal convergence rates.
Findings
Estimation method achieves minimax optimal convergence rates.
Method effectively handles continuous-time moving-average Lévy processes.
Provides theoretical guarantees for the proposed estimators.
Abstract
We consider a new method of the semiparametric statistical estimation for the continuous-time moving average L\'evy processes. We derive the convergence rates of the proposed estimators, and show that these rates are optimal in the minimax sense.
|
|
|
|
|
|
|
|
|
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Statistical Methods and Inference · Financial Risk and Volatility Modeling
Statistical inference for moving-average Lévy-driven processes: Fourier-based approach
Denis Belomestny1,2, Tatiana Orlova1, and Vladimir Panov1
1 Laboratory of Stochastic Analysis and its Applications
National Research University Higher School of Economics
Shabolovka, 26, 119049 Moscow, Russia
and
2 University of Duisburg-Essen
Thea-Leymann-Str. 9, 45127 Essen, Germany
Abstract
We consider a new method of the semiparametric statistical estimation for the continuous-time moving average Lévy processes. We derive the convergence rates of the proposed estimators, and show that these rates are optimal in the minimax sense.
keywords:
moving average , Lévy processes , low-frequency estimation , Fourier methods
††journal: Statistics & Probability Letters
1 Introduction
Generally speaking, continuous-time Lévy-driven moving average processes are defined as
[TABLE]
where is a deterministic kernel and is a two-sided Lévy process with Lévy triplet . The conditions which guarantee that this integral is well-defined are given in the pioneering work by Rajput and Rosinski [6]. For instance, if , it is sufficient to assume that . Some popular choices for the kernel are with and , (Gamma-kernels, see e.g. Barndorff-Nielsen and Schmiegel [1]), or (well-balanced Ornstein-Uhlenbeck process, see Schnurr and Woerner [7]).
Recently, Belomestny, Panov and Woerner [3] consider the statistical estimation of the Lévy measure from the low-frequency observations of the process . The approach presented in [3] is rather general - in particular, it works well under various choices of . Nevertheless, this approach is based on the superposition of the Mellin and Fourier transforms of the Lévy measure, and therefore its practical implementation can meet some computational difficulties.
In this paper, we present another method, which essentially uses the fact that in some cases there exists a direct relation between the characteristic exponent of the process and the characteristic function of the process Therefore, the characteristic exponent can be estimated from the observations of the process and further application of the Fourier techniques from Belomestny and Reiss [2] and Panov [5] leads to the construction of a consistent estimator of the Lévy triplet.
The paper is organised as follows. In the next session, we provide the specifications of our model. In Section 3, we present the key mathematical idea, which lies in the core of the estimation procedure presented in Section 4. The upper and lower error bounds for the proposed estimates are given in Section 5. Joint consideration of the corresponding results, Theorems 1 and 2, yields the optimality of the estimates. Finally, in Section 6, we illustrate our approach with some numerical examples. All proofs are collected in Section 7.
2 Set-up
In this work, we consider the integrals of the form (1), where is a symmetric kernel of the form:
[TABLE]
for some As a limiting case for we get the exponential kernel Here, for simplicity, we restrict our attention to a particular class of two-sided Lévy processes with jumps represented by a compound Poisson process ,
[TABLE]
[TABLE]
where is a drift, , is a Brownian motion, are 2 Poisson processes with intensity , and are i.i.d. r.v’s with absolutely continuous distribution, and all ’s, , are jointly independent. Due to the Lévy-Khintchine formula, the characteristic exponent of is given by
[TABLE]
where is a Lévy measure of , and stands for the Fourier transform of
It is important to note that the process is strictly stationary with the characteristic function of the form
[TABLE]
and therefore for any time points the r.v.’s are identically distributed (but dependent). Our aim is to estimate the Lévy triplet based on the equidistant observations of the process at the time points where is fixed (low-frequency set-up).
3 Main idea
The key observation is that under our choice of the kernel function we can represent the characteristic exponent of the process via the characteristic function of the process . More precisely, since
[TABLE]
we have
[TABLE]
Therefore, we derive
[TABLE]
since as provided that , see Lemma 1. Therefore, the characteristic exponent can be directly estimated from data via a plug-in estimator based on the empirical characteristic function of .
Moreover, returning to the representation (5), we conclude that the Lévy triplet can be estimated from In fact, since is absolutely continuous with an absolutely integrable density, then by the Riemann-Lebesgue lemma (see [4], p. 43) as and consequently can be viewed, at least for large as a second order polynomial with the coefficients This observation gives rise for the estimation procedure, which we present in the next session.
4 Estimation procedure
Assume that the process is observed on the equidistant time grid where is fixed.
Step 1: estimation of . Define
[TABLE]
and set
[TABLE]
where the branch of the complex logarithm is taken in such a way that is continuous on with and being the first zero of In fact, since does not vanish on , we have
Step 2: estimation of and . Let and
[TABLE]
where is a continuous function, supported on the interval with some Consider now the optimisation problem
[TABLE]
which has the solution
[TABLE]
with
[TABLE]
Note that the weighting function satisfies the property , and moreover,
[TABLE]
Analogously,
[TABLE]
holds with satisfying the properties
[TABLE]
Step 3: estimation of . Finally, the parameter can be estimated by considering the optimisation problem
[TABLE]
which leads to the estimate
[TABLE]
where fulfills All functions , and are supported on and bounded.
Step 4: estimation of the Lévy density. Note that under our assumptions on the Lévy process (see Section 2), the Levy measure possesses a density, which we denote, with a slight abuse of notation, also by . This Lévy density can be estimated as a regularised inverse Fourier transform of the remainder:
[TABLE]
where is a weight function supported on Note that if
5 Error bounds
Theorem 1**.**
Consider the model (1), where is a kernel in the form (2) and is a Lévy process in the form (3) with triplet . Assume that the Lévy density is -times weakly differentiable for some , and moreover the Lévy triplet belongs to the class
[TABLE]
*with some Assume also that the weighting functions satisfy the conditions *
[TABLE]
[TABLE]
Then it holds
[TABLE]
provided with some constant depending on and
As shown in the next theorem, the above rates are optimal in minimax sense.
Theorem 2**.**
For any there exists some such that
[TABLE]
where the infimums are taken over all possible estimates of the parameters and supremums - over all triplets from the class
6 Numerical example
Consider the integral (1) with the kernel from the class (2), and the Lévy process defined by (3)-(4). For simulation study, we take and , and aim to estimate these parameters under different choices of the parameter , namely and .
Simulation. For , denote the jump times of by , corresponding to the jump sizes Note that
[TABLE]
where
[TABLE]
Typical trajectory of the process is presented on Figure 1.
Estimation. Following the ideas from Section 4, we estimate the parameters under different choices of
To show the convergence properties of the considered estimates, we provide simulations with different values of . The boxplots of the corresponding estimation errors (differences) based on 25 simulation runs are presented on Figures 2, 3 and 4. Note that the parameter is chosen by numerical optimisation. The exact values are presented in Tables 1 and 2.
The simulation study illustrates our theoretical results on the rates of convergence given in Section 5. In fact, visual comparison of Figures 2, 3 and 4 shows that the proposed estimator for the parameter has the highest speed of convergence to the true value, whereas the corresponding speed for is lower, and for even more low (cf with the rates in Theorem 1). Moreover, the simulations results show that the convergence rates significantly depend on the parameter . More precisely, it turns out that the quality of estimation increases with growing , and the best rates correspond to the case when is close to 1. This can explained by the fact that observations become less independent as increases.
7 Proofs
7.1 Proof of Theorem 1
1. For the sake of clarity we focus our analysis on the estimate First note that by (10) and (12) the difference can be decomposed as follows:
[TABLE]
2. Let us first consider the bias term in (LABEL:eq:error_dec). Note that its order obviously depends on the decay of the Fourier transform which is related to the smoothness of , see [4]. Then by the Plancherel identity
[TABLE]
since , and (17).
2. As for the statistical error, we first note that
[TABLE]
Consider the event
[TABLE]
where and as Using the same techniques as in Theorem 2 from [3], one can show that from the condition if follows that
[TABLE]
provided with some depending on On the event , it holds
[TABLE]
because for any Moreover,
[TABLE]
and therefore on the event
[TABLE]
where we use the inequality for any Therefore, the statistical error can be further decomposed as follows:
[TABLE]
with the first order (linear) term
[TABLE]
and the remainder , which contains higher order powers of and On the event
[TABLE]
and we finally conclude that at least for large it holds
[TABLE]
where
[TABLE]
with some
4. The linear term can be analysed as follows. We have and
[TABLE]
It holds
[TABLE]
Let and compute
[TABLE]
Using the inequality which holds for any we get
[TABLE]
Due to the Lévy-Khintchine formula (5), we derive for any
[TABLE]
with As a result
[TABLE]
Hence
[TABLE]
where
[TABLE]
if and
[TABLE]
for In the limiting case we get
[TABLE]
As a result
[TABLE]
where the function is bounded provided
For further analysis of the terms in (7.1), we need also some asymptotic upper bound for the relation for large Combining (7) with (8), we get
[TABLE]
and therefore it holds as .
So we have for
[TABLE]
with some Analogously we get the upper bounds for for instance,
[TABLE]
Finally, taking into account that
[TABLE]
with any we conclude that due to Markov inequality,
[TABLE]
4. Joint consideration of (LABEL:bias), (21) and (22) concludes the proof. In fact, under the choice with we get that both and are of the polynomial order.
7.2 Proof of Theorem 2
Below we focus on the proof of the first statement.
A scheme for the proof of lower bounds is introduced in [2] and (more generally) in [8]. Shortly speaking, it is sufficient to construct two models from the class , say and (depending on ), such that
[TABLE]
and the -difference between the corresponding probability measures is bounded by
[TABLE]
where and are the probability densities for the first and the second models resp.
1. Let us first present the models. The first model has the triplet with and a Lévy density where is chosen to guarantee We now perturb such that for low frequencies the characteristic functions still coincide. For this reason, we take a flat-top kernel such that
[TABLE]
This kernel and its derivatives have polynomial decay of any order, that is, for any and any it holds at least for large . Introduce for some (bandwidth) .
Introduce the second model via the triplet , where
[TABLE]
with some which we will specify latter. Note that this model also belongs to the considered class when is small enough, provided since then as
[TABLE]
(uniformly over ) follows from the polynomial decay of of any order.
2. On the second step, we consider the difference between the models. For the corresponding characteristic exponents we obtain (note , ):
[TABLE]
which is zero for .
For further analysis, we need a lower bound for the marginal density of the process
[TABLE]
where is a Levy process with triplet Note that since the process is stationary, we can take any in particular, Taking into account the decomposition (3), we conclude that
[TABLE]
where and is the density of a random variable
[TABLE]
with being i.i.d random variables with density and being i.i.d. random variables with uniform law on . In view of the positivity of the summands, and the exponential decay of the Gaussian density (uniformly for and keeping fixed), we derive
[TABLE]
This yields the following upper bound for the difference between the models:
[TABLE]
and due to the Plancherel identity, we get
[TABLE]
With the inequality for with we can estimate the -norm between the characteristic functions and :
[TABLE]
where we use that
[TABLE]
since Analogously, we get the upper bound for the second summand in (23):
[TABLE]
Due to the definition of the class () and to the assumption , we get that and as Therefore, applying (6), we get the same asymptotics for the first and second derivatives of the function whereas and . Finally, we get
[TABLE]
3. To conclude the proof, we choose with fixed , and
[TABLE]
Then
[TABLE]
and
[TABLE]
with some constant depending on and . This observation completes the proof.
Acknowledgment
The study has been funded by the Russian Academic Excellence Project “5-100”.
Appendix. Some auxiliary results
Lemma 1**.**
Let be a characteristic exponent of in the form (5), and let . Then for any , it holds
Proof.
Note that
[TABLE]
since
[TABLE]
where the change of places between limit and integral is possible due to the Lebesque theorem. In fact,
[TABLE]
and due to the assumption. ∎
References
- [1]
Barndorff-Nielsen, Ole E. and Schmiegel, J.
Brownian semistationary processes and volatility/intermittency.
In Advanced financial modelling, volume 8 of Radon Ser. Comput. Appl. Math., pages 1–25. Walter de Gruyter, Berlin, 2009.
- [2]
Belomestny, D., and Reiss, M.
Estimation and Calibration of Lévy Models via Fourier Methods.
In Lévy Matters IV: Estimation for Discretely Observed Lévy Processes, pages 1–76. Springer International Publishing, Cham, 2015.
- [3]
Belomestny, D., Panov, V., and Woerner, J.
Low frequency estimation of continuous-time moving average Lévy processes.
arXiv: 1607.00896, 2016.
- [4]
Kawata, T.
Fourier analysis in probability theory.
Academic Press, 1972.
- [5]
Panov, V.
Abelian theorems for stochastic volatility models and semiparametric estimation of the signal space.
PhD thesis, Humboldt University, 2012.
- [6]
Rajput, B. and Rosiński, J.
Spectral representations of infinitely divisible processes.
Probability Theory and Related Fields, 82(3):451–487, 1989.
- [7]
Schnurr, A. and Woerner, J. H. C.
Well-balanced Lévy driven Ornstein-Uhlenbeck processes.
Stat. Risk Model., 28(4):343–357, 2011.
- [8]
Tsybakov, A.
Introduction to nonparametric estimation.
Springer, New York, 2009.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Barndorff-Nielsen, Ole E. and Schmiegel, J. Brownian semistationary processes and volatility/intermittency. In Advanced financial modelling , volume 8 of Radon Ser. Comput. Appl. Math. , pages 1–25. Walter de Gruyter, Berlin, 2009.
- 2[2] Belomestny, D., and Reiss, M. Estimation and Calibration of Lévy Models via Fourier Methods. In Lévy Matters IV: Estimation for Discretely Observed Lévy Processes , pages 1–76. Springer International Publishing, Cham, 2015.
- 3[3] Belomestny, D., Panov, V., and Woerner, J. Low frequency estimation of continuous-time moving average Lévy processes. ar Xiv: 1607.00896, 2016.
- 4[4] Kawata, T. Fourier analysis in probability theory . Academic Press, 1972.
- 5[5] Panov, V. Abelian theorems for stochastic volatility models and semiparametric estimation of the signal space . Ph D thesis, Humboldt University, 2012.
- 6[6] Rajput, B. and Rosiński, J. Spectral representations of infinitely divisible processes. Probability Theory and Related Fields , 82(3):451–487, 1989.
- 7[7] Schnurr, A. and Woerner, J. H. C. Well-balanced Lévy driven Ornstein-Uhlenbeck processes. Stat. Risk Model. , 28(4):343–357, 2011.
- 8[8] Tsybakov, A. Introduction to nonparametric estimation . Springer, New York, 2009.
