Statistical inference for moving-average L\'evy-driven processes:   Fourier-based approach

Denis Belomestny; Tatiana Orlova; and Vladimir Panov

arXiv:1702.02794·stat.ME·February 10, 2017

Statistical inference for moving-average L\'evy-driven processes: Fourier-based approach

Denis Belomestny, Tatiana Orlova, and Vladimir Panov

PDF

Open Access

TL;DR

This paper introduces a Fourier-based semiparametric estimation method for moving-average Lévy-driven processes, establishing optimal convergence rates and advancing statistical inference in continuous-time stochastic models.

Contribution

It presents a novel Fourier-based estimation approach for Lévy-driven processes with proven optimal convergence rates.

Findings

01

Estimation method achieves minimax optimal convergence rates.

02

Method effectively handles continuous-time moving-average Lévy processes.

03

Provides theoretical guarantees for the proposed estimators.

Abstract

We consider a new method of the semiparametric statistical estimation for the continuous-time moving average L\'evy processes. We derive the convergence rates of the proposed estimators, and show that these rates are optimal in the minimax sense.

Tables3

Table 1. Table 1: Optimal sequences for the estimation of parameter λ 𝜆 \lambda

α = 0.5

α = 0.8

α = 0.9

$n$	$U_{n}$
1000	1.2
2000	1.35
3000	1.4
5000	1.45
10000	1.55

$n$	$U_{n}$
1000	3.5
2000	3.5
3000	3.8
5000	4
10000	4.2

$n$	$U_{n}$
1000	4.5
2000	4.5
3000	4.6
5000	4.8
10000	5.1

Table 2. Table 2: Optimal sequences for the estimation of parameter γ 𝛾 \gamma

α = 0.5

α = 0.8

α = 0.9

$n$	$U_{n}$
1000	0.8
2000	0.85
3000	1
5000	1.1
10000	1.3

$n$	$U_{n}$
1000	2.7
2000	2.75
3000	2.75
5000	2.8
10000	3

$n$	$U_{n}$
1000	3
2000	3.2
3000	3.2
5000	3.3
10000	3.5

Table 3. Table 3: Optimal sequences for the estimation of parameter σ 𝜎 \sigma

α = 0.5

α = 0.8

α = 0.9

$n$	$U_{n}$
1000	8
2000	8
3000	8
5000	8.2
10000	8.5

$n$	$U_{n}$
1000	8.5
2000	8.55
3000	8.6
5000	8.7
10000	8.8

$n$	$U_{n}$
1000	8.75
2000	8.8
3000	8.8
5000	9
10000	9.2

Equations180

Z_{t} = \int_{- \infty}^{\infty} K (t - s) d L_{s}

Z_{t} = \int_{- \infty}^{\infty} K (t - s) d L_{s}

K_{α} (x) := (1 - α ∣ x ∣)^{\frac{1}{α}}, ∣ x ∣ \leq α^{- 1}

K_{α} (x) := (1 - α ∣ x ∣)^{\frac{1}{α}}, ∣ x ∣ \leq α^{- 1}

L_{t} = γ t + σ W_{t} + C P P_{t}^{(1)} \cdot I I {t \geq 0} + C P P_{t}^{(2)} \cdot I I {t < 0},

L_{t} = γ t + σ W_{t} + C P P_{t}^{(1)} \cdot I I {t \geq 0} + C P P_{t}^{(2)} \cdot I I {t < 0},

C P P_{t}^{(k)} := j = 1 \sum N_{t}^{(k)} Y_{j}^{(k)}, k = 1, 2,

C P P_{t}^{(k)} := j = 1 \sum N_{t}^{(k)} Y_{j}^{(k)}, k = 1, 2,

ψ (u) = lo g E [e^{i u L_{1}}]

ψ (u) = lo g E [e^{i u L_{1}}]

Φ (u) := E [e^{i u Z_{t}}] = exp (Ψ (u)), \mbox w h er e Ψ (u) := \int_{R} ψ (u K (s)) d s,

Φ (u) := E [e^{i u Z_{t}}] = exp (Ψ (u)), \mbox w h er e Ψ (u) := \int_{R} ψ (u K (s)) d s,

K_{α}^{'} (x) = - (1 - α x)^{\frac{1 - α}{α}} = - K_{α}^{1 - α} (x),

K_{α}^{'} (x) = - (1 - α x)^{\frac{1 - α}{α}} = - K_{α}^{1 - α} (x),

Φ (u)

Φ (u)

ψ (u) = \frac{1}{2} u^{1 - α} (u^{α} lo g (Φ (u)))^{'} = \frac{1}{2} (α lo g (Φ (u)) + u \frac{Φ ^{'} ( u )}{Φ ( u )}),

ψ (u) = \frac{1}{2} u^{1 - α} (u^{α} lo g (Φ (u)))^{'} = \frac{1}{2} (α lo g (Φ (u)) + u \frac{Φ ^{'} ( u )}{Φ ( u )}),

Φ_{n} (u) = \frac{1}{n} j = 1 \sum n e^{i u Z_{j Δ}},

Φ_{n} (u) = \frac{1}{n} j = 1 \sum n e^{i u Z_{j Δ}},

ψ_{n} (u) = \frac{1}{2} (α lo g (Φ_{n} (u)) + u \frac{Φ _{n}^{'} ( u )}{Φ _{n} ( u )}),

ψ_{n} (u) = \frac{1}{2} (α lo g (Φ_{n} (u)) + u \frac{Φ _{n}^{'} ( u )}{Φ _{n} ( u )}),

w^{U_{n}} (u) := (1/ U_{n}) w (u / U_{n}),

w^{U_{n}} (u) := (1/ U_{n}) w (u / U_{n}),

(σ_{n}^{2}, λ_{n}) := argmin_{(σ^{2}, λ)} \int_{0}^{\infty} w^{U_{n}} (u) (Re [ψ_{n} (u)] + σ^{2} u^{2} /2 + λ)^{2} d u,

(σ_{n}^{2}, λ_{n}) := argmin_{(σ^{2}, λ)} \int_{0}^{\infty} w^{U_{n}} (u) (Re [ψ_{n} (u)] + σ^{2} u^{2} /2 + λ)^{2} d u,

σ_{n}^{2} = \int_{0}^{\infty} w_{σ}^{U_{n}} (u) Re ψ_{n} (u) d u,

σ_{n}^{2} = \int_{0}^{\infty} w_{σ}^{U_{n}} (u) Re ψ_{n} (u) d u,

w_{σ}^{U_{n}} (u)

w_{σ}^{U_{n}} (u)

\int_{0}^{U_{n}} (- u^{2} /2) w_{σ}^{U_{n}} (u) d u = 1, \int_{0}^{U_{n}} w_{σ}^{U_{n}} (u) d u = 0.

\int_{0}^{U_{n}} (- u^{2} /2) w_{σ}^{U_{n}} (u) d u = 1, \int_{0}^{U_{n}} w_{σ}^{U_{n}} (u) d u = 0.

λ_{n} = \int_{0}^{\infty} w_{λ}^{U_{n}} (u) Re ψ_{n} (u) d u

λ_{n} = \int_{0}^{\infty} w_{λ}^{U_{n}} (u) Re ψ_{n} (u) d u

\int_{0}^{U_{n}} (- 1) w_{λ}^{U_{n}} (u) d u = 1, \int_{0}^{U_{n}} (- u^{2} /2) w_{λ}^{U_{n}} (u) d u = 0.

\int_{0}^{U_{n}} (- 1) w_{λ}^{U_{n}} (u) d u = 1, \int_{0}^{U_{n}} (- u^{2} /2) w_{λ}^{U_{n}} (u) d u = 0.

γ_{n} := argmin_{γ} \int_{0}^{\infty} w^{U_{n}} (u) (Im ψ_{n} (u) - γ u)^{2} d u,

γ_{n} := argmin_{γ} \int_{0}^{\infty} w^{U_{n}} (u) (Im ψ_{n} (u) - γ u)^{2} d u,

γ_{n} = \int_{0}^{\infty} w_{γ}^{U_{n}} (u) Im ψ_{n} (u) d u,

γ_{n} = \int_{0}^{\infty} w_{γ}^{U_{n}} (u) Im ψ_{n} (u) d u,

\nu_{n}(x):={\cal F}^{-1}\left[\Bigl{(}\psi_{n}(\cdot)+\tfrac{\sigma_{n}^{2}}{2}(\cdot)^{2}-i\gamma_{n}(\cdot)+\lambda_{n}\Bigr{)}w_{\nu}(\cdot/U_{n})\right](x),\quad x\in{\mathbb{R}},

\nu_{n}(x):={\cal F}^{-1}\left[\Bigl{(}\psi_{n}(\cdot)+\tfrac{\sigma_{n}^{2}}{2}(\cdot)^{2}-i\gamma_{n}(\cdot)+\lambda_{n}\Bigr{)}w_{\nu}(\cdot/U_{n})\right](x),\quad x\in{\mathbb{R}},

\displaystyle\mathcal{T}_{s}=\mathcal{T}_{s}(\sigma^{\circ},R)=\Biggl{\{}\sigma\in(0,\sigma^{\circ}),\;\;\int x^{2}\nu(dx)\leq R,\;\;\left\|\nu^{(s)}\right\|_{\infty}\leq R\Biggr{\}}

\displaystyle\mathcal{T}_{s}=\mathcal{T}_{s}(\sigma^{\circ},R)=\Biggl{\{}\sigma\in(0,\sigma^{\circ}),\;\;\int x^{2}\nu(dx)\leq R,\;\;\left\|\nu^{(s)}\right\|_{\infty}\leq R\Biggr{\}}

∥ F (w_{σ}^{1} (u) / u^{s}) ∥_{L^{1}} < \infty, ∥ F (w_{λ}^{1} (u) / u^{s}) ∥_{L^{1}} < \infty,

∥ F (w_{σ}^{1} (u) / u^{s}) ∥_{L^{1}} < \infty, ∥ F (w_{λ}^{1} (u) / u^{s}) ∥_{L^{1}} < \infty,

∥ F (w_{γ}^{1} (u) / u^{s}) ∥_{L^{1}} < \infty.

∥ F (w_{γ}^{1} (u) / u^{s}) ∥_{L^{1}} < \infty.

A \to + \infty lim n \to + \infty \overline{lim} (γ, σ, ν) \in T_{s} sup P {σ_{n}^{2} - σ^{2} \geq A \cdot U_{n}^{- (s + 3)}}

A \to + \infty lim n \to + \infty \overline{lim} (γ, σ, ν) \in T_{s} sup P {σ_{n}^{2} - σ^{2} \geq A \cdot U_{n}^{- (s + 3)}}

A \to + \infty lim n \to + \infty \overline{lim} (γ, σ, ν) \in T_{s} sup P {∣ γ_{n} - γ ∣ \geq A \cdot U_{n}^{- (s + 2)}}

A \to + \infty lim n \to + \infty \overline{lim} (γ, σ, ν) \in T_{s} sup P {∣ λ_{n} - λ ∣ \geq A \cdot U_{n}^{- (s + 1)}}

n \to + \infty \underline{lim} \overset{σ}{˘}_{n} in f (γ, σ, ν) \in T_{s} sup P {\overset{σ}{˘}_{n}^{2} - σ^{2} \geq A \cdot (lo g (n))^{- (s + 3) /2}}

n \to + \infty \underline{lim} \overset{σ}{˘}_{n} in f (γ, σ, ν) \in T_{s} sup P {\overset{σ}{˘}_{n}^{2} - σ^{2} \geq A \cdot (lo g (n))^{- (s + 3) /2}}

n \to + \infty \underline{lim} \overset{γ}{˘}_{n} in f (γ, σ, ν) \in T_{s} sup P {∣ \overset{γ}{˘}_{n} - γ ∣ \geq A \cdot (lo g (n))^{- (s + 2) /2}}

n \to + \infty \underline{lim} \overset{˘}{λ}_{n} in f (γ, σ, ν) \in T_{s} sup P {\overset{˘}{λ}_{n} - λ \geq A \cdot (lo g (n))^{- (s + 1) /2}}

Z_{t} = ⎩ ⎨ ⎧ \frac{2 γ}{1 + α} + k \in K^{(1)} \sum (1 - α ∣ t - s_{k}^{(1)} ∣)^{1/ α} Y_{k}^{(1)}, \frac{2 γ}{1 + α} + k \in K^{(2)} \sum (1 - α ∣ t - s_{k}^{(1)} ∣)^{1/ α} Y_{k}^{(1)} + k \in K^{(3)} \sum (1 - α ∣ t + s_{k}^{(2)} ∣)^{1/ α} Y_{k}^{(2)}, if t \geq \frac{1}{α} if t < \frac{1}{α},

Z_{t} = ⎩ ⎨ ⎧ \frac{2 γ}{1 + α} + k \in K^{(1)} \sum (1 - α ∣ t - s_{k}^{(1)} ∣)^{1/ α} Y_{k}^{(1)}, \frac{2 γ}{1 + α} + k \in K^{(2)} \sum (1 - α ∣ t - s_{k}^{(1)} ∣)^{1/ α} Y_{k}^{(1)} + k \in K^{(3)} \sum (1 - α ∣ t + s_{k}^{(2)} ∣)^{1/ α} Y_{k}^{(2)}, if t \geq \frac{1}{α} if t < \frac{1}{α},

K^{(1)}

K^{(1)}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Statistical Methods and Inference · Financial Risk and Volatility Modeling

Full text

Statistical inference for moving-average Lévy-driven processes: Fourier-based approach

Denis Belomestny1,2, Tatiana Orlova1, and Vladimir Panov1

1 Laboratory of Stochastic Analysis and its Applications

National Research University Higher School of Economics

Shabolovka, 26, 119049 Moscow, Russia

and

2 University of Duisburg-Essen

Thea-Leymann-Str. 9, 45127 Essen, Germany

Abstract

We consider a new method of the semiparametric statistical estimation for the continuous-time moving average Lévy processes. We derive the convergence rates of the proposed estimators, and show that these rates are optimal in the minimax sense.

keywords:

moving average , Lévy processes , low-frequency estimation , Fourier methods

††journal: Statistics & Probability Letters

1 Introduction

Generally speaking, continuous-time Lévy-driven moving average processes are defined as

[TABLE]

where $\mathcal{K}$ is a deterministic kernel and $L=(L_{t})_{t\in\mathbb{R}}$ is a two-sided Lévy process with Lévy triplet $(\gamma,\sigma,\nu)$ . The conditions which guarantee that this integral is well-defined are given in the pioneering work by Rajput and Rosinski [6]. For instance, if $\int x^{2}\nu(dx)<\infty$ , it is sufficient to assume that $\mathcal{K}\in\mathcal{L}^{1}({\mathbb{R}})\cap\mathcal{L}^{2}({\mathbb{R}})$ . Some popular choices for the kernel are $\mathcal{K}(t)=t^{\alpha}e^{-\lambda t}1_{[0,\infty)}(t)$ with $\lambda>0$ and $\alpha>-1/2$ , (Gamma-kernels, see e.g. Barndorff-Nielsen and Schmiegel [1]), or $\mathcal{K}(t)=e^{-\lambda|t|}$ (well-balanced Ornstein-Uhlenbeck process, see Schnurr and Woerner [7]).

Recently, Belomestny, Panov and Woerner [3] consider the statistical estimation of the Lévy measure $\nu$ from the low-frequency observations of the process $(Z_{t})$ . The approach presented in [3] is rather general - in particular, it works well under various choices of $\mathcal{K}$ . Nevertheless, this approach is based on the superposition of the Mellin and Fourier transforms of the Lévy measure, and therefore its practical implementation can meet some computational difficulties.

In this paper, we present another method, which essentially uses the fact that in some cases there exists a direct relation between the characteristic exponent of the process $L$ and the characteristic function of the process $Z.$ Therefore, the characteristic exponent can be estimated from the observations of the process $Z,$ and further application of the Fourier techniques from Belomestny and Reiss [2] and Panov [5] leads to the construction of a consistent estimator of the Lévy triplet.

The paper is organised as follows. In the next session, we provide the specifications of our model. In Section 3, we present the key mathematical idea, which lies in the core of the estimation procedure presented in Section 4. The upper and lower error bounds for the proposed estimates are given in Section 5. Joint consideration of the corresponding results, Theorems 1 and 2, yields the optimality of the estimates. Finally, in Section 6, we illustrate our approach with some numerical examples. All proofs are collected in Section 7.

2 Set-up

In this work, we consider the integrals of the form (1), where $\mathcal{K}$ is a symmetric kernel of the form:

[TABLE]

for some $\alpha\in(0,1).$ As a limiting case for $\alpha\searrow 0,$ we get the exponential kernel $\mathcal{K}_{0}(x)=\exp(-x).$ Here, for simplicity, we restrict our attention to a particular class of two-sided Lévy processes with jumps represented by a compound Poisson process $CPP_{t}$ ,

[TABLE]

where $\gamma\in{\mathbb{R}}$ is a drift, $\sigma\geq 0$ , $W_{t}$ is a Brownian motion, $N_{t}^{(1)},N_{t}^{(2)},$ are 2 Poisson processes with intensity $\lambda$ , $Y_{1}^{(1)},Y_{2}^{(1)},...$ and $Y_{1}^{(2)},Y_{2}^{(2)},...$ are i.i.d. r.v’s with absolutely continuous distribution, and all $Y$ ’s, $N^{(1)}_{t},$ $N^{(2)}_{t}$ , $W_{t}$ are jointly independent. Due to the Lévy-Khintchine formula, the characteristic exponent of $L$ is given by

[TABLE]

where $\nu$ is a Lévy measure of $(L_{t})$ , and $\mathcal{F}[\nu](u)=\int_{{\mathbb{R}}}e^{iux}\nu(dx)$ stands for the Fourier transform of $\nu.$

It is important to note that the process $\left(Z_{t}\right)_{t\in\mathbb{R}}$ is strictly stationary with the characteristic function of the form

[TABLE]

and therefore for any time points $t_{1},...,t_{n},$ the r.v.’s $Z_{t_{1}},...,Z_{t_{n}}$ are identically distributed (but dependent). Our aim is to estimate the Lévy triplet $(\gamma,\sigma,\nu)$ based on the equidistant observations of the process $Z_{t}$ at the time points $\Delta,2\Delta,..,n\Delta,$ where $\Delta>0$ is fixed (low-frequency set-up).

3 Main idea

The key observation is that under our choice of the kernel function $\mathcal{K},$ we can represent the characteristic exponent $\psi(\cdot)$ of the process $(L_{t})$ via the characteristic function $\Phi(\cdot)$ of the process $Z_{t}$ . More precisely, since

[TABLE]

we have

[TABLE]

Therefore, we derive

[TABLE]

since $\psi(u)u^{\alpha-1}\to 0$ as $u\to+0$ provided that $\int|x|\nu(dx)<\infty$ , see Lemma 1. Therefore, the characteristic exponent $\psi$ can be directly estimated from data via a plug-in estimator based on the empirical characteristic function of $Z$ .

Moreover, returning to the representation (5), we conclude that the Lévy triplet $(\gamma,\sigma,\nu)$ can be estimated from $\psi.$ In fact, since $\nu$ is absolutely continuous with an absolutely integrable density, then by the Riemann-Lebesgue lemma (see [4], p. 43) $\mathcal{F}[\nu](u)\to 0$ as $|u|\to\infty,$ and consequently $\psi(u)$ can be viewed, at least for large $|u|,$ as a second order polynomial with the coefficients $(-\lambda,i\gamma,-\sigma^{2}/2).$ This observation gives rise for the estimation procedure, which we present in the next session.

4 Estimation procedure

Assume that the process $(Z_{t})$ is observed on the equidistant time grid $t\in\left\{\Delta,2\Delta,...,n\Delta\right\},$ where $\Delta$ is fixed.

Step 1: estimation of $\psi$ . Define

[TABLE]

and set

[TABLE]

where the branch of the complex logarithm is taken in such a way that $\psi_{n}$ is continuous on $(-x_{0,n},x_{0,n})$ with $\psi_{n}(0)=0$ and $x_{0,n}$ being the first zero of $\Phi_{n}.$ In fact, since $\Phi$ does not vanish on $\mathbb{R}$ , we have $x_{0,n}\stackrel{{\scriptstyle a.s.}}{{\to}}\infty.$

Step 2: estimation of $\sigma$ and $\lambda$ . Let $U_{n}\to\infty$ and

[TABLE]

where $\widetilde{w}(u)$ is a continuous function, supported on the interval $[\varepsilon,1]$ with some $\varepsilon>0.$ Consider now the optimisation problem

[TABLE]

which has the solution

[TABLE]

with

[TABLE]

Note that the weighting function $w_{\sigma}^{U_{n}}(u)$ satisfies the property $w_{\sigma}^{U_{n}}(u)=U_{n}^{-3}w_{\sigma}^{1}(u/U_{n})$ , and moreover,

[TABLE]

Analogously,

[TABLE]

holds with $w_{\lambda}^{U_{n}}(u)=U_{n}^{-1}w_{\lambda}^{1}(u/U_{n})$ satisfying the properties

[TABLE]

Step 3: estimation of $\gamma$ . Finally, the parameter $\gamma$ can be estimated by considering the optimisation problem

[TABLE]

which leads to the estimate

[TABLE]

where $w_{\gamma}^{U_{n}}(u)=U_{n}^{-2}w_{\gamma}^{1}(u/U_{n})$ fulfills $\int_{0}^{U_{n}}u\,w_{\gamma}^{U_{n}}(u)\,du=1.$ All functions $w_{\sigma}^{1}$ , $w_{\gamma}^{1}$ and $w_{\lambda}^{1}$ are supported on $[\varepsilon,1]$ and bounded.

Step 4: estimation of the Lévy density. Note that under our assumptions on the Lévy process $(L_{t})$ (see Section 2), the Levy measure $\nu$ possesses a density, which we denote, with a slight abuse of notation, also by $\nu(x)$ . This Lévy density can be estimated as a regularised inverse Fourier transform of the remainder:

[TABLE]

where $w_{\nu}$ is a weight function supported on $[-1,1].$ Note that $\int_{{\mathbb{R}}}\nu_{n}(x)\,dx=\lambda_{n},$ if $w_{\nu}(0)=1.$

5 Error bounds

Theorem 1.

Consider the model (1), where $\mathcal{K}$ is a kernel in the form (2) and $(L_{t})$ is a Lévy process in the form (3) with triplet $\left(\gamma,\sigma,\nu\right)$ . Assume that the Lévy density $\nu$ is $s$ -times weakly differentiable for some $s\in\mathbb{N}$ , and moreover the Lévy triplet belongs to the class

[TABLE]

*with some $\sigma^{\circ},R>0.$ Assume also that the weighting functions satisfy the conditions *

[TABLE]

Then it holds

[TABLE]

provided $U_{n}=\sqrt{\kappa\log(n)}$ with some constant $\kappa>0$ depending on $\sigma^{\circ}$ and $R.$

As shown in the next theorem, the above rates are optimal in minimax sense.

Theorem 2.

For any $\sigma^{\circ},R>0,$ there exists some $A>0$ such that

[TABLE]

where the infimums are taken over all possible estimates $\breve{\sigma}_{n},\breve{\gamma}_{n},\breve{\lambda}_{n}$ of the parameters $\sigma,\gamma,\lambda,$ and supremums - over all triplets from the class $\mathcal{T}_{s}=\mathcal{T}_{s}(\sigma^{\circ},R).$

6 Numerical example

Consider the integral (1) with the kernel $\mathcal{K}(x)$ from the class (2), and the Lévy process $(L_{t})$ defined by (3)-(4). For simulation study, we take $\gamma=5,$ $\lambda=1$ and $\sigma=0$ , and aim to estimate these parameters under different choices of the parameter $\alpha$ , namely $\alpha=0.5,$ $0.8$ and $0.9$ .

Simulation. For $k=1,2$ , denote the jump times of $L_{t}^{(k)}$ by $s_{1}^{(k)},s_{2}^{(k)},....$ , corresponding to the jump sizes $Y_{1}^{(k)},Y_{2}^{(k)},...$ Note that

[TABLE]

where

[TABLE]

Typical trajectory of the process $Z_{t}$ is presented on Figure 1.

Estimation. Following the ideas from Section 4, we estimate the parameters $\gamma,\lambda,\sigma$ under different choices of $\alpha.$

To show the convergence properties of the considered estimates, we provide simulations with different values of $n$ . The boxplots of the corresponding estimation errors (differences) based on 25 simulation runs are presented on Figures 2, 3 and 4. Note that the parameter $U_{n}$ is chosen by numerical optimisation. The exact values are presented in Tables 1 and 2.

The simulation study illustrates our theoretical results on the rates of convergence given in Section 5. In fact, visual comparison of Figures 2, 3 and 4 shows that the proposed estimator for the parameter $\sigma$ has the highest speed of convergence to the true value, whereas the corresponding speed for $\gamma_{n}$ is lower, and for $\lambda_{n}$ even more low (cf with the rates in Theorem 1). Moreover, the simulations results show that the convergence rates significantly depend on the parameter $\alpha$ . More precisely, it turns out that the quality of estimation increases with growing $\alpha$ , and the best rates correspond to the case when $\alpha$ is close to 1. This can explained by the fact that observations become less independent as $\alpha$ increases.

7 Proofs

7.1 Proof of Theorem 1

1. For the sake of clarity we focus our analysis on the estimate $\sigma_{n}.$ First note that by (10) and (12) the difference $\sigma_{n}^{2}-\sigma^{2}$ can be decomposed as follows:

[TABLE]

2. Let us first consider the bias term in (LABEL:eq:error_dec). Note that its order obviously depends on the decay of the Fourier transform $\mathcal{F}[\nu](u),$ which is related to the smoothness of $\nu$ , see [4]. Then by the Plancherel identity

[TABLE]

since $w_{\sigma}^{U_{n}}(u)=U_{n}^{-3}w_{\sigma}^{1}(u/U_{n})$ , $\left\|\nu^{(s)}\right\|_{\infty}\leq R,$ and (17).

2. As for the statistical error, we first note that

[TABLE]

Consider the event

[TABLE]

where $D_{n,j}(u)=(\Phi_{n}^{(j)}(u)-\Phi^{(j)}(u))/\Phi(u),$ $j=0,1,$ and $\varepsilon_{n}\to 0$ as $n\to\infty.$ Using the same techniques as in Theorem 2 from [3], one can show that from the condition $\int_{|x|>1}x^{2}\nu(dx)<\infty,$ if follows that

[TABLE]

provided $\varepsilon_{n}=\sqrt{\log(n)/n}\exp\{C_{2}\sigma^{2}U_{n}^{2}\}$ with some $C_{1},C_{2}>0$ depending on $\alpha.$ On the event $\mathcal{B}_{n,A}$ , it holds

[TABLE]

because $\left|\log(1+z)-z\right|\leq 2|z|^{2}$ for any $|z|<1/2.$ Moreover,

[TABLE]

and therefore on the event $\mathcal{A},$

[TABLE]

where we use the inequality $|((1+z)^{-1}-1|\leq 2|z|$ for any $|z|<1/2.$ Therefore, the statistical error can be further decomposed as follows:

[TABLE]

with the first order (linear) term $L_{n}=\operatorname{Re}\breve{L}_{n},$

[TABLE]

and the remainder $R_{n}$ , which contains higher order powers of $D_{n,0}$ and $D_{n,1}.$ On the event $\mathcal{B}_{n,A},$

[TABLE]

and we finally conclude that at least for large $n$ it holds

[TABLE]

where

[TABLE]

with some $C_{3}>0.$

4. The linear term $L_{n}$ can be analysed as follows. We have ${\mathbb{E}}[\breve{L}_{n}]=0,$ $\operatorname{Var}L_{n}\leq\operatorname{Var}\breve{L}_{n},$ and

[TABLE]

It holds

[TABLE]

Let $t>s$ and compute

[TABLE]

Using the inequality $\left|e^{z}-e^{y}\right|\leq\left(\left|e^{z}\right|\vee\left|e^{y}\right|\right)\left|y-z\right|,$ which holds for any $z,y\in\mathbb{C},$ we get

[TABLE]

Due to the Lévy-Khintchine formula (5), we derive for any $u_{1},u_{2}\in\mathbb{R},$

[TABLE]

with $C=\sigma^{2}+\int_{\mathbb{R}}x^{2}\nu(dx)<\infty.$ As a result

[TABLE]

Hence

[TABLE]

where

[TABLE]

if $\left|h\right|>2/\alpha$ and

[TABLE]

for $\left|h\right|\leq 2/\alpha.$ In the limiting case $\alpha\searrow 0,$ we get

[TABLE]

As a result

[TABLE]

where the function $Q(u,v)$ is bounded provided $\sigma>0.$

For further analysis of the terms $I_{1}-I_{4}$ in (7.1), we need also some asymptotic upper bound for the relation $|\Phi^{\prime}(u)/\Phi(u)|$ for large $u.$ Combining (7) with (8), we get

[TABLE]

and therefore it holds $\left|\Phi^{\prime}(u)/\Phi(u)\right|\lesssim u$ as $|u|\to\infty$ .

So we have for $I_{1}$

[TABLE]

with some $Q^{*}>0.$ Analogously we get the upper bounds for $I_{2},I_{3},I_{4},$ for instance,

[TABLE]

Finally, taking into account that

[TABLE]

with any $C_{4}>\sigma^{2}/(4+2\alpha),$ we conclude that due to Markov inequality,

[TABLE]

4. Joint consideration of (LABEL:bias), (21) and (22) concludes the proof. In fact, under the choice $U_{n}=\sqrt{\kappa\log(n)}$ with $\kappa<\min\left(C_{4}^{-1},(2C_{2}\sigma^{2})^{-1}\right)$ we get that both $g_{n,1}$ and $g_{n,2}$ are of the polynomial order.

7.2 Proof of Theorem 2

Below we focus on the proof of the first statement.

A scheme for the proof of lower bounds is introduced in [2] and (more generally) in [8]. Shortly speaking, it is sufficient to construct two models from the class $\mathcal{T}_{s}(\sigma^{\circ},R)$ , say $(\gamma_{0},\sigma_{0},\nu_{0})$ and $(\gamma_{1},\sigma_{1},\nu_{1})$ (depending on $n$ ), such that

[TABLE]

and the $\chi^{2}$ -difference between the corresponding probability measures is bounded by $1/n:$

[TABLE]

where $p_{0}$ and $p_{1}$ are the probability densities for the first and the second models resp.

1. Let us first present the models. The first model has the triplet $(0,\sigma_{0},\nu_{0})\in\mathcal{T}_{s}(\sigma^{\circ},R)$ with $\sigma_{0}=\sigma=\sigma^{\circ}/2$ and a Lévy density $\nu_{0}(x)=\nu(x)=c(1+\left|x\right|)^{-4},$ where $c>0$ is chosen to guarantee $\left\|\nu^{(s)}\right\|_{\infty}\leq R.$ We now perturb $(\sigma,\nu)$ such that for low frequencies the characteristic functions still coincide. For this reason, we take a flat-top kernel $K$ such that

[TABLE]

This kernel and its derivatives have polynomial decay of any order, that is, for any $r=0,1,2,...$ and any $q=1,2,..$ it holds $K^{(r)}(x)\leq(1+|x|)^{-q}$ at least for large $|x|$ . Introduce $K_{h}(x)=h^{-1}K(h^{-1}x)$ for some (bandwidth) $h>0$ .

Introduce the second model via the triplet $(0,\sigma_{1},\nu_{1})$ , where

[TABLE]

with some $\delta>0,$ which we will specify latter. Note that this model also belongs to the considered class $\mathcal{T}_{s}(\sigma^{\circ},R)$ when $h$ is small enough, provided $\delta=o(h^{3})$ since then as $h\to 0$

[TABLE]

(uniformly over $x\in{\mathbb{R}}$ ) follows from the polynomial decay of $K^{\prime\prime}$ of any order.

2. On the second step, we consider the difference between the models. For the corresponding characteristic exponents we obtain (note ${\cal F}K_{h}^{\prime\prime}(u)=-u^{2}{\cal F}K_{h}(u)$ , $\int K_{h}^{\prime\prime}(u)du=0$ ):

[TABLE]

which is zero for $u\in[-h^{-1},h^{-1}]$ .

For further analysis, we need a lower bound for the marginal density $p_{0}$ of the process

[TABLE]

where $L_{0,t}$ is a Levy process with triplet $(0,\sigma_{0},\nu_{0}).$ Note that since the process $Z_{s}$ is stationary, we can take any $s,$ in particular, $s=0.$ Taking into account the decomposition (3), we conclude that

[TABLE]

where $\sigma_{\mathcal{K}}^{2}=\sigma_{0}^{2}\int\mathcal{K}_{\alpha}^{2}(s)\,ds$ and $q_{k}$ is the density of a random variable

[TABLE]

with $\xi_{1},\ldots,\xi_{k}$ being i.i.d random variables with density $\nu_{0}/\lambda$ and $U_{1}<\ldots<U_{k}$ being i.i.d. random variables with uniform law on $[-1/\alpha,1/\alpha]$ . In view of the positivity of the summands, $\nu_{0}\gtrsim\nu$ and the exponential decay of the Gaussian density (uniformly for $\Delta\lesssim 1$ and keeping $\lambda,\sigma_{0},\nu_{0}$ fixed), we derive

[TABLE]

This yields the following upper bound for the $\chi^{2}-$ difference between the models:

[TABLE]

and due to the Plancherel identity, we get

[TABLE]

With the inequality $\left|1-e^{-z}\right|\leq 2\left|z\right|$ for $z=x+iy\in\mathbb{C}$ with $x\geq 0$ we can estimate the $\mathcal{L}^{2}$ -norm between the characteristic functions $\Phi_{0}$ and $\Phi_{1}$ :

[TABLE]

where we use that

[TABLE]

since $|\operatorname{Re}\mathcal{F}[\nu](z)|<\lambda.$ Analogously, we get the upper bound for the second summand in (23):

[TABLE]

Due to the definition of the class ( $\int x^{2}\nu(dx)<\infty$ ) and to the assumption $\nu({\mathbb{R}})<\infty$ , we get that $|\psi^{\prime}(u)|\lesssim 1+|u|$ and $|\psi^{\prime\prime}(u)|\lesssim 1$ as $u\to\infty.$ Therefore, applying (6), we get the same asymptotics for the first and second derivatives of the function $\Psi_{0}(u),$ whereas $\Psi^{\prime}_{1}(u)\lesssim 1+|u|+\delta h^{-1}$ and $\Psi^{\prime\prime}_{1}(u)\lesssim 1+\delta h^{-2}$ . Finally, we get

[TABLE]

3. To conclude the proof, we choose $\delta=\delta^{\prime}h^{s+3}$ with fixed $\delta^{\prime}$ , and

[TABLE]

Then

[TABLE]

and

[TABLE]

with some constant $C$ depending on $\alpha$ and $\sigma^{\circ}$ . This observation completes the proof.

Acknowledgment

The study has been funded by the Russian Academic Excellence Project “5-100”.

Appendix. Some auxiliary results

Lemma 1.

Let $\psi(u)$ be a characteristic exponent of $L_{t}$ in the form (5), and let $\int|x|\nu(dx)<\infty$ . Then for any $\alpha>0$ , it holds $\lim_{u\to 0+}\psi(u)u^{\alpha-1}=0.$

Proof.

Note that

[TABLE]

since

[TABLE]

where the change of places between limit and integral is possible due to the Lebesque theorem. In fact,

[TABLE]

and $\int|x|\nu(dx)<\infty$ due to the assumption. ∎

References

[1]

Barndorff-Nielsen, Ole E. and Schmiegel, J.

Brownian semistationary processes and volatility/intermittency.

In Advanced financial modelling, volume 8 of Radon Ser. Comput. Appl. Math., pages 1–25. Walter de Gruyter, Berlin, 2009.

[2]

Belomestny, D., and Reiss, M.

Estimation and Calibration of Lévy Models via Fourier Methods.

In Lévy Matters IV: Estimation for Discretely Observed Lévy Processes, pages 1–76. Springer International Publishing, Cham, 2015.

[3]

Belomestny, D., Panov, V., and Woerner, J.

Low frequency estimation of continuous-time moving average Lévy processes.

arXiv: 1607.00896, 2016.

[4]

Kawata, T.

Fourier analysis in probability theory.

Academic Press, 1972.

[5]

Panov, V.

Abelian theorems for stochastic volatility models and semiparametric estimation of the signal space.

PhD thesis, Humboldt University, 2012.

[6]

Rajput, B. and Rosiński, J.

Spectral representations of infinitely divisible processes.

Probability Theory and Related Fields, 82(3):451–487, 1989.

[7]

Schnurr, A. and Woerner, J. H. C.

Well-balanced Lévy driven Ornstein-Uhlenbeck processes.

Stat. Risk Model., 28(4):343–357, 2011.

[8]

Tsybakov, A.

Introduction to nonparametric estimation.

Springer, New York, 2009.

Bibliography8

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Barndorff-Nielsen, Ole E. and Schmiegel, J. Brownian semistationary processes and volatility/intermittency. In Advanced financial modelling , volume 8 of Radon Ser. Comput. Appl. Math. , pages 1–25. Walter de Gruyter, Berlin, 2009.
2[2] Belomestny, D., and Reiss, M. Estimation and Calibration of Lévy Models via Fourier Methods. In Lévy Matters IV: Estimation for Discretely Observed Lévy Processes , pages 1–76. Springer International Publishing, Cham, 2015.
3[3] Belomestny, D., Panov, V., and Woerner, J. Low frequency estimation of continuous-time moving average Lévy processes. ar Xiv: 1607.00896, 2016.
4[4] Kawata, T. Fourier analysis in probability theory . Academic Press, 1972.
5[5] Panov, V. Abelian theorems for stochastic volatility models and semiparametric estimation of the signal space . Ph D thesis, Humboldt University, 2012.
6[6] Rajput, B. and Rosiński, J. Spectral representations of infinitely divisible processes. Probability Theory and Related Fields , 82(3):451–487, 1989.
7[7] Schnurr, A. and Woerner, J. H. C. Well-balanced Lévy driven Ornstein-Uhlenbeck processes. Stat. Risk Model. , 28(4):343–357, 2011.
8[8] Tsybakov, A. Introduction to nonparametric estimation . Springer, New York, 2009.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Statistical inference for moving-average Lévy-driven processes: Fourier-based approach

Abstract

keywords:

1 Introduction

2 Set-up

3 Main idea

4 Estimation procedure

5 Error bounds

Theorem 1**.**

Theorem 2**.**

6 Numerical example

7 Proofs

7.1 Proof of Theorem 1

7.2 Proof of Theorem 2

Acknowledgment

Appendix. Some auxiliary results

Lemma 1**.**

Proof.

References

Theorem 1.

Theorem 2.

Lemma 1.