An Inverse Problem for Infinitely Divisible Moving Average Random Fields

Wolfgang Karcher; Stefan Roth; Evgeny Spodarev; Corinna Walk

arXiv:1705.09542·math.ST·May 29, 2017

An Inverse Problem for Infinitely Divisible Moving Average Random Fields

Wolfgang Karcher, Stefan Roth, Evgeny Spodarev, Corinna Walk

PDF

TL;DR

This paper addresses the nonparametric estimation of Lévy characteristics in infinitely divisible moving average random fields using three different methods, providing theoretical error bounds and simulation comparisons.

Contribution

It introduces three novel estimation methods for Lévy densities in random fields and analyzes their theoretical performance and practical effectiveness.

Findings

01

All three methods provide consistent $L^2$-error bounds.

02

Numerical simulations compare the performance of the methods.

03

The Fourier-based approach shows promising accuracy in simulations.

Abstract

Given a low frequency sample of an infinitely divisible moving average random field ${\int_{R^{d}} f (x - t) Λ (d x); t \in R^{d}}$ with a known simple function $f$ , we study the problem of nonparametric estimation of the L\'{e}vy characteristics of the independently scattered random measure $Λ$ . We provide three methods, a simple plug-in approach, a method based on Fourier transforms and an approach involving decompositions with respect to $L^{2}$ -orthonormal bases, which allow to estimate the L\'{e}vy density of $Λ$ . For these methods, the bounds for the $L^{2}$ -error are given. Their numerical performance is compared in a simulation study.

Figures9

Click any figure to enlarge with its caption.

Tables2

Table 1. Table 1: Empirical mean and standard deviation of the mean square errors of our estimations based on 100 100 100 simulations.

		Method of estimation
		plug-in	Fourier	OnB
$Y_{1} \sim N (0, 1)$	mean	0.005291606	0.0005609035	0.02257974
$Y_{1} \sim N (0, 1)$	sd	0.0004369446	0.0003471337	0.001865197
$Y_{1} \sim 𝖤𝗑𝗉 (1)$	mean	0.1240124	0.1306668	0.1446655
$Y_{1} \sim 𝖤𝗑𝗉 (1)$	sd	0.004051844	0.005115684	0.007453711

Table 2. Table 2: Mean and standard deviation of the estimation CPU-times (in seconds). The computations were performed on a CPU Intel Xeon E5-2630v3, 2.4 GHz with 128 GB RAM.

		Method of estimation
		plug-in	Fourier	OnB
$Y_{1} \sim N (0, 1)$	mean	1031.18	74.95	2726.07
$Y_{1} \sim N (0, 1)$	sd	54.19	3.66	1120.59
$Y_{1} \sim 𝖤𝗑𝗉 (1)$	mean	1262.24	121.24	3165.08
$Y_{1} \sim 𝖤𝗑𝗉 (1)$	sd	13.93	18.14	721.25

Equations360

X (t) = \int_{R^{d}} f (x - t) Λ (d x), t \in R^{d},

X (t) = \int_{R^{d}} f (x - t) Λ (d x), t \in R^{d},

H^{δ} (R) = {f \in L^{2} (R) : \int_{R} ∣ F f ∣^{2} (x) (1 + x^{2})^{δ} d x < \infty}

H^{δ} (R) = {f \in L^{2} (R) : \int_{R} ∣ F f ∣^{2} (x) (1 + x^{2})^{δ} d x < \infty}

φ_{Λ (A)} (t) = exp {ν_{d} (A) K (t)}, A \in E_{0} (R^{d}),

φ_{Λ (A)} (t) = exp {ν_{d} (A) K (t)}, A \in E_{0} (R^{d}),

K (t) = i t a_{0} - \frac{1}{2} t^{2} b_{0} + R \int (e^{i t x} - 1 - i t x 1 I_{[- 1, 1]} (x)) v_{0} (x) d x,

K (t) = i t a_{0} - \frac{1}{2} t^{2} b_{0} + R \int (e^{i t x} - 1 - i t x 1 I_{[- 1, 1]} (x)) v_{0} (x) d x,

λ (A) = ν_{d} (A) ∣ a_{0} ∣ + b_{0} + R \int min {1, x^{2}} v_{0} (x) d x, A \in E_{0} (R^{d}) .

λ (A) = ν_{d} (A) ∣ a_{0} ∣ + b_{0} + R \int min {1, x^{2}} v_{0} (x) d x, A \in E_{0} (R^{d}) .

A \int f (x) Λ (d x) = j = 1 \sum n x_{j} Λ (A \cap A_{j}) .

A \int f (x) Λ (d x) = j = 1 \sum n x_{j} Λ (A \cap A_{j}) .

A \int f (x) Λ (d x) = m \to \infty P-lim A \int f^{(m)} (x) Λ (d x) .

A \int f (x) Λ (d x) = m \to \infty P-lim A \int f^{(m)} (x) Λ (d x) .

X (t) = R^{d} \int f (t - x) Λ (d x), t \in R^{d} .

X (t) = R^{d} \int f (t - x) Λ (d x), t \in R^{d} .

φ_{X (0)} (u) = exp ⎩ ⎨ ⎧ R^{d} \int K (u f (s)) d s ⎭ ⎬ ⎫,

φ_{X (0)} (u) = exp ⎩ ⎨ ⎧ R^{d} \int K (u f (s)) d s ⎭ ⎬ ⎫,

R^{d} \int K (u f (s)) d s = i u a_{1} - \frac{1}{2} u^{2} b_{1} + R \int (e^{i ux} - 1 - i ux 1 I_{[- 1, 1]} (x)) v_{1} (x) d x

R^{d} \int K (u f (s)) d s = i u a_{1} - \frac{1}{2} u^{2} b_{1} + R \int (e^{i ux} - 1 - i ux 1 I_{[- 1, 1]} (x)) v_{1} (x) d x

a_{1}

a_{1}

v_{1} (x)

U (u) = u a_{0} + R \int x [1 I_{[- 1, 1]} (ux) - 1 I_{[- 1, 1]} (x)] v_{0} (x) d x .

U (u) = u a_{0} + R \int x [1 I_{[- 1, 1]} (ux) - 1 I_{[- 1, 1]} (x)] v_{0} (x) d x .

∣∣ u - v ∣ ∣_{\infty} = 1 \leq i \leq d max ∣ u_{i} - v_{i} ∣ > m,

∣∣ u - v ∣ ∣_{\infty} = 1 \leq i \leq d max ∣ u_{i} - v_{i} ∣ > m,

\displaystyle\phi(\mathcal{U},\mathcal{V}):=\sup\bigl{\{}|P(V|U)-P(V)|:\,\,V\in\mathcal{V},\,U\in\mathcal{U},\,P(U)\neq 0\bigr{\}}

\displaystyle\phi(\mathcal{U},\mathcal{V}):=\sup\bigl{\{}|P(V|U)-P(V)|:\,\,V\in\mathcal{V},\,U\in\mathcal{U},\,P(U)\neq 0\bigr{\}}

\displaystyle\phi_{k,l}(r):=\sup\bigl{\{}\phi(\mathcal{F}_{\Gamma_{1}},\mathcal{F}_{\Gamma_{2}}):\,\,\text{card}(\Gamma_{1})\leq k,\,\text{card}(\Gamma_{2})\leq l,\,d(\Gamma_{1},\Gamma_{2})\geq r\bigr{\}},

\displaystyle\phi_{k,l}(r):=\sup\bigl{\{}\phi(\mathcal{F}_{\Gamma_{1}},\mathcal{F}_{\Gamma_{2}}):\,\,\text{card}(\Gamma_{1})\leq k,\,\text{card}(\Gamma_{2})\leq l,\,d(\Gamma_{1},\Gamma_{2})\geq r\bigr{\}},

r \to \infty lim ϕ_{k, l} (r) = 0

r \to \infty lim ϕ_{k, l} (r) = 0

\displaystyle\operatorname{\mathbb{E}}_{k}[f(X(t))]:=\operatorname{\mathbb{E}}\bigl{[}f(X(t))|\mathcal{F}_{V_{t}^{k}}\bigr{]}.

\displaystyle\operatorname{\mathbb{E}}_{k}[f(X(t))]:=\operatorname{\mathbb{E}}\bigl{[}f(X(t))|\mathcal{F}_{V_{t}^{k}}\bigr{]}.

\displaystyle\biggl{(}\operatorname{\mathbb{E}}\biggl{|}\sum_{t\in U}X(t)\biggr{|}^{p}\biggl{)}^{1/p}\leq\biggl{(}2p\sum_{t\in U}b_{t,p/2}(X)\biggr{)}^{1/2},

\displaystyle\biggl{(}\operatorname{\mathbb{E}}\biggl{|}\sum_{t\in U}X(t)\biggr{|}^{p}\biggl{)}^{1/p}\leq\biggl{(}2p\sum_{t\in U}b_{t,p/2}(X)\biggr{)}^{1/2},

\displaystyle P\Bigl{(}\Bigl{|}\sum_{t\in U}X(t)\Bigr{|}>x\Bigr{)}\leq\exp\biggl{\{}\frac{1}{e}-\frac{x^{2}}{4eb}\biggr{\}}.

\displaystyle P\Bigl{(}\Bigl{|}\sum_{t\in U}X(t)\Bigr{|}>x\Bigr{)}\leq\exp\biggl{\{}\frac{1}{e}-\frac{x^{2}}{4eb}\biggr{\}}.

r = 1 \sum \infty (r + 1)^{d (c - u + 1) - 1} [ϕ_{u, v} (r)]^{1/ c} < \infty

r = 1 \sum \infty (r + 1)^{d (c - u + 1) - 1} [ϕ_{u, v} (r)]^{1/ c} < \infty

\displaystyle\operatorname{\mathbb{E}}\biggl{|}\sum_{t\in U}X(t)\biggr{|}^{p}\leq C\cdot\max\biggl{\{}\sum_{t\in U}\operatorname{\mathbb{E}}|X(t)|^{p},\Bigl{(}\sum_{t\in U}\operatorname{\mathbb{E}}|X(t)|^{2}\Bigr{)}^{p/2}\biggr{\}}.

\displaystyle\operatorname{\mathbb{E}}\biggl{|}\sum_{t\in U}X(t)\biggr{|}^{p}\leq C\cdot\max\biggl{\{}\sum_{t\in U}\operatorname{\mathbb{E}}|X(t)|^{p},\Bigl{(}\sum_{t\in U}\operatorname{\mathbb{E}}|X(t)|^{2}\Bigr{)}^{p/2}\biggr{\}}.

B (ϕ) = j \in Z^{d} \0 \sum ϕ_{\infty, 1} (∣ j ∣) < \infty.

B (ϕ) = j \in Z^{d} \0 \sum ϕ_{\infty, 1} (∣ j ∣) < \infty.

\displaystyle P\Bigl{(}\Bigl{|}\sum_{t\in U}a_{t}X(t)\Bigr{|}>x\Bigr{)}\leq\exp\biggl{\{}\frac{1}{e}-\frac{x^{2}}{4(1+B(\phi))A(U)eh^{2}}\biggr{\}}.

\displaystyle P\Bigl{(}\Bigl{|}\sum_{t\in U}a_{t}X(t)\Bigr{|}>x\Bigr{)}\leq\exp\biggl{\{}\frac{1}{e}-\frac{x^{2}}{4(1+B(\phi))A(U)eh^{2}}\biggr{\}}.

X (t) = R^{d} \int f (t - x) Λ (d x) = k = 1 \sum n f_{k} Λ (t - Δ_{k}), t \in R^{d},

X (t) = R^{d} \int f (t - x) Λ (d x) = k = 1 \sum n f_{k} Λ (t - Δ_{k}), t \in R^{d},

a_{1}

a_{1}

v_{1} (x)

min {1, \cdot^{2}} g (\cdot) / h (\cdot) \in L^{1} (R) \mbox f or an y g \in L^{2} (R),

min {1, \cdot^{2}} g (\cdot) / h (\cdot) \in L^{1} (R) \mbox f or an y g \in L^{2} (R),

s (y) = x sup {∣ h (x) ∣/∣ h (y x) ∣)} < \infty \mbox f or an y y \neq = 0.

s (y) = x sup {∣ h (x) ∣/∣ h (y x) ∣)} < \infty \mbox f or an y y \neq = 0.

\int_{R} \frac{min { 1 , x ^{4} }}{h ^{2} ( x )} d x < \infty.

\int_{R} \frac{min { 1 , x ^{4} }}{h ^{2} ( x )} d x < \infty.

\int_{R} min {1, x^{2}} \frac{g ( x )}{h ( x )} d x \leq (\int_{R} \frac{min { 1 , x ^{4} }}{h ^{2} ( x )} d x)^{1/2} ∣∣ g ∣ ∣_{2} < \infty.

\int_{R} min {1, x^{2}} \frac{g ( x )}{h ( x )} d x \leq (\int_{R} \frac{min { 1 , x ^{4} }}{h ^{2} ( x )} d x)^{1/2} ∣∣ g ∣ ∣_{2} < \infty.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

An Inverse Problem for Infinitely Divisible Moving Average Random Fields

W. Karcher, S. Roth, E. Spodarev, C. Walk

Abstract

Given a low frequency sample of an infinitely divisible moving average random field $\{\int_{\mathbb{R}^{d}}f(x-t)\Lambda(dx);\ t\in\mathbb{R}^{d}\}$ with a known simple function $f$ , we study the problem of nonparametric estimation of the Lévy characteristics of the independently scattered random measure $\Lambda$ . We provide three methods, a simple plug-in approach, a method based on Fourier transforms and an approach involving decompositions with respect to $L^{2}$ -orthonormal bases, which allow to estimate the Lévy density of $\Lambda$ . For these methods, the bounds for the $L^{2}$ -error are given. Their numerical performance is compared in a simulation study.

Ulm University

Keywords: Infinitely divisible random measure; stationary random field; Lévy process, moving average; Lévy density; Fourier transform; Banach fixed–point theorem.

1 Introduction

Let $\Lambda$ be a stationary infinitely divisible independently scattered random measure with Lévy characteristics $(a_{0},b_{0},v_{0})$ , where $a_{0}\geq 0$ , $b_{0}\in\mathbb{R}$ and $v_{0}$ is a Lévy density. Let furthermore $X=\{X(t);\ t\in\mathbb{R}^{d}\}$ be a moving average infinitely divisible random field on $\mathbb{R}^{d}$ defined by

[TABLE]

with Lévy characteristics $(a_{1},b_{1},v_{1})$ , where $f=\sum_{k=1}^{n}f_{k}{1{\rm I}}_{\Delta_{k}}$ is a simple function. Suppose a sample $(X(t_{1}),\dots,X(t_{N}))$ from $X$ is available. The problem studied in this paper is the nonparametric estimation of $(a_{0},b_{0},v_{0})$ . For any simple function $f$ with congruent sets $\Delta_{k}$ , $X(t)$ in (1) has the same distribution as a linear combination of i.i.d. infinitely divisible random variables. Therefore, existence and uniqueness of a characteristic triplet $(a_{0},b_{0},v_{0})$ with the property that a certain linear combination of independent random variables with the corresponding infinitely divisible distribution leading to a random variable with Lévy characteristics $(a_{1},b_{1},v_{1})$ becomes a characterization problem for such distributions. For certain distributions, namely the Poisson and the Gaussian one as well as a mixture of both, all possible distributions for the summands in the linear combination can be described (see e.g. [1]). The disadvantage of those characterization theorems is that they do not give any information about the involved parameters (expectation and variance of each summand) and so it is not possible to derive sufficient conditions for the existence of a solution in terms of the kernel function $f$ . Therefore, to solve the inverse problem, we prefer to use concrete relations between the characteristic triplets of $X$ and $\Lambda$ (Section 3) given in terms of $f$ .

The recent preprint [2] covers the case $d=1$ estimating the Lévy density $v_{0}$ of the integrator Lévy process $\{L_{s}\}$ of a moving average process $X(t)=\int_{\mathbb{R}}f(t-s)\,dL_{s},$ $t\in\mathbb{R}$ . It is assumed that $\operatorname{\mathbb{E}}\,L^{2}_{0}<\infty$ . The estimate is based on the inversion of the Mellin transform of the second derivative of the cumulant of $X(0)$ . A uniform error bound as well as the consistency of the estimate are given. It is not assumed that $f$ is simple, however, main results are subject to a number of quite restricting integrability assumptions onto $x^{2}v_{0}(x)$ and $f$ as well as mixing properties of $\{L_{s}\}$ that are tricky to check. Additionally, the logarithmic convergence rate shown there (cf. [2, Corollary 1]) is too slow.

In our approach, we develop the ideas of [3] and use Banach fixed–point theorem combined with a recursive iteration procedure (Theorem 4.1) to give sufficient conditions for the existence of a (unique) solution of our (generally speaking, ill–posed) inverse problem $v_{1}\mapsto v_{0}$ . We consider simple functions $f$ since

in applications, $f$ is mainly discretely sampled, 2. 2.

any $f\in L^{1}(\mathbb{R}^{d})$ can be approximated in the $\|\cdot\|_{1}$ –norm by a sequence of simple $f^{(m)}\in L^{1}(\mathbb{R}^{d})$ (attaining a finite number of values) arbitrarily well, 3. 3.

this allows us to use relatively simple arguments in the proofs and to avoid complex assumptions that are not easy to verify, 4. 4.

the $L^{2}$ –convergence rate of our estimates of $v_{0}$ to its true value is $O(N^{-1})$ , cf. Corollaries 5.2 and 5.3.

The case of arbitrary integrable $f$ is considered in our forthcoming paper [4].

This paper is organized as follows: Section 2 gives an introduction to the theory of infinitely divisible random measures and stochastic integrals as well as a short overview on $m$ -dependent and $\phi$ -mixing random fields together with some moment inequalities (cf. Section 2.3). In Section 3, we describe the inverse problem in detail and give formulas for the relationship between the characteristics $(a_{0},b_{0},v_{0})$ and $(a_{1},b_{1},v_{1})$ . In Section 4, we obtain sufficient conditions for the existence and uniqueness of the solution of the direct problem, i.e. we propose conditions under which the mapping $(a_{0},b_{0},v_{0})\mapsto(a_{1},b_{1},v_{1})$ is a bijection. It turns out that this holds true if either one of the coefficients $f_{1},\dots,f_{n}$ dominates all the others or one of them repeats often enough in some sense.

Estimates for the characteristic Lévy triplet of $X$ are given in Section 5 for pure jump infinitely divisible random fields. Here we use the ideas of [5], [6] and [7] originally designed to estimate the Lévy density of Lévy processes. The main result of this section is the proof of the upper bound for the $L^{2}$ -error of the proposed estimator without the assumption of independence of observations $X(t_{1}),\dots,X(t_{N})$ . The estimation error remains of the same structure as in the Lévy process case if the random field $X$ is assumed to be $m$ -dependent or $\phi$ -mixing. For the ease of reading, long proofs of the results of this section are moved to Appendix. Section 6 provides three estimation approaches for the density $v_{0}$ of $\Lambda$ . The first method is a simple plug-in approach. The second one, the Fourier method, is based on the idea of estimating first the Fourier transform of $v_{0}$ followed by another plug-in procedure. The last method uses orthonormal bases in the Hilbert space $L^{2}[-A,A]$ , $A>0$ , for a representation of the solution $v_{0}$ of the inverse problem. After approximating $v_{0}$ by cutting off its expansion, the coefficients can be estimated by solving a system of linear equations. For all our methods, we propose upper bounds for the $L^{2}$ -estimation error. In the last section, the performance of the methods is compared by numerical simulations.

2 Preliminaries

Introduce some notation that will be used throughout this paper.

By $\mathcal{B}(\mathbb{R}^{d})$ we denote the Borel $\sigma$ -field on the d-dimensional Euclidean space $\mathbb{R}^{d}$ . The Lebesgue measure on $\mathbb{R}^{d}$ is denoted by $\nu_{d}$ . We briefly write $\nu_{d}(dx)=dx$ if we integrate w.r.t. $\nu_{d}$ on $\mathbb{R}^{d}$ . The collection of all bounded Borel sets in $\mathbb{R}^{d}$ will be denoted by $\mathcal{E}_{0}(\mathbb{R}^{d})$ . For any measurable space $(M,\mathcal{M},\mu)$ we denote by $L^{\alpha}(M)$ , $1\leq\alpha<\infty$ , the space of all $\mathcal{M}|\mathcal{B}(\mathbb{R})$ -mesurable functions $f:M\rightarrow\mathbb{R}$ with $\int_{M}|f|^{\alpha}(x)\mu(dx)<\infty$ . Equipped with the norm $||\cdot||_{\alpha}=\left(\int_{M}|f|^{\alpha}(x)\mu(dx)\right)^{1/\alpha}$ , $L^{\alpha}(M)$ becomes a Banach space and even in the case $\alpha=2$ a Hilbert space with scalar product $\left\langle f,g\right\rangle_{\alpha}=\int_{M}f(x)g(x)\mu(dx)$ , for any $f,g\in L^{2}(M)$ . With $L^{\infty}(M)$ (i.e. if $\alpha=\infty$ ) we denote the space of all real valued bounded functions on $M$ . In case $(M,\mathcal{M},\mu)=(\mathbb{R},\mathcal{B}(\mathbb{R}),\nu_{1})$ we denote by

[TABLE]

the Sobolev space of order $\delta>0$ equipped with the Sobolev norm $||f||_{H^{\delta}}=||\mathcal{F}f(\cdot)(1+\cdot^{2})^{\delta/2}||_{2}$ , where $\mathcal{F}$ is the Fourier transform on $L^{2}(\mathbb{R})$ . For $f\in L^{1}(\mathbb{R})$ , $\mathcal{F}f$ is defined by $\mathcal{F}f(x)=\int_{\mathbb{R}}e^{itx}f(t)dt$ , $x\in\mathbb{R}$ . If $(M,\mathcal{M},\mu)=(\mathbb{N},2^{\mathbb{N}},\mu)$ or $(M,\mathcal{M},\mu)=(\{1,\dots,n\},2^{\{1,\dots,n\}},\mu)$ , $n\in\mathbb{N}$ , with $\mu$ being the counting measure, then we write as usual $l^{\alpha}(M)$ instead of $L^{\alpha}(M)$ and all integrals above become sums. Throughout the rest of this paper $(\Omega,\mathcal{A},P)$ denotes a probability space. Note that in this case $L^{\alpha}(\Omega)$ is the space of all random variables with finite $\alpha$ -th moment as well as $||X||_{\alpha}=\left(\mathbb{E}|X|^{\alpha}\right)^{1/\alpha}$ , if $1\leq\alpha<\infty$ and $||X||_{\alpha}=\sup_{\omega\in\Omega}X(\omega)$ if $\alpha=\infty$ , for any $X\in L^{\alpha}(\Omega)$ . For an arbitrary set $A$ we introduce furthermore the notation $\textup{card}(A)$ for its cardinality. Let $\textup{supp}f=\{x\in\mathbb{R}^{d}:f(x)\neq 0\}$ be the support set of a function $f:\mathbb{R}^{d}\to\mathbb{R}$ . Denote by $\textup{diam}(A)=\sup\{\|x-y\|_{\infty}:x,y\in A\}$ the diameter of a bounded set $A\subset\mathbb{R}^{d}$ .

2.1 ID Random Measures and Fields

Recall some definitions and give a brief overview of infinitely divisible (ID) random measures and fields.

Let $\Lambda=\{\Lambda(A);\ A\in\mathcal{E}_{0}(\mathbb{R}^{d})\}$ be an ID random measure on some probability space $(\Omega,\mathcal{A},P)$ , i.e. a random measure such that

for each sequence $(E_{m})_{m\in\mathbb{N}}$ of disjoint sets in $\mathcal{E}_{0}(\mathbb{R}^{d})$ it holds

(a)

$\Lambda(\cup_{m=1}^{\infty}E_{m})=\sum_{m=1}^{\infty}\Lambda(E_{m})$ a.s., whenever $\cup_{m=1}^{\infty}E_{m}\in\mathcal{E}_{0}(\mathbb{R}^{d})$ , 2. (b)

$(\Lambda(E_{m}))_{m\in\mathbb{N}}$ is a sequence of independent random variables. 2. 2.

the random variable $\Lambda(A)$ has an ID distribution for any choice of $A\in\mathcal{E}_{0}(\mathbb{R}^{d})$ .

Due to the infinite divisibility of the random variable $\Lambda(A)$ , its characteristic function, which will be denoted by $\varphi_{\Lambda(A)}$ , has a Lévy-Khintchin representation which will assumed to be of the form

[TABLE]

with

[TABLE]

where $a_{0}\in\mathbb{R}$ , $0\leq b_{0}<\infty$ and $v_{0}$ is a Lévy density, i.e. $\int_{\mathbb{R}}\min\{1,x^{2}\}v_{0}(x)dx<\infty$ . The triplet $(a_{0},b_{0},v_{0})$ will be referred to as Lévy characteristic of $\Lambda$ . It uniquely determines the distribution of the process $\Lambda$ . A general form for the characteristic function of any ID random measure can be found in [8, p. 456]. The particular structure of the characteristic function in (2) means that the random measure $\Lambda$ is stationary with control measure $\lambda:\mathcal{B}(\mathbb{\mathbb{R}})\rightarrow[0,\infty)$ given by

[TABLE]

Now we can define the stochastic integral w.r.t. the ID random measure $\Lambda$ .

Let $f=\sum_{j=1}^{n}x_{j}{1{\rm I}}_{A_{j}}$ be a real simple function on $\mathbb{R}^{d}$ , where $A_{j}\in\mathcal{E}_{0}(\mathbb{R}^{d})$ are pairwise disjoint. Then for every $A\in\mathcal{B}(\mathbb{R}^{d})$ we define

[TABLE] 2. 2.

A measurable function $f:(\mathbb{R}^{d},\mathcal{B}(\mathbb{R}^{d}))\rightarrow(\mathbb{R},\mathcal{B}(\mathbb{R}))$ is said to be $\Lambda$ -integrable, if there exists a sequence $(f^{(m)})_{m\in\mathbb{N}}$ of simple functions as in

such that
(a)

$f^{(m)}\rightarrow f$ , $\lambda$ -a.e. 2. (b)

for every $A\in\mathcal{B}(\mathbb{R}^{d})$ , the sequence $\left(\int_{A}f^{(m)}(x)\Lambda(dx)\right)_{m\in\mathbb{N}}$ converges in probability as $m\rightarrow\infty$ . In this case we set

[TABLE]

A useful characterization of $\Lambda$ -integrability is given in [8, Theorem 2.7]. Now let $\{f(t-\cdot);\ t\in\mathbb{R}^{d}\}$ be a family of $\Lambda$ -integrable functions induced by the Borel measurable map $f:\mathbb{R}^{d}\rightarrow\mathbb{R}$ . Then we define the ID moving average random field $X=\{X(t);\ t\in\mathbb{R}^{d}\}$ by

[TABLE]

A random field is called ID if its finite dimensional distributions are ID. The random field $X$ defined in (4) is stationary and ID and the characteristic function of $\varphi_{X(0)}$ of $X(0)$ is given by

[TABLE]

with $K$ given in (3). It is easy to see that

[TABLE]

with

[TABLE]

where $a_{1}\in\mathbb{R}$ , $b_{1}\geq 0$ , $v_{1}$ is the Lévy density of $X(0)$ , $S=\textup{supp}(f)=\{s\in\mathbb{R}^{d}:\ f(s)\neq 0\}$ denotes the support of $f$ and the function $U$ is defined via

[TABLE]

The triplet $(a_{1},b_{1},v_{1})$ is again referred to as Lévy characteristic (of $X(0)$ ) and determines the distribution of $X(0)$ uniquely. Note that due to $\Lambda$ -integrability of $f$ all integrals above are finite. This immediately implies that $f\in L^{1}(\mathbb{R}^{d})\cap L^{2}(\mathbb{R}^{d})$ .

For details on the theory of infinitely divisible measures and fields with spectral representation as well as proofs for the above stated facts we refer the interested reader to [8].

2.2 $m$ -Dependent and $\phi$ -Mixing Random Fields

A random field $X=\{X(t),\ t\in T\}$ , $T\subseteq\mathbb{R}^{d}$ defined on $(\Omega,\mathcal{A},P)$ is called $m$ -dependent if for some $m\in\mathbb{N}$ and any finite subsets $U$ and $V$ of $T$ the random vectors $(X(u))_{u\in U}$ and $(X(v))_{v\in V}$ are independent, whenever

[TABLE]

for all $u=(u_{1},\dots,u_{d})^{\top}\in U$ and $v=(v_{1},\dots,v_{d})^{\top}\in V$ . Note that a random field $X$ as in (4) is $m$ -dependent, if the support $S$ of $f$ is bounded with $m\geq\textup{diam}(S)$ .

Besides, we define the notion of $\phi$ -mixing random fields. The mixing coefficient $\phi$ is defined as follows. For any $U\subset T$ , let $\mathcal{F}_{U}=\sigma(X(t),t\in U)$ be the $\sigma$ –field generated by random variables $X(t),t\in U$ . Let furthermore $\mathcal{U}$ and $\mathcal{V}$ be two sub- $\sigma$ -fields of $\mathcal{A}$ . Define

[TABLE]

and for $k,l,r\in\mathbb{N}$

[TABLE]

where $d(\Gamma_{1},\Gamma_{2}):=\min\{||i-j||_{\infty}:i\in\Gamma_{1},j\in\Gamma_{2}\}$ for $\Gamma_{1},\Gamma_{2}\subset T$ . A random field $X=\{X(t),t\in T\}$ on $(\Omega,\mathcal{A},P)$ is called $\phi$ -mixing or uniform mixing if

[TABLE]

for any $k,l\in\mathbb{N}$ . Equation (11) is called $\phi$ -mixing condition, see e.g. [9] for more details on mixing.

2.3 Moment and Exponential Inequalities for Random Fields

In the literature, one can find many moment and exponential inequalities for sums of independent and identically distributed random variables, e.g., the classical* Rosenthal inequality* [10] or the Bernstein inequality [11].

Similar inequalities hold true for random fields. For $i\in\mathbb{Z}^{d}$ define the set $V_{i}^{1}=\{j\in\mathbb{Z}^{d}:\,j<_{\text{lex}}i\}$ , where $<_{\text{lex}}$ denotes the lexicographic order. Let $V_{i}^{k}=V_{i}^{1}\cap\{j\in\mathbb{Z}^{d}:\,||i-j||_{\infty}\geq k\}$ for $k\geq 2$ . For $f(X(t))\in L^{1}(\Omega)$ set for $k\in\mathbb{N}$

[TABLE]

Figure 1 shows the sets $V_{t}^{1}$ and $V_{t}^{k}$ for some $t=(t_{1},t_{2})\in\mathbb{Z}^{2}$ . The following two results can be found in [12, pp. 12-14].

Theorem 2.1.

Let $X=\{X(t),{t\in\mathbb{Z}^{d}}\}$ be a centered and square-integrable random field. Let $U\subset\mathbb{Z}^{d}$ be a finite subset. Then for any $p\geq 2$ it holds

[TABLE]

where $b_{t,\alpha}(X)=\|X(t)^{2}\|_{\alpha}+\sum_{k\in V_{t}^{1}}\|X(k)\operatorname{\mathbb{E}}_{\|k-t\|_{\infty}}[X(t)]\|_{\alpha}$ , for $t\in U$ and for any $\alpha\geq 1$ .

Theorem 2.2.

Let $X=\{X(t),t\in\mathbb{Z}^{d}\}$ be a field of bounded and centered random variables. Set $b=\sum_{t\in U}b_{t,\infty}(X)$ . Then for any positive and real $x$ it holds

[TABLE]

Note that Theorem 2.1 and Theorem 2.2 are extensions of Burkholder’s [13] and Azuma’s [14] inequality for martingales. The next theorem [9, p. 32] states a Rosenthal-type inequality for $\phi$ -mixing random fields.

Theorem 2.3.

Let $X=\{X(t),t\in\mathbb{Z}^{d}\}$ be a random field. For $p\geq 2$ let $c$ be the smallest even integer such that $c\geq p$ . Assume

[TABLE]

for all $u,v\in\mathbb{N}$ with $u+v\leq c$ , $u,v\geq 2$ . Let $U$ be a finite subset of $\mathbb{Z}^{d}$ . If $X(t)$ belongs to $L^{p}(\Omega)$ and is centered for all $t\in U$ , then there exists a positive constant $C$ that depends on $p$ and on the mixing coefficient $\phi_{u,v}(r)$ of $X(t)$ such that

[TABLE]

Additionally, the following result can be found in [12, p. 15].

Theorem 2.4.

Let $X=\{X(t),t\in\mathbb{Z}^{d}\}$ be a strictly stationary field of bounded and centered random variables. Take $h\geq\|X(0)\|_{\infty}$ and set

[TABLE]

For any $a_{t}\in[-1,1]$ , $t\in\mathbb{Z}^{d}$ set $A(U):=\sum_{t\in U}|a_{t}|$ for $U\subset\mathbb{Z}^{d}$ . For any positive real $x$ we have

[TABLE]

3 Inverse Problem

In this section, we give a description of the inverse problem treated in this paper.

Let $\Lambda=\{\Lambda(A),\ A\in\mathcal{E}_{0}(\mathbb{R}^{d})\}$ be a homogeneous ID random measure with Lévy characteristics $(a_{0},b_{0},v_{0})$ . Consider $f=\sum_{k=1}^{n}f_{k}{1{\rm I}}_{\Delta_{k}}$ to be a simple function, where $f_{k}\in\mathbb{R}\backslash\{0\}$ and $\Delta_{k}\in\mathcal{E}_{0}(\mathbb{R}^{d})$ pairwise disjoint, $k=1,\dots,n$ . Assume furthermore $X=\{X(t),\ t\in\mathbb{R}^{d}\}$ to be an ID moving average random field of the form

[TABLE]

where $t-A=\{t-x:\ x\in A\}\subset\mathbb{R}^{d},\ t\in\mathbb{R}^{d}$ for an arbitrary set $A$ .

The Inverse Problem.

Given $N\in\mathbb{N}$ observations $X(t_{1}),\dots,X(t_{N})$ at points $t_{1},\dots,t_{N}\in\mathbb{R}^{d}$ of the random field $X$ , estimate the Lévy triplet $(a_{0},b_{0},v_{0})$ of the ID random measure $\Lambda$ .

Formulas (6) and (7) then become

[TABLE]

with $U$ defined in (8). For known $a_{1},$ $b_{1},$ $v_{0}$ , the above equations are easily solvable w.r.t. $a_{0}$ and $b_{0}$ , thus providing an estimation approach for $a_{0}$ and $b_{0}$ . So, given $v_{1}$ , the main point is now to find a solution $v_{0}$ of the last equation. In the next section, we give some sufficient conditions under which a solution exists and is unique.

4 Existence and Uniqueness of a Solution for $v_{0}$

In the following, we assume w.l.o.g. that $\nu_{d}(\Delta_{k})=1$ for all $k=1,\dots,n$ . Typically it is common to estimate $x^{n}v_{1}(x)$ rather than $v_{1}(x)$ itself, since many of the estimators for Lévy densities are based on derivatives of the Fourier transform (in the context of Lévy processes, see e.g. [5, 6, 7]). For this purpose let $h:\mathbb{R}\rightarrow\mathbb{R}$ be a measurable function such that

[TABLE]

A sufficient condition for (16) to hold is

[TABLE]

Indeed, the Cauchy-Schwarz inequality yields

[TABLE]

Examples of functions $h$ satisfying (16)–(18) are $h(x)=1$ , $h(x)=|x|^{\beta}$ , $\beta\in(1/2,5/2)$ and $h(x)=x^{\beta}$ , $\beta=1,2$ . Consider the modified equation

[TABLE]

It is understood in $L^{2}(\mathbb{R})$ -sense, where it is assumed that $g^{(h)}_{0}=hv_{0}$ and $g^{(h)}_{1}=hv_{1}$ are both in $L^{2}(\mathbb{R})$ . Let $Q=\{1\leq k\leq n:\ f_{k}=f_{1}\}$ be the set of all indices of coefficients $f_{k}$ that coincide with $f_{1}$ . Denote by $n_{1}=\text{card}(Q)$ its cardinality. Define

[TABLE]

The following theorem states conditions, under which equation (19) has a unique solution for fixed $g^{(h)}_{1}\in L^{2}(\mathbb{R})$ .

Theorem 4.1.

Let a function $h:\mathbb{R}\rightarrow\mathbb{R}$ be given as above. Then equation (19) has a unique solution $g^{(h)}_{0}\in L^{2}(\mathbb{R})$ for any $g^{(h)}_{1}\in L^{2}(\mathbb{R})$ if

[TABLE]

The solution is given by the formula

[TABLE]

Proof.

Let $g^{(h)}_{1}\in L^{2}(\mathbb{R})$ . Define the operator $\varphi_{g^{(h)}_{1}}:L^{2}(\mathbb{R})\rightarrow L^{2}(\mathbb{R})$ by

[TABLE]

Then formula (19) yields a fixed point of $\varphi_{g^{(h)}_{1}}$ , i.e., is a solution of equation

[TABLE]

It is straight forward to see that for any functions $u_{1},u_{2}\in L^{2}(\mathbb{R})$ it holds

[TABLE]

i.e. $\varphi_{g^{(h)}_{1}}$ is a contraction. By Banach fixed-point theorem there exists a unique solution $g^{(h)}_{0}\in L^{2}(\mathbb{R})$ to the equation (23) which shows the first part of the theorem. Relation (22) can easily be obtained by iterating equation (23) w.r.t. $g^{(h)}_{0}$ . ∎

Remark 4.2.

Note that the choice of $f_{1}$ in this setting is arbitrary. The statement of Theorem 4.1 does not depend on a certain order of the coefficients $f_{1},\dots,f_{n}$ . In particular, this means that $f_{1}$ in the definitions of $Q$ and $n_{1}$ can be replaced by any other coefficient $f_{j_{0}}$ , $j_{0}\in\{2,\dots,n\}$ . Consequently, substituting $f_{1}$ by $f_{j_{0}}$ in Theorem 4.1 leads to the same solution $g_{0}^{(h)}$ . Indeed, let $f_{j}$ , $j\neq 1$ be any other coefficient that fulfills the conditions of Theorem 4.1, and let $\bar{g}_{0}^{(h)}$ be the corresponding solution of (19). Then

[TABLE]

Due to Theorem 4.1, this equation has a unique solution. Since [math] is a solution it thus follows that $g_{0}^{(h)}-\bar{g}_{0}^{(h)}=0$ (in $L^{2}(\mathbb{R})$ -sense).

Remark 4.3.

Theorem 4.1 gives sufficient conditions for the existence and uniqueness of a solution (22) of equation (19). If condition (21) fails to hold, no solution as well as infinitely many solutions of (19) are possible. One can easily construct corresponding examples illustrating that. Consider e.g. $n=2$ , $f_{1}=1$ and $f_{2}=-1$ . Now choose $h$ to be any odd function satisfying (16)-(18). Clearly condition (21) is not fulfilled. Then (19) becomes

[TABLE]

Let $g_{1}^{(h)}\in L^{2}(\mathbb{R})$ be any even function, $g^{(h)}_{1}\neq 0$ a.e. Then (24) has no solution since its right–hand side is odd. 2. 2.

If, on the other hand, $g_{1}^{(h)}(x)=0$ a.e. then any even $L^{2}$ -function $g_{0}^{(h)}$ is a solution of (24).

Note that condition (17) ensures that $h(\cdot)g_{1}^{(h)}(f_{1}\cdot)/h(f_{1}\cdot)\in L^{2}(\mathbb{R})$ for any $g_{1}^{(h)}\in L^{2}(\mathbb{R})$ . This condition is necessary. Consider e.g. $g_{1}^{(h)}(x)=e^{-|x|/2}$ , $h(x)=e^{|x|}$ , $x\in\mathbb{R}$ , as well as $f_{1}=f_{2}=f_{3}=1/4$ , $f_{4}=1/16$ . Then, except for (17), all conditions of Theorem 4.1 are fulfilled, but $h(\cdot)g_{1}^{(h)}(f_{1}\cdot)/h(f_{1}\cdot)\not\in L^{2}(\mathbb{R})$ in this case. Thus (22) cannot be an $L^{2}$ -solution.

Remark 4.4.

Condition (21) is not necessary for the existence and uniqueness of a solution of equation (19). As a counterexample, consider $n=3$ , $f_{1}=e^{\alpha}$ , $f_{2}=e^{2\alpha}$ , $f_{3}=e^{3\alpha}$ , and $h(x)=x$ . If

[TABLE]

then none of the coefficients fulfills (21). In our paper [4] we prove necessary and sufficient conditions for existence and uniqueness of a solution of integral equation (7). It can be shown that $f=\sum_{k=1}^{3}e^{k\alpha}{1{\rm I}}_{\Delta_{k}}$ satisfies those conditions and hence there is a unique solution of (19) for any $g_{1}^{(h)}\in L^{2}(\mathbb{R})$ .

Condition (21) means that one of the coefficients (here $f_{1}$ ) dominates all others either in its magnitude $|f_{1}|$ or in its frequency $n_{1}$ . To illustrate this, consider any power function $h(x)=|x|^{\beta}$ with $\beta\in(1/2,5/2)$ and $|x|^{\beta}v_{1}(x)\in L^{2}(\mathbb{R})$ . Then $s_{k}=(|f_{k}|/|f_{1}|)^{\beta}$ , $k=1,\dots,n$ and the equation is solvable w.r.t. $|x|^{\beta}v_{0}(x)$ if

[TABLE]

In particular, if $n_{1}=1$ this means that $|f_{1}|>\max\{|f_{2}|,\dots,|f_{n}|\}$ . If $h$ is strictly positive and super-homogeneous of degree $\alpha$ , i.e.

[TABLE]

for all $c\geq 0$ and some $\alpha>0$ , then condition (17) is fulfilled if all the coefficients $f_{k}$ have the same sign. Then (21) holds if

[TABLE]

5 Estimation of $g^{(h)}_{1}$ for Pure Jump ID Random Fields

Modern statistical literature contains quite a number of methods to estimate the Lévy density $v_{1}$ of $X(0)$ if $d=1$ , i.e., $X$ is a Lévy process, see [15, 7, 5, 16, 6, 17, 18], [19] and references therein. They range from moment fitting and maximum likelihood ratio to inverse Fourier methods based on the empirical characterstic function of $X(0)$ . For simplicity, one often assumes that the drift and the Gaussian part of $X(0)$ vanish, thus letting $X$ be a pure jump Lévy process.

In the recent preprint [19], the problem of estimation of the Lévy measure of $X(0)$ was solved for compound Poisson Lévy processes $X$ using variational analysis on the cone of measures and the steepest descent method of minimizing of a certain risk functional implemented for the discrete (atomic) measures. The resulting estimate of $v_{1}$ can be obtained out of these measures by smoothing.

For all our estimation approaches in the next section, either estimators for $g^{(h)}_{1}$ or at least for its Fourier transform $\mathcal{F}[g^{(h)}_{1}]$ are required to proceed with the estimation of $v_{0}$ . Therefore we adopted an estimation procedure from [16, 5] for pure jump Lévy processes to estimate $v_{1}$ . The main difference to Lévy processes is in our case the assumption of independent increments which obviously is not given for random fields in arbitrary dimension $d$ . Nevertheless, assuming $X$ to be $m$ -dependent or $\phi$ -mixing allows us to use the same ideas for the estimation of $g^{(h)}_{1}$ .

Consider a stationary random field $X$ as in (14) with characteristic function $\varphi_{X(0)}(u)$ given by

[TABLE]

Note that its logarithm coincides with formula (5) by taking $a_{1}=\int_{-1}^{1}xv_{1}(x)dx$ and $b_{1}=0$ . Under the additional assumption $\int_{\mathbb{R}}|x|v_{1}(x)dx<\infty$ it holds

[TABLE]

that is equivalent to

[TABLE]

where $g_{1}(x):=g^{(h)}_{1}(x)=xv_{1}(x)$ (taking $h(x)=x$ ) and $\mathcal{F}[g_{1}]$ denotes the Fourier transform of $g_{1}$ . Now let $X$ be discretely observed on a regular grid $\Delta\mathbb{Z}^{d}$ with mesh size $\Delta>0$ , i.e. we consider the random field $Y=\{Y_{j};\,j\in\mathbb{Z}^{d}\},$ where

[TABLE]

For a finite nonempty set $W\subset\mathbb{Z}^{d}$ with cardinality $N=|W|$ let $(Y_{j})_{j\in W}$ be a sample from $Y$ . By taking the empirical counterparts

[TABLE]

of $\psi(u)$ and $\theta(u):=\psi^{\prime}(u)$ on the right–hand side of (25) an estimator for $\mathcal{F}[g_{1}]$ can be defined as

[TABLE]

where

[TABLE]

The indicator function on the right hand side of (29) ensures the stability of the estimator for small values of $|\hat{\psi}(u)|$ . Based on this idea Comte and Genon-Catalot [16] provided the estimator

[TABLE]

for $g_{1}$ . We make the following assumptions: for a $k\in\mathbb{N}$

(H1)

$g_{1}\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$

${\rm\bf(H2)}_{k}$

$\int_{\mathbb{R}}|x|^{k-1}|g_{1}(x)|dx<\infty$

(H3)

$\exists\ c_{\psi},\ C_{\psi}>0$ and $\beta\geq 0$ such that for all $x\in\mathbb{R}$

[TABLE]

(H4)

$g_{1}\in H^{\beta}(\mathbb{R})$ where $\beta>0$ is as in (H3).

Assumptions (H1)– ${\rm\bf(H2)}_{k}$ are moment conditions for $X(0)$ . Assumptions (H3)– (H4) are used to compute $L^{2}$ –error bounds and rates of convergence of Lévy density estimates, cf. [5]. For the random field $Y$ we define

[TABLE]

where $u\in\mathbb{R}$ , $t\in\mathbb{Z}^{d}$ . Under condition ${\rm\bf(H2)}_{2}$ , it holds $\operatorname{\mathbb{E}}X^{2}(0)<\infty$ and hence $\operatorname{\mathbb{E}}\big{(}\xi_{t}^{(i)}(u)\big{)}^{2}<\infty$ , $\operatorname{\mathbb{E}}\big{(}\tilde{\xi}_{t}^{(i)}(u)\big{)}^{2}<\infty$ for $i=1,2$ , $t\in\mathbb{Z}^{d}$ and $u\in\mathbb{R}$ . Introduce the notation $\lVert\xi\rVert_{\cdot}=\left(\operatorname{\mathbb{E}}\lVert\xi\rVert_{2}^{2}\right)^{1/2}$ for any random function $\xi:\Omega\times\mathbb{R}\to\mathbb{C}$ s.t. $\xi\in L^{2}(\Omega\times\mathbb{R})$ .

The following $L^{2}$ -error bounds for $\hat{g}_{1,l}$ will be proven in Appendix.

Theorem 5.1.

Assume that (H1), ${\rm\bf(H2)}_{4}$ hold and that we observe the strictly stationary random field $Y=\{Y_{t},\ t\in\mathbb{Z}^{d}\}$ . Further assume that either

(i)

the field $Y$ is $m$ -dependent or

(ii)

the random field $Y$ is $\phi$ -mixing such that equations (12)–(13) hold.

Then for all $l\in\mathbb{N}$

[TABLE]

where $K>0$ is a constant, $g_{1,l}$ is given by $g_{1,l}(x)=\frac{1}{2\pi}\int_{-\pi l}^{\pi l}e^{-iux}\frac{\theta(u)}{\psi(u)}du$ for $x\in\mathbb{R}$ , and $N\in\mathbb{N}$ is the sample size.

Notice that random fields (14) are $m$ –dependent with $m=\textup{diam}(\textup{supp}f)$ since a simple function $f$ has a compact support. Introduce the notation $L:=\|g_{1}\|_{H^{\beta}}^{2}$ .

The following corollary is an immediate consequence of Theorem 5.1.

Corollary 5.2.

If additionally (H3) and (H4) hold then the bound in Theorem 5.1 can be improved to

[TABLE]

where $\tilde{K}>0$ is constant.

Corollary 5.3.

Under the assumptions of Corollary 5.2 it holds

[TABLE]

where $\bar{K}=2\pi Kc_{\psi}\left(\sqrt{\operatorname{\mathbb{E}}|Y_{0}|^{4}}+\|g_{1}\|_{1}^{2}\right)$ .

The upper bound (31) allows to choose the cut–off parameter $l>0$ optimally by minimizing the right–hand side expression in (31) numerically. Choosing $N,l\to+\infty$ such that $l^{1+2\beta}/N\to 0$ yields the $L^{2}$ –consistency of the estimate $\hat{g}_{1,l}$ .

6 Estimation of the Lévy Density $v_{0}$

In the following Section three different estimation approaches will be discussed. The plug-in and the Fourier method are both based on formula (22), whereas the third one, which uses orthonormal bases (OnB’s) in $L^{2}(\mathbb{R})$ , is totally different from them. For this reason, the problem will be reformulated in terms of $L^{2}$ –OnB’s there. Nevertheless it turns out that the sufficient conditions for the existence of a solution do not change essentially.

6.1 Plug-In Estimator

Let $\hat{g}_{1}^{(h)}$ be an estimator for $g_{1}^{(h)}=h\cdot v_{1}$ . We now consider a simple plug-in estimator $\hat{g}_{0}^{(h)}$ of $g_{0}^{(h)}=h\cdot g_{0}$ defined by

[TABLE]

where $N\in\mathbb{N}$ denotes the sample size and $n_{N}$ is a certain cut-off parameter depending on $N$ . The following theorem gives a bound for the mean square error $||g_{0}^{(h)}-\hat{g}_{0}^{(h)}||_{\cdot}$ .

Theorem 6.1.

Consider $g_{0}^{(h)}\in L^{2}(\mathbb{R})$ and let $\hat{g}_{1}^{(h)}\in L^{2}(\mathbb{R})$ be an estimator of $g_{1}^{(h)}$ . Let furthermore the conditions of Theorem 4.1 be fulfilled. Then with the notation given there it holds

[TABLE]

In particular, if $\hat{g}_{1}^{(h)}$ is an $L^{2}$ -consistent estimator for $g_{1}^{(h)}$ (i.e., $\|g_{1}^{(h)}-\hat{g}_{1}^{(h)}\|_{\cdot}\to 0$ as $N,n_{N}\to\infty$ ) then $\hat{g}_{0}^{(h)}$ is as well an $L^{2}$ -consistent estimator for $g_{0}^{(h)}$ .

Proof.

First of all, we observe that for each $k\in\mathbb{N}$ and $f_{i_{1}},\dots,f_{i_{k}}\neq f_{1}$ it holds

[TABLE]

By relation (19) and condition (17), $g_{1}^{(h)}\in L^{2}(\mathbb{R})$ as well, cf. Lemma 6.2. Using formula (22) it follows by triangle inequality and a simple integral substitution that

[TABLE]

Since $\frac{1}{n_{1}}\sum_{k:f_{k}\neq f_{1}}\left(\frac{|f_{1}|}{|f_{k}|}\right)^{1/2}s_{k}<1$ the consistency result follows immediately from this approximation. ∎

Lemma 6.2.

Let $g_{0}^{(h)}\in L^{p}(\mathbb{R})$ , $p\geq 1$ , and condition (17) hold. Then $g_{1}^{(h)}\in L^{p}(\mathbb{R})$ .

Proof.

Using relation (19), condition (17) and triangle inequality, we get

[TABLE]

∎

Using the estimator $\hat{g}^{(h)}_{0}$ in practice reveals that

the choice $n_{N}=1,2,3$ suffcies completely to get good results due to fast convergence of the geometric series in (33), 2. 2.

$\hat{g}^{(h)}_{0}$ oscillates much in a neighborhood of the origin.

Hence, one has to regularize it applying a usual smoothing procedure. Convolve $\hat{g}^{(h)}_{0}$ with a smoothing kernel $K_{b}:\mathbb{R}\to\mathbb{R}_{+}$ which depends on its bandwidth $b>0$ and satisfies the following assumptions:

(K1)

$K_{b}\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ , $\int_{\mathbb{R}}K_{b}(x)\,dx=1$ for all $b>0$

(K2)

$\sup_{x}|\mathcal{F}[K_{b}](x)|\leq C_{K}$ where $C_{K}\in(0,+\infty)$ is a constant independent of $b>0$

(K3)

$|1-\mathcal{F}[K_{b}](x)|\leq c_{1}\min\{1,b|x|\}$ for all $b>0$ , $x\in\mathbb{R}$ where $c_{1}>0$ is a constant.

For the resulting estimator

[TABLE]

we give an upper bound of its mean square error and prove its consistency as $N,n_{N}\to\infty$ and $b\to+0$ .

Theorem 6.3.

Let $g^{(h)}_{0}\in L^{1}(\mathbb{R})\cap H^{\delta}(\mathbb{R})$ for some $\delta>1/2$ , and let $\hat{g}^{(h)}_{1}\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ be an estimator of ${g}^{(h)}_{1}$ . For a kernel $K_{b}$ satisfying assumptions * (K1) –(K3), $b\in(0,1)$ it holds*

[TABLE]

where

[TABLE]

Proof.

By triangle inequality, Plancherel identity and convolution property of $\mathcal{F}$ we have

[TABLE]

since $\hat{g}^{(h)}_{0}\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ by relation (32). By assumption (K3) and Cauchy-Schwartz inequality, we have

[TABLE]

The rest of the proof follows by observing that for $b\in(0,1)$

[TABLE]

as $h\rightarrow 0$ . ∎

There are many examples of kernels satisfying assumptions (K1)–(K3), e.g., the Gaussian kernel $K_{b}(x)=\frac{1}{\sqrt{2\pi}b}e^{-x^{2}/(2b^{2})}$ . Since $\mathcal{F}[K_{b}](x)=e^{-b^{2}x^{2}/2}$ , (K1)–(K2) are trivial. Condition (K3) holds from the inequality

[TABLE]

Another class of examples is provided by $K_{b}(x)=K(x/b)/b$ , $x\in\mathbb{R}$ , where $K\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ is a nonnegative function such that $\int_{\mathbb{R}}K(x)\,dx=1$ , $\textup{supp}(\mathcal{F}[K])\subseteq[-1,1]$ and $\mathcal{F}[K]$ is a Lipschitz continuous function. While (K1)–(K2) trivially hold in this case, (K3) can be seen from the following lemma.

Lemma 6.4.

Let $K:\mathbb{R}\rightarrow\mathbb{R}_{+}$ be as above. Then

[TABLE]

where $c_{1}=\max\{1,L_{K}\}$ with $L_{K}>0$ being the Lipschitz constant of $K$ .

Proof.

Because of $\textup{supp}(\mathcal{F}[K])\subseteq[-1,1]$ and the Lipschitz continuity of $\mathcal{F}[K]$ it follows

[TABLE]

Thus $|1-\mathcal{F}[K_{b}](x)|\leq c_{1}\min\{1,b|x|\}$ . ∎

Corollary 6.5.

Choose $\hat{g}^{(h)}_{1}=\hat{g}_{1,l}$ , $h(x)=x$ as in Section 5. Under the assumptions of Theorem 4.1, Corollary 5.2 and Theorem 6.3 the estimator $\tilde{g}_{0}^{(h)}$ is $L^{2}$ -consistent for $g_{0}$ as $N,n_{N}\to\infty$ and $b\to+0$ .

Proof.

Applying Theorem 6.1 and Corollary 5.3 yields $\|g_{0}-\hat{g}^{(h)}_{0}||_{\cdot}\to 0$ as $N,l\to\infty$ for any sequence $n_{N}\to\infty$ . Relation $a_{\delta}(b)\to 0$ as $b\to+0$ finishes the proof. ∎

Remark 6.6.

The choice of bandwidth $b>0$ in (34) can be made by solving the following minimization problem numerically:

[TABLE]

which means that we are seeking for a sufficiently smooth estimate $\tilde{g}^{(h)}_{0}$ . Assuming that $K_{b}$ is a $C^{1}$ –smooth function of parameter $b>0$ and that the differentiation with respect to $b$ and the integral can be interchanged we get by Plancherel identity and convolution property of $\mathcal{F}$ that

[TABLE]

For easy particular functions $K_{b}$ the Fourier transform of $\frac{\partial K_{b}}{\partial b}$ can be usually calculated explicitly. In contrast, $\mathcal{F}\left[\hat{g}^{(h)}_{0}\right]$ has to be estimated from the data, compare Section 6.2 for $h(x)=x$ . There, we use the estimate $\widehat{\mathcal{F}[g_{0}]}$ to assess $\mathcal{F}\left[\hat{g}_{0}\right]$ .

6.2 Fourier Approach

A common strategy in the estimation of $g_{1}^{(h)}$ (e.g. in the case of Lévy processes) is first to estimate its Fourier transform $\mathcal{F}[g_{1}^{(h)}]$ and then to invert it. This causes an error in the estimation of the Fourier transform and additionally in the inversion procedure. Using plug-in estimators of Section 6.1, this may increase the estimation error for $g_{0}^{(h)}$ . For this reason, here we estimate $\mathcal{F}[g_{1}^{(h)}]$ directly to recover $g_{0}^{(h)}$ .

From now on, set $h(x)=x^{\beta}$ for some $\beta\in\mathbb{N}$ . In other words, equation (22) is of the form

[TABLE]

where $g_{0}(x)=x^{\beta}v_{0}(x)$ and $g_{1}(x)=x^{\beta}v_{1}(x)$ . Suppose that $g_{0}\in L^{1}(\mathbb{R})$ and the conditions of Theorem 4.1 are fulfilled. Then $g_{1}\in L^{1}(\mathbb{R})$ as well by Lemma 6.2.

The following construction of $\hat{g}_{0,l}(t)$ and $\hat{g}_{1,l}(t)$ is motivated by estimation approaches for the characteristic triplet of Lévy processes (see e.g. [7]). Taking Fourier transforms on both sides of (37) yields

[TABLE]

for $t\in\mathbb{R}$ . Let $\widehat{\mathcal{F}[g_{1}]}$ be any estimator for the Fourier transform of $g_{1}$ . Then we define the estimator $\widehat{\mathcal{F}[g_{0}]}$ for $\mathcal{F}[g_{0}]$ via

[TABLE]

$t\in\mathbb{R}$ . If $\widehat{\mathcal{F}[g_{1}]}$ is locally square integrable, an estimator $\hat{g}_{0,l}$ of $g_{0}$ is constructed for some $l>0$ as

[TABLE]

The last expression can be rewritten as

[TABLE]

with $\hat{g}_{1,l}(t)=\frac{1}{2\pi}\int_{-\pi l}^{\pi l}e^{-itu}\widehat{\mathcal{F}[g_{1}]}(u)du$ being an estimator of $g_{1}$ .

Remark 6.7.

The estimator (28) from Section 5 is locally square integrable. In this case an appropriate choice for the parameter $l>0$ can be achieved e.g. by minimizing the right-hand side of (31) for any fixed sample size $N$ (see also the discussion following Corollary 5.3).

Similar as in Theorem 6.1 one can obtain an upper bound for the $L^{2}$ -error. With the notation $g_{1,l}(t)=\frac{1}{2\pi}\int_{-\pi l}^{\pi l}e^{-itu}\mathcal{F}[g_{1}](u)du$ we get

[TABLE]

where $s_{k}=\left(|f_{k}|/|f_{1}|\right)^{\beta}$ , $k=2,\ldots,n$ . Assume

[TABLE]

Choose the estimator $\hat{g}_{1,l}$ of $g_{1}$ in an $L^{2}$ –consistent way. Then, as $N,l,n_{N}\to\infty$ in an appropriate manner, the above upper bound (39) tends to zero, and $\hat{g}_{0,l}$ is $L^{2}$ –consistent for $g_{0}$ . For instance, one can choose $\hat{g}_{1,l}$ from Section 5, which is $L^{2}$ –consistent under assumptions of Corollary 5.2.

Assume, in addition to (40), that $|f_{1}|>\max_{k:f_{k}\neq f_{1}}|f_{k}|.$ By (31), the upper bound of $\|g_{1}-\hat{g}_{1,l}\|_{\cdot}$ is monotonously non–decreasing in $l$ . Since

[TABLE]

we get by (39) and (31) that

[TABLE]

as $N,n_{N},l\to\infty$ such that $\frac{l^{2\beta+1}}{N}\to 0$ .

6.3 Orthonormal Basis Approach

Since the series representation (22) is sensitive to noise and bad estimates for $v_{1}$ , the aim is to obtain an estimation approach that uses (local) orthonormal bases (e.g., Haar wavelets) of $L^{2}$ . Moreover, from the numerical point of view it is much more convenient to find a solution only on a finite interval. For this reason, the problem of Section 4 should be reformulated for functions on $L^{2}(\mathbb{R})$ with support contained in a finite interval. For $0<A<\infty$ , consider

[TABLE]

to be the closed linear subspace of $L^{2}(\mathbb{R})$ equipped with the usual scalar product on $L^{2}(\mathbb{R})$ . Find a function $g_{0}^{(h)}\in U_{A}$ that fulfills equation (19) for fixed $g_{1}^{(h)}$ . Because of the scalings on the right hand side of this equation, we have to extend the assumptions on $g_{1}^{(h)}$ and the coefficients $|f_{j}|$ a bit. Let $|f_{1}|\geq\max_{k:f_{k}\neq f_{1}}|f_{k}|$ be the largest coefficient and define $M=\min\{1,|f_{1}|\}$ . Then for $g_{1}^{(h)}\in U_{AM}$ it follows that $g_{1}^{(h)}(f_{1}\cdot)\in U_{A}$ . Since $|f_{1}|$ is the largest coefficient, it holds moreover that $g_{0}^{(h)}(f_{1}/f_{j}x)=0$ , for all $|x|>A$ , i.e. $g_{0}^{(h)}(f_{1}/f_{j}\cdot)\in U_{A}$ for all $j=1,\dots,n$ . For this reason the restriciton $\varphi_{g_{1}^{(h)}}|_{U_{A}}$ of the function $\varphi_{g_{1}^{(h)}}$ from the proof of Theorem 4.1 is a map on $U_{A}$ . Then one can show the following theorem with the same arguments applied to $\varphi_{g_{1}^{(h)}}|_{U_{A}}$ .

Theorem 6.8.

Let $h:\mathbb{R}\rightarrow\mathbb{R}$ , $s_{k}$ be as in Theorem 4.1, and let $g_{1}^{(h)}\in U_{AM}$ with $M$ defined as before. Assume furthermore that $|f_{1}|\geq\max_{k:f_{k}\neq f_{1}}|f_{k}|$ and relation (21) holds. Then there exists a unique function $g_{0}^{(h)}\in U_{A}$ such that

[TABLE]

a.e. on $[-AM,AM]$ . The solution $g_{0}^{(h)}$ can be expressed as in (22).

Note that the solution $g_{0}^{(h)}$ fulfills the equation $\varphi_{g_{1}^{(h)}}|_{U_{A}}(g_{0}^{(h)})=g_{0}^{(h)}$ a.e. on the whole interval $[-A,A]$ , whereas (41) holds only on $[-AM,AM]$ , which is merely the same if $M=1$ , i.e. in the case $|f_{1}|\geq 1$ . Notice that $g_{1}^{(h)}\in{U_{AM}}$ means that the random field $X$ has a compound Poisson marginal distribution if $h(x)\equiv 1$ .

The last theorem stated the existence of a solution $g_{0}^{(h)}$ of the fixpoint equation $\varphi_{g_{1}^{(h)}}|_{U_{A}}(g_{0}^{(h)})=g_{0}^{(h)}$ or equivalently for

[TABLE]

Now let $(\psi_{n})_{n\in\mathbb{N}}$ be an orthonormal basis (OnB) of $U_{A}$ . Since $g_{0}^{(h)}=\sum_{j=1}^{\infty}\left\langle g_{0}^{(h)},\psi_{j}\right\rangle\psi_{j}$ it holds

[TABLE]

Note that because of $|f_{1}|\geq\max_{k:f_{k}\neq f_{1}}|f_{k}|$ the function $\psi_{j}(f_{1}/f_{k}\cdot)$ is in $U_{A}$ for all $k\in\mathbb{N}$ . Set

[TABLE]

Then we can conclude that there exists a solution $g_{0}^{(h)}\in U_{A}$ of (42) if and only if the function $\bar{g}_{1}$ admits a representation $\bar{g}_{1}=\sum_{j=1}^{\infty}x_{j}\eta_{j}$ with some $l^{2}$ -sequence $(x_{j})_{j\in\mathbb{N}}$ . In this case, a solution $g_{0}^{(h)}$ is given by $\sum_{j=1}^{\infty}x_{j}\psi_{j}$ . It is unique if and only if the scalar sequence $(x_{j})_{j\in\mathbb{N}}$ is unique. In other words, the problem is characterized by the operator $T:l^{2}\rightarrow U_{A}$ ,

[TABLE]

If $T$ is surjective there exists a solution. If it is bijective the solution is unique. It is clear now that under the conditions of Theorem 6.8 the operator $T$ is a bijection. Nevertheless, let us reformulate this theorem in terms of the OnB $(\psi_{l})_{l\in\mathbb{N}}$ and give another proof for it.

Theorem 6.9.

Let $(\psi_{l})_{l\in\mathbb{N}}$ be an OnB of $U_{A}$ , and let the conditions of Theorem 6.8 be fulfilled. Then there exists a unique sequence $x\in l^{2}$ such that the operator $T$ is one–to–one.

Proof.

We would like to show that the system $(\eta_{j})_{j\in\mathbb{N}}$ is a basis for $U_{A}$ . First we show, by contradiction, that

[TABLE]

Therefore assume that $V\subset U_{A}$ . Since $V$ is a closed subspace of $U_{A}$ it follows by Riesz lemma (see e.g. [20]) that for any $0<\delta<1$ there exists a function $g_{\delta}\in U_{A}$ with $\|g_{\delta}\|_{2}=1$ such that $\|g_{\delta}-v\|_{2}\geq 1-\delta$ , for all $v\in V$ . Now choose $\delta:=\left(1-e(f,h)\right)/2.$ Then we can write $g_{\delta}=\sum_{k=1}^{\infty}\left\langle g_{\delta},\psi_{k}\right\rangle\psi_{k}$ . Define the sequence $x=(x_{k})_{k\in\mathbb{N}}\in l^{2}$ via $x_{k}=|f_{1}|\left\langle g_{\delta},\psi_{k}\right\rangle/n_{1}$ , $k\in\mathbb{N}$ . Since $\|g_{\delta}\|_{2}=1$ it follows $\|x\|_{2}=|f_{1}|/n_{1}$ . Clearly, we have $\sum_{k=1}^{\infty}x_{k}\eta_{k}\in V$ . By triangle inequality, a substitution in the integral and the definition of $s_{k}$ it can be observed that

[TABLE]

which is a contradiction to the fact that $1-\delta>e(f,h)$ , i.e. $V=U_{A}$ .

In the second step of the proof, we use [21, Theorem 3.1.4] to show that $(\eta_{j})_{j\in\mathbb{N}}$ is a basis for $U_{A}$ . Therefore we have to verify the assumptions there. First of all, we observe that $\eta_{l}$ are non-zero functions, since

[TABLE]

where the latter is strictly positive, i.e. $(\eta_{j})_{j\in\mathbb{N}}$ is a sequence of non-zero functions in the Hilbert space $U_{A}$ . Now let $(c_{j})_{j\in\mathbb{N}}$ be an arbitrary real valued sequence and $m,l\in\mathbb{N}$ with $m\leq l$ . Show that there exists a constant $K$ such that $\|\sum_{j=1}^{m}c_{j}\eta_{j}\|_{2}\leq K\|\sum_{j=1}^{l}c_{j}\eta_{j}\|_{2}$ . If $c_{1}=c_{2}=\dots=c_{l}=0$ then this relation is obviously true for any choice of $K$ . Otherwise,

[TABLE]

Thus, we have

[TABLE]

This means $(\eta_{j})_{j\in\mathbb{N}}$ is a basis for $U_{A}$ , i.e. for any function $f\in U_{A}$ there is a unique scalar sequence $(c_{j}(f))_{j\in\mathbb{N}}$ with $f=\sum_{j=1}^{\infty}c_{j}(f)\eta_{j}$ . Since

[TABLE]

the sequence $(c_{j}(f))_{j\in\mathbb{N}}$ is furthermore an element of $l^{2}$ . Choosing

[TABLE]

completes the proof. ∎

Note that the proof of the last theorem shows that the system $(\eta_{j})_{j\in\mathbb{N}}$ is a basis for the $L^{2}$ -subspace $U_{A}$ . Therefore we can orthonormalize it by Gram–Schmidt method to an OnB $(e_{j})_{j\in\mathbb{N}}$ of $U_{A}$ given by $e_{1}=\eta_{1}/||\eta_{1}||_{2}$ and succesively

[TABLE]

Now let $\hat{\bar{g}}_{1}$ be any estimator for $\bar{g}_{1}\in U_{A}$ and let $P_{m}$ be the orthogonal projection of $U_{A}$ onto the $m$ -dimensional subspace $V_{m}=\text{span}\{\eta_{1},\dots,\eta_{m}\}=\text{span}\{e_{1},\dots,e_{m}\}$ which is given by $P_{m}f=\sum_{j=1}^{m}\left\langle f,e_{j}\right\rangle e_{j}$ . Define the sequence $(\hat{y}_{j})_{j\in\mathbb{N}}$ by

[TABLE]

Then the orthogonal projection of $\hat{\bar{g}}_{1}$ onto $V_{m}$ is

[TABLE]

Now, an estimator $\hat{g}_{0,m}^{(h)}$ for $g_{0}^{(h)}$ will be constructed as follows:

1.)

Let $(\hat{x}_{1,m},\dots,\hat{x}_{m,m})$ be the unique solution to

[TABLE]

Set

[TABLE] 2. 2.)

Then we define

[TABLE]

Equation (44) comes from the fact that for any $f\in V_{m}$ , $\sum_{i=1}^{m}\lambda_{i}\eta_{i}=\sum_{i=1}^{m}\left\langle f,e_{i}\right\rangle e_{i}$ if and only if $\left\langle f,e_{i}\right\rangle=\sum_{j=1}^{m}\lambda_{j}\left\langle\eta_{j},e_{i}\right\rangle$ . Note that $\left\langle e_{i},\eta_{j}\right\rangle=0$ whenever $i>j$ since $\eta_{j}$ is a linear combination of $e_{1},\dots,e_{j}$ . In particular, formula (44) stays true if $j>m$ . Due to that, the system of linear equations there becomes diagonal and can easily be solved by backward substitution.

Theorem 6.10.

Let $\bar{g}_{1}\in U_{A}$ and $\hat{\bar{g}}_{1}\in U_{A}$ be an estimator of $\bar{g}_{1}$ . Let furthermore $\hat{\bar{g}}_{1,m}:=P_{m}\hat{\bar{g}}_{1}$ be the orthogonal projection of $\hat{\bar{g}}_{1}$ onto $V_{m}$ . Then under the conditions of Theorem 6.9 it holds for $\hat{g}_{0,m}^{(h)}$ as in (45) that

[TABLE]

where $x_{j}=\left\langle{g}_{0}^{(h)},\psi_{j}\right\rangle$ , $j\in\mathbb{N}$ .

Proof.

First of all, it holds

[TABLE]

and therefore

[TABLE]

with $(y_{j})_{j\in\mathbb{N}}$ defined by $y_{j}=\left\langle\bar{g}_{1},e_{j}\right\rangle=\sum_{i=1}^{\infty}x_{i}\left\langle\eta_{i},e_{j}\right\rangle$ , $j\in\mathbb{N}$ , compare (43). Then

[TABLE]

By (6.3) together with the triangle inequality we get

[TABLE]

Taking into account that $\left\|\sum\limits_{i=m+1}^{\infty}x_{i}\eta_{i}\right\|_{2}\geq\frac{n_{1}}{|f_{1}|}\big{(}1-e(f,h)\big{)}\left\|\sum\limits_{j=m+1}^{\infty}x_{j}\psi_{j}\right\|_{2}$ the statement of the theorem follows by (48). ∎

Remark 6.11.

The term $\left\|\sum\limits_{i=m+1}^{\infty}x_{i}\eta_{i}\right\|_{2}$ in (46) is the approximation error of $\bar{g}_{1}=\sum_{i=1}^{\infty}x_{i}\eta_{i}$ by the first $m$ summands of its series. As $m\to\infty$ , the upper bound (46) tends to $\frac{|f_{1}|}{n_{1}\big{(}1-e(f,h)\big{)}}\|\bar{g}_{1}-\hat{\bar{g}}_{1}\|_{\cdot}$ . In order to estimate $\bar{g}_{1}$ , the method of Section 5 can be used if the random field $X$ satisfies the assumptions given there. In this case, Corollaries 5.2 and 5.3 yield an upper bound for $\|\bar{g}_{1}-\hat{\bar{g}}_{1}\|_{\cdot}$ leading to $L^{2}$ –consistent estimates of $g_{0}^{(h)}$ .

Since the estimator in (45) is strongly oscillating, a smoothed version $\tilde{g}_{0,m}^{(h)}=\hat{g}_{0,m}^{(h)}\ast K_{b}$ of $\hat{g}_{0,m}^{(h)}$ is considered here, where $K_{b}$ is a smoothing kernel with properties (K1)-(K3) from Section 6.1. It is clear that $g_{0}^{(h)}\in L^{1}(\mathbb{R})$ , $\hat{g}_{0,m}^{(h)}\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ , because both are in $U_{A}$ by assumption. If additionally $g_{0}^{(h)}\in H^{\delta}(\mathbb{R})$ for some $\delta>1/2$ then it immediately follows from the proof of Theorem 6.3 that

[TABLE]

with $a_{\delta}$ given in (36). The bandwidth $b>0$ can be chosen as in Remark 6.6.

7 Numerical Performance of the Estimators

In order to compare the three approaches of Section 6, we consider $\Lambda(\Delta)$ to be a compound Poisson random variable

[TABLE]

where $\{Y_{k}\}_{k\in\mathbb{N}}$ is a sequence of independent and identically distributed random variables, independent of $N\sim Poi(\nu_{d}(\Delta))$ . Then for any simple function $f=\sum_{k=1}^{n}f_{k}{1{\rm I}}_{\Delta_{k}}$ with $\nu_{d}(\Delta_{k})=\nu_{d}(\Delta)$ for all $k=1,\dots,n$ it holds

[TABLE]

where $W_{1},\dots,W_{n}$ are i.i.d. with $W_{1}\overset{d}{=}\Lambda(\Delta)$ .

In the following examples, we assumed $d=2$ , $n=4$ , $f_{1}=1.3$ , $f_{2}=0.2$ , $f_{3}=f_{4}=0.1$ as well as $\nu_{2}(\Delta)=1$ . Then $v_{0}$ is the density of the random variable $Y_{1}$ , and due to formula (15), $v_{1}$ is given by

[TABLE]

or, equivalently,

[TABLE]

where $g_{1}(x)=xv_{1}(x)$ and $g_{0}(x)=xv_{0}(x)$ , $h(x)=x$ . Note that the coefficients $f_{1},\dots,f_{4}$ fulfill conditions of Theorem 4.1, i.e. for given $g_{1}\in L^{2}(\mathbb{R})$ there exists a solution $g_{0}\in L^{2}(\mathbb{R})$ to the above equation. In our examples, we simulated the random field $X$ on an integer grid. The estimators for $g_{0}$ based on the corresponding sample with sample size $N=10000$ were compared to the original $g_{0}$ for the following examples:

[TABLE]

For the estimators based on the Fourier method from Section 6.2, the parameter $l=1$ is chosen due to Corollary 5.3, cf. Section 5. For both the plug-in (Section 6.1) and the Fourier method, we used furthermore the cut-off parameter $n_{N}=1$ . For the smoothing procedure, the Epanechnikov kernel

[TABLE]

with bandwidths $b=0.5$ and $b=1.0$ was used in examples (49) and (50) respectively, chosen according to Remark 6.6. For the OnB method, Haar wavelets $\{\psi_{j}\}$ on $[-A,A]$ for $A=6$ were used together with the cut–off parameter $m=7$ . The parameter $l>0$ and the bandwidth $b>0$ for the estimator in (45) (using Epanechnikov kernel $K_{b}$ ) were chosen based on a simulation study with different parameters. It turned out that visually the best choice for the example in (49) is $l=4.5$ , $b=0.7$ whereas for the example in (50) the parameters $l=4.0$ , $b=1.1$ turned out to be optimal. Figures 2 and 3 show realizations of the estimated $g_{0}$ (red) by our methods compared to the original $g_{0}$ (dashed) from examples (49) and (50).

The empirical mean and the standard deviation of the mean square errors of our estimation (assessed upon estimation results for $g_{0}$ out of $100$ simulations of $X$ ) are given in Table 1. It is seen there that plug-in and Fourier methods perform equally well whereas the mean error for the OnB method is significantly higher. Regarding their computation times (see Table 2), the Fourier approach outperforms the others since its algorithm is at least $10$ times faster. To summarize, we recommend the Fourier method for the estimation of $v_{0}$ unless the plug-in approach can be used under milder assumptions on $v_{0}$ and $v_{1}$ . This essentially depends on the estimator for $v_{1}$ which is chosen as a plug-in.

Appendix

Here we give a proof of Theorem 5.1 and its corollaries. Before doing so we prove auxiliary statements.

Lemma 7.1.

Let $Y=\{Y_{t},t\in\mathbb{Z}^{d}\}$ be a random field defined in (26) satisfying ${\rm\bf(H2)}_{2}$ such that $Y$ is either

(i)

$m$ -dependent or

(ii)

$\phi$ -mixing and condition (12) holds.

Furthermore, let $W\subset\mathbb{Z}^{d}$ be a finite subset, $N=\textup{card}(W)$ , and let $\hat{\theta}(u)=\frac{1}{N}\sum_{t\in W}Y_{t}e^{iuY_{t}}$ and $\theta(u)=\operatorname{\mathbb{E}}Y_{0}e^{iuY_{0}}$ . Then

[TABLE]

where $C>0$ is a constant.

Proof.

It holds that

[TABLE]

(i)

By Theorem 2.1 it holds for $p=4$ , $\alpha=1$ and $i=1,2$

[TABLE]

To determine expression $D$ it is useful to decompose it into two parts. The first part consists of all $k$ for which $\|k\|_{\infty}>m$ and the second part contains all other $k$ . Hence,

[TABLE]

For the first part it holds due to $m$ –dependence of $\{\xi^{(i)}_{t}(u)\}$ that

[TABLE]

since $\xi^{(i)}_{t}(u)$ is centered. Furthermore, for the second sum in expression $D$ it follows by Hölder inequality that

[TABLE]

Let $\tilde{V}_{t}^{1}:=\{k\in V_{t}^{1}:\|k-t\|_{\infty}\leq m\}$ and $n_{t}:=\textup{card}(\tilde{V}_{t}^{1})$ . This set is shown in Figure 4 for $d=2$ .

Note that for $i=1,2$ due to stationarity of $Y$

[TABLE]

Therefore, for all $t\in W$ and $i=1,2$ it holds that $\bigl{\|}\xi^{(i)}_{t}(u)\bigr{\|}_{2}\leq 2\|Y_{0}\|_{2}.$ Applying this to ((i)), we get $\bigl{\|}\xi^{(i)}_{t}(u)\bigr{\|}_{2}\sum_{k\in\tilde{V}_{t}^{1}}\bigl{\|}\xi^{(i)}_{k}(u)\bigr{\|}_{2}\leq 4n_{t}\|Y_{0}\|_{2}^{2}.$ Moreover, it follows

[TABLE]

with $n^{*}:=\underset{t\in W}{\max}\{n_{t}\}$ . By Ljapunov inequality, it holds

[TABLE]

(ii)

Using Theorem 2.3 with $p=4$ and applying the Ljapunov inequality we get

[TABLE]

for some constants $C_{i}>0$ , $i=1,2$ , where the last inequality follows by equation (52). Thus, we have

[TABLE]

where $C=2^{5}(C_{1}+C_{2})>0$ is constant.

∎

If assumption (i) holds then the constant $C$ is given by $C=2^{10}(1+2n^{*})^{2}$ , where $n^{*}\leq m^{d}$ is the maximum over the cardinalities of the sets $\tilde{V}_{t}^{1}$ for every $t\in W$ . Therefore, in the first case the constant $C$ depends on $m$ . In the second case the constant $C=2^{5}(C_{1}+C_{2})$ depends on the mixing coefficient $\phi_{u,v}(r)$ by Theorem 2.3.

Lemma 7.2.

Let $\hat{\psi}(u)=\frac{1}{N}\sum_{t\in W}e^{iuY_{t}}$ and $\psi(u)=\operatorname{\mathbb{E}}e^{iuY_{0}}$ where $N=\textup{card}(W)$ . Under the assumptions of Lemma 7.1 for $p\geq 2$ there exists a constant $C_{p}>0$ such that

[TABLE]

Proof.

Since $x\mapsto|x|^{p}$ , $p\geq 2$ is a convex function it holds

[TABLE]

(i)

Applying Theorem 2.1 with $\alpha=1$ we get for $i=1,2$

[TABLE]

Since

[TABLE]

it follows $\bigl{\|}\tilde{\xi}^{(i)}_{t}(u)\bigr{\|}_{2}^{2}\leq 4.$ Analogously to the calculations in the proof of Lemma 7.1 (i) we observe

[TABLE]

and hence

[TABLE]

So all in all we get from (54) that $\operatorname{\mathbb{E}}\bigl{|}\hat{\psi}(u)-\psi(u)\bigr{|}^{p}\leq\frac{C_{p}}{N^{p/2}}$ for the constant $C_{p}=2^{2p}p^{p/2}(1+n^{*})^{p/2}>0$ and $n^{*}\leq m^{d}$ .

(ii)

Using Theorem 2.3 and inequality (55) it follows for $p\geq 2$ and $i=1,2$ that

[TABLE]

By equation (54) it finally follows $\operatorname{\mathbb{E}}\bigl{|}\hat{\psi}(u)-\psi(u)\bigr{|}^{p}\leq\frac{C_{p}}{N^{p/2}},$ where $C_{p}=2^{\frac{3}{2}p-1}(C_{1}+C_{2})>0$ is a constant depending on $p$ and the mixing coefficient of $\{\tilde{\xi}_{t}^{(i)}\}$ , $i=1,2$ .

∎

The following lemma is a generalization of [22, Lemma 2.1] (proven there for independent random variables and $p=1$ ) to the case of weakly dependent random fields.

Lemma 7.3.

Under the assumptions of Lemma 7.1 together with condition (13) there exists a constant $C>0$ such that for $p\in\mathbb{N}$

[TABLE]

Proof.

1.)

Let $|\psi(u)|<2N^{-1/2}$ . Then it holds

[TABLE]

where the last inequality follows by Lemma 7.2 and the fact that an indicator is always smaller or equal than $1$ . In this case, we get for $|\psi(u)|<2N^{-1/2}$ that

[TABLE]

2.)

Let $|\psi(u)|\geq 2N^{-1/2}$ . Then we get

[TABLE]

To calculate this probability, we consider assumptions (i) and (ii) separately.

(i)

Here we can apply Theorem 2.2 and we get for $i=1,2$

[TABLE]

where

[TABLE]

and $\|Z\|_{\infty}:=\inf\{c>0:P(|Z|>c)=0\}$ for a random variable $Z$ . By inequality (55) and $m$ -dependence

[TABLE]

Therefore, $b_{i}$ can be estimated as $b_{i}\leq\sum_{t\in W}(4+4n_{t})\leq 4N(1+n^{*}),$ $i=1,2$ , with $n^{*}$ as in the proof of Lemma 7.1. For expression (57) we get

[TABLE]

(ii)

Apply Theorem 2.4 to $\{\tilde{\xi}_{t}^{(i)}(u)\}$ with $a_{t}=1$ for all $t\in W$ and $h=2$ . Then, $A(W)=N$ , and we have

[TABLE]

So we get in both cases

[TABLE]

It holds that

[TABLE]

Applying the binomial theorem and $|\hat{\psi}(u)|\geq N^{-1/2}$ we get

[TABLE]

Therefore,

[TABLE]

So all in all, it holds

[TABLE]

that concludes the proof.

∎

Now we can finalize the proof of Theorem 5.1.

Proof of Theorem 5.1.

Note that $g_{1}-g_{1,l}$ is orthogonal to $\hat{g}_{1,l}-g_{1,l}$ , since

[TABLE]

due to isometry property of $\mathcal{F}$ in $L^{2}(\mathbb{R})$ . By Pythagorean theorem we get

[TABLE]

and the second term can further be determined by

[TABLE]

Furthermore,

[TABLE]

First we calculate expression (I). Using the Cauchy-Schwarz inequality and applying Lemma 7.1 and Lemma 7.3 it holds

[TABLE]

where $c_{1},c_{2}>0$ are some constants. Now we get

[TABLE]

For the second term (II) we use again Lemma 7.3. Then it holds

[TABLE]

for some $c>0$ . Using this, we get

[TABLE]

Part (III) can be estimated by Ljapunov inequality and Lemma 7.1 as

[TABLE]

So putting all these results together, it follows for some constant $K>0$ that

[TABLE]

that completes the proof. ∎

Proof of Corollary 5.2.

Consider expression (58) in the proof of Theorem 5.1. Using assumptions (H3)– (H4) there, it holds

[TABLE]

∎

Proof of Corollary 5.3.

Since $\cal F$ is an isometry of $L^{2}(\mathbb{R})$ one has

[TABLE]

by assumption (H4). Using (H3) one gets

[TABLE]

Plugging this into (30) yields the result. ∎

Acknowledgement

W. Karcher and E. Spodarev are grateful to E. V. Jensen for her hospitality during their stay at Aarhus University in February 2011 where this research was initiated. The authors thank M. Reiß for the fruitful discussions on the subject of the paper. They also acknowledge the valuable help of O. Moreva in implementing the algorithms of Section 7.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A.M. Kagan, Yu.V. Linnik, and R. C. Rao. Characterization Problems in Mathematical Statistics . John Wiley & Sons, New York, 1973.
2[2] D. Belomestny, V. Panov, and J. Woerner. Low frequency estimation of continuous–time moving average Lévy processes. ar Xiv: 1607.00896 v 1 , 2016.
3[3] W. Karcher. On Infinitely Divisible Random Fields with an Application in Insurance. Phd thesis, Ulm University, 2012.
4[4] J. Glück, S. Roth, and E. Spodarev. A solution of a linear integral equation with the application to statistics of infinitely divisible moving averages. Preprint , 2017.
5[5] M. H. Neumann and M. Reiß. Nonparametric estimation for Lévy processes from low-frequency observations. Bernoulli , 15(1):223–248, 2009.
6[6] S. Gugushvili. Nonparametric inference for discretely sampled Lévy processes. In Ann. Inst. H. Poincaré, Probab. Statist. , volume 48, pages 282–307, 2012.
7[7] F. Comte and V. Genon-Catalot. Nonparametric estimation for pure jump Lévy processes based on high frequency data. Stochastic Processes and their Applications , 119(12):4088–4123, 2009.
8[8] B. S. Rajput and J. Rosinski. Spectral representations of infinitely divisible processes. Probab. Th. Rel. Fields , (82):451–487, 1989.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

An Inverse Problem for Infinitely Divisible Moving Average Random Fields

Abstract

1 Introduction

2 Preliminaries

2.1 ID Random Measures and Fields

2.2 mmm-Dependent and ϕ\phiϕ-Mixing Random Fields

2.3 Moment and Exponential Inequalities for Random Fields

Theorem 2.1**.**

Theorem 2.2**.**

Theorem 2.3**.**

Theorem 2.4**.**

3 Inverse Problem

The Inverse Problem**.**

4 Existence and Uniqueness of a Solution for v0v_{0}v0​

Theorem 4.1**.**

Proof.

Remark 4.2**.**

Remark 4.3**.**

Remark 4.4**.**

5 Estimation of g1(h)g^{(h)}_{1}g1(h)​ for Pure Jump ID Random Fields

Theorem 5.1**.**

Corollary 5.2**.**

Corollary 5.3**.**

6 Estimation of the Lévy Density v0v_{0}v0​

6.1 Plug-In Estimator

Theorem 6.1**.**

Proof.

Lemma 6.2**.**

Proof.

Theorem 6.3**.**

Proof.

Lemma 6.4**.**

Proof.

Corollary 6.5**.**

Proof.

Remark 6.6**.**

6.2 Fourier Approach

Remark 6.7**.**

6.3 Orthonormal Basis Approach

Theorem 6.8**.**

Theorem 6.9**.**

Proof.

Theorem 6.10**.**

Proof.

Remark 6.11**.**

7 Numerical Performance of the Estimators

Appendix

Lemma 7.1**.**

Proof.

Lemma 7.2**.**

Proof.

Lemma 7.3**.**

Proof.

Proof of Theorem 5.1.

Proof of Corollary 5.2.

Proof of Corollary 5.3.

Acknowledgement

2.2 $m$ -Dependent and $\phi$ -Mixing Random Fields

Theorem 2.1.

Theorem 2.2.

Theorem 2.3.

Theorem 2.4.

The Inverse Problem.

4 Existence and Uniqueness of a Solution for $v_{0}$

Theorem 4.1.

Remark 4.2.

Remark 4.3.

Remark 4.4.

5 Estimation of $g^{(h)}_{1}$ for Pure Jump ID Random Fields

Theorem 5.1.

Corollary 5.2.

Corollary 5.3.

6 Estimation of the Lévy Density $v_{0}$

Theorem 6.1.

Lemma 6.2.

Theorem 6.3.

Lemma 6.4.

Corollary 6.5.

Remark 6.6.

Remark 6.7.

Theorem 6.8.

Theorem 6.9.

Theorem 6.10.

Remark 6.11.

Lemma 7.1.

Lemma 7.2.

Lemma 7.3.