Optimal sampling design for global approximation of jump diffusion SDEs

Pawe{\l} Przyby{\l}owicz

arXiv:1701.08311·math.NA·October 6, 2020

Optimal sampling design for global approximation of jump diffusion SDEs

Pawe{\l} Przyby{\l}owicz

PDF

TL;DR

This paper analyzes the optimal sampling strategies for accurately approximating jump diffusion SDEs driven by Poisson and Wiener processes, establishing convergence rates and constructing asymptotically optimal methods.

Contribution

It determines the exact convergence rate of minimal errors for approximating jump diffusion SDEs and constructs optimal Milstein-based methods using various sampling schemes.

Findings

01

Nonequidistant sampling is more efficient than equidistant sampling.

02

Optimal methods asymptotically attain the minimal possible errors.

03

The convergence rate of approximation errors is precisely characterized.

Abstract

The paper deals with strong global approximation of SDEs driven by two independent processes: a nonhomogeneous Poisson process and a Wiener process. We assume that the jump and diffusion coefficients of the underlying SDE satisfy jump commutativity condition. We establish the exact convergence rate of minimal errors that can be achieved by arbitrary algorithms based on a finite number of observations of the Poisson and Wiener processes. We consider classes of methods that use equidistant or nonequidistant sampling of the Poisson and Wiener processes. We provide a construction of optimal methods, based on the classical Milstein scheme, which asymptotically attain the established minimal errors. The analysis implies that methods based on nonequidistant mesh are more efficient than those based on the equidistant mesh.

Equations393

\left\{\begin{array}[]{ll}dX(t)=a(t,X(t))dt+b(t,X(t))dW(t)+c(t,X(t-))dN(t),&t\in[0,T],\\ X(0)=x_{0},\end{array}\right.

\left\{\begin{array}[]{ll}dX(t)=a(t,X(t))dt+b(t,X(t))dW(t)+c(t,X(t-))dN(t),&t\in[0,T],\\ X(0)=x_{0},\end{array}\right.

\limsup\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm noneq}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}\leq\frac{1}{\sqrt{6}}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}dt,

\limsup\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm noneq}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}\leq\frac{1}{\sqrt{6}}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}dt,

\liminf\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm noneq}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}\geq\frac{1}{\sqrt{12}}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}dt,

\liminf\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm noneq}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}\geq\frac{1}{\sqrt{12}}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}dt,

\lim\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm noneq*}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}=\frac{1}{\sqrt{6}}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}dt,

\lim\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm noneq*}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}=\frac{1}{\sqrt{6}}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}dt,

\lim\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm eq}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}=\sqrt{\frac{T}{6}}\Biggl{(}\int\limits_{0}^{T}\mathbb{E}(\mathcal{Y}(t))dt\Biggr{)}^{1/2}.

\lim\limits_{n\to+\infty}n^{1/2}\cdot\inf\limits_{\bar{X}_{n}\in\chi^{\rm eq}}\Bigl{(}\mathbb{E}\|X-\bar{X}_{n}\|^{2}_{L^{2}([0,T])}\Bigr{)}^{1/2}=\sqrt{\frac{T}{6}}\Biggl{(}\int\limits_{0}^{T}\mathbb{E}(\mathcal{Y}(t))dt\Biggr{)}^{1/2}.

\tilde{N} (t) = N (t) - m (t), t \in [0, T],

\tilde{N} (t) = N (t) - m (t), t \in [0, T],

L_{1} f (t, y) = b (t, y) \cdot \frac{\partial f}{\partial y} (t, y),

L_{1} f (t, y) = b (t, y) \cdot \frac{\partial f}{\partial y} (t, y),

L_{- 1} f (t, y) = f (t, y + c (t, y)) - f (t, y), (t, y) \in [0, T] \times R .

∣ L_{1} f (t, y) - L_{1} f (t, z) ∣ \leq K ∣ y - z ∣.

∣ L_{1} f (t, y) - L_{1} f (t, z) ∣ \leq K ∣ y - z ∣.

L_{- 1} b (t, y) = L_{1} c (t, y),

L_{- 1} b (t, y) = L_{1} c (t, y),

∣ f (t, y) ∣ \leq K_{1} (1 + ∣ y ∣),

∣ f (t, y) ∣ \leq K_{1} (1 + ∣ y ∣),

\Bigl{|}\frac{\partial^{j}f}{\partial y^{j}}(t,y)\Bigl{|}\leq K,\quad j=1,2.

\Bigl{|}\frac{\partial^{j}f}{\partial y^{j}}(t,y)\Bigl{|}\leq K,\quad j=1,2.

max {∣ L_{- 1} f (t, y) ∣, ∣ L_{1} f (t, y) ∣} \leq K_{2} (1 + ∣ y ∣),

max {∣ L_{- 1} f (t, y) ∣, ∣ L_{1} f (t, y) ∣} \leq K_{2} (1 + ∣ y ∣),

∥ t \in [0, T] sup X (t) ∥_{L^{4} (Ω)} \leq C_{1},

∥ t \in [0, T] sup X (t) ∥_{L^{4} (Ω)} \leq C_{1},

∥ X (t) - X (s) ∥_{L^{2} (Ω)} \leq C_{2} ∣ t - s ∣^{1/2} .

∥ X (t) - X (s) ∥_{L^{2} (Ω)} \leq C_{2} ∣ t - s ∣^{1/2} .

Y (t) = ∣ b (t, X (t)) ∣^{2} + λ (t) \cdot ∣ c (t, X (t)) ∣^{2}, t \in [0, T] .

Y (t) = ∣ b (t, X (t)) ∣^{2} + λ (t) \cdot ∣ c (t, X (t)) ∣^{2}, t \in [0, T] .

h \to 0 + lim \frac{∥ X ( t + h ) - X ( t ) ∣ X ( t ) ∥ _{L^{2} (Ω)}}{h ^{1/2}} = (Y (t))^{1/2},

h \to 0 + lim \frac{∥ X ( t + h ) - X ( t ) ∣ X ( t ) ∥ _{L^{2} (Ω)}}{h ^{1/2}} = (Y (t))^{1/2},

\lim\limits_{h\to 0+}\frac{\|X(t+h)-X(t)\|_{L^{2}(\Omega)}}{h^{1/2}}=\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}.

\lim\limits_{h\to 0+}\frac{\|X(t+h)-X(t)\|_{L^{2}(\Omega)}}{h^{1/2}}=\Bigl{(}\mathbb{E}(\mathcal{Y}(t))\Bigr{)}^{1/2}.

φ_{n} : R^{2 n} \to L^{2} ([0, T]),

φ_{n} : R^{2 n} \to L^{2} ([0, T]),

Δ_{n}^{Z} = {t_{0, n}^{Z}, t_{1, n}^{Z}, \dots, t_{n, n}^{Z}},

Δ_{n}^{Z} = {t_{0, n}^{Z}, t_{1, n}^{Z}, \dots, t_{n, n}^{Z}},

0 = t_{0, n}^{Z} < t_{1, n}^{Z} < \dots < t_{n, n}^{Z} = T,

0 = t_{0, n}^{Z} < t_{1, n}^{Z} < \dots < t_{n, n}^{Z} = T,

\overset{ˉ}{N} (N, W) = {N_{n} (N, W)}_{n \in N},

\overset{ˉ}{N} (N, W) = {N_{n} (N, W)}_{n \in N},

N_{n} (N, W)

N_{n} (N, W)

\overset{ˉ}{X}_{n} = φ_{n} (N_{n} (N, W)) .

\overset{ˉ}{X}_{n} = φ_{n} (N_{n} (N, W)) .

cost_{n}(\bar{X})=\left\{\begin{array}[]{ll}2n,&\hbox{if}\ b\not\equiv 0\ \hbox{and}\ c\not\equiv 0,\\ n,&\hbox{if}\ (b\not\equiv 0\ \hbox{and}\ c\equiv 0)\ \hbox{or}\ (b\equiv 0\ \hbox{and}\ c\not\equiv 0),\\ 0,&\hbox{if}\ b\equiv 0\ \hbox{and}\ c\equiv 0.\end{array}\right.

cost_{n}(\bar{X})=\left\{\begin{array}[]{ll}2n,&\hbox{if}\ b\not\equiv 0\ \hbox{and}\ c\not\equiv 0,\\ n,&\hbox{if}\ (b\not\equiv 0\ \hbox{and}\ c\equiv 0)\ \hbox{or}\ (b\equiv 0\ \hbox{and}\ c\not\equiv 0),\\ 0,&\hbox{if}\ b\equiv 0\ \hbox{and}\ c\equiv 0.\end{array}\right.

χ^{noneq *} = {\overset{ˉ}{X} \in χ^{noneq} ∣ \exists_{n_{0}^{*} = n_{0}^{*} (\overset{ˉ}{X})} : \forall_{n \geq n_{0}^{*}} Δ_{n}^{N} = Δ_{n}^{W}},

χ^{noneq *} = {\overset{ˉ}{X} \in χ^{noneq} ∣ \exists_{n_{0}^{*} = n_{0}^{*} (\overset{ˉ}{X})} : \forall_{n \geq n_{0}^{*}} Δ_{n}^{N} = Δ_{n}^{W}},

χ^{eq} = {\overset{ˉ}{X} \in χ^{noneq} ∣ \exists_{n_{0}^{*} = n_{0}^{*} (\overset{ˉ}{X})} : \forall_{n \geq n_{0}^{*}} Δ_{n}^{N} = Δ_{n}^{W} = {i T / n : i = 0, 1, \dots, n}} .

χ^{eq} = {\overset{ˉ}{X} \in χ^{noneq} ∣ \exists_{n_{0}^{*} = n_{0}^{*} (\overset{ˉ}{X})} : \forall_{n \geq n_{0}^{*}} Δ_{n}^{N} = Δ_{n}^{W} = {i T / n : i = 0, 1, \dots, n}} .

e_{n}(\bar{X})=\|X-\bar{X}_{n}\|_{2}=\Biggl{(}\mathbb{E}\int\limits_{0}^{T}|X(t)-\bar{X}_{n}(t)|^{2}dt\Biggr{)}^{1/2}.

e_{n}(\bar{X})=\|X-\bar{X}_{n}\|_{2}=\Biggl{(}\mathbb{E}\int\limits_{0}^{T}|X(t)-\bar{X}_{n}(t)|^{2}dt\Biggr{)}^{1/2}.

e^{⋄} (n) = \overset{ˉ}{X} \in χ^{⋄} in f e_{n} (\overset{ˉ}{X}), ⋄ \in {eq, noneq *, noneq} .

e^{⋄} (n) = \overset{ˉ}{X} \in χ^{⋄} in f e_{n} (\overset{ˉ}{X}), ⋄ \in {eq, noneq *, noneq} .

0 = t_{0} < t_{1} < \dots < t_{m} = T,

0 = t_{0} < t_{1} < \dots < t_{m} = T,

Δ Z_{i} = Z (t_{i + 1}) - Z (t_{i}), i = 0, 1, \dots, m - 1,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

**Optimal sampling design for global approximation of jump diffusion SDEs 111 This research was partly supported by the Polish NCN grant - decision No. DEC-2013/09/B/ST1/04275 and by AGH local grant.

**

Paweł Przybyłowicz

*AGH University of Science and Technology,

Faculty of Applied Mathematics,

Al. Mickiewicza 30, 30-059 Krakow, Poland,

E-mail:* [email protected]

Abstract

The paper deals with strong global approximation of SDEs driven by two independent processes: a nonhomogeneous Poisson process and a Wiener process. We assume that the jump and diffusion coefficients of the underlying SDE satisfy jump commutativity condition (see Chapter 6.3 in [21]). We establish the exact convergence rate of minimal errors that can be achieved by arbitrary algorithms based on a finite number of observations of the Poisson and Wiener processes. We consider classes of methods that use equidistant or nonequidistant sampling of the Poisson and Wiener processes. We provide a construction of optimal methods, based on the classical Milstein scheme, which asymptotically attain the established minimal errors. The analysis implies that methods based on nonequidistant mesh are more efficient than those based on the equidistant mesh.

Key words: nonhomogeneous Poisson process, Wiener process, jump commutativity condition, standard information, minimal strong error, asymptotically optimal algorithm

Mathematics Subject Classification: 68Q25, 65C30.

1 Introduction

We investigate the global approximation for the following jump diffusion stochastic differential equations (SDEs)

[TABLE]

driven by two independent processes: a nonhomogeneous one–dimensional Poisson process $N=\{N(t)\}_{t\in[0,T]}$ with intensity function $\lambda=\lambda(t)>0$ and one-dimensional Wiener process $W$ . We assume, without the loss of generality, that $x_{0}\in\mathbb{R}$ . Jump diffusion SDEs (1) appear in various fields such as e.g. physics, biology, engineering and mathematical finance, see, for example, [1], [9], [29], [30] and pages 43-44 in [21]. We are interested in efficient algorithms that approximate whole trajectories of $X$ and use only discrete values of the driving Poisson and Wiener processes.

Approximation of stochastic differential equations only driven by a Wiener process has been widely investigated in the literature. In that case, upper bounds on the error of defined methods were established, see, for example, [14]. Lower bounds were also investigated for the strong approximation in the Wiener (Gaussian) case, see, for example, [8], [12], [18]-[20] and [23]-[25].

In the jump diffusion case suitable approximation schemes were provided and upper bounds on their errors discussed, for example, in the monograph [21] and in the articles [3], [6], [9]-[11] and [16]. However, according to the author‘s best knowledge, till now there is only one paper that deals with asymptotic lower bounds and exact rate of convergence of the minimal errors for the global approximation of SDEs with jumps, see [26]. In that paper the author considered the pure jumps SDEs (1), i.e., $b\equiv 0$ and $c=c(t)$ . We can also mention [4] where the authors investigated the optimal rate of convergence for the problem of approximating stochastic integrals of regular functions with respect to a homogeneous Poisson process. Here, we extend the approach used in [26] in order to cover more general SDEs of the form (1).

The purpose of this paper is to find lower bounds on the error and to define optimal methods solving (1). In the purely Gaussian case, similar question were considered, for example, in [12] and [18]. In order to study jump diffusion equations (1) driven by the Poisson and the Wiener processes a new technique is necessary. The main difference, comparing to the Gaussian case, is that we have to use some facts from the theory of stochastic integration with respect to càdlàg, square integrable martingales, see, for example, [15], [17], [22] and [29]. Moreover, we have to face the fact that when establishing the exact asymptotic constants the intensity of the process $N$ depends on time. This problem does not appear in [26], where the intensity is constant. The another thing is we assume that the coefficients $b$ and $c$ satisfy the jump commutativity condition. This condition is widely described and discussed in, for example, Chapter 6.3 in [21]. Roughly speaking, it assures for the construction of the Itô-Taylor schemes that we do not need to know the exact location of the jump times of the Poisson process $N$ . In this paper we widely use this condition when establishing asymptotic lower and upper bounds.

We consider three classes of approximation schemes denoted by $\chi^{\rm eq}$ , $\chi^{\rm noneq*}$ and $\chi^{\rm noneq}$ , dependent on the sampling method for trajectories of the processes $N$ and $W$ . The class $\chi^{\rm eq}$ contains methods based on the equidistant discretization of $[0,T]$ . Methods using the same (but not necessarily equidistant) evaluation points for $N$ and $W$ belong to a wider class $\chi^{\rm noneq*}$ . Methods that can use different, but also not necessarily equidistant, sampling point for the processes $N$ and $W$ belong to $\chi^{\rm noneq}$ . We have $\chi^{\rm eq}\subset\chi^{\rm noneq*}\subset\chi^{\rm noneq}$ .

The main result of the paper, Theorem 4.2, states that for fixed $a$ , $b$ , $c$ , $\lambda$ , $x_{0}$ and in the case when the underlying SDE (1) is driven by two processes $N$ and $W$ (i.e., $b\not\equiv 0$ and $c\not\equiv 0$ ) the following holds

[TABLE]

and

[TABLE]

where $\mathcal{Y}(t)=|b(t,X(t))|^{2}+\lambda(t)\cdot\mathbb{E}|c(t,X(t))|^{2}$ , $t\in[0,T]$ . In (2) and (3) the method $\bar{X}_{n}$ uses at most $n$ evaluations of $N$ and $W$ . By taking the infimum we mean that we choose mappings $\{\bar{X}_{n}\}_{n\in\mathbb{N}}$ along with discretization points in the best possible way. For the subclass $\chi^{\rm noneq*}$ of $\chi^{\rm noneq}$ we have

[TABLE]

while in $\chi^{\rm eq}$ we have that

[TABLE]

In (5) the infimum means that we only choose mappings $\{\bar{X}_{n}\}_{n\in\mathbb{N}}$ in the best possible way, while the discretization of $[0,T]$ is fixed and uniform. As we can see, the order of convergence is $n^{-1/2}$ , but the asymptotic constant in (5) may be considerably larger than that in (2), (3) and (4). In the class $\chi^{\rm noneq}$ we have a small gap between the upper and lower asymptotic constants. We conjecture that the exact rate of convergence of the minimal errors in $\chi^{\rm noneq}$ is the same as for $\chi^{\rm noneq*}$ . Note also that if $b\equiv 0$ and $c=c(t)$ then we arrive at results known from [26], while if $c\equiv 0$ and $b\not\equiv 0$ then, for the classes $\chi^{\rm eq}$ and $\chi^{\rm noneq*}$ , we restore the results known from [12], see Remark 4.2.

The asymptotically optimal scheme is defined by a piecewise linear interpolation of the classical Milstein steps, performed at suitably selected discretization points. The discretization points are chosen as quantiles of a distribution corresponding to a density $\psi:[0,T]\to\mathbb{R}_{+}$ . It turns out that in the class $\chi^{\rm noneq*}$ the optimal density $\psi_{0}$ is proportional to $(\mathbb{E}(\mathcal{Y}(t)))^{1/2}$ . The main disadvantage of using such regular sampling is the need of using exact values of quantiles of $(\mathbb{E}(\mathcal{Y}(t)))^{1/2}$ that might be hard to compute in general. In Section 4.1 we present the exact computation of sampling points in the linear case (Merton‘s model).

The paper is organized as follows. In Section 2 we give basic notions and definitions. Asymptotic lower bounds on the minimal errors are established in Section 3, while asymptotically optimal methods are defined in Section 4. We chose such order of presentation due to the fact that the technique used when proving the lower bounds in Section 3 suggests definitions of the optimal methods in Section 4. Finally, Appendix contains proofs of auxiliary results used in the paper.

2 Preliminaries

Let $T>0$ be a given real number. We denote $\mathbb{N}=\{1,2,\ldots\}$ and $\mathbb{N}_{0}=\mathbb{N}\cup\{0\}$ . Let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space. We consider on it two independent processes: a one-dimensional Wiener process $W=\{W(t)\}_{t\in[0,T]}$ and a one–dimensional nonhomogeneous Poisson process $N=\{N(t)\}_{t\in[0,T]}$ with continuous intensity function $\lambda=\lambda(t)>0$ . Let $\{\mathcal{F}_{t}\}_{t\in[0,T]}$ denote the complete filtration generated by the driving processes $N$ and $W$ . We set $\displaystyle{m(t)=\int\limits_{0}^{t}\lambda(s)ds}$ and $\Lambda(t,s)=m(t)-m(s)$ for $t,s\in[0,T]$ . The process $N$ has independent increments where the increment $N(t)-N(s)$ has Poisson law with parameter $\Lambda(t,s)$ and $\mathbb{E}(N(t))=m(t)$ for $0\leq s\leq t$ , see [7] or [21]. The compensated Poisson process $\tilde{N}=\{\tilde{N}(t)\}_{t\in[0,T]}$ is defined as follows

[TABLE]

which is a zero mean, square integrable $\{\mathcal{F}_{t}\}_{t\in[0,T]}$ -martingale with càdlàg paths. For a random variable $X:\Omega\to\mathbb{R}$ we write $\|X\|_{L^{q}(\Omega)}=(\mathbb{E}|X|^{q})^{1/q}$ , $q\in\{2,4\}$ , and $\|X\ |\ \mathcal{G}\|_{L^{2}(\Omega)}=(\mathbb{E}(|X|^{2}\ |\ \mathcal{G}))^{1/2}$ , where $\mathcal{G}$ is a sub- $\sigma$ -filed of $\mathcal{F}$ . We say that a continuous function $f:[0,T]\times\mathbb{R}\to\mathbb{R}$ belongs to $C^{0,2}([0,T]\times\mathbb{R})$ if for $j\in\{1,2\}$ the partial derivatives $\partial^{j}f/\partial y^{j}=\partial f^{j}(t,y)/\partial y^{j}$ exist and are continuous on $(0,T)\times\mathbb{R}$ , and can be continuously extended to $[0,T]\times\mathbb{R}$ . For a continuous function $f:[0,T]\to\mathbb{R}$ its modulus of continuity is $\bar{\omega}(f,\delta)=\sup\limits_{t,s\in[0,T],|t-s|\leq\delta}|f(t)-f(s)|$ , $\delta\in[0,+\infty)$ . If $Y=\{Y(t)\}_{t\in[0,T]}$ is a right-continuous process with left hand limits then we can define $Y(t-):=\lim\limits_{s\to t-}Y(s)$ for all $t\in(0,T]$ . We have that $Y(t-)=Y(t)$ if and only if $Y$ is continuous at $t$ . For the further properties of càdlàg mappings used in this paper see, for example, Chapter 2.9 in [1]. For $f\in\{b,c\}$ we use the following notation

[TABLE]

We impose the following assumption on the mappings $a:[0,T]\times\mathbb{R}\to\mathbb{R}$ , $b:[0,T]\times\mathbb{R}\to\mathbb{R}$ , $c:[0,T]\times\mathbb{R}\to\mathbb{R}$ and on the intensity function $\lambda:[0,T]\to\mathbb{R}_{+}$ :

(A)

$f\in C^{0,2}([0,T]\times\mathbb{R})$ for $f\in\{a,b,c\}$ .

(B)

There exists $K>0$ such that for $f\in\{a,b,c\}$ , for all $t,s\in[0,T]$ and all $y,z\in\mathbb{R}$

(B1)

$|f(t,y)-f(t,z)|\leq K|y-z|$ ,

(B2)

$|f(t,y)-f(s,y)|\leq K(1+|y|)|t-s|$ ,

(B3)

$\Bigl{|}\frac{\partial f}{\partial y}(t,y)-\frac{\partial f}{\partial y}(t,z)\Bigl{|}\leq K|y-z|$ .

(C)

There exists $K>0$ such that for $f\in\{b,c\}$ , for all $t\in[0,T]$ and all $y,z\in\mathbb{R}$

[TABLE]

(D)

The diffusion and the jump coefficients satisfy the following jump commutativity condition

[TABLE]

for all $(t,y)\in[0,T]\times\mathbb{R}$ . (We refer to Chapter 6.3 in [21] where the condition (9) is widely discussed.)

(E)

The intensity $\lambda:[0,T]\to\mathbb{R}_{+}$ is continuous in $[0,T]$ .

The assumptions (B1) and (B2) imply for $f\in\{a,b,c\}$ and all $(t,y)\in[0,T]\times\mathbb{R}$ that

[TABLE]

where $K_{1}>0$ depends only on $f(0,0)$ , $K$ and $T$ . Moreover, by (B1) and (B3) we have for $f\in\{a,b,c\}$ and all $(t,y)\in[0,T]\times\mathbb{R}$ that

[TABLE]

From (B1), (10) and (11) we get for $f\in\{b,c\}$ and all $(t,y)\in[0,T]\times\mathbb{R}$ that

[TABLE]

where $K_{2}=KK_{1}$ .

Unless otherwise stated, all unspecified constants appearing in this paper may only depend on the constant $K$ from the assumptions (B)-(C), $x_{0}$ , $\|\lambda\|_{\infty}$ , $\|1/\lambda\|_{\infty}$ , $a(0,0)$ , $b(0,0)$ , $c(0,0)$ and $T$ . Moreover, the same symbol might be used to denote different constants.

The assumptions (A)-(E) are rather standard when comparing to those known from the literature concerning approximations of jump diffusion SDEs, see the comment before Theorem 6.1. Only in Section 4.1 we impose additional assumption on the coefficients which, in fact, turns out to be necessary in order to define an optimal sampling from a probabilistic density function.

For $a$ , $b$ , $c$ and $\lambda$ satisfying (B1), (B2) and (E) the equation (1) has a unique strong solution $X=\{X(t)\}_{t\in[0,T]}$ that is adapted to $\{\mathcal{F}_{t}\}_{t\in[0,T]}$ and has càdlàg paths, see [21], [22] or [30]. We have also the following moments estimates for the solution $X$ , see, for example, [22] or [21] .

Lemma 2.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions (B1), (B2) and (E). Then there exist positive constants $C_{1}$ , $C_{2}$ such that

[TABLE]

and for all $t,s\in[0,T]$

[TABLE]

The following result characterizes the local mean square smoothness of the solution $X$ in the terms of the process $\mathcal{Y}=\{\mathcal{Y}(t)\}_{t\in[0,T]}$ defined as follows

[TABLE]

Of course $\mathcal{Y}$ has càdlàg paths and it is adapted to $\{\mathcal{F}_{t}\}_{t\in[0,T]}$ . (See Fact 6.1 in Appendix for the further properties of $\mathcal{Y}$ used in the paper.)

Proposition 2.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(B1)$ , $(B2)$ and $(E)$ . Then for the solution $X$ of (1) we have that for all $t\in[0,T)$

[TABLE]

almost surely and, in particular,

[TABLE]

Proof. See the Appendix. $\blacksquare$

By Proposition 2.1 the square root of $\mathcal{Y}$ can be interpreted as a conditional Hölder constant of $X$ . This local smoothness will reflect in the exact rate of convergence of minimal errors established in Section 4. A result similar to Proposition 2.1 for SDEs driven by a multiplicative Wiener process has been obtained in [12], while for SDEs driven by an additive fractional Brownian motion with the Hurst parameter $H\in(0,1)$ has been shown in Proposition 1 in [19].

The problem considered in the paper is to find an optimal strong global approximation of the solution $X=\{X(t)\}_{t\in[0,T]}$ of (1). For any fixed $(a,b,c,\lambda,x_{0})$ an approximation of $X=X(a,b,c,\lambda,x_{0})$ is given by a method $\bar{X}=\bar{X}(a,b,c,\lambda,x_{0})$ . The method computes the approximation by using some information about the functions $a$ , $b$ , $c$ and $\lambda$ , the Poisson process $N$ and the Wiener process $W$ . We consider methods that are based on a finite number of observations of trajectories of the driving processes $N$ and $W$ at suitably chosen points from the interval $[0,T]$ . The cost of the method is measured by the total number of evaluations of the processes $N$ and $W$ .

We fix $(a,b,c,\lambda,x_{0})$ and we consider the corresponding equation (1). Any approximation method $\bar{X}=\{\bar{X}_{n}\}_{n\in\mathbb{N}}$ is defined by three sequences $\bar{\varphi}=\{\varphi_{n}\}_{n\in\mathbb{N}}$ , $\bar{\Delta}^{Z}=\{\Delta^{Z}_{n}\}_{n\in\mathbb{N}}$ , $Z\in\{N,W\}$ , where

[TABLE]

is a measurable mapping and

[TABLE]

is a partition of $[0,T]$ with

[TABLE]

for $Z\in\{N,W\}$ . We have that $\{0,T\}\subset\Delta^{N}_{n}\cap\Delta^{W}_{n}$ for all $n$ and, in particular, we might have $\Delta^{N}_{n}\cap\Delta^{W}_{n}=\emptyset$ for some $n$ . The sequences $\bar{\Delta}^{N}$ , $\bar{\Delta}^{W}$ provide (not necessary equidistant) discretizations of $[0,T]$ used by $N$ and $W$ , respectively. Mostly, in the literature, we have that $\bar{\Delta}^{N}=\bar{\Delta}^{W}$ see, for example, Chapter 6 in [21]. Here, mainly for the lower bound, we allow more general discretization. By

[TABLE]

we denote a sequence of vectors $\mathcal{N}_{n}$ of size $2n$ , which provides standard information with $n$ evaluations of the Poisson process and $n$ evaluations of the Wiener process at the discrete points from $\Delta^{N}_{n}\cup\Delta^{W}_{n}$ , i.e.,

[TABLE]

where $\mathcal{N}_{n}(Z)=[Z(t^{Z}_{1,n}),Z(t^{Z}_{2,n}),\ldots,Z(t^{Z}_{n,n})]$ for $Z\in\{N,W\}$ . Recall that $N(0)=W(0)=0$ . In particular, the sequences $\bar{\varphi}$ , $\bar{\Delta}$ may depend on functions $a$ , $b$ , $c$ , $\lambda$ and on $x_{0}$ but not on trajectories of the processes $N$ and $W$ . (Information (22) uses the same evaluation points for all trajectories of the Poisson and Wiener processes.) Therefore, information (21) about the processes $N$ and $W$ is nonadaptive. Moreover, since $\mathcal{N}_{n}(N,W)$ does not have to be contained in $\mathcal{N}_{n+1}(N,W)$ , the information (22) is called nonexpanding, see [24]. We stress that our model of computation covers the regular strong Taylor approximations and it excludes the jump-adapted time discretizations, since we do not assume the knowledge of the jump times for $N$ (see Chapters 6 and 8 in [21]). This restriction reflects our assumption that only nonadaptive standard information is available for the process $N$ .

After computing the information $\mathcal{N}_{n}(N,W)$ , we apply the mapping $\varphi_{n}$ in order to obtain the $n$ th approximation $\bar{X}_{n}=\{\bar{X}_{n}(t)\}_{t\in[0,T]}$ in the following way

[TABLE]

The $n$ th cost of the method $\bar{X}$ is the total number of evaluations of $N$ and $W$ used by the $n$ th approximation $\bar{X}_{n}$ , defined as follows

[TABLE]

(If $b\equiv 0$ then we take formally $\mathcal{N}_{n}(W)$ to be a zero vector and the sequence $\bar{\Delta}^{W}$ can be arbitrary; we use analogous convention in the case when $c\equiv 0$ .) The set of all methods $\bar{X}=\{\bar{X}_{n}\}_{n\in\mathbb{N}}$ , defined as above, is denoted by $\chi^{\rm noneq\it}$ . Moreover, we consider the following subclasses of $\chi^{\rm noneq\it}$

[TABLE]

and

[TABLE]

Methods based on the sequence of equidistant discretizations (19) belong to the class $\chi^{\rm eq\it}$ while to the class $\chi^{\rm noneq*\it}$ belong methods that evaluates $N$ and $W$ at the same, possibly nonuniform, sampling points. We have that $\chi^{\rm eq\it}\subset\chi^{\rm noneq*\it}\subset\chi^{\rm noneq\it}$ .

The $n$ th error of a method $\bar{X}=\{\bar{X}_{n}\}_{n\in\mathbb{N}}$ is defined as

[TABLE]

The $n$ th minimal error, in the respective class of methods under consideration, is defined by

[TABLE]

We will investigate the exact rate of convergence of the $n$ th minimal errors (28) together with asymptotic constants. Moreover, we wish to determine (asymptotically) optimal methods $\bar{X}^{\diamond}$ , $\diamond\in\{\rm eq\it,\rm noneq*\it,\rm noneq\it\}$ , such that the $n$ th errors $e_{n}(\bar{X}^{\diamond})$ tend to zero as fast as $e^{\diamond}(n)$ when $n\to+\infty$ .

3 Asymptotic lower bounds

In this section we investigate asymptotic lower bounds for the problem (1) in the classes of methods $\chi^{\diamond}$ , $\diamond\in\{\rm eq,\ noneq*,\ noneq\}$ . In the next section we give a construction of approximation methods which are asymptotically optimal. Their definitions will be inspired by the technique used for establishing lower bounds given in this section.

We give the definition of the continuous Milstein approximation and we state its properties that we use in order to establish the lower bounds. Moreover, in next section we use it in order to construct asymptotically optimal methods.

Let $m\in\mathbb{N}$ and

[TABLE]

be an arbitrary discretization of $[0,T]$ . We denote by

[TABLE]

for $Z\in\{N,W\}$ . The continuous Milstein approximation $\tilde{X}^{M}_{m}=\{\tilde{X}^{M}_{m}(t)\}_{t\in[0,T]}$ based on (29) is defined as follows. We denote

[TABLE]

and we set

[TABLE]

and

[TABLE]

for $t\in[t_{i},t_{i+1}]$ , $i=0,1,\ldots,m-1$ , where

[TABLE]

for $Y,Z\in\{N,W\}$ . It is well-known that

[TABLE]

where $\tau_{k}$ is the $k$ th jump time of $N$ , and

[TABLE]

Moreover, $I_{t_{i},t}(W,W)$ , $I_{t_{i},t}(N,N)$ , $I_{t_{i},t}(N,W)$ , $I_{t_{i},t}(W,N)$ are independent of $\mathcal{F}_{t_{i}}$ , see Fact 6.2 in Appendix.

The main properties of $\tilde{X}^{M}_{m}$ are as follows. For every $m\in\mathbb{N}$ the process $\{\tilde{X}^{M}_{m}(t)\}_{t\in[0,T]}$ is adapted to $\{\mathcal{F}_{t}\}_{t\in[0,T]}$ and has càdlàg paths. The upper bounds on the error of $\tilde{X}^{M}_{m}$ are given in Theorem 6.1. Furthermore, under the commutativity condition (9) the random variables $\{\tilde{X}^{M}_{m}(t_{i})\}_{i=0}^{m}$ are measurable with respect to the sigma filed

[TABLE]

In particular, this and independence of $N$ and $W$ imply that for all $t\in[t_{i},t_{i+1}]$ , $i=0,1,\ldots,m-1$

[TABLE]

where

[TABLE]

The conditional expectations appearing above can be computed explicitly. Namely, from Lemma 8 in [8] and Lemma 6.2 in Appendix we get by direct calculations

[TABLE]

We stress that for any $m$ the approximation $\{\tilde{X}^{M}_{m}(t)\}_{t\in[0,T]}$ is not an implementable numerical scheme in our model of computation (even under the commutativity condition (9)), since computation of a trajectory of $\tilde{X}^{M}_{m}$ requires complete knowledge of a corresponding trajectories of $N$ and $W$ . However, if the condition (9) holds, by (35), (36) and (38), we can compute values of $\tilde{X}^{M}_{m}$ at the discrete points (29) using only function evaluations of $W$ and $N$ at (29).

In order to characterize asymptotic lower bounds we define

[TABLE]

where the process $\{\mathcal{Y}(t)\}_{t\in[0,T]}$ is defined in (15). We have that

(i)

$0\leq C^{\rm noneq}\leq C^{\rm eq}$ ,

(ii)

$C^{\rm noneq}=C^{\rm eq}$ iff there exists $\gamma\geq 0$ such that for all $t\in[0,T]$

[TABLE]

(iii)

$C^{\rm eq}=0$ iff $C^{\rm noneq}=0$ iff $b(t,X(t))=0=c(t,X(t))$ for all $t\in[0,T]$ and almost surely.

We have the following result.

Theorem 3.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(A)$ - $(E)$ .

(i)

Let $\bar{X}$ be an arbitrary method from $\chi^{\rm noneq}$ . Then

[TABLE]

(ii)

Let $\bar{X}$ be an arbitrary method from $\chi^{\rm noneq*}$ . If $b\not\equiv 0$ and $c\not\equiv 0$ then

[TABLE]

(iii)

Let $\bar{X}$ be an arbitrary method from $\chi^{\rm eq}$ . If $b\not\equiv 0\ \hbox{and}\ c\not\equiv 0$ then

[TABLE]

else

[TABLE]

Proof. We start by showing (49) in the case when $b\not\equiv 0$ and $c\not\equiv 0$ . Let $\bar{X}=\{\bar{X}_{n}\}_{n\in\mathbb{N}}\in\chi^{\rm noneq}$ be a method based on an arbitrary sequence of discretizations $\bar{\Delta}^{N}=\{\Delta^{N}_{n}\}_{n\in\mathbb{N}}$ and $\bar{\Delta}^{W}=\{\Delta^{W}_{n}\}_{n\in\mathbb{N}}$ , where each $\Delta^{N}_{n}$ and $\Delta^{W}_{n}$ is of the form (19). Every $\bar{X}_{n}$ uses information (22) about the processes $N$ and $W$ . Take any sequence $\{m_{n}\}_{n\in\mathbb{N}}$ of positive integers such that

[TABLE]

By $\hat{\Delta}=\{\hat{\Delta}_{n}\}_{n\in\mathbb{N}}$ we denote a sequence of discretizations given by $\{\hat{\Delta}_{n}\}_{n\in\mathbb{N}}=\{\Delta^{N}_{n}\cup\Delta^{W}_{n}\cup\Delta_{n}^{\rm eq\it}\}_{n\in\mathbb{N}}$ , where every set $\Delta_{n}^{\rm eq\it}$ of equidistant points is defined by $\Delta_{n}^{\rm eq\it}=\{jT/m_{n}\ |\ j=0,1,\ldots,m_{n}\}$ . Hence, for all $n\in\mathbb{N}$ ,

[TABLE]

with

[TABLE]

and

[TABLE]

Therefore, from (53) and (56) we have that

[TABLE]

and, since $\Delta^{\rm eq\it}_{n}\subset\hat{\Delta}_{n}$ for all $n\in\mathbb{N}$ ,

[TABLE]

We denote by $\mathcal{\hat{N}}(N,W)=\{\mathcal{\hat{N}}_{n}(N,W)\}_{n\in\mathbb{N}}$ , where each vector $\mathcal{\hat{N}}_{n}(N,W)$ consists of the values of $N$ and $W$ at $\hat{\Delta}_{n}$ , i.e.,

[TABLE]

Since $\Delta^{N}_{n}\cup\Delta^{W}_{n}\subset\hat{\Delta}_{n}$ for all $n\in\mathbb{N}$ , we have that

[TABLE]

Let us denote by $\{\tilde{X}^{M}_{k_{n}}\}_{n\in\mathbb{N}}$ the sequence of continuous Milstein approximations (32)-(33) based on the sequence of discretizations $\hat{\Delta}$ and which use the information $\mathcal{\hat{N}}(N,W)$ about the processes $N$ and $W$ . From Theorem 6.1 and (58) we have that

[TABLE]

where the positive constant $C$ does not depend on $n$ . Moreover, let

[TABLE]

for $Z\in\{N,W\}$ and $t\in[0,T]$ . Note that for any $t\in[\hat{t}_{i,n},\hat{t}_{i+1,n}]$ the random variable $\hat{Z}_{n}(t)$ is a convex combination of $Z(t)-Z(\hat{t}_{i,n})$ and $-(Z(\hat{t}_{i+1,n})-Z(t))$ . Hence, $\hat{Z}_{n}(t)$ is independent of $\mathcal{F}_{\hat{t}_{i,n}}$ for all $t\in[\hat{t}_{i,n},\hat{t}_{i+1,n}]$ and the processes $\{\hat{N}_{n}(t)\}_{t\in[0,T]}$ , $\{\hat{W}_{n}(t)\}_{t\in[0,T]}$ are independent. From (58), (60), (61), (39) and Lemma 6.3 we get

[TABLE]

Now, we analyze the asymptotic behavior of the first term in (64). From Lemma 8 in [8] we have that

[TABLE]

For $i=0,1,\ldots,k_{n}-1$ and $t\in(\hat{t}_{i,n},\hat{t}_{i+1,n})$ we define

[TABLE]

Of course $H_{i,n}\in C((\hat{t}_{i,n},\hat{t}_{i+1,n}))$ and it can be continuously extended to $[\hat{t}_{i,n},\hat{t}_{i+1,n}]$ , since $H(\hat{t}_{i,n}+)=\lambda(\hat{t}_{i,n})\cdot\Lambda(\hat{t}_{i+1,n},\hat{t}_{i,n})/(\hat{t}_{i+1,n}-\hat{t}_{i,n})$ and $H(\hat{t}_{i+1,n}-)=\lambda(\hat{t}_{i+1,n})\cdot\Lambda(\hat{t}_{i+1,n},\hat{t}_{i,n})/(\hat{t}_{i+1,n}-\hat{t}_{i,n})$ are finite. Therefore, by Lemma 6.2 and from the mean value theorems we get

[TABLE]

for some $\hat{d}_{i,n},\hat{\alpha}_{i,n},\hat{\beta}_{i,n},\hat{\gamma}_{i,n}\in[\hat{t}_{i,n},\hat{t}_{i+1,n}]$ , $i=0,1,\ldots,k_{n}-1$ . Next, for $f\in\{b,c\}$ we have from Theorem 6.1 that

[TABLE]

Therefore, for $(f,Z)\in\{(b,W),(c,N)\}$ we have by (65), (3) and (3) that

[TABLE]

This together with the Hölder inequality imply

[TABLE]

We have that

[TABLE]

where

[TABLE]

and, by (13),

[TABLE]

since $\displaystyle{|\sqrt{x}-\sqrt{y}|\leq\sqrt{|x-y|}}$ for all $x,y\geq 0$ . From the uniform continuity of $\lambda$ we get

[TABLE]

Hence, by (70), (71), (73) and Fact 6.1 (ii) we have

[TABLE]

Therefore, by (53), (64), (3) and (74) we obtain

[TABLE]

which ends the proof of (49) in the case when $b\not\equiv 0$ and $c\not\equiv 0$ . If $(b\not\equiv 0\ \hbox{and}\ c\equiv 0)$ or $(b\equiv 0\ \hbox{and}\ c\not\equiv 0)$ then $cost_{n}(\bar{X})=n$ ,

[TABLE]

and

[TABLE]

which yield

[TABLE]

For $b\equiv 0$ and $c\equiv 0$ we obtain trivial lower bound. Finally, if $\bar{X}\in\chi^{\rm noneq*}$ , $b\not\equiv 0$ and $c\not\equiv 0$ then $cost_{n}(\bar{X})=2n$ and by (77) we get

[TABLE]

which completes the proof of (49). The proofs of (52) and (52) are straightforward modifications of the proofs of (49) and (50). Hence, we skip it here. $\blacksquare$

Remark 3.1

Theorem 3.1 gives nontrivial lower bounds only in the case when

[TABLE]

In this case the presented lower bounds still hold even if we allow for methods to use an arbitrary information about $a$ , $b$ and $c$ , for example, values of partial derivatives or values of arbitrary linear functionals. If $\mathbb{E}(\mathcal{Y}(t))=0$ for all $t\in[0,T]$ then (1) becomes (almost surely) deterministic ODE. Then different lower bounds hold, see, for example, [13]. $\square$ **

4 Asymptotically optimal methods

We provide definitions of methods that are asymptotically optimal. The construction is inspired by the technique used for establishing the lower bounds in the previous section. We restrict our consideration to approximation methods based on the regular sequences of discretizations generated by a probability density function $\psi$ , see [28]. For the density $\psi$ we assume that

(P1)

$\psi\in C([0,T])$ and $\psi(t)>0$ for all $t\in[0,T]$ .

We will use the notation $\bar{\Delta}_{\psi}=\{\Delta_{\psi,n}\}_{n\in\mathbb{N}}$ for a sequence of discretizations generated by a density $\psi$ . The knots

[TABLE]

of the $n$ th discretization are given by

[TABLE]

Hence, by choosing such a density $\psi$ one gets a whole sequence of discretizations $\bar{\Delta}_{\psi}$ . For instance, the sequence of equidistant discretizations is obtained by taking $\psi\equiv 1/T$ . Since $0<\|\psi\|_{\infty}^{-1}\leq||1/\psi||_{\infty}<+\infty$ , we have for all $n\in\mathbb{N}$ and $i=0,1,\ldots,n-1$

[TABLE]

We now provide a construction of asymptotically optimal approximation methods. The definition of this method is inspired by the dominating term in the estimation (64). Denote by $\tilde{X}^{M}_{\psi}=\{\tilde{X}^{M}_{\psi,n}\}_{n\in\mathbb{N}}$ the sequence of continuous Milstein approximations (32)-(33) based on the sequence of discretizations $\bar{\Delta}_{\psi}$ . For a given density $\psi$ , we define the method $\bar{X}_{\psi}^{cM}=\{\bar{X}_{\psi,n}^{cM}\}_{n\in\mathbb{N}}$ by

[TABLE]

where $\mathcal{N}_{\psi,n}(N,W)$ consists of values of the processes $N$ and $W$ at the points $\Delta_{\psi,n}$ . (Hence, we formally take $\bar{\Delta}^{W}=\bar{\Delta}^{N}=\bar{\Delta}_{\psi}$ .) We call (83) the conditional Milstein method. We have that $\bar{X}_{\psi,n}^{cM}\in\chi^{\rm noneq*}$ . We present an explicit formula for the algorithm (83) in order to show that it has a form that is allowed in our model of computation. By (9), (33), (83) and (41)-(45) each term $\bar{X}_{\psi,n}^{cM}$ can be written as

[TABLE]

for $t\in[t_{i,n},t_{i+1,n}]$ , $i=0,1,\ldots,n-1$ and $\bar{X}^{cM}_{\psi,n}(0)=x_{0}$ . Note that $\bar{X}_{\psi,n}^{cM}$ has continuous trajectories and coincides with $\tilde{X}^{M}_{\psi,n}$ at the discretization points. In general, the method $\bar{X}_{\psi,n}^{cM}$ is not equal to the piecewise linear interpolation $\bar{X}_{\psi,n}^{Lin-M}$ of the classical Mistein steps, defined as

[TABLE]

for $t\in[t_{i,n},t_{i+1,n}]$ , $i=0,1,\ldots,n-1$ , see Remark 4.3. However, we use the method $\bar{X}_{\psi,n}^{cM}$ in order to investigate the error of $\bar{X}_{\psi,n}^{Lin-M}$ and we show in the sequel that they behave asymptotically in the same way. Moreover, for a fixed discretization $\Delta_{\psi,n}$ the method $\bar{X}_{\psi,n}^{Lin-M}$ does not evaluates $\Lambda$ and its implementation, at least in the case when $\psi\equiv 1/T$ , is straightforward.

In the following theorem we give the exact convergence rate of the errors for the methods $\bar{X}_{\psi}^{cM}$ and $\bar{X}_{\psi}^{Lin-M}$ in the terms of the following asymptotic constant

[TABLE]

The strategy of the proof goes as follows. First, we analyze the error of the conditional Milstein method $\bar{X}_{\psi}^{cM}$ . Due to its definition given by the conditional expectation (83) this can be done by using some estimates already established in the proof of Theorem 3.1. Then we show that $\bar{X}_{\psi}^{Lin-M}$ is sufficiently close to $\bar{X}_{\psi}^{cM}$ . This will give us the asymptotic error for the piecewise linear interpolation method $\bar{X}_{\psi}^{Lin-M}$ .

Theorem 4.1

Let us assume that the mappings $a$ , $b$ , $c$ , $\lambda$ and $\psi$ satisfy the assumptions $(A)$ - $(E)$ and $(P1)$ , and let $\bar{X}_{\psi}\in\{\bar{X}^{cM}_{\psi},\bar{X}^{Lin-M}_{\psi}\}$ . Then if $b\not\equiv 0$ and $c\not\equiv 0$

[TABLE]

else

[TABLE]

**Proof. **From Theorem 6.1 and (82) we get

[TABLE]

where a constant $C>0$ does not depend on $n$ . Moreover, the equality (81) and the integral mean value theorem yield

[TABLE]

As in the proof of Theorem 3.1, we use the notation

[TABLE]

for $Z\in\{N,W\}$ . From (89), (90), Lemma 6.3 and by proceeding analogously as in the proof of Theorem 3.1 we arrive at

[TABLE]

for some $d_{i,n},\alpha_{i,n},\beta_{i,n},\gamma_{i,n}\in[t_{i,n},t_{i+1,n}]$ , $i=0,1,\ldots,n-1$ . Moreover, we have

[TABLE]

where

[TABLE]

By Fact 6.1 (i), (13) and (82) we get

[TABLE]

and

[TABLE]

This and the uniform continuity of $\lambda$ imply

[TABLE]

By (4), (4), (97) and Fact 6.1 (ii) we obtain

[TABLE]

which ends the proof of (87) for $\bar{X}_{\psi}=\bar{X}^{cM}_{\psi}$ .

We now analyze the error of $\bar{X}_{\psi,n}^{Lin-M}$ . Note that

[TABLE]

for $t\in[t_{i,n},t_{i+1,n}]$ , $i=0,1,\ldots,n-1$ . In addition

[TABLE]

for $t\in[t_{i,n},t_{i+1,n}]$ , $i=0,1,\ldots,n-1$ . For $f\in\{b,c\}$ and $j\in\{-1,1\}$ the random variable $L_{j}f(t_{i,n},\tilde{X}^{M}_{\psi,n}(t_{i,n}))$ is $\mathcal{F}_{t_{i,n}}$ -measurable and the estimate (180) holds for $U_{i}:=(t_{i,n},\tilde{X}^{M}_{\psi,n}(t_{i,n}))$ . Hence, it is independent of $I_{t_{i,n},t_{i+1,n}}(N,N)$ , $I_{t_{i,n},t_{i+1,n}}(W,W)$ and $\Delta N_{i,n}\cdot\Delta W_{i,n}$ . Therefore, by (82) we have that

[TABLE]

for $t\in[t_{i,n},t_{i+1,n}]$ , $i=0,1,\ldots,n-1$ . Since, from (100)

[TABLE]

we obtain (87) for $\bar{X}_{\psi}=\bar{X}^{Lin-M}_{\psi}$ . This ends the proof. $\blacksquare$

Let us now assume that the following additional assumption is satisfied:

(P2)

$\inf\limits_{t\in[0,T]}\mathbb{E}(\mathcal{Y}(t))>0$ .

The methods $\bar{X}^{cM}_{\psi}$ and $\bar{X}^{Lin-M}_{\psi}$ obtain the exact rate of convergence $n^{-1/2}$ , with the asymptotic constant $C_{\psi}$ which depends on $\psi$ . The best density $\psi_{0}$ , which is unique and minimizes $C_{\psi}$ among all positive mappings $\psi\in C([0,T])$ such that $\displaystyle{\int\limits_{0}^{T}\psi(t)dt=1}$ , is

[TABLE]

(The minimization property of $\psi_{0}$ follows from the application of the Hölder inequality.) We stress that $\psi_{0}$ is strictly positive in $[0,T]$ under the additional assumption (P2). Furthermore,

[TABLE]

The following fact characterizes the case when the equidistant sampling is the optimal one.

Fact 4.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(A)$ - $(E)$ and $(P2)$ . Then the following assertions are equivalent.

(i)

$\psi_{0}\equiv 1/T$ .

(ii)

$\displaystyle{\mathbb{E}(\mathcal{Y}(t))=\frac{1}{T}\int\limits_{0}^{T}\Bigl{(}\mathbb{E}(\mathcal{Y}(s))\Bigr{)}^{1/2}ds}$ * for all $t\in[0,T]$ .*

(iii)

$C^{\rm noneq}=C^{\rm eq}>0$ .

Proof. The assertion can easily be shown by proving the implications $(i)\Rightarrow(ii)\Rightarrow(iii)\Rightarrow(i)$ and we left if for the reader. $\blacksquare$

From Theorem 4.1 we directly obtain the following result.

Corollary 4.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(A)$ - $(E)$ .

(i)

Let us moreover assume that the assumption (P2) is satisfied. If $b\not\equiv 0$ and $c\not\equiv 0$ then for $\bar{X}_{\psi_{0}}\in\{\bar{X}^{cM}_{\psi_{0}},\bar{X}^{Lin-M}_{\psi_{0}}\}$ it holds

[TABLE]

else

[TABLE]

(ii)

Let $\bar{X}_{1/T}\in\{\bar{X}^{cM}_{1/T},\bar{X}^{Lin-M}_{1/T}\}$ . If $b\not\equiv 0$ and $c\not\equiv 0$ then it holds

[TABLE]

else

[TABLE]

Theorem 3.1 and Corollary 4.1 imply the main result of the paper.

Theorem 4.2

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(A)$ - $(E)$ .

(i)

Let us additionally assume that the assumption (P2) is satisfied. If ( $b\not\equiv 0$ * and $c\equiv 0$ ) or ( $b\equiv 0$ and $c\not\equiv 0$ ) then*

[TABLE]

and the methods $\bar{X}^{cM}_{\psi_{0}},\bar{X}^{Lin-M}_{\psi_{0}}$ , where $\psi_{0}$ is defined in (102), are asymptotically optimal in the class $\chi^{\rm noneq}$ . If $b\not\equiv 0$ and $c\not\equiv 0$ then

[TABLE]

(ii)

If the assumption (P2) is satisfied, $b\not\equiv 0$ and $c\not\equiv 0$ then

[TABLE]

and the methods $\bar{X}^{cM}_{\psi_{0}},\bar{X}^{Lin-M}_{\psi_{0}}$ are asymptotically optimal in the class $\chi^{\rm noneq*}$ .

(iii)

We have that

[TABLE]

and the methods $\bar{X}^{cM}_{1/T},\bar{X}^{Lin-M}_{1/T}$ are asymptotically optimal in the class $\chi^{\rm eq}$ .

As we can see the optimal rate of convergence of the minimal errors in the classes $\chi^{\rm eq}$ and $\chi^{\rm noneq*}$ is proportional to $n^{-1/2}$ , where $n$ is a total number of evaluations of $N$ and $W$ . In the class $\chi^{\rm noneq}$ we have a gap between upper and lower asymptotic constants. We conjecture that (108) holds also if $b\not\equiv 0$ and $c\not\equiv 0$ .

We end this section with the following remarks.

Remark 4.1

Theorem 4.2 implies that the error can be reduced asymptotically by the factor

[TABLE]

if we use the optimal discretization instead of the equidistant one. However, the optimal density $\psi_{0}$ and the optimal sampling $\{t_{i,n}\}_{i=0}^{n}$ , defined by

[TABLE]

can be computed explicitly only in particular cases see, for example, Section 4.1. Moreover, the additional assumption (P2) is required. We plan to overwhelm these difficulties in the future work. $\square$

Remark 4.2

If $c\equiv 0$ , $b\not\equiv 0$ and $T=1$ then, for the classes $\chi^{\rm eq}$ and $\chi^{\rm noneq}$ , Theorem 4.2 restores the results of Theorem 2 (iii) and Proposition 2 from [12] in the Gaussian case, while if $c=c(t)$ , $b\equiv 0$ and $\lambda=const$ then we get Theorem 4.2 from [26] for the pure jump case with an additive Poisson noise. In addition to this paper, in [26] the author established a method based on an adaptive stepsize control that does not depend on the knowledge of $\lambda$ . The problem of defining such methods for SDEs of the general type (1) will be the topic of our future work. $\square$

Remark 4.3

We have $\bar{X}^{cM}_{\psi}\equiv\bar{X}^{Lin-M}_{\psi}$ , if $\lambda=const$ , $b=b(t)$ and $c=c(t)$ . $\square$

4.1 Linear case - Merton‘s jump diffusion model

Let us consider the following SDE

[TABLE]

that models the stock price in the Merton‘s model, see [21]. We assume $\lambda$ to be a constant function and $r\in\mathbb{R}$ , $\sigma>0$ . The solution of (111) is

[TABLE]

We denote $\gamma=r+\sigma^{2}/2+3\lambda/2$ and we have

[TABLE]

If $\gamma=0$ then the optimal sampling is the equidistant one and $C^{\rm noneq}=C^{\rm eq}=Tx_{0}\sqrt{\frac{\sigma^{2}+\lambda}{6}}$ . If $\gamma\neq 0$ then we obtain the following optimal sampling for (111)

[TABLE]

We have that $t_{i,n}\to iT/n$ for $\gamma\to 0$ . Since $C^{\rm noneq}/C^{\rm eq}$ behaves as $\sqrt{2/\gamma T}$ when $\gamma T\to+\infty$ , we can gain by using the nonequidistant mesh.

5 Conclusions

We investigated the minimal asymptotic errors for strong global approximation of SDEs driven by the Poisson and Wiener processes. We considered the cases of equidistant and nonequidistant sampling of $N$ and $W$ . In both cases, we showed that the minimal error tends to zero like $Cn^{-1/2}$ , where $C$ is an average in time of a local Hölder constant of $X$ and $n$ is the number of evaluations of $N$ and $W$ . However, the asymptotic constant $C$ in the case of equidistant sampling can be considerably larger than the asymptotic constant when nonuniform mesh is used. We provided a construction of methods that asymptotically achieve the established minimal errors.

In this paper, we addressed the case when sampling points for the processes $N$ and $W$ are chosen only in the nonadaptive way with respect to $N$ and $W$ . Moreover, we assume that the diffusion and jump coefficients satisfied the jump commutativity condition. For the adaptive sampling and non-commutative case preliminary considerations indicate that the direct application of methods developed in this paper is not possible. Further extension of the presented analysis is needed in that case and we postpone this problem to our future work.

**Acknowledgments

**Part of this work was done at Banff International Research Station for Mathematical Innovation and Discovery (BIRS), Alberta, Canada, where the author participated at the workshop ”Approximation of High-Dimensional Numerical Problems - Algorithms, Analysis and Applications”, Fall 2015. I would like to thank the Staff of the BIRS for great hospitality.

6 Appendix

We use the following version of the Itô formula for semimartingales with jumps, see, for example, [29] or [22].

Lemma 6.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(B1)$ , $(B2)$ and $(E)$ . Let a function $U:\mathbb{R}\to\mathbb{R}$ belongs to $C^{2}(\mathbb{R})$ . Then for the solution $X$ of (1) it holds

[TABLE]

The proof of the following fact is straightforward.

Fact 6.1

Let the mappings $a,b,c$ and $\lambda$ satisfy the assumptions $(B1)$ , $(B2)$ and $(E)$ .

(i)

There exists a constant $C_{1}>0$ such that for all $f\in\{b,c\}$ and $t,s\in[0,T]$ we have

[TABLE]

(ii)

The mapping

[TABLE]

is continuous.

(iii)

There exists a constant $C_{2}>0$ such that

[TABLE]

Fact 6.2

(i)

There exists $C>0$ such that for all $0\leq s\leq t\leq T$ and $Y,Z\in\{N,W\}$ we have

[TABLE]

(ii)

For all $0\leq s\leq t\leq T$ and $Y,Z\in\{N,W\}$ the stochastic integral $I_{s,t}(Y,Z)$ is independent of $\mathcal{F}_{s}$ .

Proof. The proof of (i) can be straightforwardly delivered from (6), (38), the isometry for stochastic integrals driven by martingales and by the independence of $W$ and $N$ . Hence, we skip it.

For the proof of (ii) note that directly from (35) and (36) we get that $I_{s,t}(Y,Y)$ , $Y\in\{N,W\}$ , is independent of $\mathcal{F}_{s}$ . So the only case of interest is when $(Y,Z)\in\{(N,W),(W,N)\}$ .

Fix $s,t\in[0,T]$ , $s\leq t$ , and let $\Delta_{m}=\{\alpha_{0,m},\alpha_{1,m},\ldots,\alpha_{m,m}\}$ , $m\in\mathbb{N}$ , be a sequence of discretizations of $[s,t]$ such that $s=\alpha_{0,m}<\alpha_{1,m}<\ldots<\alpha_{m,m}=t$ and $\lim\limits_{m\to+\infty}\|\Delta_{m}\|=0$ , where $\|\Delta_{m}\|=\max\limits_{0\leq i\leq m-1}(\alpha_{i+1,m}-\alpha_{i,m})$ . Moreover, let

[TABLE]

We have that

[TABLE]

Therefore, the sequence $\{I_{s,t}^{m}(N,W)\}_{m\in\mathbb{N}}$ converges also in probability and, by the independence of the increments of $N$ and $W$ , every random variable $I_{s,t}^{m}(N,W)$ is independent of $\mathcal{F}_{s}$ . Hence, the limit $I_{s,t}(N,W)$ is also independent of $\mathcal{F}_{s}$ . By (38) we get that also $I_{s,t}(W,N)$ is independent of $\mathcal{F}_{s}$ . $\blacksquare$

The proof of Proposition 2.1. By the Markov property of the solution $X$ we have that $\|X(t+h)-X(t)\ |\ X(t)\|_{L^{2}(\Omega)}=\|X(t+h)-X(t)\ |\ \mathcal{F}_{t}\|_{L^{2}(\Omega)}$ . For all $t\in[0,T)$ and $h>0$ such that $0\leq t<t+h\leq T$ we have

[TABLE]

From (10) and (E) we obtain that

[TABLE]

almost surely. By Theorem 88 in [29] we obtain for all $t\in[0,T)$ and almost surely

[TABLE]

and

[TABLE]

since $(W(t)\cdot\tilde{N}(t),\mathcal{F}_{t})_{t\in[0,T]}$ is a martingale. Therefore, by Minkowski‘s inequality for conditional expectations (see [5]), we have that

[TABLE]

almost surely. From (13), Fact 6.1 (iii) and the Lebesgue‘s dominated convergence theorem for conditional expectations (see [5]) we have for all $t\in[0,T)$ and almost surely that

[TABLE]

and

[TABLE]

since $X$ and $\mathcal{Y}$ have càdlàg paths and $\mathcal{Y}(t)$ is $\mathcal{F}_{t}$ -measurable. This together with (6) yield (16). Now, (17) follows from (16) and Lebesgue‘s dominated convergence theorem. $\blacksquare$

Lemma 6.2

Let $m\in\mathbb{N}$ and let

[TABLE]

be an arbitrary discretization of the interval $[0,T]$ and

[TABLE]

Then for all $i=0,1,\ldots,m-1$ and $t\in[t_{i},t_{i+1}]$

(i)

[TABLE]

almost surely,

(ii)

[TABLE]

almost surely and, in particular,

[TABLE]

Proof. For $t=t_{i}$ , $i=0,1,\ldots,m$ , we directly get (131), (132) and (133). By the results of [2], from the fact that the process $N$ has independent increments and by direct calculations we obtain that conditioned on $\mathcal{N}_{m}(N)$ and for $t\in(t_{i},t_{i+1})$ , $i=0,1,\ldots,m-1$ the increment $N(t)-N(t_{i})$ is a binomial random variable with the number of trials $N(t_{i+1})-N(t_{i})$ and with the probability of success in each trial equal to $\displaystyle{\frac{\Lambda(t,t_{i})}{\Lambda(t_{i+1},t_{i})}}$ . Now, the rest of proof goes analogously as the proof of Lemma 3.1 in [26]. $\blacksquare$

We provide a result concerning an upper bound on the error for the continuous Milstein approximation $\tilde{X}^{M}_{m}$ . A similar result has been shown in Theorem 6.4.1 in [21], however, under slightly stronger assumptions. In particular, we do not assume the existence of continuous partial derivative $\partial f/\partial t$ for $f\in\{a,b,c\}$ and we do not impose here any Lipschitz conditions on the second partial derivative of $f=f(t,y)$ , $f\in\{a,b,c\}$ , with respect to $y$ . Moreover, we consider here nonstationary Poisson process, while in [21] Theorem 6.4.1 has been proven for stationary point processes.

Theorem 6.1

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(A)$ , $(B)$ , $(C)$ and $(E)$ . Let $m\in\mathbb{N}$ and let (29) be an arbitrary discretization of the interval $[0,T]$ . Then for the continuous Milstein approximation $\tilde{X}^{M}_{m}$ , based on the mesh (29), we have that

[TABLE]

and

[TABLE]

where $C_{1},C_{2}>0$ do not depend on $m$ .

Proof. Recall that $U_{i}=(t_{i},\tilde{X}^{M}_{m}(t_{i}))$ and $L_{j}f(U_{i})$ is $\mathcal{F}_{t_{i}}$ -measurable for $f\in\{b,c\}$ , $j\in\{-1,1\}$ . First, we show that

[TABLE]

We proceed by induction. Let us assume that $\|\tilde{X}^{M}_{m}(t_{k})\|_{L^{2}(\Omega)}<+\infty$ for $k=0,1,\ldots,i$ and some $i$ . (The assumption is fulfilled for $i=0$ .) By (10), (12), (33) and Fact 6.2 we have for all $k=0,1,\ldots,i$ and $t\in[t_{k},t_{k+1}]$ that

[TABLE]

Hence, $\sup\limits_{t\in[t_{i},t_{i+1}]}\|\tilde{X}^{M}_{m}(t)\|_{L^{2}(\Omega)}<+\infty$ and, in particular, $\|\tilde{X}^{M}_{m}(t_{i+1})\|_{L^{2}(\Omega)}<+\infty$ . Therefore, we get $\max\limits_{i=0,1,\ldots,n}\|\tilde{X}^{M}_{m}(t_{i})\|_{L^{2}(\Omega)}<+\infty$ and (136).

We now justify (135). The solution $X=X(t)$ of (1) and the continuous Milstein approximation $\tilde{X}^{M}_{m}=\tilde{X}^{M}_{m}(t)$ can be written as

[TABLE]

where

[TABLE]

and

[TABLE]

We have for all $t\in[0,T]$ that

[TABLE]

where

[TABLE]

We get from the Hölder inequality and Lemma 2.1 for all $t\in[0,T]$ that

[TABLE]

From Lemma 6.1 applied to $U(x)=a(t_{i},x)$ and (6) we have that for $s\in[t_{i},t_{i+1}]$

[TABLE]

We denote for $f\in\{a,b,c\}$ and $u\in(t_{i},t_{i+1}]$

[TABLE]

We have

[TABLE]

where

[TABLE]

for all $t\in[0,T]$ . By the Hölder inequality, (10), (11) and Lemma 2.1 we have

[TABLE]

By Theorem 6.5.8 in [17] and Theorem 88 (iii) in [29] we obtain for all $t\in[0,T]$

[TABLE]

We estimate (151) analogously as (151) and we get for all $t\in[0,T]$ that

[TABLE]

Hence, by (148), (153), (154) and (156) we arrive at

[TABLE]

for all $t\in[0,T]$ . For (146) we have by the Hölder inequality and (A2) that

[TABLE]

for all $t\in[0,T]$ . Hence, (143), (147), (157) and (158) yield for all $t\in[0,T]$ that

[TABLE]

We have for all $t\in[0,T]$ that

[TABLE]

where

[TABLE]

From the Itô isometry and the Hölder inequality we obtain for $t\in[0,T]$

[TABLE]

By the Itô isometry together with the Itô formula we get

[TABLE]

where

[TABLE]

for all $t\in[0,T]$ . From the Hölder inequality we get

[TABLE]

for all $t\in[0,T]$ . Since we have for $u\in[t_{i},t_{i+1}]$ that

[TABLE]

we obtain

[TABLE]

Moreover, for $s\in[t_{i},t_{i+1}]$ we have that

[TABLE]

which implies

[TABLE]

Hence, for all $t\in[0,T]$ we get

[TABLE]

Therefore, by (160), (161), (173) and (174) we obtain for all $t\in[0,T]$

[TABLE]

Now

[TABLE]

where

[TABLE]

for all $t\in[0,T]$ . Next, we use the decomposition $dN(t)=d\tilde{N}(t)+dm(t)=d\tilde{N}(t)+\lambda(t)dt$ and the martingale isometry. Then the estimation of the above terms goes in analogous way as for $\mathbb{E}|B(t)-\tilde{B}^{M}_{m}(t)|^{2}$ , hence, we skip it. We get for all $t\in[0,T]$ that

[TABLE]

Combining (159), (175) and (177) we get for all $t\in[0,T]$

[TABLE]

By (13) and (136) the mapping $[0,T]\ni t\to\sup\limits_{0\leq s\leq t}\mathbb{E}|X(s)-\tilde{X}^{M}_{m}(s)|^{2}\in\mathbb{R}_{+}\cup\{0\}$ is bounded and Borel measurable. Hence, by Gronwall‘s lemma we get (135). The estimate (134) is a consequence of (13) and (135). This ends the proof. $\blacksquare$

Lemma 6.3

Let us assume that the mappings $a$ , $b$ , $c$ and $\lambda$ satisfy the assumptions $(A)$ - $(E)$ . For all $t\in[t_{i},t_{i+1}]$ , $i=0,1,\ldots,m-1$

[TABLE]

where $C>0$ does not depend on $m$ nor $i$ .

Proof. From (12) and Theorem 6.1 we have for $f\in\{b,c\}$ and $j\in\{-1,1\}$ that

[TABLE]

where $C>0$ does not depend on $m$ nor $i$ . Moreover, for $f\in\{b,c\}$ and $j\in\{-1,1\}$ the random variable $L_{j}f(U_{i})$ is $\mathcal{F}_{t_{i}}$ -measurable. From Fact 6.2 (ii) and by (41)-(45) we have that $I_{t_{i},t}(N,N)-\mathbb{E}(I_{t_{i},t}(N,N)\ |\ \mathcal{N}_{m}(N))$ , $I_{t_{i},t}(W,W)-\mathbb{E}(I_{t_{i},t}(W,W)\ |\ \mathcal{N}_{m}(W))$ and $I_{t_{i},t}(N,W)+I_{t_{i},t}(W,N)-\mathbb{E}(I_{t_{i},t}(N,W)+I_{t_{i},t}(W,N)\ |\ \mathcal{N}_{m}(N,W))$ are independent of $\mathcal{F}_{t_{i}}$ . Hence, by (40), Fact 6.2 (i) and (180) we get

[TABLE]

for $t\in[t_{i},t_{i+1}]$ , which ends the proof of (179). $\blacksquare$

7 References

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Applebaum, D., Lévy Processes and Stochastic Calculus. 2nd ed., Cambridge University Press, 2011.
2[2] Bonet, E., Nualart, D., Interpolation and forecasting in Poisson‘s processes. Stochastica 2 (1977), 1–5.
3[3] Bruti-Liberati, N., Platen, E., Strong approximations of stochastic differential equations with jumps. J. Comput. Appl. Math. 205 (2007), 982–1001.
4[4] Debowski, J., Przybyłowicz, P., Optimal approximation of stochastic integrals with respect to a homogeneous Poisson process, submitted.
5[5] Doob, J. L., Measure Theory. Springer-Verlag New York, 1994.
6[6] Gardoń, A., The order of approximations for solutions of Itô-type stochastic differential equations with jumps. Stoch. Anal. Appl. 22 (2004), 679–699.
7[7] Graham, C., Talay, D., Stochastic Simulation and Monte Carlo Methods. Springer-Verlag, Berlin, Heidelberg, 2013.
8[8] Hertling, P., Nonlinear Lebesgue and Itô integration problems of high complexity. J. Complexity 17 (2001), 366–387.