Overcoming the curse of dimensionality in the approximative pricing of   financial derivatives with default risks

Martin Hutzenthaler; Arnulf Jentzen; and Philippe von Wurstemberger

arXiv:1903.05985·math.NA·October 5, 2020

Overcoming the curse of dimensionality in the approximative pricing of financial derivatives with default risks

Martin Hutzenthaler, Arnulf Jentzen, and Philippe von Wurstemberger

PDF

TL;DR

This paper extends multilevel Picard algorithms to high-dimensional semilinear Black-Scholes PDEs, demonstrating polynomial computational effort and overcoming the curse of dimensionality in derivative pricing with default risks.

Contribution

It introduces a new MLP algorithm for semilinear Black-Scholes equations and proves polynomial complexity, a first in this context.

Findings

01

MLP algorithms overcome the curse of dimensionality for these PDEs.

02

Computational effort grows polynomially with dimension and accuracy.

03

First proof of polynomial tractability for semilinear Black-Scholes PDEs.

Abstract

Parabolic partial differential equations (PDEs) are widely used in the mathematical modeling of natural phenomena and man made complex systems. In particular, parabolic PDEs are a fundamental tool to determine fair prices of financial derivatives in the financial industry. The PDEs appearing in financial engineering applications are often nonlinear and high dimensional since the dimension typically corresponds to the number of considered financial assets. A major issue is that most approximation methods for nonlinear PDEs in the literature suffer under the so-called curse of dimensionality in the sense that the computational effort to compute an approximation with a prescribed accuracy grows exponentially in the dimension of the PDE or in the reciprocal of the prescribed approximation accuracy and nearly all approximation methods have not been shown not to suffer under the curse of…

Equations724

\big{(}\tfrac{\partial u_{d}}{\partial t}\big{)}(t,x)+\left[\sum_{i=1}^{d}\tfrac{|\beta|^{2}|x_{i}|^{2}}{2}\big{(}\tfrac{\partial^{2}u_{d}}{\partial(x_{i})^{2}}\big{)}(t,x)\right]+\left[\sum_{i=1}^{d}\alpha x_{i}\big{(}\tfrac{\partial u_{d}}{\partial x_{i}}\big{)}(t,x)\right]+f(u_{d}(t,x))=0,

\big{(}\tfrac{\partial u_{d}}{\partial t}\big{)}(t,x)+\left[\sum_{i=1}^{d}\tfrac{|\beta|^{2}|x_{i}|^{2}}{2}\big{(}\tfrac{\partial^{2}u_{d}}{\partial(x_{i})^{2}}\big{)}(t,x)\right]+\left[\sum_{i=1}^{d}\alpha x_{i}\big{(}\tfrac{\partial u_{d}}{\partial x_{i}}\big{)}(t,x)\right]+f(u_{d}(t,x))=0,

X^{d,\theta,x,i}_{t,s}=x_{i}\exp\!\big{(}\big{(}\alpha-\tfrac{\beta^{2}}{2}\big{)}(s-t)+\beta\big{(}W^{d,\theta,i}_{s}-W^{d,\theta,i}_{t}\big{)}\big{)},

X^{d,\theta,x,i}_{t,s}=x_{i}\exp\!\big{(}\big{(}\alpha-\tfrac{\beta^{2}}{2}\big{)}(s-t)+\beta\big{(}W^{d,\theta,i}_{s}-W^{d,\theta,i}_{t}\big{)}\big{)},

\begin{split}V^{d,\theta}_{M,n}(t,x)&=\sum_{k=0}^{n-1}\frac{(T-t)}{M^{n-k}}\Bigg{[}\sum_{m=1}^{M^{n-k}}f\Big{(}V^{d,(\theta,k,m)}_{M,k}\big{(}R^{(\theta,k,m)}_{t},X^{d,(\theta,k,m),x}_{t,R^{(\theta,k,m)}_{t}}\big{)}\Big{)}\\ &\quad-\mathbbm{1}_{\mathbb{N}}(k)f\Big{(}V^{d,(\theta,k,-m)}_{M,k-1}\big{(}R^{(\theta,k,m)}_{t},X^{d,(\theta,k,m),x}_{t,R^{(\theta,k,m)}_{t}}\big{)}\Big{)}\Bigg{]}+\Bigg{[}\sum_{m=1}^{M^{n}}\frac{g_{d}(X^{d,(\theta,n,-m),x}_{t,T})}{M^{n}}\Bigg{]},\end{split}

\begin{split}V^{d,\theta}_{M,n}(t,x)&=\sum_{k=0}^{n-1}\frac{(T-t)}{M^{n-k}}\Bigg{[}\sum_{m=1}^{M^{n-k}}f\Big{(}V^{d,(\theta,k,m)}_{M,k}\big{(}R^{(\theta,k,m)}_{t},X^{d,(\theta,k,m),x}_{t,R^{(\theta,k,m)}_{t}}\big{)}\Big{)}\\ &\quad-\mathbbm{1}_{\mathbb{N}}(k)f\Big{(}V^{d,(\theta,k,-m)}_{M,k-1}\big{(}R^{(\theta,k,m)}_{t},X^{d,(\theta,k,m),x}_{t,R^{(\theta,k,m)}_{t}}\big{)}\Big{)}\Bigg{]}+\Bigg{[}\sum_{m=1}^{M^{n}}\frac{g_{d}(X^{d,(\theta,n,-m),x}_{t,T})}{M^{n}}\Bigg{]},\end{split}

\big{(}\mathbb{E}\big{[}|u_{d}(0,\xi_{d})-V^{d,0}_{N_{d,\varepsilon},N_{d,\varepsilon}}(0,\xi_{d})|^{2}\big{]}\big{)}^{\nicefrac{{1}}{{2}}}\leq\varepsilon.

\big{(}\mathbb{E}\big{[}|u_{d}(0,\xi_{d})-V^{d,0}_{N_{d,\varepsilon},N_{d,\varepsilon}}(0,\xi_{d})|^{2}\big{]}\big{)}^{\nicefrac{{1}}{{2}}}\leq\varepsilon.

u_{d}(t,x)=\mathbb{E}\!\left[g\big{(}X^{d,\theta,x}_{t,T}\big{)}+(T-t)f\big{(}u_{d}(R^{\theta}_{t},X^{d,\theta,x}_{t,R^{\theta}_{t}})\big{)}\right].

u_{d}(t,x)=\mathbb{E}\!\left[g\big{(}X^{d,\theta,x}_{t,T}\big{)}+(T-t)f\big{(}u_{d}(R^{\theta}_{t},X^{d,\theta,x}_{t,R^{\theta}_{t}})\big{)}\right].

\begin{split}\mathbb{E}\big{[}V^{d,\theta}_{M,n}(t,x)\big{]}&=\sum_{k=0}^{n-1}(T-t)\mathbb{E}\Big{[}f\big{(}V^{d,(\theta,1)}_{M,k}\big{(}R^{(\theta,1)}_{t},X^{d,(\theta,1),x}_{t,R^{(\theta,1)}_{t}}\big{)}\big{)}\\ &\qquad-\mathbbm{1}_{\mathbb{N}}(k)f\big{(}V^{d,(\theta,-1)}_{M,k-1}\big{(}R^{(\theta,1)}_{t},X^{d,(\theta,1),x}_{t,R^{(\theta,1)}_{t}}\big{)}\big{)}\Big{]}+\mathbb{E}\big{[}g_{d}(X^{d,\theta,x}_{t,T})\big{]}\\ &=\mathbb{E}\!\left[g_{d}(X^{d,\theta,x}_{t,T})+(T-t)f\big{(}V^{d,\theta}_{M,n-1}\big{(}R^{\theta}_{t},X^{d,\theta,x}_{t,R^{\theta}_{t}}\big{)}\big{)}\right].\end{split}

\begin{split}\mathbb{E}\big{[}V^{d,\theta}_{M,n}(t,x)\big{]}&=\sum_{k=0}^{n-1}(T-t)\mathbb{E}\Big{[}f\big{(}V^{d,(\theta,1)}_{M,k}\big{(}R^{(\theta,1)}_{t},X^{d,(\theta,1),x}_{t,R^{(\theta,1)}_{t}}\big{)}\big{)}\\ &\qquad-\mathbbm{1}_{\mathbb{N}}(k)f\big{(}V^{d,(\theta,-1)}_{M,k-1}\big{(}R^{(\theta,1)}_{t},X^{d,(\theta,1),x}_{t,R^{(\theta,1)}_{t}}\big{)}\big{)}\Big{]}+\mathbb{E}\big{[}g_{d}(X^{d,\theta,x}_{t,T})\big{]}\\ &=\mathbb{E}\!\left[g_{d}(X^{d,\theta,x}_{t,T})+(T-t)f\big{(}V^{d,\theta}_{M,n-1}\big{(}R^{\theta}_{t},X^{d,\theta,x}_{t,R^{\theta}_{t}}\big{)}\big{)}\right].\end{split}

ϵ_{n} \leq α + [k = 0 \sum n - 1 β_{k} ϵ_{k}] .

ϵ_{n} \leq α + [k = 0 \sum n - 1 β_{k} ϵ_{k}] .

ϵ_{n} \leq α [k = 0 \prod n - 1 (1 + β_{k})] \leq α exp (k = 0 \sum n - 1 β_{k}) < \infty.

ϵ_{n} \leq α [k = 0 \prod n - 1 (1 + β_{k})] \leq α exp (k = 0 \sum n - 1 β_{k}) < \infty.

u_{n} = α + [k = 0 \sum n - 1 β_{k} u_{k}] .

u_{n} = α + [k = 0 \sum n - 1 β_{k} u_{k}] .

u_{n} = α [k = 0 \prod n - 1 (1 + β_{k})] .

u_{n} = α [k = 0 \prod n - 1 (1 + β_{k})] .

u_{0} = α .

u_{0} = α .

u_{n} = α + [k = 0 \sum n - 1 β_{k} u_{k}] = α + [k = 0 \sum n - 2 β_{k} u_{k}] + β_{n - 1} u_{n - 1} = u_{n - 1} + β_{n - 1} u_{n - 1} = (1 + β_{n - 1}) u_{n - 1} = α [k = 0 \prod n - 1 (1 + β_{k})] .

u_{n} = α + [k = 0 \sum n - 1 β_{k} u_{k}] = α + [k = 0 \sum n - 2 β_{k} u_{k}] + β_{n - 1} u_{n - 1} = u_{n - 1} + β_{n - 1} u_{n - 1} = (1 + β_{n - 1}) u_{n - 1} = α [k = 0 \prod n - 1 (1 + β_{k})] .

ϵ_{n} \leq u_{n} .

ϵ_{n} \leq u_{n} .

ϵ_{n} \leq α [k = 0 \prod n - 1 (1 + β_{k})] .

ϵ_{n} \leq α [k = 0 \prod n - 1 (1 + β_{k})] .

ϵ_{n} \leq α [k = 0 \prod n - 1 (1 + β_{k})] \leq α [k = 0 \prod n - 1 exp (β_{k})] = α exp (k = 0 \sum n - 1 β_{k}) .

ϵ_{n} \leq α [k = 0 \prod n - 1 (1 + β_{k})] \leq α [k = 0 \prod n - 1 exp (β_{k})] = α exp (k = 0 \sum n - 1 β_{k}) .

ϵ_{n} \leq α + β [k = 0 \sum n - 1 ϵ_{k}] .

ϵ_{n} \leq α + β [k = 0 \sum n - 1 ϵ_{k}] .

ϵ_{n} \leq α (1 + β)^{n} \leq α e^{β n} < \infty.

ϵ_{n} \leq α (1 + β)^{n} \leq α e^{β n} < \infty.

max {⟨ x, μ (t, x) ⟩, ∣ ∣ ∣ σ (t, x) ∣ ∣ ∣^{2}} \leq C_{1} + C_{2} ∥ x ∥^{2} and V_{p} (x) = (1 + ∥ x ∥^{2})^{\nicefrac p 2} .

max {⟨ x, μ (t, x) ⟩, ∣ ∣ ∣ σ (t, x) ∣ ∣ ∣^{2}} \leq C_{1} + C_{2} ∥ x ∥^{2} and V_{p} (x) = (1 + ∥ x ∥^{2})^{\nicefrac p 2} .

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &\leq\tfrac{p(p+1)}{2}\big{(}\tfrac{p-2}{p}+C_{2}\big{)}V_{p}(x)+(p+1)|C_{1}|^{\nicefrac{{p}}{{2}}}.\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &\leq\tfrac{p(p+1)}{2}\big{(}\tfrac{p-2}{p}+C_{2}\big{)}V_{p}(x)+(p+1)|C_{1}|^{\nicefrac{{p}}{{2}}}.\end{split}

σ (t, x) = σ_{1, 1} (t, x) σ_{2, 1} (t, x) ⋮ σ_{d, 1} (t, x) σ_{1, 2} (t, x) σ_{2, 2} (t, x) ⋮ σ_{d, 2} (t, x) \dots \dots ⋱ \dots σ_{1, m} (t, x) σ_{2, m} (t, x) ⋮ σ_{d, m} (t, x) \in R^{d \times m} .

σ (t, x) = σ_{1, 1} (t, x) σ_{2, 1} (t, x) ⋮ σ_{d, 1} (t, x) σ_{1, 2} (t, x) σ_{2, 2} (t, x) ⋮ σ_{d, 2} (t, x) \dots \dots ⋱ \dots σ_{1, m} (t, x) σ_{2, m} (t, x) ⋮ σ_{d, m} (t, x) \in R^{d \times m} .

(\nabla V_{p}) (x) = \frac{p}{2} (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} \cdot (2 x) = p V_{p} (x) [\frac{1}{1 + ∥ x ∥ ^{2}}] x

(\nabla V_{p}) (x) = \frac{p}{2} (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} \cdot (2 x) = p V_{p} (x) [\frac{1}{1 + ∥ x ∥ ^{2}}] x

(\frac{\partial ^{2} V _{p}}{\partial x _{i} \partial x _{j}}) (x) = \frac{\partial}{\partial x _{i}} [p (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} x_{j}] = p [\frac{\partial}{\partial x _{i}} (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1}] x_{j} + p (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} [\frac{\partial}{\partial x _{i}} x_{j}] = p (\frac{p}{2} - 1) (1 + ∥ x ∥^{2})^{\frac{p}{2} - 2} \cdot (2 x_{i}) x_{j} + p (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} \mathbbm 1_{{i}} (j) = p (p - 2) V_{p} (x) \frac{x _{i} x _{j}}{( 1 + ∥ x ∥ ^{2} ) ^{2}} + p V_{p} (x) \frac{\mathbbm 1 _{{i}} ( j )}{1 + ∥ x ∥ ^{2}} = p V_{p} (x) [(p - 2) \frac{x _{i} x _{j}}{( 1 + ∥ x ∥ ^{2} ) ^{2}} + \frac{\mathbbm 1 _{{i}} ( j )}{1 + ∥ x ∥ ^{2}}] .

(\frac{\partial ^{2} V _{p}}{\partial x _{i} \partial x _{j}}) (x) = \frac{\partial}{\partial x _{i}} [p (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} x_{j}] = p [\frac{\partial}{\partial x _{i}} (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1}] x_{j} + p (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} [\frac{\partial}{\partial x _{i}} x_{j}] = p (\frac{p}{2} - 1) (1 + ∥ x ∥^{2})^{\frac{p}{2} - 2} \cdot (2 x_{i}) x_{j} + p (1 + ∥ x ∥^{2})^{\frac{p}{2} - 1} \mathbbm 1_{{i}} (j) = p (p - 2) V_{p} (x) \frac{x _{i} x _{j}}{( 1 + ∥ x ∥ ^{2} ) ^{2}} + p V_{p} (x) \frac{\mathbbm 1 _{{i}} ( j )}{1 + ∥ x ∥ ^{2}} = p V_{p} (x) [(p - 2) \frac{x _{i} x _{j}}{( 1 + ∥ x ∥ ^{2} ) ^{2}} + \frac{\mathbbm 1 _{{i}} ( j )}{1 + ∥ x ∥ ^{2}}] .

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{*}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &=\tfrac{1}{2}\left[\sum_{k=1}^{m}\sum_{i,j=1}^{d}\sigma_{i,k}(t,x)\sigma_{j,k}(t,x)(\tfrac{\partial^{2}V_{p}}{\partial x_{i}\partial x_{j}})(t,x)\right]+\left\langle\mu(t,x),(\nabla V_{p})(x)\right\rangle\\ &=\tfrac{pV_{p}(x)}{2}\left(\left[\sum_{k=1}^{m}\sum_{i,j=1}^{d}\sigma_{i,k}(t,x)\sigma_{j,k}(t,x)\left((p-2)\tfrac{x_{i}x_{j}}{(1+\left\|x\right\|^{2})^{2}}+\tfrac{\mathbbm{1}_{\{i\}}(j)}{1+\left\|x\right\|^{2}}\right)\right]+\tfrac{2\langle\mu(t,x),x\rangle}{1+\left\|x\right\|^{2}}\right)\\ &=\tfrac{pV_{p}(x)}{2}\left(\tfrac{(p-2)}{(1+\left\|x\right\|^{2})^{2}}\left[\sum_{k=1}^{m}\left[\sum_{i=1}^{d}\sigma_{i,k}(t,x)x_{i}\right]^{2}\right]+\tfrac{{\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(t,x)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}^{2}}{1+\left\|x\right\|^{2}}+\tfrac{2\langle\mu(t,x),x\rangle}{1+\left\|x\right\|^{2}}\right).\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{*}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &=\tfrac{1}{2}\left[\sum_{k=1}^{m}\sum_{i,j=1}^{d}\sigma_{i,k}(t,x)\sigma_{j,k}(t,x)(\tfrac{\partial^{2}V_{p}}{\partial x_{i}\partial x_{j}})(t,x)\right]+\left\langle\mu(t,x),(\nabla V_{p})(x)\right\rangle\\ &=\tfrac{pV_{p}(x)}{2}\left(\left[\sum_{k=1}^{m}\sum_{i,j=1}^{d}\sigma_{i,k}(t,x)\sigma_{j,k}(t,x)\left((p-2)\tfrac{x_{i}x_{j}}{(1+\left\|x\right\|^{2})^{2}}+\tfrac{\mathbbm{1}_{\{i\}}(j)}{1+\left\|x\right\|^{2}}\right)\right]+\tfrac{2\langle\mu(t,x),x\rangle}{1+\left\|x\right\|^{2}}\right)\\ &=\tfrac{pV_{p}(x)}{2}\left(\tfrac{(p-2)}{(1+\left\|x\right\|^{2})^{2}}\left[\sum_{k=1}^{m}\left[\sum_{i=1}^{d}\sigma_{i,k}(t,x)x_{i}\right]^{2}\right]+\tfrac{{\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(t,x)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}^{2}}{1+\left\|x\right\|^{2}}+\tfrac{2\langle\mu(t,x),x\rangle}{1+\left\|x\right\|^{2}}\right).\end{split}

k = 1 \sum m [i = 1 \sum d σ_{i, k} (t, x) x_{i}]^{2} \leq k = 1 \sum m [i = 1 \sum d ∣ σ_{i, k} (t, x) ∣^{2}] [i = 1 \sum d ∣ x_{i} ∣^{2}] = ∣ ∣ ∣ σ (t, x) ∣ ∣ ∣^{2} ∥ x ∥^{2} \leq ∣ ∣ ∣ σ (t, x) ∣ ∣ ∣^{2} (1 + ∥ x ∥^{2}) .

k = 1 \sum m [i = 1 \sum d σ_{i, k} (t, x) x_{i}]^{2} \leq k = 1 \sum m [i = 1 \sum d ∣ σ_{i, k} (t, x) ∣^{2}] [i = 1 \sum d ∣ x_{i} ∣^{2}] = ∣ ∣ ∣ σ (t, x) ∣ ∣ ∣^{2} ∥ x ∥^{2} \leq ∣ ∣ ∣ σ (t, x) ∣ ∣ ∣^{2} (1 + ∥ x ∥^{2}) .

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &\leq\tfrac{p}{2}\left[\tfrac{(p-2){\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(t,x)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}^{2}}{1+\left\|x\right\|^{2}}+\tfrac{{\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(t,x)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}^{2}}{1+\left\|x\right\|^{2}}+\tfrac{2\langle\mu(t,x),x\rangle}{1+\left\|x\right\|^{2}}\right]V_{p}(x)\\ &\leq\tfrac{p}{2}(p-2+1+2)\tfrac{(C_{1}+C_{2}\left\|x\right\|^{2})}{1+\left\|x\right\|^{2}}V_{p}(x)\\ &\leq\tfrac{p(p+1)}{2}\left(C_{1}\left[\tfrac{V_{p}(x)}{1+\left\|x\right\|^{2}}\right]+C_{2}V_{p}(x)\right)=\tfrac{p(p+1)}{2}\left(C_{1}(1+\left\|x\right\|^{2})^{\nicefrac{{p}}{{2}}-1}+C_{2}V_{p}(x)\right).\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &\leq\tfrac{p}{2}\left[\tfrac{(p-2){\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(t,x)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}^{2}}{1+\left\|x\right\|^{2}}+\tfrac{{\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(t,x)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}^{2}}{1+\left\|x\right\|^{2}}+\tfrac{2\langle\mu(t,x),x\rangle}{1+\left\|x\right\|^{2}}\right]V_{p}(x)\\ &\leq\tfrac{p}{2}(p-2+1+2)\tfrac{(C_{1}+C_{2}\left\|x\right\|^{2})}{1+\left\|x\right\|^{2}}V_{p}(x)\\ &\leq\tfrac{p(p+1)}{2}\left(C_{1}\left[\tfrac{V_{p}(x)}{1+\left\|x\right\|^{2}}\right]+C_{2}V_{p}(x)\right)=\tfrac{p(p+1)}{2}\left(C_{1}(1+\left\|x\right\|^{2})^{\nicefrac{{p}}{{2}}-1}+C_{2}V_{p}(x)\right).\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &\leq\tfrac{p(p+1)}{2}\left(\frac{|C_{1}|^{\nicefrac{{p}}{{2}}}}{\nicefrac{{p}}{{2}}}+\frac{\left|(1+\left\|x\right\|^{2})^{\nicefrac{{p}}{{2}}-1}\right|^{\nicefrac{{p}}{{{(p-2)}}}}}{\nicefrac{{p}}{{{(p-2)}}}}+C_{2}V_{p}(x)\right)\\ &=(p+1)|C_{1}|^{\nicefrac{{p}}{{2}}}+\left(\tfrac{p(p+1)}{2}\big{(}\tfrac{p-2}{p}+C_{2}\big{)}\right)V_{p}(x).\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{p})(x)\big{)}+\langle\mu(t,x),(\nabla V_{p})(x)\rangle\\ &\leq\tfrac{p(p+1)}{2}\left(\frac{|C_{1}|^{\nicefrac{{p}}{{2}}}}{\nicefrac{{p}}{{2}}}+\frac{\left|(1+\left\|x\right\|^{2})^{\nicefrac{{p}}{{2}}-1}\right|^{\nicefrac{{p}}{{{(p-2)}}}}}{\nicefrac{{p}}{{{(p-2)}}}}+C_{2}V_{p}(x)\right)\\ &=(p+1)|C_{1}|^{\nicefrac{{p}}{{2}}}+\left(\tfrac{p(p+1)}{2}\big{(}\tfrac{p-2}{p}+C_{2}\big{)}\right)V_{p}(x).\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{2})(x)\big{)}+\langle\mu(t,x),(\nabla V_{2})(x)\rangle\leq 3\left(C_{1}+C_{2}V_{2}(x)\right).\end{split}

\begin{split}&\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{\ast}(\operatorname{Hess}V_{2})(x)\big{)}+\langle\mu(t,x),(\nabla V_{2})(x)\rangle\leq 3\left(C_{1}+C_{2}V_{2}(x)\right).\end{split}

\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{*}(\operatorname{Hess}V)(x)\big{)}+\langle\mu(t,x),(\nabla V)(x)\rangle\leq\rho,

\tfrac{1}{2}\operatorname{Trace}\!\big{(}\sigma(t,x)[\sigma(t,x)]^{*}(\operatorname{Hess}V)(x)\big{)}+\langle\mu(t,x),(\nabla V)(x)\rangle\leq\rho,

X_{t} = ξ + \int_{0}^{t} μ (r, X_{r}) d r + \int_{0}^{t} σ (r, X_{r}) d W_{r} .

X_{t} = ξ + \int_{0}^{t} μ (r, X_{r}) d r + \int_{0}^{t} σ (r, X_{r}) d W_{r} .

E [V (X_{t})] \leq V (ξ) + tρ .

E [V (X_{t})] \leq V (ξ) + tρ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Overcoming the curse of dimensionality

in the approximative pricing of

financial derivatives with default risks

Martin Hutzenthaler, Arnulf Jentzen, and Philippe von Wurstemberger

Abstract

Parabolic partial differential equations (PDEs) are widely used in the mathematical modeling of natural phenomena and man made complex systems. In particular, parabolic PDEs are a fundamental tool to determine fair prices of financial derivatives in the financial industry. The PDEs appearing in financial engineering applications are often nonlinear (e.g. PDE models which take into account the possibility of a defaulting counterparty) and high dimensional since the dimension typically corresponds to the number of considered financial assets. A major issue is that most approximation methods for nonlinear PDEs in the literature suffer under the so-called curse of dimensionality in the sense that the computational effort to compute an approximation with a prescribed accuracy grows exponentially in the dimension of the PDE or in the reciprocal of the prescribed approximation accuracy and nearly all approximation methods have not been shown not to suffer under the curse of dimensionality. Recently, a new class of approximation schemes for semilinear parabolic PDEs, termed full history recursive multilevel Picard (MLP) algorithms, were introduced and it was proven that MLP algorithms do overcome the curse of dimensionality for semilinear heat equations. In this paper we extend those findings to a more general class of semilinear PDEs including as special cases semilinear Black-Scholes equations used for the pricing of financial derivatives with default risks. More specifically, we introduce an MLP algorithm for the approximation of solutions of semilinear Black-Scholes equations and prove, under the assumption that the nonlinearity is globally Lipschitz continuous, that the computational effort of our method grows at most polynomially both in the dimension and the reciprocal of the prescribed approximation accuracy. This is, to the best of our knowledge, the first result showing that the approximation of solutions of semilinear Black-Scholes equations is a polynomially tractable approximation problem.

1 Introduction
2 On a distributional flow property for stochastic differential equations (SDEs)
2.1 Time-discrete Gronwall inequalities
2.2 A priori moment bounds for solutions of SDEs
2.3 Temporal regularity properties for solutions of SDEs
2.4 Strong error estimates for Euler-Maruyama approximations
2.5 On identically distributed random variables
2.6 On random evaluations of random fields
2.7 Brownian motions and right-continuous filtrations
2.8 On a distributional flow property for solutions of SDEs
3 Full history recursive multilevel Picard (MLP) approximation algorithms
3.1 Stochastic fixed point equations and MLP approximations
3.2 A priori bounds for solutions of stochastic fixed point equations
3.3 Properties of MLP approximations
3.4 Analysis of approximation errors of MLP approximations
3.4.1 Expectations of MLP approximations
3.4.2 Biases of MLP approximations
3.4.3 Estimates for the variances of MLP approximations
3.4.4 On a geometric time-discrete Gronwall inequality
3.4.5 Error estimates for MLP approximations
3.5 Complexity analysis for MLP approximation algorithms
3.6 MLP approximations for semilinear partial differential equations (PDEs)
3.6.1 MLP approximations in fixed space dimensions
3.6.2 MLP approximations in variable space dimensions
4 MLP approximations for PDE models
4.1 MLP approximations for semilinear heat equations
4.2 MLP approximations for semilinear Black-Scholes equations
4.3 MLP approximations for the pricing of financial derivatives with default risks

1 Introduction

Parabolic partial differential equations (PDEs) are widely used in the mathematical modeling of natural phenomena and man made complex systems. In particular, parabolic PDEs are a fundamental tool to determine fair prices of financial derivatives in the financial industry. The use of PDEs for option pricing originated in the work of Black, Scholes, & Merton (see [9, 76]) which suggested that the price of a financial derivative satisfies a linear parabolic PDE, nowadays known as Black-Scholes equation. The derivation of their theory is based on several assumption which are not met in the financial practice and consequently various changes and extensions to the original pricing model have been developed. One key modification of the initial Black-Scholes model is to include the possibility of a defaulting counterparty (cf., e.g., Burgard & Kjaer [14], Crepey et al. [24], Duffie et al. [33], and Henry-Labordere [53]). Such extended models suggest that the price process of a financial derivative satisfies a certain semilinear PDE (cf. (1) in Theorem 1.1 below and Subsections 4.2–4.3 below). Typically, such PDEs can not be solved explicitly and it is therefore a very active topic of research to solve such PDEs approximatively; cf., e.g., [30, 94, 95, 97] for deterministic approximation methods for PDEs, cf., e.g., [2, 6, 7, 10, 12, 13, 17, 18, 19, 20, 25, 26, 27, 28, 38, 39, 31, 32, 40, 41, 42, 43, 44, 45, 46, 57, 69, 70, 71, 72, 73, 74, 78, 79, 82, 83, 84, 85, 89, 90, 91, 96, 102, 103, 104] for probabilistic approximation methods for PDEs using discretizations of the associated backward stochastic differential equations (BSDEs), cf., e.g., [11, 21, 37, 49, 68, 105] for probabilistic approximation methods for PDEs using temporal discretizations of the associated second-order BSDEs cf., e.g., [16, 53, 55, 56, 75, 88, 93, 98, 101] for probabilistic approximation methods for PDEs using branching diffusions processes, cf., e.g., [99, 100] for probabilistic approximation methods for PDEs using nested Monte Carlo simulations, cf., e.g., [35, 36, 59, 60] for full history recursive multilevel Picard (MLP) approximation methods for PDEs, and cf., e.g., [3, 4, 8, 15, 34, 51, 52, 54, 58, 65, 80, 87, 92] for approximation methods for PDEs which are based on reformulations of PDEs as a deep learning problems.

The PDEs appearing in financial engineering applications are often high dimensional since the dimension corresponds to the number of financial assets (such as stocks, commodities, interest rates, or exchange rates) in the involved hedging portfolio. A major issue is that most approximation methods suffer under the so-called curse of dimensionality (see Bellman [5]) in the sense that the computational effort to compute an approximation with a prescribed accuracy $\varepsilon>0$ grows exponentially in the dimension $d\in\mathbb{N}$ of the PDE or in the reciprocal $\nicefrac{{1}}{{\varepsilon}}$ of the prescribed approximation accuracy (cf., e.g., E et al. [36, Section 4] for a discussion of the curse of dimensionality in the PDE approximation literature) and nearly all approximation methods have not been shown not to suffer under the curse of dimensionality. Recently, a new class of approximation schemes for semilinear parabolic PDEs, termed full history recursive multilevel Picard (MLP) algorithms, were introduced in E et al. [35, 36] and it was proven, under restrictive assumptions on the regularity of the solution of the PDE that they overcome the curse of dimensionality for semilinear heat equations. Building on this work, [59] proposed for semilinear heat equations an adaption of the original MLP scheme in [35, 36]. Under the assumption that the nonlinearity in the PDE is globally Lipschitz continuous [59, Theorem 1.1] proves that the proposed scheme does indeed overcome the curse of dimensionality in the sense that the computational effort to compute an approximation with a prescribed accuracy $\varepsilon>0$ grows at most polynomially in both the dimension $d\in\mathbb{N}$ of the PDE and the reciprocal $\nicefrac{{1}}{{\varepsilon}}$ of the prescribed approximation accuracy.

In this paper we generalize the MLP algorithm of [59] and the main result of this article, Theorem 3.20 below, proves that the MLP algorithm proposed in this paper overcomes the curse of dimensionality for a more general class of semilinear PDEs which includes as special cases the important examples of semilinear Black-Scholes equations used for the pricing of financial derivatives with default risks. In particular, we show for the first time that the solution of a semilinear Black-Scholes PDE with a globally Lipschitz continuous nonlinearity can be approximated with a computational effort which grows at most polynomially in both the dimension and the reciprocal of the prescribed approximation accuracy. Put differently, we show that the approximation of solutions of such semilinear Black-Scholes equations is a polynomially tractable approximation problem (cf., e.g., Novak & Wozniakowski [81]). To illustrate the main result of this paper, Theorem 3.20 below, we present in the following theorem, Theorem 1.1 below, a special case of Theorem 3.20. Theorem 1.1 demonstrates that the MLP algorithm proposed in this article overcomes the curse of dimensionality for the approximation of solutions of certain semilinear Black-Scholes equations.

Theorem 1.1.

Let $T\in(0,\infty)$ , $p,\mathfrak{P},q\in[0,\infty)$ , $\alpha,\beta\in\mathbb{R}$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ , let $f\colon\mathbb{R}\to\mathbb{R}$ be a Lipschitz continuous function, let $\xi_{d}\in\mathbb{R}^{d}$ , $d\in\mathbb{N}$ , and $g_{d}\in C^{2}(\mathbb{R}^{d},\mathbb{R})$ , $d\in\mathbb{N}$ , satisfy that $\sup_{d\in\mathbb{N},x\in\mathbb{R}^{d}}\big{(}\tfrac{|g_{d}(x)|}{d^{\mathfrak{P}}(1+\left\|x\right\|_{\mathbb{R}^{d}}^{p})}+\frac{\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}}{d^{q}}\big{)}<\infty$ , let $u_{d}\in C^{1,2}([0,T]\times\mathbb{R}^{d},\mathbb{R})$ , $d\in\mathbb{N}$ , be polynomially growing functions which satisfy for all $d\in\mathbb{N}$ , $t\in(0,T)$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that $R^{\theta}_{t}=t+(T-t)\mathcal{R}^{\theta}$ , let $W^{d,\theta}=(W^{d,\theta,i})_{i\in\{1,2,\ldots,d\}}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be independent standard Brownian motions, assume that $(W^{d,\theta})_{d\in\mathbb{N},\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $s\in[t,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ let $X^{d,\theta,x}_{t,s}=(X^{d,\theta,x,i}_{t,s})_{i\in\{1,2,\ldots,d\}}\colon\Omega\to\mathbb{R}^{d}$ be the function which satisfies for all $i\in\{1,2,\ldots,d\}$ that

[TABLE]

let $V^{d,\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be functions which satisfy for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{d,\theta}_{M,-1}(t,x)=V^{d,\theta}_{M,0}(t,x)=0$ and

[TABLE]

and for every $d,n,M\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ let $\mathcal{C}_{d,M,n}\in\mathbb{N}_{0}$ be the number of realizations of standard normal random variables which are used to compute one realization of $V^{d,0}_{M,n}(t,x)$ (see (336) below for a precise definition). Then there exist functions $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,1]}\colon\mathbb{N}\times(0,1]\to\mathbb{N}$ and $C=(C_{\delta})_{\delta\in(0,\infty)}\colon(0,\infty)\to(0,\infty)$ such that for all $d\in\mathbb{N}$ , $\varepsilon\in(0,1]$ , $\delta\in(0,\infty)$ it holds that $\mathcal{C}_{d,N_{d,\varepsilon},N_{d,\varepsilon}}\leq C_{\delta}\,d^{1+(\mathfrak{P}+qp)(2+\delta)}\varepsilon^{-(2+\delta)}$ and

[TABLE]

Theorem 1.1 is an immediate consequence of Theorem 4.4 below. Theorem 4.4 in turn is a consequence of Theorem 3.20 below, the main result of this paper. We now provide some explanations for Theorem 1.1. In Theorem 1.1 we present a stochastic approximation scheme (cf. $(V^{d,0}_{M,n})_{M,n,d\in\mathbb{N}}$ in Theorem 1.1 above) which is able to approximate in the strong $L^{2}$ -sense the initial value $u_{d}(0,\xi_{d})$ of the solution of an uncorrelated semilinear Black-Scholes equation (cf. (1) in Theorem 1.1 above) with a computational effort which grows at most polynomially in both the dimension $d\in\mathbb{N}$ and the reciprocal $\nicefrac{{1}}{{\varepsilon}}$ of the prescribed approximation accuracy $\varepsilon>0$ . The time horizon $T\in(0,\infty)$ , the drift parameter $\alpha\in\mathbb{R}$ , the diffusion parameter $\beta\in\mathbb{R}$ , as well as the Lipschitz continuous nonlinearity $f\colon\mathbb{R}\to\mathbb{R}$ of the semilinear Black-Scholes equations in Theorem 1.1 above (cf. (1) in Theorem 1.1 above) are fixed over all dimensions (cf. Theorem 4.3 for a more general result with dimension-dependent drift and diffusion coefficients and dimension-dependent nonlinearities which may additionally depend on the time and the space variable). The approximation points $(\xi_{d})_{d\in\mathbb{N}}$ and the terminal conditions $(g_{d})_{d\in\mathbb{N}}$ of the PDE (1) in Theorem 1.1 above are both allowed to grow in a certain polynomial fashion determined by the constants $p,\mathfrak{P},q\in[0,\infty)$ . The idea for the full history multilevel Picard scheme (cf. $(V^{d,\theta}_{M,n})_{M,d\in\mathbb{N},n\in\mathbb{N}_{0},\theta\in\Theta}$ in Theorem 1.1 above) is based on a reformulation of the semilinear PDE in (1) as a stochastic fixed point equation. For this we consider the independent solution fields $(X^{d,\theta})_{d\in\mathbb{N},\theta\in\Theta}$ of the stochastic differential equation (SDE) associated to the PDE in (1) and for every $t\in[0,T]$ we consider independent $\mathcal{U}_{[t,T]}$ -distributed random variables $(R^{\theta}_{t})_{\theta\in\Theta}$ . As a consequence of the Feynman-Kac formula we obtain that $(u_{d})_{d\in\mathbb{N}}$ are the unique at most polynomially growing functions which satisfy for all $d\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Note that for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Thus for every $d,M\in\mathbb{N}$ , $\theta\in\Theta$ the sequence of random fields $(V^{d,\theta}_{M,n})_{n\in\mathbb{N}_{0}}$ behave, in expectation, like Picard iterations for the stochastic fixed point equation in (5) above. In each iteration in (3) the expectation of the Picard iteration for the stochastic fixed point equation in (5) is approximated with a multilevel Monte Carlo approach on a telescope expansion over the full history of the previous iterations. According to the multilevel Monte Carlo paradigm the number of samples in each level is chosen such that computationally inexpensive summands (corresponding to small $k\in\{0,1,2,\ldots,n\}$ in (6)) of the telescope expansion get sampled more often than computationally expensive ones (corresponding to large $k\in\{0,1,2,\ldots,n\}$ in (6)). Roughly speaking, the conclusion of Theorem 1.1 above states (cf. Theorem 1.1 above for the precise formulation) that for every $d\in\mathbb{N}$ , $\varepsilon\in(0,1]$ there exists a natural number $N\in\mathbb{N}$ such that $V^{d,0}_{N,N}(0,\xi_{d})$ approximates $u_{d}(0,\xi_{d})$ in the $L^{2}$ -sense with accuracy $\varepsilon$ and such that the computational effort to compute $V^{d,0}_{N,N}(0,\xi_{d})$ is essentially of the order $d^{1+2(\mathfrak{P}+pq)}\varepsilon^{-2}$ . Remarkably this is exactly the computational complexity of the standard Monte Carlo approximation of the solution of the PDE (1) in the case that the nonlinearity $f$ vanishes (cf., e.g., Graham & Talay [47]).

The remainder of this paper is structured as follows. In Section 2 we prove a well-known distributional flow property for the composition of independent solutions fields of a stochastic differential equation (SDE) (see Lemma 2.19 below), which will be a key assumption in the abstract treatment of stochastic fixed point equations in Section 3. Several auxiliary results which are needed for the proof of the flow property (see Lemma 2.19 below) in Section 2 will be used again in Section 3. Section 3 introduces the MLP algorithm, provides a complexity analysis in the setting of stochastic fixed point equations in Subsections 3.1–3.5, and then carries over those results to semilinear Kolmogorov PDEs in Subsection 3.6 leading to Theorem 3.20 below, the main result of this article. In the last section, Section 4, we apply the result for general semilinear Kolmogorov PDEs of Theorem 3.20 to semilinear heat equations (see Subsection 4.1) and semlinear Black-Scholes equations (see Subsection 4.2 and Subsection 4.3) which are notably used to compute prices for financial derivatives in the presence of counterparty credit risks (see Subsection 4.3).

2 On a distributional flow property for stochastic differential equations (SDEs)

In our analysis of the proposed MLP algorithm in Section 3 below, we will make use of random fields which satisfy a certain flow-type condition (see (154) in Setting 3.1 below). The main intent of this section is to establish that solution processes of SDEs enjoy, under suitable conditions (see Lemma 2.19 below for details), this flow-type property. To rigorously prove this result we need a series of elementary and well-known results, presented in Subsections 2.1–2.7 below, many of which will be reused in Section 3.

2.1 Time-discrete Gronwall inequalities

In this subsection we present elementary and well-known Gronwall inequalities (cf., e.g., Agarwal [1]).

Lemma 2.1.

Let $N\in\mathbb{N}$ , $\alpha\in[0,\infty)$ , $(\beta_{n})_{n\in\{0,1,2,\ldots,N-1\}}\subseteq[0,\infty)$ , $(\epsilon_{n})_{n\in\{0,1,2,\ldots,N\}}\subseteq[0,\infty]$ satisfy for all $n\in\{0,1,2,\ldots,N\}$ that

[TABLE]

Then it holds for all $n\in\{0,1,2,\ldots,N\}$ that

[TABLE]

Proof of Lemma 2.1.

Throughout this proof let $(u_{n})_{n\in\{0,1,2,\ldots,N\}}\subseteq[0,\infty]$ be the extended real numbers which satisfy for all $n\in\{0,1,2,\ldots,N\}$ that

[TABLE]

We claim that for all $n\in\{0,1,2,\ldots,N\}$ it holds that

[TABLE]

We now prove (10) by induction on $n\in\{0,1,2,\ldots,N\}$ . For the base case $n=0$ observe that (9) ensures that

[TABLE]

This proves (10) in the base case $n=0$ . For the induction step $\{0,1,2,\ldots,N-1\}\ni(n-1)\to n\in\{1,2,\ldots,N\}$ observe that (9) implies that for all $n\in\{1,2,\ldots,N\}$ with $u_{n-1}=\alpha\left[\prod_{k=0}^{n-2}(1+\beta_{k})\right]$ it holds that

[TABLE]

Induction thus establishes (10). Moreover, note that (7), (9), and induction prove that for all $n\in\{0,1,2,\ldots,N\}$ it holds that

[TABLE]

This and (10) establish that for all $n\in\{0,1,2,\ldots,N\}$ it holds that

[TABLE]

The fact that for all $x\in\mathbb{R}$ it holds that $(1+x)\leq\exp(x)$ therefore ensures that for all $n\in\{0,1,2,\ldots,N\}$ it holds that

[TABLE]

The proof of Lemma 2.1 is thus completed. ∎

Corollary 2.2.

Let $N\in\mathbb{N}\cup\{\infty\}$ , $\alpha,\beta\in[0,\infty)$ , $(\epsilon_{n})_{n\in\mathbb{N}_{0}\cap[0,N]}\subseteq[0,\infty]$ satisfy for all $n\in\mathbb{N}_{0}\cap[0,N]$ that

[TABLE]

Then it holds for all $n\in\mathbb{N}_{0}\cap[0,N]$ that

[TABLE]

Proof of Corollary 2.2.

Note that Lemma 2.1 establishes Corollary 2.2. The proof of Corollary 2.2 is thus completed. ∎

2.2 A priori moment bounds for solutions of SDEs

In this subsection we establish in the elementary result in Lemma 2.6 below for every $p\in[0,\infty)$ a bound on the $p$ -th absolute moment of the solution of an SDE with deterministic initial value, a one-sided linear growth condition on the drift coefficient of the SDE, and a linear growth condition on the diffusion coefficient of the SDE (cf. (43) in Lemma 2.6 below). Our proof of Lemma 2.6 employs standard Lyapunov-type techniques from the literature to establish the desired a priori moment bound (cf., e.g., Cox et al. [22, Section 2.2]).

Lemma 2.3.

Let $d,m\in\mathbb{N}$ , $T,C_{1},C_{2}\in[0,\infty)$ , let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , and let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ , $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ , and $V_{p}\colon\mathbb{R}^{d}\to(0,\infty)$ , $p\in[2,\infty)$ , be functions which satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $p\in[2,\infty)$ that

[TABLE]

Then

(i)

it holds for all $p\in[2,\infty)$ that $V_{p}\in C^{\infty}(\mathbb{R}^{d},(0,\infty))$ and 2. (ii)

it holds for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $p\in[2,\infty)$ that

[TABLE]

Proof of Lemma 2.3.

Throughout this proof let $\sigma_{i,j}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}$ , $i\in\{1,2,\ldots,d\}$ , $j\in\{1,2,\ldots,m\}$ , be the functions which satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Note that the chain rule, the fact that the function $\mathbb{R}^{d}\ni x\mapsto 1+\left\|x\right\|^{2}\in(0,\infty)$ is infinitely often differentiable, and the fact that for every $p\in[2,\infty)$ the function $(0,\infty)\ni s\mapsto s^{\frac{p}{2}}\in(0,\infty)$ is infinitely often differentiable establish item (i). It thus remains to prove item (ii). For this, observe that the chain rule ensures that for all $x=(x_{1},\ldots,x_{d})\in\mathbb{R}^{d}$ , $i,j\in\{1,2,\ldots,d\}$ , $p\in[2,\infty)$ it holds that

[TABLE]

and

[TABLE]

This implies that for all $t\in[0,T]$ , $x=(x_{1},\ldots,x_{d})\in\mathbb{R}^{d}$ , $p\in[2,\infty)$ it holds that

[TABLE]

In addition, note that the Cauchy Schwarz inequality assures that for all $t\in[0,T]$ , $x=(x_{1},\ldots,x_{d})\in\mathbb{R}^{d}$ it holds that

[TABLE]

This, (18), and (23) demonstrate that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $p\in[2,\infty)$ it holds that

[TABLE]

Young’s inequality (with $p=\nicefrac{{p}}{{2}}$ , $q=\nicefrac{{p}}{{(p-2)}}=\frac{\nicefrac{{p}}{{2}}}{\nicefrac{{p}}{{2}}-1}$ for $p\in(2,\infty)$ in the usual notation of Young’s inequality) hence proves that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $p\in(2,\infty)$ it holds that

[TABLE]

Moreover, note that (25) ensures that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Combining this and (26) establishes item (ii). The proof of Lemma 2.3 is thus completed. ∎

Lemma 2.4.

Let $d,m\in\mathbb{N}$ , $T,\rho\in[0,\infty)$ , $\xi\in\mathbb{R}^{d}$ , let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\mu\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d})$ , $\sigma\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d\times m})$ , $V\in C^{2}(\mathbb{R}^{d},(0,\infty))$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t\in[0,T]}))$ -Brownian motion, and let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process with continuous sample paths which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds for all $t\in[0,T]$ that

[TABLE]

Proof of Lemma 2.4.

Throughout this proof assume w.l.o.g. that $T>0$ and let $\mathbb{V}\colon[0,T]\times\mathbb{R}^{d}\to(0,\infty)$ be the function which satisfies for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Note that the fact that $V\in C^{2}(\mathbb{R}^{d},(0,\infty))$ ensures that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

(I)

$\mathbb{V}\in C^{2}([0,T]\times\mathbb{R}^{d},(0,\infty))$ , 2. (II)

$(\tfrac{\partial\mathbb{V}}{\partial t})(t,x)=-\rho$ , 3. (III)

$(\nabla_{x}\mathbb{V})(t,x)=(\nabla V)(x)$ , and 4. (IV)

$(\operatorname{Hess}_{x}\mathbb{V})(t,x)=(\operatorname{Hess}V)(x)$ .

Observe that items (II)–(IV) and (28) show that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Combining this with Itô’s formula demonstrates that for all $t\in[0,T]$ it holds that

[TABLE]

Therefore, we obtain that for all $t\in[0,T]$ it holds that

[TABLE]

The proof of Lemma 2.4 is thus completed. ∎

Lemma 2.5.

Let $d,m\in\mathbb{N}$ , $T,\rho_{1},\rho_{2}\in[0,\infty)$ , $\xi\in\mathbb{R}^{d}$ , let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\mu\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d})$ , $\sigma\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d\times m})$ , $V\in C^{2}(\mathbb{R}^{d},(0,\infty))$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t\in[0,T]}))$ -Brownian motion, and let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process with continuous sample paths which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds for all $t\in[0,T]$ that

[TABLE]

Proof of Lemma 2.5.

Throughout this proof assume w.l.o.g. that $\rho_{1}>0$ (cf. Lemma 2.4) and that $T>0$ and let $\mathbb{V}\colon[0,T]\times\mathbb{R}^{d}\to(0,\infty)$ be the function which satisfies for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Note that the fact that $V\in C^{2}(\mathbb{R}^{d},(0,\infty))$ ensures that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

(I)

$\mathbb{V}\in C^{2}([0,T]\times\mathbb{R}^{d},(0,\infty))$ , 2. (II)

$(\tfrac{\partial\mathbb{V}}{\partial t})(t,x)=-\rho_{1}e^{-\rho_{1}t}(V(x)+\tfrac{\rho_{2}}{\rho_{1}})$ , 3. (III)

$(\nabla_{x}\mathbb{V})(t,x)=e^{-\rho_{1}t}(\nabla V)(x)$ , and 4. (IV)

$(\operatorname{Hess}_{x}\mathbb{V})(t,x)=e^{-\rho_{1}t}(\operatorname{Hess}V)(x)$ .

Observe that items (II)–(IV) and (35) assure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Combining this with Itô’s formula demonstrates that for all $t\in[0,T]$ it holds that

[TABLE]

Therefore, we obtain that for all $t\in[0,T]$ it holds that

[TABLE]

The fact that for all $a\in\mathbb{R}$ it holds that $e^{a}-1\leq ae^{a}$ hence ensures that for all $t\in[0,T]$ it holds that

[TABLE]

The proof of Lemma 2.5 is thus completed. ∎

Lemma 2.6.

Let $d,m\in\mathbb{N}$ , $T,C_{1},C_{2}\in[0,\infty)$ , $\xi\in\mathbb{R}^{d}$ , let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , let $\mu\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d})$ , $\sigma\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d\times m})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t\in[0,T]}))$ -Brownian motion, and let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process with continuous sample paths which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds for all $p\in[0,\infty)$ , $t\in[0,T]$ that

[TABLE]

Proof of Lemma 2.6.

Throughout this proof let $(\rho_{1}^{(p)})_{p\in[2,\infty),}(\rho_{2}^{(p)})_{p\in[2,\infty)}\subseteq[0,\infty)$ satsify for all $p\in[2,\infty)$ that

[TABLE]

and let $V_{p}\colon\mathbb{R}^{d}\to(0,\infty)$ , $p\in[2,\infty)$ , be the functions which satisfy for all $p\in[2,\infty)$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Observe that Lemma 2.3 and (43) assure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $p\in[2,\infty)$ it holds that $V_{p}\in C^{\infty}(\mathbb{R}^{d},(0,\infty))$ and

[TABLE]

Lemma 2.5 hence implies that for all $t\in[0,T]$ , $p\in[2,\infty)$ it holds that

[TABLE]

This, Jensen’s inequality, and the fact that for all $p\in[0,2]$ it holds that $3^{\nicefrac{{p}}{{2}}}\leq p+1$ assure that for all $t\in[0,T]$ , $p\in[0,2)$ it holds that

[TABLE]

Combining this with (49) implies (45). The proof of Lemma 2.6 is thus completed. ∎

2.3 Temporal regularity properties for solutions of SDEs

For the proof of our strong $L^{2}$ -error estimates for Euler-Maruyama approximations in Subsection 2.4 we need Corollary 2.8 below, which asserts that, under suitable conditions (see Corollary 2.8 below for details), solutions of SDEs have a certain temporal regularity property. To prove Corollary 2.8 we employ (without providing a proof) a well-known temporal regularity property for solutions of SDEs from the literature stated in Lemma 2.7 below (cf., e.g., Da Prato et al. [29, Proposition 3], Cox et al. [23, Corollary 3.8], and Jentzen et al. [63, Proposition 5.1]). Additionally, we offer in Lemma 2.10 below a self contained proof of an explicit temporal regularity estimate for solutions of SDEs with deterministic initial values which will be used in Subsection 2.8.

Lemma 2.7 (Temporal regularity of solutions of time-homogeneous SDEs).

Let $d,m\in\mathbb{N}$ , $T\in(0,\infty)$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ -Brownian motion, let $\mu\colon\mathbb{R}^{d}\to\mathbb{R}^{d}$ , $\sigma\colon\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be globally Lipschitz continuous functions, and let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic processes with continuous sample paths which satisfies that $\mathbb{E}\!\left[\left\|X_{0}\right\|^{2}\right]<\infty$ and which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds that

[TABLE]

Lemma 2.8 (Temporal regularity of solutions of time-inhomogeneous SDEs).

Let $d,m\in\mathbb{N}$ , $T\in(0,\infty)$ , $L\in[0,\infty)$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ -Brownian motion, let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be globally Lipschitz continuous functions, and let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic processes with continuous sample paths which satisfies that $\mathbb{E}\!\left[\left\|X_{0}\right\|^{2}\right]<\infty$ and which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds that

[TABLE]

Proof of Lemma 2.8.

Throughout this proof let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d+1}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d+1}$ , let $Y\colon[0,T]\times\Omega\to\mathbb{R}^{d+1}$ be the stochastic process which satisfies for all $t\in[0,T]$ that

[TABLE]

and let $\tilde{\mu}\colon\mathbb{R}^{d+1}\to\mathbb{R}^{d+1}$ and $\tilde{\sigma}\colon\mathbb{R}^{d+1}\to\mathbb{R}^{(d+1)\times m}$ be the functions which satisfy for all $y=(y_{1},y_{2},\ldots,y_{d+1})\in\mathbb{R}^{d+1}$ that

[TABLE]

Observe that the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous functions and the fact that $\mathbb{R}\ni y\mapsto\min\{\max\{y,0\},T\}\in\mathbb{R}$ is a globally Lipschitz continuous function assure that $\tilde{\mu}$ and $\tilde{\sigma}$ are globally Lipschitz continuous functions. Moreover, note that it holds for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

This and (53) assure that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

The fact that $\tilde{\mu}$ and $\tilde{\sigma}$ are globally Lipschitz continuous functions and Lemma 2.7 (with $d=d+1$ , $m=m$ , $T=T$ , $\mu=\tilde{\mu}$ , $\sigma=\tilde{\sigma}$ , $X=Y$ in the notation of Lemma 2.7) hence prove that

[TABLE]

Hence, we obtain that

[TABLE]

The proof of Lemma 2.8 is thus completed. ∎

The following very elementary and well-known result will be helpfull in the proof of Lemma 2.10 below and will be repeatedly used throughout this paper.

Lemma 2.9 (A consequence of Hölders inequality).

Let $(\Omega,\mathcal{F},\mu)$ be a measure space and let $f\colon\Omega\to[0,\infty]$ be an $\mathcal{F}/\mathcal{B}([0,\infty])$ -measurable function. Then

[TABLE]

Proof of Lemma 2.9.

Note that Hölders inequality demonstrates that

[TABLE]

The proof of Lemma 2.9 is thus completed. ∎

Lemma 2.10 (Explicit temporal regularity for solutions of SDEs with deterministic initial values).

Let $d,m\in\mathbb{N}$ , $T\in(0,\infty)$ , $L\in[0,\infty)$ , $\xi\in\mathbb{R}^{d}$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ -Brownian motion, let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ , $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be functions which satisfy for all $t,s\in[0,T]$ , $x,y\in\mathbb{R}^{d}$ that

[TABLE]

and let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic processes with continuous sample paths which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds that

[TABLE]

Proof of Lemma 2.10.

Throughout this proof let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ and let $C\in(0,\infty)$ be given by

[TABLE]

Note that (64) and the triangle inequality assure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

This assures that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

In addition, note that (69) implies that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Moreover, observe that (65), Lemma 2.9, Tonelli’s theorem, and Itô’s isometry demonstate that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

The triangle inequality, (68), and (69) therefore ensure that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Furthermore, note that (70), (71), (65), and Lemma 2.6 (with $d=d$ , $m=m$ , $T=T$ , $C_{1}=C$ , $C_{2}=C$ , $\xi=\xi$ , $\mu=\mu$ , $\sigma=\sigma$ , $X=X$ in the notation of Lemma 2.6) assure that for all $t\in[0,T]$ it holds that

[TABLE]

This, (73), the fact that $C\geq 1$ , the fact that for all $x\in[0,\infty)$ it holds that $\max\{x,1+x\}\leq e^{x}$ , and the fact that for all $x,y\in[0,\infty)$ it holds that $\sqrt{x+y}\leq\sqrt{x}+\sqrt{y}$ demonstrate that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

This implies (66). The proof of Lemma 2.10 is thus completed. ∎

2.4 Strong error estimates for Euler-Maruyama approximations

Our proof of the flow-type property of solutions of SDEs in Subsection 2.8 below makes use of Euler-Maruyama approximations of solutions. For that reason we present in this subsection explicit strong $L^{2}$ -error estimates for Euler-Maruyama approximations in Proposition 2.11 and Corollary 2.12 below. The results in this subsection are essentially well-known (cf., e.g., Kloeden & Platen [67, Chapter 10] and Milstein [77]).

Proposition 2.11 (Strong convergence of the Euler-Maruyama method).

Let $d,m,N\in\mathbb{N}$ , $T\in(0,\infty)$ , $L\in[0,\infty)$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ -Brownian motion, let $\zeta\colon\Omega\to\mathbb{R}^{d}$ be an $\mathbb{F}_{0}$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable function which satisfies that $\mathbb{E}\!\left[\left\|\zeta\right\|^{2}\right]<\infty$ , let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ , $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be functions which satisfy for all $t,s\in[0,T]$ , $x,y\in\mathbb{R}^{d}$ that

[TABLE]

let $X\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic processes with continuous sample paths which satisfies that $\mathbb{E}\!\left[\left\|X_{0}\right\|^{2}\right]<\infty$ and which satisfies that for all $t\in[0,T]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

let $t_{0},t_{1},\ldots,t_{N}\in[0,T]$ satisfy that

[TABLE]

and let $\mathcal{X}\colon\{0,1,\ldots,N\}\times\Omega\to\mathbb{R}^{d}$ be the stochastic process which satisfies for all $n\in\{1,2,\ldots,N\}$ that

[TABLE]

Then it holds that

[TABLE]

Proof

of Proposition 2.11.

Throughout this proof assume w.l.o.g. that $t_{0}<t_{1}<t_{2}<\ldots<t_{N}$ , let $(h_{n})_{n\in\{1,2,\ldots,N\}}\subseteq(0,T]$ , $H\in(0,T]$ , $K\in[0,\infty]$ satisfy for all $n\in\{1,2,\ldots,N\}$ that

[TABLE]

let $\mathfrak{t}\colon[0,T]\to\{t_{0},t_{1},t_{2},\ldots,t_{N}\}$ be the function which satisfies for all $s\in[0,T]$ that

[TABLE]

and let $\mathfrak{n}\colon[0,T]\to\{0,1,2,\ldots,N\}$ be the function which satisfies for all $s\in[0,T]$ that

[TABLE]

Note that the hypothesis that $\mathbb{E}\!\left[\left\|X_{0}\right\|^{2}\right]<\infty$ , the fact that $\mu$ and $\sigma$ are globally Lipschitz continuous functions, (77), and Lemma 2.8 imply that $K<\infty$ . Next observe that (79) and induction assure that for all $n\in\{0,1,2,\dots,N\}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

This and (77) imply that for all $n\in\{0,1,2,\dots,N\}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

The triangle inequality hence proves that for all $n\in\{0,1,2,\dots,N\}$ it holds that

[TABLE]

Lemma 2.9, Tonelli’s Theorem, and Itô’s isometry therefore imply that for all $n\in\{0,1,2,\dots,N\}$ it holds that

[TABLE]

This and (76) show that for all $n\in\{0,1,2,\dots,N\}$ it holds that

[TABLE]

This, the triangle inequality, and the fact that for all $s\in[0,T]$ it holds that $|s-\mathfrak{t}(s)|\leq H$ imply that for all $n\in\{0,1,2,\dots,N\}$ it holds that

[TABLE]

The fact that for all $x,y\in[0,\infty)$ it holds that $(x+y)^{2}\leq 2x^{2}+2y^{2}$ hence proves that for all $n\in\{0,1,2,\dots,N\}$ it holds that

[TABLE]

The discrete Gronwall-type inequality in Lemma 2.1 (with $N=N$ , $\alpha=2\big{(}\left(\mathbb{E}\!\left[\|X_{0}-\zeta\|^{2}\right]\right)^{\!\nicefrac{{1}}{{2}}}\allowbreak+L(1+\sqrt{T})[\sqrt{T}H+(\int_{0}^{T}\mathbb{E}\!\left[\|X_{s}-X_{\mathfrak{t}(s)}\|^{2}\right]ds)^{\!\nicefrac{{1}}{{2}}}]\big{)}^{2}$ , $(\beta_{n})_{n\in\{0,1,2,\ldots,N-1\}}=(2L^{2}(1+\sqrt{T})^{2}h_{n+1})_{n\in\{0,1,2,\ldots,N-1\}}$ , $(\epsilon_{n})_{n\in\{0,1,2,\ldots,N\}}=(\mathbb{E}\!\left[\left\|X_{t_{n}}-\mathcal{X}_{n}\right\|^{2}\right])_{n\in\{0,1,2,\ldots,N\}}$ in the notation of Lemma 2.1) and the fact that $\sum_{k=1}^{N}h_{k}=T$ therefore show that

[TABLE]

This and the fact that for all $s\in[0,T]$ it holds that $|s-\mathfrak{t}(s)|\leq H$ imply that

[TABLE]

The fact that $H\leq\sqrt{T}\sqrt{H}$ hence assures that

[TABLE]

This implies (80). The proof of Proposition 2.11 is thus completed. ∎

Corollary 2.12.

Let $d,m,N\in\mathbb{N}$ , $T\in(0,\infty)$ , $t\in[0,T]$ , $s\in[t,T]$ , $L\in[0,\infty)$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ -Brownian motion, let $\zeta\colon\Omega\to\mathbb{R}^{d}$ be an $\mathbb{F}_{t}$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable function with $\mathbb{E}\!\left[\left\|\zeta\right\|^{2}\right]<\infty$ , let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ , $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be functions which satisfy for all $r,h\in[0,T]$ , $x,y\in\mathbb{R}^{d}$ that

[TABLE]

let $X\colon[t,s]\times\Omega\to\mathbb{R}^{d}$ be an $(\mathbb{F}_{r})_{r\in[t,s]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic processes with continuous sample paths which satisfies that $\mathbb{E}\!\left[\left\|X_{t}\right\|^{2}\right]<\infty$ and which satisfies that for all $r\in[t,s]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

let $r_{0},r_{1},\ldots,r_{N}\in[0,T]$ satisfy that

[TABLE]

and let $\mathcal{X}\colon\{0,1,\ldots,N\}\times\Omega\to\mathbb{R}^{d}$ be the stochastic process which satisfies for all $n\in\{1,2,\ldots,N\}$ that

[TABLE]

Then it holds that

[TABLE]

Proof of Corollary 2.12.

Throughout this proof assume w.l.o.g. that $s>t$ . Observe that Proposition 2.11 (with $d=d$ , $m=m$ , $N=N$ , $T=s-t$ , $L=L$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{r})_{r\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t+r})_{r\in[0,s-t]})$ , $(W_{r})_{r\in[0,T]}=(W_{t+r}-W_{t})_{r\in[0,s-t]}$ , $\zeta=\zeta$ , $(\mu(r,x))_{r\in[0,T],x\in\mathbb{R}^{d}}=(\mu(t+r,x))_{r\in[0,s-t],x\in\mathbb{R}^{d}}$ , $(\sigma(r,x))_{r\in[0,T],x\in\mathbb{R}^{d}}=(\sigma(t+r,x))_{r\in[0,s-t],x\in\mathbb{R}^{d}}$ , $(X_{r})_{r\in[0,T]}=(X_{t+r})_{r\in[0,s-t]}$ , $(t_{n})_{n\in\{0,1,\ldots,N\}}=(r_{n}-t)_{n\in\{0,1,\ldots,N\}}$ , $(\mathcal{X}_{n})_{n\in\{0,1,\ldots,N\}}=(\mathcal{X}_{n})_{n\in\{0,1,\ldots,N\}}$ in the notation of Proposition 2.11) establishes that

[TABLE]

This implies (98). The proof of Corollary 2.12 is thus completed. ∎

2.5 On identically distributed random variables

The next elementary and well-known result, Lemma 2.13 below, provides a sufficient condition for two random variables to have the same distribution.

Lemma 2.13.

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $(E,d)$ be a metric space, let $X,Y\colon\Omega\to E$ be random variables which satisfy that for all globally bounded and Lipschitz continuous functions $g\colon E\to\mathbb{R}$ it holds that

[TABLE]

Then it holds that $X$ and $Y$ are identically distributed random variables.

Proof of Lemma 2.13.

Throughout this proof for every $n\in\mathbb{N}$ let $h_{n}\colon[0,\infty)\to[0,1]$ be the function which satisfies for all $r\in[0,\infty)$ that

[TABLE]

for every closed and non-empty set $A\subseteq E$ let $D_{A}\colon E\to[0,\infty)$ be the function which satisfies for all $e\in E$ that

[TABLE]

and for every $n\in\mathbb{N}$ and every closed and non-empty set $A\subseteq E$ let $f_{A,n}\colon E\to[0,1]$ be the function which satisfies for all $e\in E$ that

[TABLE]

Note that the triangle inequality assures that for all closed and non-empty sets $A\subseteq E$ and all $e_{1},e_{2}\in E$ , $a\in A$ , $\varepsilon\in(0,\infty)$ with $D_{A}(e_{1})\geq D_{A}(e_{2})$ and $d(e_{2},a)\leq D_{A}(e_{2})+\varepsilon$ it holds that

[TABLE]

The fact that for all closed and non-empty sets $A\subseteq E$ and all $e\in E$ , $\varepsilon\in(0,\infty)$ there exists $a\in A$ such that $d(e,a)\leq D_{A}(e)+\varepsilon$ hence assures that for all closed and non-empty sets $A\subseteq E$ and all $e_{1},e_{2}\in E$ it holds that

[TABLE]

Moreover note that for all $n\in\mathbb{N}$ , $r_{1},r_{2}\in[0,\infty)$ with $r_{1}\leq r_{2}$ it holds that

[TABLE]

Combining this with (105) establishes that for all closed and non-empty sets $A\subseteq E$ and all $n\in\mathbb{N}$ , $e_{1},e_{2}\in E$ it holds that

[TABLE]

This demonstrates that for every closed and non-empty set $A\subseteq E$ and every $n\in\mathbb{N}$ it holds that $f_{A,n}\colon E\to[0,1]$ is a globally bounded and Lipschitz continuous function. Next observe that the fact that for all closed and non-empty sets $A\subseteq E$ and all $e\in A$ it holds that $D_{A}(e)=0$ assures that for all closed and non-empty sets $A\subseteq E$ and all $n\in\mathbb{N}$ , $e\in A$ it holds that

[TABLE]

Moreover, note the fact that for all closed and non-empty sets $A\subseteq E$ and all $e\in E\setminus A$ there exists $n\in\mathbb{N}$ such that $D_{A}(e)>\frac{1}{n}$ and the fact that for all $n\in\mathbb{N}$ it holds that $h_{n}$ is a non-increasing function assure that for all closed and non-empty sets $A\subseteq E$ and all $e\in E\setminus A$ there exist $n\in\mathbb{N}$ such that for all $m\in\{n,n+1,\ldots\}$ it holds that

[TABLE]

Combining this and (108) establishes that for all closed and non-empty sets $A\subseteq E$ and all $e\in E$ it holds that

[TABLE]

The theorem of dominated convergence, the fact that for all closed and non-empty sets $A\subseteq E$ and all $n\in\mathbb{N}$ it holds that $f_{A,n}\colon E\to[0,1]$ is a globally bounded and Lipschitz continuous function, and (100) therefore imply that for all closed and non-empty sets $A\subseteq E$ it holds that

[TABLE]

The fact that $\mathcal{B}(E)=\mathfrak{S}(\{A\subseteq E\colon A\text{ is closed}\})$ , the fact that $\{A\subseteq E\colon A\text{ is closed}\}$ is closed under intersections, and the uniqueness theorem for measures (see, e.g., Klenke [66, Lemma 1.42]) hence assure that for all $B\in\mathcal{B}(E)$ it holds that

[TABLE]

The proof of Lemma 2.13 is thus completed. ∎

2.6 On random evaluations of random fields

This subsection collects elementary and well-known results about random variables originating from evaluations of random fields at random indices.

Lemma 2.14.

Let $(\Omega,\mathcal{F})$ , $(S,\mathcal{S})$ , $(E,\mathcal{E})$ be measurable spaces, let $U=(U(s))_{s\in S}=(U(s,\omega))_{s\in S,\omega\in\Omega}\colon S\times\Omega\to E$ be an $(\mathcal{S}\otimes\mathcal{F})/\mathcal{E}$ -measurable function, and let $X\colon\Omega\to S$ be an $\mathcal{F}/\mathcal{S}$ -measurable function. Then it holds that the function $U(X)=(U(X(\omega),\omega))_{\omega\in\Omega}\colon\Omega\to E$ is $\mathcal{F}/\mathcal{E}$ -measurable.

Proof of Lemma 2.14.

Throughout this proof let $\mathcal{X}\colon\Omega\to S\times\Omega$ be the function which satisfies for all $\omega\in\Omega$ that

[TABLE]

Observe that the hypothesis that $X\colon\Omega\to S$ is an $\mathcal{F}/\mathcal{S}$ -measurable function assures that $\mathcal{X}\colon\Omega\to S\times\Omega$ is an $\mathcal{F}/(\mathcal{S}\otimes\mathcal{F})$ -measurable function. Combining this with the fact that $U\colon S\times\Omega\to E$ is an $(\mathcal{S}\otimes\mathcal{F})/\mathcal{E}$ -measurable function demonstrates that

[TABLE]

is an $\mathcal{F}/\mathcal{E}$ -measurable function. The proof of Lemma 2.14 is thus completed. ∎

A proof for the next two elementary and well-known results (see Lemma 2.15 and Lemma 2.16 below) can, e.g., be found in [59, Lemma 2.3 and Lemma 2.4].

Lemma 2.15.

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $(S,\delta)$ be a separable metric space, let $U=(U(s))_{s\in S}\colon S\times\Omega\to[0,\infty)$ be a continuous random field, let $X\colon\Omega\to S$ be a random variable, and assume that $U$ and $X$ are independent. Then it holds that

[TABLE]

Lemma 2.16.

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $(S,\delta)$ be a separable metric space, let $U=(U(s))_{s\in S}\colon S\times\Omega\to\mathbb{R}$ be a continuous random field, let $X\colon\Omega\to S$ be a random variable, assume that $U$ and $X$ are independent, and assume that $\int_{S}\mathbb{E}\!\left[|U(s)|\right](X(\mathbb{P})_{\mathcal{B}(S)})(ds)<\infty$ . Then it holds that $(X(\mathbb{P})_{\mathcal{B}(S)})(\{s\in S\colon\mathbb{E}\!\left[|U(s)|\right]=\infty\})=0$ , $\mathbb{E}\!\left[|U(X)|\right]<\infty$ , and

[TABLE]

2.7 Brownian motions and right-continuous filtrations

The next result, Lemma 2.17 below, states that a Brownian motion with respect to a filtration is also a Brownian motion with respect to the smallest right-continuous filtration containing the original filtration (cf. (117)). Lemma 2.17 and its proof are very similar to Prévôt & Röckner [86, Proposition 2.1.13].

Lemma 2.17.

Let $m\in\mathbb{N}$ , $T\in(0,\infty)$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ -Brownian motion, and let $\mathbb{H}_{t}\subseteq\mathcal{F}$ , $t\in[0,T]$ , satisfy for all $t\in[0,T]$ that

[TABLE]

Then it holds that $W$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{t})_{t\in[0,T]})$ -Brownian motion.

Proof of Lemma 2.17.

Throughout this proof let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , for every $n\in\mathbb{N}$ let $h_{n}\colon[0,\infty)\to[0,1]$ be the function which satisfies for all $r\in[0,\infty)$ that

[TABLE]

for every closed and non-empty set $A\subseteq\mathbb{R}^{d}$ let $D_{A}\colon\mathbb{R}^{d}\to[0,\infty)$ be the function which satisfies for all $x\in\mathbb{R}^{d}$ that

[TABLE]

and for every $n\in\mathbb{N}$ and every closed and non-empty set $A\subseteq\mathbb{R}^{d}$ let $f_{A,n}\colon\mathbb{R}^{d}\to[0,1]$ be the function which satisfies for all $x\in\mathbb{R}^{d}$ that

[TABLE]

Observe that the fact that $W$ has continuous sample paths, the fact that for all $t\in[0,T)$ , $s\in(t,T]$ , $k\in\mathbb{N}$ it holds that $W_{s}-W_{\min\{t+\nicefrac{{1}}{{k}},s\}}$ and $\mathbb{H}_{t}$ are independent, Klenke [66, Theorem 5.4], and the theorem of dominated convergence assure that for all $t\in[0,T)$ , $s\in(t,T]$ , $B\in\mathbb{H}_{t}$ and all globally bounded and continuous functions $g\colon\mathbb{R}^{d}\to\mathbb{R}$ it holds that

[TABLE]

Next note that the fact that closed and non-empty sets $A\subseteq\mathbb{R}^{d}$ and all $x\in\mathbb{R}^{d}$ it holds that $D_{A}(x)=0\Leftrightarrow x\in A$ assures that for all closed and non-empty sets $A\subseteq\mathbb{R}^{d}$ and all $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Moreover, note that the fact that for every $n\in\mathbb{N}$ it holds that $h_{n}\colon[0,\infty)\to[0,1]$ is a continuous function and the fact that for every closed and non-empty set $A\subseteq\mathbb{R}^{d}$ it holds that $D_{A}\colon\mathbb{R}^{d}\to[0,\infty)$ is a continuous function assure that for every $n\in\mathbb{N}$ and every closed and non-empty set $A\subseteq\mathbb{R}^{d}$ it holds that $f_{A,n}\colon\mathbb{R}^{d}\to[0,1]$ is a continuous function. Combining this, (121), (122), and the theorem of dominated convergence shows that for all $t\in[0,T)$ , $s\in(t,T]$ , $B\in\mathbb{H}_{t}$ and all closed and non-empty sets $A\subseteq\mathbb{R}^{d}$ it holds that

[TABLE]

This proves that for all $t\in[0,T)$ , $s\in(t,T]$ , $B\in\mathbb{H}_{t}$ it holds that $(\mathbbm{1}_{B})^{-1}(\{\},\{0\},\{1\},\{0,1\})$ and $(W_{s}-W_{t})^{-1}(\{A\subseteq\mathbb{R}^{d}\colon A\text{ is a closed set}\})$ are independent. The fact that $\{A\subseteq\mathbb{R}^{d}\colon A\text{ is a closed set}\}$ is closed under intersections, the fact that $\mathfrak{S}(\{A\subseteq\mathbb{R}^{d}\colon A\text{ is a closed set}\})=\mathcal{B}(\mathbb{R}^{d})$ , and Klenke [66, Theorem 2.16] hence assure that for all $t\in[0,T)$ , $s\in(t,T]$ , $B\in\mathbb{H}_{t}$ it holds that $W_{s}-W_{t}$ and $B$ are independent. This implies that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that $W_{s}-W_{t}$ and $\mathbb{H}_{t}$ are independent. Combining this with the hypothesis that $W$ is a Brownian motion, and the fact that $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ is an $(\mathbb{H}_{t})_{t\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{m})$ -adapted stochastic processes establishes that $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{t})_{t\in[0,T]})$ -Brownian motion. The proof of Lemma 2.17 is thus completed. ∎

2.8 On a distributional flow property for solutions of SDEs

In this subsection we prove a distributional flow property for solutions of SDEs in Lemma 2.19 below. The idea for the proof of Lemma 2.19 is based on the observation that if we replace solution processes of SDEs by Euler-Maruyama approximations the flow-type condition trivially holds (cf. the argument below (150) in the proof of Lemma 2.19 below). To prove Lemma 2.19 below we also need, besides several auxiliary results of the previous subsections, the following well-known statement (see Lemma 2.18 below).

Lemma 2.18.

Let $d,m\in\mathbb{N}$ , $T\in(0,\infty)$ , $t\in[0,T]$ , $s\in[t,T]$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t})_{t\in[0,T]})$ be a filtered probability space which satisfies the usual conditions, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{r})_{r\in[0,T]})$ -Brownian motion, let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be globally Lipschitz continuous functions, let $X=(X_{r}(x))_{r\in[t,s],x\in\mathbb{R}^{d}}\colon[t,s]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be a continuous random field which satisfies for every $x\in\mathbb{R}^{d}$ that $(X_{r}(x))_{r\in[t,s]}\colon[t,s]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{F}_{r})_{r\in[t,s]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process and which satisfies that for all $r\in[t,s]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

and let $\xi\colon\Omega\to\mathbb{R}^{d}$ be an $\mathbb{F}_{t}$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable function with $\mathbb{E}\!\left[\left\|\xi\right\|^{2}\right]<\infty$ . Then for all $r\in[t,s]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Proof of Lemma 2.18.

Throughout this proof assume w.l.o.g. that $s>t$ , let $(u^{N,r}_{n})_{n\in\{0,1,2,\ldots,N\},N\in\mathbb{N},r\in(t,s]}\subseteq[t,s]$ satisfy for all $N\in\mathbb{N}$ , $n\in\{0,1,2,\ldots,N\}$ , $r\in(t,s]$ that $u^{N,r}_{n}=t+\frac{n(r-t)}{N}$ , for every $N\in\mathbb{N}$ , $r\in(t,s]$ let $\mathcal{X}^{N,r}=(\mathcal{X}^{N,r}_{n}(x))_{n\in\{0,1,2,\ldots,N\},x\in\mathbb{R}^{d}}\colon\{0,1,2,\ldots,N\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be the continuous random field which satisfies for all $n\in\{1,2,\ldots,N\}$ , $x\in\mathbb{R}^{d}$ that $\mathcal{X}^{N,r}_{0}(x)=x$ and

[TABLE]

let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , and let $L\in[0,\infty)$ satisfy for all $r,h\in[0,T]$ , $x,y\in\mathbb{R}^{d}$ that

[TABLE]

Note that (124), (126), (127), Corollary 2.12 (with $d=d$ , $m=m$ , $N=N$ , $T=T$ , $t=t$ , $s=r$ , $L=L$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})$ , $W=W$ , $\zeta=x$ , $\mu=\mu$ , $\sigma=\sigma$ , $(X_{h})_{h\in[t,s]}=(X_{h})_{h\in[t,r]}$ , $(r_{n})_{n\in\{0,1,\ldots,N\}}=(u^{N,r}_{n})_{n\in\{0,1,\ldots,N\}}$ , $(\mathcal{X}_{n})_{n\in\{0,1,\ldots,N\}}=(\mathcal{X}^{N,r}_{n}(x))_{n\in\{0,1,\ldots,N\}}$ for $N\in\mathbb{N}$ , $x\in\mathbb{R}^{d}$ , $r\in(t,s]$ in the notation of Corollary 2.12), and Lemma 2.10 (with $d=d$ , $m=m$ , $T=r-t$ , $\xi=x$ , $L=L$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t+r})_{r\in[0,r-t]})$ , $(W_{h})_{h\in[0,T]}=(W_{t+h}-W_{t})_{h\in[0,r-t]}$ , $(\mu(h,x))_{h\in[0,T],x\in\mathbb{R}^{d}}=(\mu(t+h,x))_{h\in[0,r-t],x\in\mathbb{R}^{d}}$ , $(\sigma(h,x))_{h\in[0,T],x\in\mathbb{R}^{d}}=(\sigma(t+h,x))_{h\in[0,r-t],x\in\mathbb{R}^{d}}$ , $(X_{h})_{h\in[0,T]}=(X_{t+h})_{h\in[0,r-t]}$ for $x\in\mathbb{R}^{d}$ , $r\in(t,s]$ in the notation of Lemma 2.10) assure that for all $x\in\mathbb{R}^{d}$ , $N\in\mathbb{N}$ , $r\in(t,s]$ it holds that

[TABLE]

This ensures that for all $r\in[t,s]$ , $x\in\mathbb{R}^{d}$ it holds that $\limsup_{N\to\infty}\mathbb{E}[\|X_{r}(x)-\mathcal{X}^{N,r}_{N}(x)\|^{2}]=0$ . This and the fact that for all $r\in[t,s]$ , $x\in\mathbb{R}^{d}$ , $N\in\mathbb{N}$ it holds that $\mathcal{X}^{N,r}_{N}(x)\colon\Omega\to\mathbb{R}^{d}$ is $\mathfrak{S}(W_{h}-W_{t}\colon h\in[t,r])$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable imply that for all $r\in[t,s]$ , $x\in\mathbb{R}^{d}$ it holds that $X_{r}(x)\colon\Omega\to\mathbb{R}^{d}$ is $\mathfrak{S}(\mathfrak{S}(W_{h}-W_{t}\colon h\in[t,r])\cup\{A\in\mathcal{F}\colon\mathbb{P}(A)=0\})$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable. Combining this with the fact that $\xi\colon\Omega\to\mathbb{R}^{d}$ is an $\mathbb{F}_{t}$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable function and the fact that $W\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{r})_{r\in[0,T]})$ -Brownian motion demonstrates for all $r\in[t,s]$ , $N\in\mathbb{N}$ it holds that $(X_{r}(x)-\mathcal{X}^{N,r}_{N}(x))_{x\in\mathbb{R}^{d}}$ and $\xi$ are independent. Lemma 2.15 and (128) hence assure that for all $N\in\mathbb{N}$ , $r\in(t,s]$ it holds that

[TABLE]

Next observe that the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous functions, the hypothesis that $\mathbb{E}\!\left[\left\|\xi\right\|^{2}\right]<\infty$ , and the existence theorem for the solutions of SDEs (see, e.g., Karatzas & Shreve [64, Proposition 5.2.9]) prove that there exists an $(\mathbb{F}_{r})_{r\in[t,s]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process $Y\colon[t,s]\times\Omega\to\mathbb{R}^{d}$ with continuous sample paths which satisfies that for all $r\in[t,s]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Moreover, observe that (126) ensures that for all $N\in\mathbb{N}$ , $n\in\{1,2,\ldots,N\}$ , $r\in(t,s]$ and all functions $\zeta\colon\Omega\to\mathbb{R}^{d}$ it holds that $\mathcal{X}^{N,r}_{0}(\zeta)=\zeta$ and

[TABLE]

Combining this, (127), the fact that $\mathbb{E}\!\left[\left\|Y_{t}\right\|^{2}\right]=\mathbb{E}\!\left[\left\|\xi\right\|^{2}\right]<\infty$ , and (130) with Corollary 2.12 (with $d=d$ , $m=m$ , $N=N$ , $T=T$ , $t=t$ , $s=r$ , $L=L$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})$ , $W=W$ , $\zeta=\xi$ , $\mu=\mu$ , $\sigma=\sigma$ , $(X_{h})_{h\in[t,s]}=(Y_{h})_{h\in[t,r]}$ , $(r_{n})_{n\in\{0,1,\ldots,N\}}=(u^{N,r}_{n})_{n\in\{0,1,\ldots,N\}}$ , $(\mathcal{X}_{n})_{n\in\{0,1,\ldots,N\}}=(\mathcal{X}^{N,r}_{n}(\xi))_{n\in\{0,1,\ldots,N\}}$ for $N\in\mathbb{N}$ , $r\in(t,s]$ in the notation of Corollary 2.12) demonstrates that for all $N\in\mathbb{N}$ , $r\in(t,s]$ it holds that

[TABLE]

The triangle inequality and (129) hence show that for all $r\in(t,s]$ it holds that

[TABLE]

Combining this with the fact that $(X_{r}(\xi))_{r\in[t,s]}$ and $(Y_{r})_{r\in[t,s]}$ are continuous random fields demonstrates that

[TABLE]

This and (130) prove that for all $r\in[t,s]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

The proof of Lemma 2.18 is thus completed. ∎

Lemma 2.19.

Let $d,m\in\mathbb{N}$ , $T\in(0,\infty)$ , let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be globally Lipschitz continuous functions, let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space, let $(\mathbb{F}^{1}_{t})_{t\in[0,T]}$ and $(\mathbb{F}^{2}_{t})_{t\in[0,T]}$ be filtrations on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfy the usual conditions, assume that $\mathbb{F}^{1}_{T}$ and $\mathbb{F}^{2}_{T}$ are independent, for every $i\in\{1,2\}$ let $W^{i}\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}^{i}_{t})_{t\in[0,T]})$ -Brownian motion, and for every $i\in\{1,2\}$ let $X^{i}=(X^{i}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be a continuous random field which satisfies for every $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $(X^{i}_{t,s}(x))_{s\in[t,T]}\colon[t,T]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{F}^{i}_{s})_{s\in[t,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process and which satisfies that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Then it holds for all $r,s,t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $B\in\mathcal{B}(\mathbb{R}^{d})$ with $t\leq s\leq r$ that $\mathbb{P}(X^{1}_{t,t}(x)=x)=1$ and

[TABLE]

Proof of Lemma 2.19.

Throughout this proof let $r,s,t\in[0,T]$ , $x\in\mathbb{R}^{d}$ satisfy that $t\leq s\leq r$ , let $(u^{N}_{n})_{n\in\{0,1,2,\ldots,N\},N\in\mathbb{N}}\subseteq[t,s]$ , $(v^{N}_{n})_{n\in\{0,1,2,\ldots,N\},N\in\mathbb{N}}\subseteq[s,r]$ satisfy for all $N\in\mathbb{N}$ , $n\in\{0,1,2,\ldots,N\}$ that $u^{N}_{n}=t+\frac{n(s-t)}{N}$ and $v^{N}_{n}=s+\frac{n(r-s)}{N}$ , for every $N\in\mathbb{N}$ let $\mathcal{X}^{N}\colon\{0,1,2,\ldots,2N\}\times\Omega\to\mathbb{R}^{d}$ and $\mathcal{Y}^{N},\mathcal{Z}^{N}\colon\{0,1,2,\ldots,N\}\times\Omega\to\mathbb{R}^{d}$ be the stochastic processes which satisfy for all $n\in\{1,2,\ldots,N\}$ that

[TABLE]

let $\mathbb{G}_{h}\subseteq\mathcal{F}$ , $h\in[0,T]$ , and $\mathbb{H}_{h}\subseteq\mathcal{F}$ , $h\in[0,T]$ , be the sigma-algebras which satisfy for all $h\in[0,T]$ that

[TABLE]

let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , and let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ . Note that the hypothesis that $(\mathbb{F}^{1}_{t})_{t\in[0,T]}$ and $(\mathbb{F}^{2}_{t})_{t\in[0,T]}$ are filtrations on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfy the usual conditions and (142) imply that $(\mathbb{H}_{t})_{t\in[0,T]}$ is a filtration on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfies the usual conditions. Moreover, observe that (136) assures that

[TABLE]

Furthermore, note that the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous, (136), (138), (139), (140), and Corollary 2.12 demonstrate that there exists a real number $C\in(0,\infty)$ which satisfies that for all $N\in\mathbb{N}$ it holds that

[TABLE]

This implies that

[TABLE]

Moreover, observe that the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous implies that

[TABLE]

Lemma 2.6 therefore demonstrates that

[TABLE]

Next note that the fact that for all $h\in[0,T]$ , $l\in[h,T]$ it holds that $W^{1}_{l}-W^{1}_{h}$ , $\mathbb{F}^{1}_{h}$ , and $\mathbb{F}^{2}_{h}$ are independent assures that for all $h\in[0,T]$ , $l\in[h,T]$ it holds that $W^{1}_{l}-W^{1}_{h}$ and $\mathbb{G}_{h}$ are independent. This, the fact that $W^{1}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is a Brownian motion, and the fact that $W^{1}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{G}_{h})_{h\in[0,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process imply that $W^{1}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{G}_{h})_{h\in[0,T]})$ -Brownian motion. Lemma 2.17 and (142) hence ensure that $W^{1}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{h})_{h\in[0,T]})$ -Brownian motion. Combining this, the fact that $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{h})_{h\in[0,T]})$ is a filtered probability space which satisfies the usual conditions, the fact that for all $y\in\mathbb{R}^{d}$ it holds that $(X^{1}_{s,h}(y))_{h\in[s,r]}\colon[s,r]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{H}_{h})_{h\in[s,r]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process, (136), the fact that $X^{2}_{t,s}(x)\colon\Omega\to\mathbb{R}^{d}$ is $\mathbb{H}_{s}$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable, and (147) with Lemma 2.18 (with $d=d$ , $m=m$ , $T=T$ , $t=s$ , $s=r$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{h})_{h\in[0,T]})$ , $W=W^{1}$ , $\mu=\mu$ , $\sigma=\sigma$ , $(X_{h}(y))_{h\in[t,s],y\in\mathbb{R}^{d}}=(X^{1}_{s,h}(y))_{h\in[s,r],y\in\mathbb{R}^{d}}$ , $\xi=X^{2}_{t,s}(x)$ in the notation of Lemma 2.18) proves that for all $h\in[s,r]$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

The fact that $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{h})_{h\in[0,T]})$ is a filtered probability space which satisfies the usual conditions, the fact that $W^{1}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{h})_{h\in[0,T]})$ -Brownian motion, the fact that $\mathcal{Y}^{N}_{N}\colon\Omega\to\mathbb{R}^{d}$ is $\mathbb{H}_{s}$ / $\mathcal{B}(\mathbb{R}^{d})$ -measurable, the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous functions, the fact that $(X^{1}_{s,h}(X^{2}_{t,s}(x)))_{h\in[s,r]}\colon[s,r]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{H}_{h})_{h\in[s,r]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process with continuous sample paths, (147), (141), and Corollary 2.12 (with $d=d$ , $m=m$ , $N=N$ , $T=T$ , $t=s$ , $s=r$ , $L=\sup_{h,l\in[0,T],y,z\in\mathbb{R}^{d}\colon(h,y)\neq(l,z)}\frac{\left\|\mu(h,y)-\mu(l,z)\right\|+{\left|\kern-0.75346pt\left|\kern-0.75346pt\left|\sigma(h,y)-\sigma(l,z)\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}}{|h-l|+\left\|y-z\right\|}$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{h})_{h\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{H}_{h})_{h\in[0,T]})$ , $W=W^{1}$ , $\zeta=\mathcal{Y}^{N}_{N}$ , $\mu=\mu$ , $\sigma=\sigma$ , $(X_{h})_{h\in[t,s]}=(X^{1}_{s,h}(X^{2}_{t,s}(x)))_{h\in[s,r]}$ , $(r_{n})_{n\in\{0,1,\ldots,N\}}=(v^{N}_{n})_{n\in\{0,1,\ldots,N\}}$ , $(\mathcal{X}_{n})_{n\in\{0,1,\ldots,N\}}=(\mathcal{Z}^{N}_{n})_{n\in\{0,1,\ldots,N\}}$ for $N\in\mathbb{N}$ in the notation of Corollary 2.12) hence demonstrate that there exists a real number $K\in(0,\infty)$ which satisfies that for all $N\in\mathbb{N}$ it holds that

[TABLE]

This and (144) imply that

[TABLE]

Furthermore, observe that (138)–(141) assure that for all $N\in\mathbb{N}$ it holds that $\mathcal{X}^{N}_{2N}$ and $\mathcal{Z}^{N}_{N}$ have the same distribution. This, (145), and (150) imply that for all globally bounded and Lipschitz continuous functions $g\colon\mathbb{R}^{d}\to\mathbb{R}$ it holds that

[TABLE]

Lemma 2.13 hence assures that $X^{1}_{s,r}(X^{2}_{t,s}(x))$ and $X^{1}_{t,r}(x)$ are identically distributed. Combining this with (143) completes the proof of Lemma 2.19. ∎

3 Full history recursive multilevel Picard (MLP) approximation algorithms

In this section we present the proposed MLP scheme and perform a rigorous complexity analysis. First, we introduce our MLP scheme (cf. (156) in Subsection 3.1 below) as an approximation algorithm for a solution (cf. $u$ in Setting 3.1 in Subsection 3.1 below) of certain type of stochastic fixed point equation (cf. (155) in Subsection 3.1 below) in Subsection 3.1. Subsequently, the goal of Subsections 3.2–3.4 is to obtain an estimate for the $L^{2}$ -error between the MLP scheme and the solution of the stochastic fixed point equation. This results in Proposition 3.15 and Corollary 3.16 in Subsection 3.4 below. In Subsection 3.5 we estimate the computational effort needed to simulate realizations of the MLP scheme and combine this with the $L^{2}$ -error estimate in Corollary 3.16 to obtain a computational complexity analysis for the MLP algorithm in Proposition 3.18. Finally, in Subsection 3.6, we exploit a connection between stochastic fixed point equations and viscosity solutions of semilinear Kolmogorov PDEs to carry over the complexity analysis of Subsection 3.5 to semilinear Kolmogorov PDEs (cf. (300) in Theorem 3.20 below) demonstrating that our proposed MLP algorithm overcomes the curse of dimensionality in the approximation of semilinear Kolmogorov PDEs in Theorem 3.20, the main result of this paper.

3.1 Stochastic fixed point equations and MLP approximations

Setting 3.1.

Let $d\in\mathbb{N}$ , $T\in(0,\infty)$ , $L\in[0,\infty)$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ , $u\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ , $g\in C(\mathbb{R}^{d},\mathbb{R})$ , $f\in C([0,T]\times\mathbb{R}^{d}\times\mathbb{R},\mathbb{R})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v,w\in\mathbb{R}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that

[TABLE]

let $X^{\theta}=(X^{\theta}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , be independent continuous random fields which satisfy for all $r,s,t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $\theta,\vartheta\in\Theta$ , $B\in\mathcal{B}(\mathbb{R}^{d})$ with $t\leq s\leq r$ and $\theta\neq\vartheta$ that $\mathbb{P}(X^{\theta}_{t,t}(x)=x)=1$ and

[TABLE]

assume that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\mathbb{E}\big{[}|g(X^{0}_{t,T}(x))|+\int_{t}^{T}|f(r,X^{0}_{t,r}(x),u(r,X^{0}_{t,r}(x)))|\,dr\big{]}<\infty$ and

[TABLE]

and let $V^{\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , be functions which satisfy for all $M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{\theta}_{M,-1}(t,x)=V^{\theta}_{M,0}(t,x)=0$ and

[TABLE]

3.2 A priori bounds for solutions of stochastic fixed point equations

In our $L^{2}$ -error analysis (see Subsection 3.4 below) of the MLP scheme introduced in Setting 3.1 we need to estimate expectations involving the solution of the stochastic fixed point equation. This estimate is carried out in Lemma 3.3 below. In order to prove Lemma 3.3 we need the elementary and well known time reversed Gronwall inequality in Lemma 3.2.

Lemma 3.2 (Time reversed time-continuous Gronwall inequality).

Let $T,\alpha,\beta\in[0,\infty)$ and let $\epsilon\colon[0,T]\to[0,\infty]$ be a $\mathcal{B}([0,T])/\mathcal{B}([0,\infty])$ -measurable function which satisfies for all $t\in[0,T]$ that $\int_{0}^{T}\epsilon(r)\,dr<\infty$ and $\epsilon(t)\leq\alpha+\beta\int_{t}^{T}\epsilon(r)\,dr.$ Then

(i)

it holds for all $t\in[0,T]$ that $\epsilon(t)\leq\alpha\exp(\beta(T-t))$ and 2. (ii)

it holds that $\sup_{t\in[0,T]}\epsilon(t)\leq\alpha\exp(\beta T)<\infty$ .

Proof of Lemma 3.2.

Throughout this proof let $\Phi\colon[0,T]\to[0,T]$ and $\varepsilon\colon[0,T]\to[0,\infty]$ be the functions which satisfy for all $t\in[0,T]$ that

[TABLE]

Observe that the integral transformation theorem (see, e.g., Klenke [66, Theorem 4.10]) implies that for all $t\in[0,T]$ it holds that

[TABLE]

Hence, we obtain that

[TABLE]

Moreover, observe that (157), (158), and the hypothesis that for all $t\in[0,T]$ it holds that $\epsilon(t)\leq\alpha+\beta\int_{t}^{T}\epsilon(r)\,dr$ assure that for all $t\in[0,T]$ it holds that

[TABLE]

Combining this and (159) with Gronwall’s integral inequality (cf, e.g., Grohs et al. [48, Lemma 2.11]) demonstrates that for all $t\in[0,T]$ it holds that

[TABLE]

Hence, we obtain that for all $t\in[0,T]$ it holds that

[TABLE]

This establishes items (i)–(ii). The proof of Lemma 3.2 is thus completed. ∎

Lemma 3.3.

Assume Setting 3.1, let $\xi\in\mathbb{R}^{d}$ , $C\in[0,\infty]$ satisfy that

[TABLE]

and assume that $\int_{0}^{T}\left(\mathbb{E}\!\left[|u(t,X^{0}_{0,t}(\xi))|^{2}\right]\right)^{\nicefrac{{1}}{{2}}}\,dt<\infty$ . Then

(i)

it holds for all $t\in[0,T]$ that $\left(\mathbb{E}\!\left[|u(t,X^{0}_{0,t}(\xi))|^{2}\right]\right)^{\!\nicefrac{{1}}{{2}}}\leq C\exp(L(T-t))$ and 2. (ii)

it holds that $\sup_{t\in[0,T]}\left(\mathbb{E}\!\left[|u(t,X^{0}_{0,t}(\xi))|^{2}\right]\right)^{\!\nicefrac{{1}}{{2}}}\leq C\exp(LT)$ .

Proof of Lemma 3.3.

Throughout this proof assume w.l.o.g. that $C<\infty$ and let $\mu_{t}\colon\mathcal{B}(\mathbb{R}^{d})\to[0,1]$ , $t\in[0,T]$ , be the probability measures which satisfy for all $t\in[0,T]$ , $B\in\mathcal{B}(\mathbb{R}^{d})$ that

[TABLE]

(cf. item (iv) in Lemma 3.6). Note that (155) and the triangle inequality ensure that for all $t\in[0,T]$ it holds that

[TABLE]

Jensen’s inequality hence assures that for all $t\in[0,T]$ it holds that

[TABLE]

Furthermore, observe that (164), the fact that $X^{0}$ and $X^{1}$ are independent and continuous random fields, (154), and Lemma 2.15 demonstrate that for all $t\in[0,T]$ it holds that

[TABLE]

In addition, note that Minkowski’s integral inequality (cf., e.g., Jentzen & Kloeden [61, Proposition 8 in Appendix A.1]), (164), the fact that $X^{0}$ and $X^{1}$ are independent and continuous random fields, (154), and Lemma 2.15 imply that for all $t\in[0,T]$ it holds that

[TABLE]

Moreover, observe that (152) ensures that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v\in\mathbb{R}$ it holds that

[TABLE]

This, (168), and the triangle inequality imply that for all $t\in[0,T]$ it holds that

[TABLE]

Furthermore, note that Lemma 2.9 assures that for all $t\in[0,T]$ it holds that

[TABLE]

Combining this with (163), (166), (167), and (170) implies that for all $t\in[0,T]$ it holds that

[TABLE]

The hypothesis that $\int_{0}^{T}\left(\mathbb{E}\!\left[|u(t,X^{0}_{0,t}(\xi))|^{2}\right]\right)^{\nicefrac{{1}}{{2}}}\,dt<\infty$ and Lemma 3.2 (with $T=T$ , $\alpha=C$ , $\beta=L$ , $(\epsilon(t))_{t\in[0,T]}=\big{(}(\mathbb{E}[|u(t,X^{0}_{0,t}(\xi))|^{2}])^{\nicefrac{{1}}{{2}}}\big{)}_{t\in[0,T]}$ in the notation of Lemma 3.2) hence establish items (i)–(ii). The proof of Lemma 3.3 is thus completed. ∎

3.3 Properties of MLP approximations

In this subsection we establish in Lemma 3.6 below some elementary properties of the MLP approximations (cf. (156) in Setting 3.1 above) introduced in Setting 3.1 above. For this we need two elementary and well known results on identically distributed random variables (see Lemma 3.4 and Lemma 3.5 below).

Lemma 3.4.

Let $d,N\in\mathbb{N}$ , let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $X_{k}\colon\Omega\to\mathbb{R}^{d}$ , $k\in\{1,2,\ldots,N\}$ , be independent random variables, let $Y_{k}\colon\Omega\to\mathbb{R}^{d}$ , $k\in\{1,2,\ldots,N\}$ , be independent random variables, and assume for every $k\in\{1,2,\ldots,N\}$ that $X_{k}$ and $Y_{k}$ are identically distributed. Then it holds that $\big{(}\sum_{k=1}^{N}X_{k}\big{)}\colon\Omega\to\mathbb{R}^{d}$ and $\big{(}\sum_{k=1}^{N}Y_{k}\big{)}\colon\Omega\to\mathbb{R}^{d}$ are identically distributed random variables.

Proof of Lemma 3.4.

Throughout this proof let $\mathfrak{X},\mathfrak{Y}\colon\Omega\to\mathbb{R}^{Nd}$ be the random variables which satisfy that

[TABLE]

and let $f\in C(\mathbb{R}^{Nd},\mathbb{R}^{d})$ be the function which satisfies for all $v_{1},v_{2},\ldots,v_{N}\in\mathbb{R}^{d}$ that $f(v_{1},v_{2},\ldots,v_{N})=\sum_{k=1}^{N}v_{k}$ . Observe that the hypothesis that $(X_{k})_{k\in\{1,2,\ldots,N\}}$ are independent, the hypothesis that $(Y_{k})_{k\in\{1,2,\ldots,N\}}$ are independent, and the hypothesis that for every $k\in\{1,2,\ldots,N\}$ it holds that $X_{k}$ and $Y_{k}$ are identically distributed random variables assure that for all $(B_{k})_{k\in\{1,2,\ldots,N\}}\subseteq\mathcal{B}(\mathbb{R}^{d})$ it holds that

[TABLE]

This, the fact that

[TABLE]

and the uniqueness theorem for measures (see, e.g., Klenke [66, Lemma 1.42]) imply that it holds for all $B\in\mathcal{B}(\mathbb{R}^{Nd})$ that

[TABLE]

Hence, we obtain that for all $B\in\mathcal{B}(\mathbb{R}^{d})$ it holds that

[TABLE]

This shows that $\big{(}\sum_{k=1}^{N}X_{k}\big{)}\colon\Omega\to\mathbb{R}^{d}$ and $\big{(}\sum_{k=1}^{N}Y_{k}\big{)}\colon\Omega\to\mathbb{R}^{d}$ are identically distributed random variables. The proof of Lemma 3.4 is thus completed. ∎

Lemma 3.5.

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $(S,\delta)$ be a separable metric space, let $(E,\delta)$ be a metric space, let $U,V\colon S\times\Omega\to E$ be continuous random fields, let $X,Y\colon\Omega\to S$ be random variables, assume that $U$ and $X$ are independent, assume that $V$ and $Y$ are independent, assume for all $s\in S$ that $U(s)$ and $V(s)$ are identically distributed, and assume that $X$ and $Y$ are identically distributed. Then it holds that $U(X)=(U(X(\omega),\omega))_{\omega\in\Omega}\colon\Omega\to E$ and $V(Y)=(V(Y(\omega),\omega))_{\omega\in\Omega}\colon\Omega\to E$ are identically distributed random variables.

Proof of Lemma 3.5.

First, note that Grohs et al. [3, Lemma 2.4], the fact that $U$ and $V$ are continuous random fields, and Lemma 2.14 ensure that $U(X)$ and $V(Y)$ are random variables. Next observe the hypothesis that $U$ and $X$ are independent, the hypothesis that $V$ and $Y$ are independent, the hypothesis that for all $s\in S$ it holds that $U(s)$ and $V(s)$ are identically distributed, the hypothesis that $X$ and $Y$ are identically distributed and Lemma 2.16 demonstrate that for all globally bounded and Lipschitz continuous functions $g\colon E\to\mathbb{R}$ it holds that

[TABLE]

Combining this with Lemma 2.13 assures that $U(X)$ and $V(Y)$ are identically distributed. The proof of Lemma 3.5 is thus completed. ∎

Lemma 3.6 (Properties of MLP approximations).

*Assume Setting 3.1 and let $M\in\mathbb{N}$ . Then *

(i)

for all $\theta\in\Theta$ , $n\in\mathbb{N}_{0}$ it holds that $V^{\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ is a continuous random field, 2. (ii)

for all $\theta\in\Theta$ , $n\in\mathbb{N}_{0}$ it holds that $V^{\theta}_{M,n}$ is $\left(\mathcal{B}([0,T]\times\mathbb{R}^{d})\otimes\mathfrak{S}((\mathcal{R}^{(\theta,\vartheta)})_{\vartheta\in\Theta},(X^{(\theta,\vartheta)})_{\vartheta\in\Theta})\right)\!/\mathcal{B}(\mathbb{R})$ -measurable, 3. (iii)

for all $\theta\in\Theta$ , $n\in\mathbb{N}_{0}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

is an independent family of random variables, 4. (iv)

for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds that $X^{\theta}_{t,s}(x)\colon\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , are identically distributed random variables, and 5. (v)

for all $n\in\mathbb{N}_{0}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that $V^{\theta}_{M,n}(t,x)\colon\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , are identically distributed random variables.

Proof of Lemma 3.6.

We first prove item (i) by induction on $n\in\mathbb{N}_{0}$ . For the base case $n=0$ observe that the hypothesis that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,0}=0$ demonstrates that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,0}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ is a continuous random field. This establishes item (i) in the base case $n=0$ . For the induction step $\mathbb{N}_{0}\ni(n-1)\to n\in\mathbb{N}$ let $n\in\mathbb{N}$ and assume that for every $k\in\mathbb{N}_{0}\cap[0,n)$ , $\theta\in\Theta$ it holds that $V^{\theta}_{M,k}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ is a continuous random field. Combining this, the hypothesis that $g$ and $f$ are continuous functions, and the fact that for all $\theta\in\Theta$ it holds that $R^{\theta}\colon[0,T]\times\Omega\to[0,T]$ and $X^{\theta}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ are continuous random fields with (156), Grohs et al. [3, Lemma 2.4], and Lemma 2.14 proves that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ is a continuous random field. Induction thus establishes item (i). Next we prove item (ii) by induction on $n\in\mathbb{N}_{0}$ . For the base case $n=0$ observe that the hypothesis that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,0}=0$ demonstrates that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,0}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ is $\left(\mathcal{B}([0,T]\times\mathbb{R}^{d})\otimes\mathfrak{S}((\mathcal{R}^{(\theta,\vartheta)})_{\vartheta\in\Theta},(X^{(\theta,\vartheta)})_{\vartheta\in\Theta})\right)/\mathcal{B}(\mathbb{R})$ -measurable. This implies item (ii) in the base case $n=0$ . For the induction step $\mathbb{N}_{0}\ni(n-1)\to n\in\mathbb{N}$ let $n\in\mathbb{N}$ and assume that for all $k\in\mathbb{N}_{0}\cap[0,n)$ , $\theta\in\Theta$ it holds that $V^{\theta}_{M,k}$ is $\left(\mathcal{B}([0,T]\times\mathbb{R}^{d})\otimes\mathfrak{S}((\mathcal{R}^{(\theta,\vartheta)})_{\vartheta\in\Theta},(X^{(\theta,\vartheta)})_{\vartheta\in\Theta})\right)/\mathcal{B}(\mathbb{R})$ -measurable. Combining this, the fact that $f$ and $g$ are Borel measurable, and the fact that for all $\theta\in\Theta$ it holds that $X^{\theta}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ is a continuous random field with (156) and Lemma 2.14 proves that for all $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Moreover, observe that item (i) and Grohs et al. [3, Lemma 2.4] ensure that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,n}$ is $\left(\mathcal{B}([0,T]\times\mathbb{R}^{d})\otimes\mathfrak{S}(V^{\theta}_{M,n})\right)\!/\mathcal{B}(\mathbb{R})$ -measurable. Combining this with (180) demonstrates that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,n}$ is $\left(\mathcal{B}([0,T]\times\mathbb{R}^{d})\otimes\mathfrak{S}((\mathcal{R}^{(\theta,\vartheta)})_{\vartheta\in\Theta},(X^{(\theta,\vartheta)})_{\vartheta\in\Theta})\right)/\mathcal{B}(\mathbb{R})$ -measurable. Induction thus establishes item (ii). Furthermore, observe that item (ii), the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, and Lemma 2.14 prove item (iii). Next observe that (154), the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, Lemma 2.16 (with $S=\mathbb{R}^{d}$ , $U=g(X^{\theta}_{s,s}(\cdot))$ , $X=X^{\vartheta}_{t,s}(x)$ for $g\in C(\mathbb{R}^{d},\mathbb{R})$ , $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ , $\theta,\vartheta\in\Theta$ in the notation of Lemma 2.16), and the fact that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $\theta\in\Theta$ it holds that $\mathbb{P}(X^{\theta}_{t,t}(x)=x)=1$ assure that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ , $\theta,\vartheta\in\Theta$ with $\theta\neq\vartheta$ and all globally bounded and continuous functions $g\colon\mathbb{R}^{d}\to\mathbb{R}$ it holds that

[TABLE]

Combining this with Lemma 2.13 demonstrates that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ , $\theta,\vartheta\in\Theta$ it holds that $X^{\theta}_{t,s}(x)\colon\Omega\to\mathbb{R}^{d}$ and $X^{\vartheta}_{t,s}(x)\colon\Omega\to\mathbb{R}^{d}$ are identically distributed random variables. This establishes item (iv). Next we prove item (v) by induction on $n\in\mathbb{N}_{0}$ . For the base case $n=0$ observe that the hypothesis that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,0}=0$ demonstrates that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that $V^{\theta}_{M,0}(t,x)\colon\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , are identically distributed random variables. This establishes item (v) in the base case $n=0$ . For the induction step $\mathbb{N}_{0}\ni(n-1)\to n\in\mathbb{N}$ let $n\in\mathbb{N}$ and assume that for all $k\in\mathbb{N}_{0}\cap[0,n)$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that $V^{\theta}_{M,k}(t,x)\colon\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , are identically distributed random variables. This, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(R^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, item (ii), Lemma 3.4, and Lemma 3.5 (with $S=[0,T]\times\mathbb{R}^{d}$ , $E=\mathbb{R}$ , $U=\big{(}f(s,y,V^{(\theta,k,m)}_{M,k}(s,y))-\mathbbm{1}_{\mathbb{N}}(k)f(s,y,V^{(\theta,k,-m)}_{M,k-1}(s,y))\big{)}_{(s,y)\in[0,T]\times\mathbb{R}^{d}}$ , $V=\big{(}f(s,y,V^{(\vartheta,k,m)}_{M,k}(s,y))-\mathbbm{1}_{\mathbb{N}}(k)f(s,y,V^{(\vartheta,k,-m)}_{M,k-1}(s,y))\big{)}_{(s,y)\in[0,T]\times\mathbb{R}^{d}}$ , $X=(R^{(\theta,k,m)}_{t},X^{(\theta,k,m)}_{t,R^{(\theta,k,m)}_{t}}(x))$ , $Y=(R^{(\vartheta,k,m)}_{t},X^{(\vartheta,k,m)}_{t,R^{(\vartheta,k,m)}_{t}}(x))$ for $\theta,\vartheta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $k\in\mathbb{N}_{0}\cap[0,n)$ , $m\in\mathbb{N}$ with $\theta\neq\vartheta$ in the notation of Lemma 3.5) assure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $k\in\mathbb{N}_{0}\cap[0,n)$ , $m\in\mathbb{N}$ it holds that

[TABLE]

are identically distributed random variables. Items (iii)–(iv), (156), and Lemma 3.4 therefore ensure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that $V^{\theta}_{M,n}(t,x)\colon\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , are identically distributed random variables. Induction thus establishes item (v). The proof of Lemma 3.6 is thus completed. ∎

3.4 Analysis of approximation errors of MLP approximations

Proposition 3.15, resp. Corollary 3.16, in Subsection 3.4.5 below presents estimates for the $L^{2}$ -approximation error of the MLP scheme (cf. (156) in Setting 3.1 above) introduced in Setting 3.1 with respect to the solution of the stochastic fixed point equation (cf. (155) in Setting 3.1 above) for every iteration (cf. $n\in\mathbb{N}$ in (156) in Subsection 3.1 above) and every Monte Carlo accuracy (cf. $M\in\mathbb{N}$ in (156) in Subsection 3.1 above) of the MLP scheme. The essential idea for the proof of those statements is to decompose the $L^{2}$ -approximation error into a bias and a variance part and to analyze them separately (see Subsections 3.4.1–3.4.3). This approach leads to a recursive inequality (cf. (240) in the proof of Proposition 3.15 below) which can be treated using an elementary Gronwall inequality, proven in Subsection 3.4.4 (see Lemma 3.12). For the proofs of the statements in this subsection we need some elementary and well-known results (see Lemma 3.7, Lemma 3.10, and Lemma 3.14) which we state and prove where they are used.

3.4.1 Expectations of MLP approximations

Lemma 3.7.

Assume Setting 3.1, let $\theta\in\Theta$ , $t\in[0,T]$ , let $U_{1}\colon[t,T]\times\Omega\to[0,\infty]$ and $U_{2}\colon[t,T]\times\Omega\to\mathbb{R}$ be continuous random fields which satisfy for all $i\in\{1,2\}$ that $U_{i}$ and $\mathcal{R}^{\theta}$ are independent and $\int_{t}^{T}\mathbb{E}\!\left[|U_{2}(r)|\right]dr<\infty$ . Then it holds for all $i\in\{1,2\}$ that $\operatorname{Borel}_{[t,T]}(\{r\in[t,T]\colon\mathbb{E}[|U_{2}({r})|]=\infty\})=0$ , $\mathbb{E}\!\left[|U_{2}(R^{\theta}_{t})|\right]<\infty$ , and

[TABLE]

Proof of Lemma 3.7.

Throughout this proof assume w.l.o.g. that $t<T$ . Observe that (153) implies that $R^{\theta}_{t}$ is $\mathcal{U}_{[t,T]}$ -distributed. Combining this with the fact that $U_{1}$ is continuous, the fact that $U_{1}$ and $R^{\theta}_{t}$ are independent, and Lemma 2.15 assures that

[TABLE]

In addition, note that the fact that $R^{\theta}_{t}$ is $\mathcal{U}_{[t,T]}$ -distributed, the fact that $U_{2}$ is continuous, the fact that $U_{2}$ and $R^{\theta}_{t}$ are independent, the hypothesis that $\int_{t}^{T}\mathbb{E}\!\left[|U_{2}(r)|\right]dr<\infty$ , and Lemma 2.16 ensure that $\operatorname{Borel}_{[t,T]}(\{r\in[t,T]\colon\mathbb{E}[|U_{2}({r})|]=\infty\})=0$ , $\mathbb{E}\!\left[|U_{2}(R^{\theta}_{t})|\right]<\infty$ , and

[TABLE]

Combining this with (184) establishes (183). The proof of Lemma 3.7 is thus completed. ∎

Lemma 3.8 (Expectations of MLP approximations).

Assume Setting 3.1 and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ . Then

(i)

for all $M\in\mathbb{N}$ , $n\in\mathbb{N}_{0}$ , $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

and 2. (ii)

for all $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Proof of Lemma 3.8.

Throughout this proof let $M\in\mathbb{N}$ , $x\in\mathbb{R}^{d}$ . Observe that Lemma 3.7, items (i)–(ii) in Lemma 3.6, and the fact that for all $n\in\mathbb{N}$ it holds that $V_{M,n}^{0}$ , $X^{0}$ , and $\mathcal{R}^{0}$ are independent demonstrate that for all $n\in\mathbb{N}_{0}$ , $t\in[0,T]$ it holds that

[TABLE]

Next we claim that for all $n\in\mathbb{N}_{0}$ , $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

We now prove (189) by induction on $n\in\mathbb{N}_{0}$ . For the base case $n=0$ observe that the hypothesis that $V^{0}_{M,0}=0$ and the hypothesis that for all $t\in[0,T]$ it holds that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ imply that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

This establishes (189) in the case $n=0$ . For the induction step $\mathbb{N}_{0}\ni(n-1)\to n\in\mathbb{N}$ let $n\in\mathbb{N}$ and assume that for all $k\in\mathbb{N}_{0}\cap[0,n)$ , $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Note that (156) and the triangle inequality ensure that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Furthermore, observe that (154), (155), and item (iv) in Lemma 3.6 assure that for all $m\in\mathbb{Z}$ , $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Moreover, note that Lemma 3.7, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, items (i)–(ii) & (iv)–(v) in Lemma 3.6, (154), and Lemma 2.15 demonstrate that for all $i,j,l,m\in\mathbb{Z}$ , $k\in\mathbb{N}_{0}$ , $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Combining this with (191), (192), and (193) establishes that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Hence, we obtain that for all $t\in[0,T]$ it holds that

[TABLE]

The hypothesis that for all $t\in[0,T]$ it holds that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ and the fact that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v\in\mathbb{R}$ it holds that $|f(t,x,v)|\leq|f(t,x,0)|+L|v|$ therefore assure that for all $t\in[0,T]$ it holds that

[TABLE]

This, (195), and (196) establish that for all $t\in[0,T]$ , $s\in[t,T]$ it holds that

[TABLE]

Induction thus proves (189). Combining (188) and (189) establishes item (i). Next observe that (156), (189), items (i)–(ii) & (iv)–(v) in Lemma 3.6, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, and Lemma 3.5 ensure that for all $n\in\mathbb{N}$ , $t\in[0,T]$ it holds that

[TABLE]

Lemma 3.7, items (i)–(ii) in Lemma 3.6, the fact that for all $n\in\mathbb{N}_{0}$ it holds that $V^{0}_{M,n}$ , $X^{0}$ , and $\mathcal{R}^{0}$ are independent, (189), and Fubini’s theorem therefore imply that for all $n\in\mathbb{N}$ , $t\in[0,T]$ it holds that

[TABLE]

This establishes item (ii). The proof of Lemma 3.8 is thus completed. ∎

3.4.2 Biases of MLP approximations

Lemma 3.9 (Biases of MLP approximations).

Assume Setting 3.1 and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ . Then it holds for all $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Proof of Lemma 3.9.

Note that Lemma 3.8, the hypothesis that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ , (152), (155), and Tonelli’s theorem demonstrate that for all $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Lemma 2.9 and Jensen’s inequality hence show that for all $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

The proof of Lemma 3.9 is thus completed. ∎

3.4.3 Estimates for the variances of MLP approximations

Lemma 3.10.

Let $n\in\mathbb{N}$ , let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, and let $X_{1},X_{2},\ldots,X_{n}\colon\Omega\to\mathbb{R}$ be independent random variables which satisfy for all $i\in\{1,2,\ldots,n\}$ that $\mathbb{E}\!\left[|X_{i}|\right]<\infty$ . Then it holds that

[TABLE]

Proof of Lemma 3.10.

Note that the fact that for all independent random variables $Y,Z\colon\Omega\to\mathbb{R}$ with $\mathbb{E}[|Y|+|Z|]<\infty$ it holds that $\mathbb{E}[|YZ|]<\infty$ and $\mathbb{E}[YZ]=\mathbb{E}[Y]\,\mathbb{E}[Z]$ (cf., e.g., Klenke [66, Theorem 5.4]) and the hypothesis that $X_{i}\colon\Omega\to\mathbb{R}$ , $i\in\{1,2,\dots,n\}$ , are independent random variables assure that

[TABLE]

The proof of Lemma 3.10 is thus completed. ∎

Lemma 3.11 (Estimates for the variances of MLP approximations).

Assume Setting 3.1 and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ . Then it holds for all $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Proof of Lemma 3.11.

Throughout this proof let $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ . Observe that Lemma 3.10, item (i) in Lemma 3.8, the fact that for all $\theta\in\Theta$ it holds that $\mathbb{E}\!\left[|g(X^{0}_{t,T}(x))|\right]<\infty$ , item (iii) in Lemma 3.6, and (156) imply that

[TABLE]

Moreover, note that item (iv) in Lemma 3.6 and the fact that for all $Z\in\mathcal{L}^{1}(\mathbb{P},\mathbb{R})$ it holds that $\operatorname{Var}(Z)\leq\mathbb{E}[|Z|^{2}]$ ensure that

[TABLE]

In addition, note that items (i)–(ii) & (iv)–(v) in Lemma 3.6, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the fact that for all $Z\in\mathcal{L}^{1}(\mathbb{P},\mathbb{R})$ it holds that $\operatorname{Var}(Z)\leq\mathbb{E}[|Z|^{2}]$ , and Lemma 3.5 show that for all $k\in\mathbb{N}_{0}\cap[0,n)$ it holds that

[TABLE]

Lemma 3.7, the fact that $X^{0}$ and $R^{0}$ are independent, and the hypothesis that for all $\theta\in\Theta$ it holds that $V^{\theta}_{M,0}=0$ therefore demonstrate that

[TABLE]

In addition, observe that (152), (209), the fact that for all $x,y\in[0,\infty)$ it holds that $|x+y|^{2}\leq 2(|x|^{2}+|y|^{2})$ , items (i)–(ii) & (v) in Lemma 3.6, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, and Lemma 3.5 assure that for all $k\in\mathbb{N}\cap[1,n)$ it holds that

[TABLE]

Lemma 3.7, items (i)–(ii) in Lemma 3.6, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, and the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent hence ensure that for all $k\in\mathbb{N}\cap[1,n)$ it holds that

[TABLE]

Combining this with (207), (208), and (210) establishes that

[TABLE]

The proof of Lemma 3.11 is thus completed. ∎

3.4.4 On a geometric time-discrete Gronwall inequality

Lemma 3.12.

Let $\alpha,\beta\in[0,\infty)$ , $M\in(0,\infty)$ , $(\epsilon_{n,q})_{n,q\in\mathbb{N}_{0}}\subseteq[0,\infty]$ satisfy for all $n,q\in\mathbb{N}_{0}$ that

[TABLE]

Then it holds for all $n,q\in\mathbb{N}_{0}$ that

[TABLE]

Proof of Lemma 3.12.

Throughout this proof assume w.l.o.g. that $\beta>0$ . We prove (215) by induction on $n\in\mathbb{N}_{0}$ . For the base case $n=0$ observe that (214) assures that for all $q\in\mathbb{N}_{0}$ it holds that

[TABLE]

This proves (215) in the base case $n=0$ . For the induction step $\mathbb{N}_{0}\ni(n-1)\to n\in\mathbb{N}$ observe that (214) ensures that for all $n\in\mathbb{N}$ , $q\in\mathbb{N}_{0}$ with $\forall\,k\in\mathbb{N}_{0}\cap[0,n),p\in\mathbb{N}_{0}\colon\epsilon_{k,p}\leq\alpha\frac{(1+\beta)^{k}}{M^{k+p}}$ it holds that

[TABLE]

Induction hence establishes (215). The proof of Lemma 3.12 is thus completed. ∎

3.4.5 Error estimates for MLP approximations

Corollary 3.13.

Assume Setting 3.1 and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]\,dr<\infty$ . Then it holds for all $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

Proof of Corollary 3.13.

Throughout this proof let $M,n\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $C\in[0,\infty]$ , $(e_{k})_{k\in\mathbb{N}_{0}\cap[0,n)}\subseteq[0,\infty]$ satisfy that for all $k\in\mathbb{N}_{0}\cap[0,n)$ that

[TABLE]

and

[TABLE]

Note that item (i) in Lemma 3.8, the bias variance decomposition of the mean square error (cf., e.g., Jentzen & von Wurstemberger [62, Lemma 2.2]), the hypothesis that for all $s\in[0,T]$ , $z\in\mathbb{R}^{d}$ it holds that $\int_{s}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{s,r}(z),0)|\right]\,dr<\infty$ , Lemma 3.9, and Lemma 3.11 demonstrate that

[TABLE]

The proof of Corollary 3.13 is thus completed. ∎

Lemma 3.14.

Let $T\in[0,\infty)$ , $q\in\mathbb{N}$ and let $U\colon[0,T]\to[0,\infty]$ be a $\mathcal{B}([0,T])/\mathcal{B}([0,\infty])$ -measurable function. Then

[TABLE]

Proof of Lemma 3.14.

Observe that Tonelli’s theorem assures that

[TABLE]

The proof of Lemma 3.14 is thus completed. ∎

Proposition 3.15.

Assume Setting 3.1, let $\xi\in\mathbb{R}^{d}$ , $C\in[0,\infty]$ satisfy that

[TABLE]

and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{0}^{T}\left(\mathbb{E}\!\left[|u(r,X^{0}_{0,r}(\xi))|^{2}\right]\right)^{\nicefrac{{1}}{{2}}}dr+\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]dr<\infty$ . Then it holds for all $M\in\mathbb{N}$ , $n\in\mathbb{N}_{0}$ that

[TABLE]

Proof of Proposition 3.15.

Throughout this proof assume w.l.o.g. that $C<\infty$ , let $M\in\mathbb{N}$ , let $\epsilon_{n,q}\in[0,\infty]$ , $n,q\in\mathbb{N}_{0}$ , be the extended real numbers which satisfy for all $n,q\in\mathbb{N}_{0}$ that

[TABLE]

and let $\mu_{t}\colon\mathcal{B}(\mathbb{R}^{d})\to[0,1]$ , $t\in[0,T]$ , be the probability measures which satisfy for all $t\in[0,T]$ , $B\in\mathcal{B}(\mathbb{R}^{d})$ that

[TABLE]

(cf. item (iv) in Lemma 3.6). Note that the fact that for all $x,y\in[0,\infty)$ it holds that $(x+y)^{2}\geq x^{2}+y^{2}$ assures that

[TABLE]

Next observe that items (i)–(ii) in Lemma 3.6, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, Tonelli’s theorem, Corollary 3.13, and Lemma 2.15 ensure that for all $n\in\mathbb{N}$ , $t\in[0,T]$ it holds that

[TABLE]

Moreover, observe that (228), (229), the fact that $X^{0}$ and $X^{1}$ are independent and continuous random fields, (154), and Lemma 2.15 imply that for all $t\in[0,T]$ it holds that

[TABLE]

In addition, note that (228), items (i)–(ii) in Lemma 3.6, the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ are independent, the hypothesis that $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, and the hypothesis that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, (154), Lemma 2.15, and Lemma 3.5 assure that for all $n\in\mathbb{N}_{0}$ , $t\in[0,T]$ , $r\in[t,T]$ it holds that

[TABLE]

Combining this with (230) and (231) ensures that for all $n\in\mathbb{N}$ , $t\in[0,T]$ it holds that

[TABLE]

The fact that $\mathbb{P}(X^{0}_{0,0}(\xi)=\xi)=1$ , the fact that for all $n\in\mathbb{N}$ it holds that $V^{0}_{M,n}$ , $X^{0}$ , and $\mathcal{R}^{0}$ are independent, Lemma 3.5, and (226) hence imply that for all $n\in\mathbb{N}$ it holds that

[TABLE]

Moreover, observe that Lemma 3.14 (with $T=T$ , $q=q$ , $(U(r))_{r\in[0,T]}=(\mathbb{E}[|u(r,X^{0}_{0,r}(\xi))-V^{0}_{M,n}(r,X^{0}_{0,r}(\xi))|^{2}])_{r\in[0,T]}$ for $n\in\mathbb{N}_{0}$ , $q\in\mathbb{N}$ in the notation of Lemma 3.14) demonstrates that for all $n\in\mathbb{N}_{0}$ , $q\in\mathbb{N}$ it holds that

[TABLE]

This and (233) imply that for all $n,q\in\mathbb{N}$ it holds that

[TABLE]

Furthermore, note the fact that $\int_{0}^{T}\left(\mathbb{E}\!\left[|u(r,X^{0}_{0,r}(\xi))|^{2}\right]\right)^{\nicefrac{{1}}{{2}}}dr<\infty$ and Lemma 3.3 prove that

[TABLE]

The fact that $\mathbb{P}(X^{0}_{0,0}(\xi)=\xi)=1$ and the fact that $V^{0}_{M,0}=0$ hence assure that

[TABLE]

Moreover, observe that (237) and the fact that $V^{0}_{M,0}=0$ ensure that for all $q\in\mathbb{N}$ it holds that

[TABLE]

Combining this, (234), (236), and (238) demonstrates that for all $n,q\in\mathbb{N}_{0}$ it holds that

[TABLE]

Lemma 3.12 (with $\alpha=C^{2}\exp(M)$ , $\beta=4L^{2}T^{2}$ , $M=M$ , $(\epsilon_{n,q})_{n,q\in\mathbb{N}_{0}}=(\epsilon_{n,q})_{n,q\in\mathbb{N}_{0}}$ in the notation of Lemma 3.12) therefore proves that for all $n,q\in\mathbb{N}_{0}$ it holds that

[TABLE]

This implies that for all $n\in\mathbb{N}_{0}$ it hold that

[TABLE]

The fact that for all $x,y\in[0,\infty)$ it holds that $\sqrt{x+y}\leq\sqrt{x}+\sqrt{y}$ hence demonstrates that for all $n\in\mathbb{N}_{0}$ it holds that

[TABLE]

The proof of Proposition 3.15 is thus completed. ∎

Corollary 3.16.

Assume Setting 3.1, let $\xi\in\mathbb{R}^{d}$ , $C\in[0,\infty]$ satisfy that

[TABLE]

and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{0}^{T}\left(\mathbb{E}\!\left[|u(r,X^{0}_{0,r}(\xi))|^{2}\right]\right)^{\nicefrac{{1}}{{2}}}dr+\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]dr<\infty$ . Then it holds for all $N\in\mathbb{N}$ that

[TABLE]

Proof of Corollary 3.16.

Proposition 3.15 establishes Corollary 3.16. The proof of Corollary 3.16 is thus completed. ∎

3.5 Complexity analysis for MLP approximation algorithms

In this subsection we consider the computational effort of the MLP scheme (cf. (156) in Setting 3.1 above) introduced in Setting 3.1 and combine it with the $L^{2}$ -error estimate in Corollary 3.16 to obtain a complexity analysis for the MLP scheme in Proposition 3.18 below. In Lemma 3.17 we think for all $M,n\in\mathbb{N}$ of $\mathcal{C}_{M,n}$ as the number of realizations of $1$ -dimensional random variables needed to simulate one realization of $V^{\theta}_{M,n}(t,x)$ for any $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ . The recursive inequality in (246) in Lemma 3.17 is based on (156) and the assumption that the number of realizations of $1$ -dimensional random variables needed to simulate $X^{\theta}_{t,r}(x)$ for any $\theta\in\Theta$ , $t\in[0,T]$ , $r\in[t,T]$ , $x\in\mathbb{R}^{d}$ is bounded by $\alpha d$ .

Lemma 3.17.

Let $d\in\mathbb{N}$ , $\alpha\in[1,\infty)$ , $(\mathcal{C}_{M,n})_{M,n\in\mathbb{Z}}\subseteq[0,\infty)$ satisfy for all $n,M\in\mathbb{N}$ that $\mathcal{C}_{M,0}=0$ and

[TABLE]

Then it holds for all $n,M\in\mathbb{N}$ that $\mathcal{C}_{n,M}\leq\alpha d\,(5M)^{n}$ .

Proof of Lemma 3.17.

First, observe that (246) and the hypothesis that for all $M\in\mathbb{N}$ it holds that $\mathcal{C}_{M,0}=0$ imply that for all $n\in\mathbb{N}$ , $M\in\mathbb{N}\cap[2,\infty)$ it holds that

[TABLE]

The discrete Gronwall inequality in Corollary 2.2 (with $N=\infty$ , $\alpha=3\alpha d+2$ , $\beta=\left(1+\tfrac{1}{M}\right)$ , $(\epsilon_{n})_{n\in\mathbb{N}_{0}}=(M^{-(n+1)}\mathcal{C}_{M,(n+1)})_{n\in\mathbb{N}_{0}}$ in the notation of Corollary 2.2) hence ensures that for all $n\in\mathbb{N}_{0}$ , $M\in\mathbb{N}\cap[2,\infty)$ it holds that

[TABLE]

This establishes that for all $n\in\mathbb{N}$ , $M\in\mathbb{N}\cap[2,\infty)$ it holds that

[TABLE]

Moreover, observe that the fact that $\mathcal{C}_{1,0}=0$ and (246) demonstrate that for all $n\in\mathbb{N}$ it holds that

[TABLE]

Hence, we obtain for all $n\in\mathbb{N}$ , $k\in\mathbb{N}\cap(0,n]$ that

[TABLE]

Combining this with the discrete Gronwall inequality in Corollary 2.2 (with $N=n-1$ , $\alpha=\alpha d+n(\alpha d+1)$ , $\beta=2$ , $(\epsilon_{k})_{k\in\mathbb{N}_{0}\cap[0,N]}=(\mathcal{C}_{1,k+1})_{k\in\mathbb{N}_{0}\cap[0,n)}$ in the notation of Corollary 2.2) proves that for all $n\in\mathbb{N}$ , $k\in\mathbb{N}_{0}\cap[0,n)$ it holds that

[TABLE]

The fact that for all $n\in\mathbb{N}$ it holds that $(1+2n)3^{n-1}\leq 5^{n}$ hence shows that for all $n\in\mathbb{N}$ it holds that

[TABLE]

Combining this with (249) completes the proof of Lemma 3.17. ∎

Proposition 3.18.

Assume 3.1, let $\xi\in\mathbb{R}^{d}$ , $C\in[0,\infty)$ , $\alpha\in[1,\infty)$ , $(\mathcal{C}_{M,n})_{M,n\in\mathbb{Z}}\subseteq\mathbb{N}_{0}$ satisfy for all $n,M\in\mathbb{N}$ that

[TABLE]

and assume for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $\int_{0}^{T}\left(\mathbb{E}\!\left[|u(r,X^{0}_{0,r}(\xi))|^{2}\right]\right)^{\nicefrac{{1}}{{2}}}dr+\int_{t}^{T}\mathbb{E}\!\left[|f(r,X^{0}_{t,r}(x),0)|\right]dr<\infty$ . Then there exists a function $N\colon(0,\infty)\to\mathbb{N}$ such that for all $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Proof of Proposition 3.18.

Throughout this proof let $\kappa\in(0,\infty)$ be given by

[TABLE]

let $N\colon(0,\infty)\to\mathbb{N}$ be the function which satisfies for all $\varepsilon\in(0,\infty)$ that

[TABLE]

and let $\delta\in(0,\infty)$ . Note that (259) and Corollary 3.16 assure that for all $\varepsilon\in(0,\infty)$ it holds that

[TABLE]

Moreover, observe that (259) ensures that for all $\varepsilon\in(0,\infty)$ with $N_{\varepsilon}\geq 2$ it holds that

[TABLE]

Lemma 3.17 and (254) hence show that for all $\varepsilon\in(0,\infty)$ with $N_{\varepsilon}\geq 2$ it holds that

[TABLE]

Next note that for all $n\in\mathbb{N}\cap[2,\infty)$ it holds that

[TABLE]

Furthermore, observe that the fact that $\kappa\geq\sqrt{e}$ and the fact that $\sqrt{5e}\leq 4$ imply that for all $n\in\mathbb{N}\cap[2,\infty)$ it holds that

[TABLE]

Combining this, (263), and the fact that for all $n\in\mathbb{N}$ it holds that $n\leq(4+8LT)^{n}$ demonstrates that

[TABLE]

In addition, observe that

[TABLE]

This, (262), and (265) prove that for all $\varepsilon\in(0,\infty)$ with $N_{\varepsilon}\geq 2$ it holds that

[TABLE]

Next note that the hypothesis that $\mathcal{C}_{1,0}=0$ , (254), and the fact that $3\leq\sup_{n\in\mathbb{N}}\left[\frac{(4+8LT)^{(n+1)(3+\delta)}}{n^{\nicefrac{{(n\delta)}}{{2}}}}\right]<\infty$ assure that for all $\varepsilon\in(0,\infty)$ with $N_{\varepsilon}=1$ it holds that

[TABLE]

This and (267) demonstrate that for all $\varepsilon\in(0,\infty)$ it holds that

[TABLE]

Combining this with (260) completes the proof of Proposition 3.18. ∎

3.6 MLP approximations for semilinear partial differential equations (PDEs)

Thanks to an equivalence between semilinear Kolmogorov PDEs and stochastic fixed points equations we can carry over the complexity analysis of Subsection 3.5 for the approximation of solutions of stochastic fixed points equations to our proposed MLP scheme for the approximation of solutions of semilinear Kolmogorov PDEs (cf. (275) in Subsection 3.6.1 below) resulting in Proposition 3.19. Considering this complexity analysis over variable dimensions shows that our proposed MLP algorithm overcomes the curse of dimensionality in the approximation of solutions of certain semilinear Kolmogorov PDEs (see Theorem 3.20 in Subsection 3.6.2 below, the main result of this paper, for details).

3.6.1 MLP approximations in fixed space dimensions

Proposition 3.19.

Let $d,m\in\mathbb{N}$ , $T\in(0,\infty)$ , $L,K,p,C_{1},C_{2},\mathfrak{C}\in[0,\infty)$ , $\alpha\in[1,\infty)$ , $\xi\in\mathbb{R}^{d}$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ , let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\left\|\cdot\right\|\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}\colon\mathbb{R}^{d\times m}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times m}$ , assume that

[TABLE]

let $g\in C(\mathbb{R}^{d},\mathbb{R})$ , $f\in C([0,T]\times\mathbb{R}^{d}\times\mathbb{R},\mathbb{R})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v,w\in\mathbb{R}$ that

[TABLE]

let $\mu\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times m}$ be globally Lipschitz continuous functions which satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that

[TABLE]

let $(\mathbb{F}^{\theta}_{t})_{t\in[0,T]}$ , $\theta\in\Theta$ , be filtrations on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfy the usual conditions, assume that $(\mathbb{F}^{\theta}_{T})_{\theta\in\Theta}$ is an independent family of sigma-algebras, assume that $(\mathbb{F}^{\theta}_{T})_{\theta\in\Theta}$ and $\left(\mathcal{R}^{\theta}\right)_{\theta\in\Theta}$ are independent, for every $\theta\in\Theta$ let $W^{\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{m}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}^{\theta}_{t})_{t\in[0,T]})$ -Brownian motion, for every $\theta\in\Theta$ let $X^{\theta}=(X^{\theta}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be a continuous random field which satisfies for every $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $(X^{\theta}_{t,s}(x))_{s\in[t,T]}\colon[t,T]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{F}^{\theta}_{s})_{s\in[t,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process and which satisfies that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

let $V^{\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , be functions which satisfy for all $M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{\theta}_{M,-1}(t,x)=V^{\theta}_{M,0}(t,x)=0$ and

[TABLE]

and let $(\mathcal{C}_{M,n})_{M,n\in\mathbb{Z}}\subseteq\mathbb{N}_{0}$ satisfy for all $n,M\in\mathbb{N}$ that $\mathcal{C}_{M,0}=0$ and

[TABLE]

Then

(i)

there exists a unique at most polynomially growing function $u\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u(T,x)=g(x)$ , 2. (ii)

it holds for all $M\in\mathbb{N}$ , $n\in\mathbb{N}_{0}$ that

[TABLE]

and 3. (iii)

there exists a function $N\colon(0,\infty)\to\mathbb{N}$ such that for all $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Proof of Proposition 3.19.

Throughout this proof let $(\rho_{1}^{(q)})_{q\in[0,\infty),}(\rho_{2}^{(q)})_{q\in[0,\infty)}\subseteq(0,\infty)$ , $C\in[0,\infty]$ satisfy for all $q\in[0,\infty)$ that

[TABLE]

Observe that the fact that $\mu$ and $\sigma$ are globally Lipschitz continuous functions and (271) assure that there exists a unique at most polynomially growing function $u\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u(T,x)=g(x)$ (cf., e.g., Hairer et al. [50, Section 4]). This proves item (i). In addition, note that the fact that $\mu$ and $\sigma$ are globally Lipschitz continuous functions, (271), (274), and the Feynman-Kac formula assure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

(cf., e.g., Hairer et al. [50, Section 4]). Moreover, observe that the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous functions, the fact that for all $\theta,\vartheta\in\Theta$ with $\theta\neq\vartheta$ it holds that $\mathbb{F}^{\theta}_{T}$ and $\mathbb{F}^{\vartheta}_{T}$ are independent, (274), and Lemma 2.19 assure that for all $\theta,\vartheta\in\Theta$ , $r,s,t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $B\in\mathcal{B}(\mathbb{R}^{d})$ with $t\leq s\leq r$ and $\theta\neq\vartheta$ it holds that $\mathbb{P}(X^{\theta}_{t,t}(x)=x)=1$ and

[TABLE]

Next note that the hypothesis that $\mu$ and $\sigma$ are globally Lipschitz continuous functions, (272), (274), and Lemma 2.6 (with $d=d$ , $m=m$ , $T=T-t$ , $C_{1}=C_{1}$ , $C_{2}=C_{2}$ , $\xi=x$ , $(\mu(r,y))_{r\in[0,T],y\in\mathbb{R}^{d}}=(\mu(t+r,y))_{r\in[0,T-t],y\in\mathbb{R}^{d}}$ , $(\sigma(r,y))_{r\in[0,T],y\in\mathbb{R}^{d}}=(\sigma(t+r,y))_{r\in[0,T-t],y\in\mathbb{R}^{d}}$ , $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{r})_{r\in[0,T]})=(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}_{t+r})_{r\in[0,T-t]})$ , $(W_{r})_{r\in[0,T]}=(W^{0}_{t+r}-W^{0}_{t})_{r\in[0,T-t]}$ , $(X_{r})_{r\in[0,T]}=(X^{0}_{t,t+r}(x))_{r\in[0,T-t]}$ for $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ in the notation of Lemma 2.6) assure that for all $x\in\mathbb{R}^{d}$ , $t\in[0,T]$ , $s\in[t,T]$ , $q\in[0,\infty)$ it holds that

[TABLE]

For the next step let $\mathfrak{K},\mathfrak{p}\in[0,\infty)$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

This, Tonelli’s theorem, and (271) assure that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Moreover, observe that (271), (286), (287), and the triangle inequality demonstrate that for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Combining this, (271), (275), (284), (285), (288), the fact that $(X^{\theta})_{\theta\in\Theta}$ are independent, and the fact that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent with Proposition 3.15 (with $d=d$ , $T=T$ , $L=L$ , $u=u$ , $g=g$ , $f=f$ , $\mathcal{R}^{\theta}=\mathcal{R}^{\theta}$ , $X^{\theta}=X^{\theta}$ , $V^{\theta}_{M,n}=V^{\theta}_{M,n}$ , $\xi=\xi$ , $C=C$ for $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ in the notation of Proposition 3.15) proves that for all $M\in\mathbb{N}$ , $n\in\mathbb{N}_{0}$ it holds that

[TABLE]

Next observe that (271), (286), and the triangle inequality imply that

[TABLE]

In addition note that (271), (286), and the triangle inequality imply that

[TABLE]

Combining this and (291) with (281) and (282) demonstrates that

[TABLE]

This and (290) establish item (ii). In addition, observe that (271), (275), (276), (284), (285), (288) (289), the fact that $(X^{\theta})_{\theta\in\Theta}$ are independent, the fact that $(X^{\theta})_{\theta\in\Theta}$ and $(\mathcal{R}^{\theta})_{\theta\in\Theta}$ are independent, (293), and Proposition 3.18 (with $d=d$ , $T=T$ , $L=L$ , $u=u$ , $g=g$ , $f=f$ , $X^{\theta}=X^{\theta}$ , $V^{\theta}_{M,n}=V^{\theta}_{M,n}$ , $\xi=\xi$ , $C=C$ , $\alpha=\alpha$ , $\mathcal{C}_{M,n}=\mathcal{C}_{M,n}$ for $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ in the notation of Proposition 3.15) prove that there exists a function $N\colon(0,\infty)\to\mathbb{N}$ such that for all $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

This establishes item (iii). The proof of Proposition 3.19 is thus completed. ∎

3.6.2 MLP approximations in variable space dimensions

Theorem 3.20.

Let $T\in(0,\infty)$ , $\alpha,c,K\in[1,\infty)$ , $L,p,P,\mathfrak{P},q,C_{1},C_{2}\in[0,\infty)$ , for every $d\in\mathbb{N}$ let $\left\|\cdot\right\|_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , let $\left<\cdot,\cdot\right>_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , and let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{d}\colon\mathbb{R}^{d\times d}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times d}$ , for every $d\in\mathbb{N}$ let $\xi_{d}\in\mathbb{R}^{d}$ satisfy that $\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}\leq cd^{q}$ , for every $d\in\mathbb{N}$ let $g_{d}\in C(\mathbb{R}^{d},\mathbb{R})$ , $f_{d}\in C([0,T]\times\mathbb{R}^{d}\times\mathbb{R},\mathbb{R})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v,w\in\mathbb{R}$ that

[TABLE]

for every $d\in\mathbb{N}$ let $\mu_{d}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma_{d}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times d}$ be globally Lipschitz continuous functions which satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that

[TABLE]

let $(\mathbb{F}^{d,\theta}_{t})_{t\in[0,T]}$ , $d\in\mathbb{N}$ , $\theta\in\Theta$ , be filtrations on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfy the usual conditions, assume for every $d\in\mathbb{N}$ that $(\mathbb{F}^{d,\theta}_{T})_{\theta\in\Theta}$ is an independent family of sigma-algebras, assume that $(\mathbb{F}^{d,\theta}_{T})_{d\in\mathbb{N},\theta\in\Theta}$ and $\left(\mathcal{R}^{\theta}\right)_{\theta\in\Theta}$ are independent, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ let $W^{d,\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}^{d,\theta}_{t})_{t\in[0,T]})$ -Brownian motion, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ let $X^{d,\theta}=(X^{d,\theta}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be a continuous random field which satisfies for every $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $(X^{d,\theta}_{t,s}(x))_{s\in[t,T]}\colon[t,T]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{F}^{d,\theta}_{s})_{s\in[t,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process and which satisfies that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

let $V^{d,\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be functions which satisfy for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{d,\theta}_{M,-1}(t,x)=V^{d,\theta}_{M,0}(t,x)=0$ and

[TABLE]

and let $(\mathcal{C}_{d,M,n})_{M,n\in\mathbb{Z},d\in\mathbb{N}}\subseteq\mathbb{N}_{0}$ satisfy for all $d,n,M\in\mathbb{N}$ that $\mathcal{C}_{d,M,0}=0$ and

[TABLE]

Then

(i)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (ii)

there exists a function $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,\infty)}\colon\mathbb{N}\times(0,\infty)\to\mathbb{N}$ such that for all $d\in\mathbb{N}$ , $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Proof of Theorem 3.20.

Throughout this proof let $(\beta_{\delta})_{\delta\in(0,\infty)}\subseteq(0,\infty)$ , $(\mathfrak{C}_{d})_{d\in\mathbb{N}}\subseteq[0,\infty)$ satisfy for all $\delta\in(0,\infty)$ , $d\in\mathbb{N}$ that $\beta_{\delta}=\left[\sup_{n\in\mathbb{N}}\tfrac{(4+8LT)^{(3+\delta)(n+1)}}{n^{(n\delta/2)}}\right]$ and

[TABLE]

Observe that Proposition 3.19 (with $d=d$ , $m=d$ , $T=T$ , $L=L$ , $K=Kd^{\mathfrak{P}}$ , $p=p$ , $C_{1}=C_{1}d^{P}$ , $C_{2}=C_{2}$ , $\alpha=\alpha$ , $\xi=\xi_{d}$ , $g=g_{d}$ , $f=f_{d}$ , $\mu=\mu_{d}$ , $\sigma=\sigma_{d}$ , $\mathcal{R}^{\theta}=\mathcal{R}^{\theta}$ , $\mathbb{F}^{\theta}=\mathbb{F}^{d,\theta}$ , $W^{\theta}=W^{d,\theta}$ , $X^{\theta}=X^{d,\theta}$ , $V^{\theta}_{M,n}=V^{d,\theta}_{M,n}$ , $\mathcal{C}_{M,n}=\mathcal{C}_{d,M,n}$ for $d\in\mathbb{N}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ in the notation of Proposition 3.19) proves that for every $d\in\mathbb{N}$

(I)

there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (II)

there exists a function $N_{d}=(N_{d,\varepsilon})_{\varepsilon\in(0,\infty)}\colon(0,\infty)\to\mathbb{N}$ such that for all $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Observe that item (I) proves item (i). Moreover, note that the hypothesis that for all $d\in\mathbb{N}$ it holds that $\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}\leq cd^{q}$ and the fact that $(2p+1)\leq 4^{p+1}$ imply that for all $d\in\mathbb{N}$ it holds that

[TABLE]

This and (308) demonstrate that for all $d\in\mathbb{N}$ , $\delta,\varepsilon\in(0,\infty)$ it holds that

[TABLE]

Combining this and (307) establishes item (ii). The proof of Theorem 3.20 is thus completed. ∎

4 MLP approximations for PDE models

The MLP scheme for semilinear Kolmogorov PDEs (cf. (300) in Theorem 3.20 above) proposed in Subsection 3.6 can only be implemented for semilinear Kolmogorov PDEs for which an explicit solution of the corresponding SDE is known. In this section, we consider the MLP algorithm for two examples of such semilinear Kolmogorov PDEs, semilinear heat equations (see Subsection 4.1 below) and semilinear Black-Scholes equations (see Subsections 4.2–4.3 below). Apart from specifying the linear part of the PDE we also choose a particular nonlinearity (cf. (357) in Corollary 4.5 below) in Subsection 4.3 to obtain a PDE, which is used in the pricing of financial derivatives with default risk (cf., e.g., Han et al. [51, (10)] and Duffie et al. [33]).

4.1 MLP approximations for semilinear heat equations

Theorem 4.1.

Let $T\in(0,\infty)$ , $\kappa,p,\mathfrak{P},q\in[0,\infty)$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ , for every $d\in\mathbb{N}$ let $\left\|\cdot\right\|_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , for every $d\in\mathbb{N}$ let $\xi_{d}\in\mathbb{R}^{d}$ satisfy that $\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}\leq\kappa d^{q}$ , for every $d\in\mathbb{N}$ let $g_{d}\in C(\mathbb{R}^{d},\mathbb{R})$ , $f_{d}\in C([0,T]\times\mathbb{R}^{d}\times\mathbb{R},\mathbb{R})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v,w\in\mathbb{R}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that

[TABLE]

for every $d\in\mathbb{N}$ let $W^{d,\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , be independent standard Brownian motions, assume that $\left(W^{d,\theta}\right)_{d\in\mathbb{N},\theta\in\Theta}$ and $\left(\mathcal{R}^{\theta}\right)_{\theta\in\Theta}$ are independent, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ let $X^{d,\theta}=(X^{d,\theta}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be the function which satisfies for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $V^{d,\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be functions which satisfy for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{d,\theta}_{M,-1}(t,x)=V^{d,\theta}_{M,0}(t,x)=0$ and

[TABLE]

and let $(\mathcal{C}_{d,M,n})_{M,n\in\mathbb{Z},d\in\mathbb{N}}\subseteq\mathbb{N}_{0}$ satisfy for all $d,n,M\in\mathbb{N}$ that $\mathcal{C}_{d,M,0}=0$ and

[TABLE]

Then

(i)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (ii)

there exist functions $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,\infty)}\colon\mathbb{N}\times(0,\infty)\to\mathbb{N}$ and $C=(C_{\delta})_{\delta\in(0,\infty)}\colon(0,\infty)\to(0,\infty)$ such that for all $d\in\mathbb{N}$ , $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Proof of Theorem 4.1.

Throughout this proof assume w.l.o.g. that $\kappa\geq 1$ , assume w.l.o.g. that $(\Omega,\mathcal{F},\mathbb{P})$ is a complete probability space, for every $d\in\mathbb{N}$ let $\left<\cdot,\cdot\right>_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ and let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{d}\colon\mathbb{R}^{d\times d}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times d}$ , let $\mu_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d})$ , $d\in\mathbb{N}$ , and $\sigma_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R}^{d\times d})$ , $d\in\mathbb{N}$ , satisfy for all $d\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

and for every $d\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ let $\mathbb{F}^{d,\theta}_{t}\subseteq\mathcal{F}$ be the sigma-algebra which satisfies that

[TABLE]

Note that (320) implies that for every $d\in\mathbb{N}$ , $\theta\in\Theta$ it holds that $(\mathbb{F}^{d,\theta}_{t})_{t\in[0,T]}$ is a filtration on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfies the usual conditions. Moreover, observe that (320) and Lemma 2.17 demonstrate that for every $d\in\mathbb{N}$ , $\theta\in\Theta$ it holds that $W^{d,\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}^{d,\theta}_{t})_{t\in[0,T]})$ -Brownian motion. Next note that (313) and (319) assure that for every $d\in\mathbb{N}$ , $\theta\in\Theta$ it holds that $X^{d,\theta}$ is continuous random field which satisfies for every $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $(X^{d,\theta}_{t,s}(x))_{s\in[t,T]}\colon[t,T]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{F}^{d,\theta}_{s})_{s\in[t,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process and which satisfies that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

In addition, note that for all $d\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

This, (311), (312), (314), (315), (319), (321), and Theorem 3.20 (with $T=T$ , $\alpha=1$ , $c=\kappa$ , $K=\kappa$ , $L=\kappa$ , $p=p$ , $P=1$ , $\mathfrak{P}=\mathfrak{P}$ , $q=q$ , $C_{1}=1$ , $C_{2}=0$ , $\xi_{d}=\xi_{d}$ , $g_{d}=g_{d}$ , $f_{d}=f_{d}$ , $\mu_{d}=\mu_{d}$ , $\sigma_{d}=\sigma_{d}$ , $\mathcal{R}^{\theta}=\mathcal{R}^{\theta}$ , $\mathbb{F}^{d,\theta}=\mathbb{F}^{d,\theta}$ , $W^{d,\theta}=W^{d,\theta}$ , $X^{d,\theta}=X^{d,\theta}$ , $V^{d,\theta}_{M,n}=V^{d,\theta}_{M,n}$ , $\mathcal{C}_{d,M,n}=\mathcal{C}_{d,M,n}$ for $d\in\mathbb{N}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ in the notation of Theorem 3.20) establish that

(I)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (II)

there exists a function $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,\infty)}\colon\mathbb{N}\times(0,\infty)\to\mathbb{N}$ such that for all $d\in\mathbb{N}$ , $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Note that item (I) establishes item (i). Moreover, observe that item (II) establishes item (ii). The proof of Theorem 4.1 is thus completed. ∎

4.2 MLP approximations for semilinear Black-Scholes equations

Lemma 4.2.

Let $d\in\mathbb{N}$ , $T\in(0,\infty)$ , $(\alpha_{i})_{i\in\{1,2,\ldots,d\}}$ , $(\beta_{i})_{i\in\{1,2,\ldots,d\}}\subseteq\mathbb{R}$ , let $\left<\cdot,\cdot\right>\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ , let $\Sigma=(\zeta_{1},\ldots,\zeta_{d})\in\mathbb{R}^{d\times d}$ satisfy for all $i\in\{1,2,\ldots,d\}$ that $\left<\zeta_{i},\zeta_{i}\right>=1$ , let $(\Omega,\mathcal{F},\mathbb{P})$ be a complete probability space, let $W\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ be a $d$ -dimensional standard Brownian motion, and let $X=(X^{(i)}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d},i\in\{1,2,\ldots,d\}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be the function which satisfies for all $i\in\{1,2,\ldots,d\}$ , $t\in[0,T]$ , $s\in[t,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ that

[TABLE]

Then it holds that $X$ is a continuous random field which satisfies that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Proof of Lemma 4.2.

Throughout this proof let $t\in[0,T]$ , $s\in(0,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ , let $f_{i}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}$ , $i\in\{1,2,\ldots,d\}$ , be the functions which satisfy for all $i\in\{1,2,\ldots,d\}$ , $r\in[0,T]$ , $w\in\mathbb{R}^{d}$ that

[TABLE]

let $B=(B^{(i)})_{i\in\{1,2,\ldots,d\}}\colon[0,s-t]\times\Omega\to\mathbb{R}^{d}$ satisfy for all $r\in[0,s-t]$ that $B_{r}=W_{t+r}-W_{t}$ , and let $\zeta_{i}^{(j)}\in\mathbb{R}$ , $i,j\in\{1,2,\ldots,d\}$ , satisfy for all $i\in\{1,2,\ldots,d\}$ that $\zeta_{i}=(\zeta_{i}^{(j)})_{j\in\{1,2,\ldots,d\}}$ . Observe that Itô’s formula (cf., e.g., Karatzas & Shreve [64, Theorem 3.3.6]) assures that for all $i\in\{1,2,\ldots,d\}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

The fact that for all $i\in\{1,2,\ldots,d\}$ it holds that $\sum_{j=1}^{d}\big{|}\zeta_{i}^{(j)}\big{|}^{2}=\left<\zeta_{i},\zeta_{i}\right>=1$ and the fact that for all $i\in\{1,2,\ldots,d\}$ , $r\in[0,s-t]$ it holds that $f_{i}(r,B_{r})=X^{(i)}_{t,t+r}(x)$ hence assure that for all $i\in\{1,2,\ldots,d\}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

This implies (327). The proof of Lemma 4.2 is thus completed. ∎

Theorem 4.3.

Let $T\in(0,\infty)$ , $\kappa,p,\mathfrak{P},q\in[0,\infty)$ , $(\alpha_{d,i})_{i\in\{1,2,\ldots,d\},d\in\mathbb{N}}$ , $(\beta_{d,i})_{i\in\{1,2,\ldots,d\},d\in\mathbb{N}}\subseteq\mathbb{R}$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ satisfy that $\sup_{d\in\mathbb{N},i\in\{1,2,\ldots,d\}}\max\{|\alpha_{d,i}|,|\beta_{d,i}|^{2}\}\leq\kappa$ , for every $d\in\mathbb{N}$ let $\left<\cdot,\cdot\right>_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ and let $\left\|\cdot\right\|_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , for every $d\in\mathbb{N}$ let $\xi_{d}\in\mathbb{R}^{d}$ , $\Sigma_{d}=(\zeta_{d,1},\ldots,\zeta_{d,d})\in\mathbb{R}^{d\times d}$ satisfy for all $i\in\{1,2,\ldots,d\}$ that $\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}\leq\kappa d^{q}$ and $\|\zeta_{d,i}\|_{\mathbb{R}^{d}}=1$ , for every $d\in\mathbb{N}$ let $\mu_{d}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma_{d}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times d}$ be the functions which satisfy for all $t\in[0,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ that

[TABLE]

for every $d\in\mathbb{N}$ let $g_{d}\in C(\mathbb{R}^{d},\mathbb{R})$ , $f_{d}\in C([0,T]\times\mathbb{R}^{d}\times\mathbb{R},\mathbb{R})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ , $v,w\in\mathbb{R}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that

[TABLE]

for every $d\in\mathbb{N}$ let $W^{d,\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , be independent standard Brownian motions, assume that $\left(W^{d,\theta}\right)_{d\in\mathbb{N},\theta\in\Theta}$ and $\left(\mathcal{R}^{\theta}\right)_{\theta\in\Theta}$ are independent, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ let $X^{d,\theta}=(X^{d,\theta,i}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d},i\in\{1,2,\ldots,d\}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be the function which satisfies for all $t\in[0,T]$ , $s\in[t,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ , $i\in\{1,2,\ldots,d\}$ that

[TABLE]

let $V^{d,\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be functions which satisfy for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{d,\theta}_{M,-1}(t,x)=V^{d,\theta}_{M,0}(t,x)=0$ and

[TABLE]

and let $(\mathcal{C}_{d,M,n})_{M,n\in\mathbb{Z},d\in\mathbb{N}}\subseteq\mathbb{N}_{0}$ satisfy for all $d,n,M\in\mathbb{N}$ that $\mathcal{C}_{d,M,0}=0$ and

[TABLE]

Then

(i)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (ii)

there exist functions $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,\infty)}\colon\mathbb{N}\times(0,\infty)\to\mathbb{N}$ and $C=(C_{\delta})_{\delta\in(0,\infty)}\colon(0,\infty)\to(0,\infty)$ such that for all $d\in\mathbb{N}$ , $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Proof of Theorem 4.3.

Throughout this proof assume w.l.o.g. that $\kappa\geq 1$ , assume w.l.o.g. that $(\Omega,\mathcal{F},\mathbb{P})$ is a complete probability space, for every $d\in\mathbb{N}$ let ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|\cdot\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{d}\colon\mathbb{R}^{d\times d}\to[0,\infty)$ be the Frobenius norm on $\mathbb{R}^{d\times d}$ , for every $d\in\mathbb{N}$ , $i\in\{1,2,\ldots,d\}$ let $\zeta_{d,i}^{(j)}\in\mathbb{R}$ , $j\in\{1,2,\ldots,d\}$ , satisfy that $\zeta_{d,i}=(\zeta_{d,i}^{(j)})_{j\in\{1,2,\ldots,d\}}$ , and for every $d\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ let $\mathbb{F}^{d,\theta}_{t}\subseteq\mathcal{F}$ be the sigma-algebra which satisfies that

[TABLE]

Note that (340) implies that for every $d\in\mathbb{N}$ , $\theta\in\Theta$ it holds that $(\mathbb{F}^{d,\theta}_{t})_{t\in[0,T]}$ is a filtration on $(\Omega,\mathcal{F},\mathbb{P})$ which satisfies the usual conditions. Moreover, observe that (340) and Lemma 2.17 demonstrate that for every $d\in\mathbb{N}$ , $\theta\in\Theta$ it holds that $W^{d,\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ is a standard $(\Omega,\mathcal{F},\mathbb{P},(\mathbb{F}^{d,\theta}_{t})_{t\in[0,T]})$ -Brownian motion. In addition, note that (331) and the fact that $\sup_{d\in\mathbb{N},i\in\{1,2,\ldots,d\}}|\alpha_{d,i}|\leq\kappa$ imply that for all $d\in\mathbb{N}$ , $t\in[0,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ it holds that

[TABLE]

Furthermore, observe that (331), the fact that $\sup_{d\in\mathbb{N},i\in\{1,2,\ldots,d\}}|\beta_{d,i}|^{2}\leq\kappa$ , and the hypothesis that for all $d\in\mathbb{N}$ , $i\in\{1,2,\ldots,d\}$ it holds that $\left\|\zeta_{d,i}\right\|_{\mathbb{R}^{d}}=1$ assure that for all $d\in\mathbb{N}$ , $t\in[0,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ it holds that

[TABLE]

This and (341) ensure that for all $d\in\mathbb{N}$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ it holds that

[TABLE]

Next note that (331), (334), and Lemma 4.2 (with $d=d$ , $T=T$ , $(\alpha_{i})_{i\in\{1,\ldots,d\}}=(\alpha_{d,i})_{i\in\{1,\ldots,d\}}$ , $(\beta_{i})_{i\in\{1,\ldots,d\}}=(\beta_{d,i})_{i\in\{1,\ldots,d\}}$ , $\Sigma=\Sigma_{d}$ , $W=W^{d,\theta}$ , $X=X^{d,\theta}$ for $\theta\in\Theta$ , $d\in\mathbb{N}$ in the notation of Lemma 4.2) demonstrate that for all $d\in\mathbb{N}$ , $\theta\in\Theta$ it holds that $X^{d,\theta}$ is continuous random field which satisfies for every $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $(X^{d,\theta}_{t,s}(x))_{s\in[t,T]}\colon[t,T]\times\Omega\to\mathbb{R}^{d}$ is an $(\mathbb{F}^{d,\theta}_{s})_{s\in[t,T]}$ / $\mathcal{B}(\mathbb{R}^{d})$ -adapted stochastic process and which satisfies that for all $t\in[0,T]$ , $s\in[t,T]$ , $x\in\mathbb{R}^{d}$ it holds $\mathbb{P}$ -a.s. that

[TABLE]

Combining this, (332), the fact that for all $d\in\mathbb{N}$ it holds that $\mu_{d}$ and $\sigma_{d}$ are globally Lipschitz continuous functions, and (343) with Theorem 3.20 (with $T=T$ , $\alpha=1$ , $c=\kappa$ , $K=\kappa$ , $L=\kappa$ , $p=p$ , $P=0$ , $\mathfrak{P}=\mathfrak{P}$ , $q=q$ , $C_{1}=0$ , $C_{2}=\kappa$ , $\xi_{d}=\xi_{d}$ , $g_{d}=g_{d}$ , $f_{d}=f_{d}$ , $\mu_{d}=\mu_{d}$ , $\sigma_{d}=\sigma_{d}$ , $\mathcal{R}^{\theta}=\mathcal{R}^{\theta}$ , $\mathbb{F}^{d,\theta}=\mathbb{F}^{d,\theta}$ , $W^{d,\theta}=W^{d,\theta}$ , $X^{d,\theta}=X^{d,\theta}$ , $V^{d,\theta}_{M,n}=V^{d,\theta}_{M,n}$ , $\mathcal{C}_{d,M,n}=\mathcal{C}_{d,M,n}$ for $d\in\mathbb{N}$ , $\theta\in\Theta$ , $M,n\in\mathbb{Z}$ , in the notation of Theorem 3.20) establishes that

(I)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (II)

there exists a function $N=(N_{d,\varepsilon})_{\varepsilon\in(0,\infty)}\colon\mathbb{N}\times(0,\infty)\to\mathbb{N}$ such that for all $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

Observe that item (I) proves item (i). Furthermore, note that item (II) establishes item (ii). The proof of Theorem 4.3 is thus completed. ∎

Theorem 4.4.

Let $T\in(0,\infty)$ , $\kappa,p,\mathfrak{P},q\in[0,\infty)$ , $(\alpha_{d,i})_{i\in\{1,2,\ldots,d\},d\in\mathbb{N}}$ , $(\beta_{d,i})_{i\in\{1,2,\ldots,d\},d\in\mathbb{N}}\subseteq\mathbb{R}$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ satisfy that $\sup_{d\in\mathbb{N},i\in\{1,2,\ldots,d\}}\max\{|\alpha_{d,i}|,|\beta_{d,i}|^{2}\}\leq\kappa$ , for every $d\in\mathbb{N}$ let $\left<\cdot,\cdot\right>_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R}$ be the Euclidean scalar product on $\mathbb{R}^{d}$ and let $\left\|\cdot\right\|_{\mathbb{R}^{d}}\colon\mathbb{R}^{d}\to[0,\infty)$ be the Euclidean norm on $\mathbb{R}^{d}$ , for every $d\in\mathbb{N}$ let $\xi_{d}\in\mathbb{R}^{d}$ , $\Sigma_{d}=(\zeta_{d,1},\ldots,\zeta_{d,d})\in\mathbb{R}^{d\times d}$ satisfy for all $i\in\{1,2,\ldots,d\}$ that $\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}\leq\kappa d^{q}$ and $\|\zeta_{d,i}\|_{\mathbb{R}^{d}}=1$ , for every $d\in\mathbb{N}$ let $\mu_{d}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma_{d}\colon[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d\times d}$ be the functions which satisfy for all $t\in[0,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ that

[TABLE]

let $f\colon\mathbb{R}\to\mathbb{R}$ be a Lipschitz continuous function, for every $d\in\mathbb{N}$ let $g_{d}\in C(\mathbb{R}^{d},\mathbb{R})$ satisfy for all $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that

[TABLE]

let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that

[TABLE]

for every $d\in\mathbb{N}$ let $W^{d,\theta}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , be independent standard Brownian motions, assume that $\left(W^{d,\theta}\right)_{d\in\mathbb{N},\theta\in\Theta}$ and $\left(\mathcal{R}^{\theta}\right)_{\theta\in\Theta}$ are independent, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ let $X^{d,\theta}=(X^{d,\theta,i}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d},i\in\{1,2,\ldots,d\}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be the function which satisfies for all $t\in[0,T]$ , $s\in[t,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ , $i\in\{1,2,\ldots,d\}$ that

[TABLE]

let $V^{d,\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be functions which satisfy for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{d,\theta}_{M,-1}(t,x)=V^{d,\theta}_{M,0}(t,x)=0$ and

[TABLE]

and let $(\mathcal{C}_{d,M,n})_{M,n\in\mathbb{Z},d\in\mathbb{N}}\subseteq\mathbb{N}_{0}$ satisfy for all $d,n,M\in\mathbb{N}$ that $\mathcal{C}_{d,M,0}=0$ and

[TABLE]

Then

(i)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (ii)

there exist functions $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,\infty)}\colon\mathbb{N}\times(0,\infty)\to\mathbb{N}$ and $C=(C_{\delta})_{\delta\in(0,\infty)}\colon(0,\infty)\to(0,\infty)$ such that for all $d\in\mathbb{N}$ , $\varepsilon,\delta\in(0,\infty)$ it holds that

[TABLE]

4.3 MLP approximations for the pricing of financial derivatives with default risks

Corollary 4.5.

Let $T,R,\gamma_{l},\gamma_{h},v_{l},v_{h}\in(0,\infty)$ , $p,q\in[0,\infty)$ , $\epsilon\in[0,1)$ , $\alpha,\beta\in\mathbb{R}$ , $f\in C(\mathbb{R},\mathbb{R})$ , $\Theta=\cup_{n=1}^{\infty}\mathbb{Z}^{n}$ satisfy for all $u\in\mathbb{R}$ that $\gamma_{l}<\gamma_{h}$ , $v_{l}>v_{h}$ , and

[TABLE]

let $\xi_{d}\in\mathbb{R}^{d}$ , $d\in\mathbb{N}$ satisfy that $\sup_{d\in\mathbb{N}}\frac{\left\|\xi_{d}\right\|_{\mathbb{R}^{d}}}{d^{q}}<\infty$ , let $g_{d}\in C(\mathbb{R}^{d},\mathbb{R})$ , $d\in\mathbb{N}$ , satisfy that $\sup_{d\in\mathbb{N},x\in\mathbb{R}^{d}}\tfrac{|g_{d}(x)||}{1+\left\|x\right\|_{\mathbb{R}^{d}}^{p}}<\infty$ , let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space, let $\mathcal{R}^{\theta}\colon\Omega\to[0,1]$ , $\theta\in\Theta$ , be independent $\mathcal{U}_{[0,1]}$ -distributed random variables, let $R^{\theta}=(R^{\theta}_{t})_{t\in[0,T]}\colon[0,T]\times\Omega\to[0,T]$ , $\theta\in\Theta$ , be the stochastic processes which satisfy for all $t\in[0,T]$ , $\theta\in\Theta$ that $R^{\theta}_{t}=t+(T-t)\mathcal{R}^{\theta}$ , for every $d\in\mathbb{N}$ let $W^{d,\theta}=(W^{d,\theta,i})_{i\in\{1,2,\ldots,d\}}\colon[0,T]\times\Omega\to\mathbb{R}^{d}$ , $\theta\in\Theta$ , be independent standard Brownian motions, assume that $\left(W^{d,\theta}\right)_{d\in\mathbb{N},\theta\in\Theta}$ and $\left(\mathcal{R}^{\theta}\right)_{\theta\in\Theta}$ are independent, for every $d\in\mathbb{N}$ , $\theta\in\Theta$ let $X^{d,\theta}=(X^{d,\theta,i}_{t,s}(x))_{s\in[t,T],t\in[0,T],x\in\mathbb{R}^{d},i\in\{1,2,\ldots,d\}}\colon\{(t,s)\in[0,T]^{2}\colon t\leq s\}\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}^{d}$ be the function which satisfies for all $i\in\{1,2,\ldots,d\}$ , $t\in[0,T]$ , $s\in[t,T]$ , $x=(x_{1},x_{2},\ldots,x_{d})\in\mathbb{R}^{d}$ that

[TABLE]

let $V^{d,\theta}_{M,n}\colon[0,T]\times\mathbb{R}^{d}\times\Omega\to\mathbb{R}$ , $M,n\in\mathbb{Z}$ , $\theta\in\Theta$ , $d\in\mathbb{N}$ , be functions which satisfy for all $d,M,n\in\mathbb{N}$ , $\theta\in\Theta$ , $t\in[0,T]$ , $x\in\mathbb{R}^{d}$ that $V^{d,\theta}_{M,-1}(t,x)=V^{d,\theta}_{M,0}(t,x)=0$ and

[TABLE]

and let $(\mathcal{C}_{d,M,n})_{M,n\in\mathbb{Z},d\in\mathbb{N}}\subseteq\mathbb{N}_{0}$ satisfy for all $d,n,M\in\mathbb{N}$ that $\mathcal{C}_{d,M,0}=0$ and

[TABLE]

*Then *

(i)

for every $d\in\mathbb{N}$ there exists a unique at most polynomially growing function $u_{d}\in C([0,T]\times\mathbb{R}^{d},\mathbb{R})$ which satisfies that $u_{d}|_{(0,T)\times\mathbb{R}^{d}}\colon(0,T)\times\mathbb{R}^{d}\to\mathbb{R}$ is a viscosity solution of

[TABLE]

for $(t,x)=(t,(x_{1},x_{2},\ldots,x_{d}))\in(0,T)\times\mathbb{R}^{d}$ and which satisfies for all $x\in\mathbb{R}^{d}$ that $u_{d}(T,x)=g_{d}(x)$ and 2. (ii)

there exist functions $N=(N_{d,\varepsilon})_{d\in\mathbb{N},\varepsilon\in(0,1]}\colon\mathbb{N}\times(0,1]\to\mathbb{N}$ and $C=(C_{\delta})_{\delta\in(0,\infty)}\colon(0,\infty)\to(0,\infty)$ such that for all $d\in\mathbb{N}$ , $\varepsilon\in(0,1]$ , $\delta\in(0,\infty)$ it holds that $\mathcal{C}_{d,N_{d,\varepsilon},N_{d,\varepsilon}}\leq C_{\delta}\,d^{1+qp(2+\delta)}\varepsilon^{-(2+\delta)}$ and

[TABLE]

Acknowledgments

This project has been partially supported through the SNSF-Research project 200020_175699 “Higher order numerical approximation methods for stochastic partial differential equations”.

Bibliography105

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Agarwal, R. P. Difference equations and inequalities: theory, methods, and applications . CRC Press, 2000.
2[2] Bally, V., Pages, G., et al. A quantization algorithm for solving multidimensional discrete-time optimal stopping problems. Bernoulli 9 , 6 (2003), 1003–1049.
3[3] Beck, C., Becker, S., Grohs, P., Jaafari, N., and Jentzen, A. Solving stochastic differential equations and kolmogorov equations by means of deep learning. ar Xiv:1806.00421 (2018), 56 pages.
4[4] Becker, S., Cheridito, P., and Jentzen, A. Deep optimal stopping. ar Xiv:1804.05394 (2018).
5[5] Bellman, R. Dynamic programming. Science 153 , 3731 (1966), 34–37.
6[6] Bender, C., and Denk, R. A forward scheme for backward sdes. Stochastic processes and their applications 117 , 12 (2007), 1793–1812.
7[7] Bender, C., Schweizer, N., and Zhuo, J. A primal-dual algorithm for BSD Es. Mathematical Finance. An International Journal of Mathematics, Statistics and Financial Economics 27 , 3 (2017), 866–901.
8[8] Berg, J., and Nyström, K. A unified deep artificial neural network approach to partial differential equations in complex geometries. Neurocomputing 317 (2018), 28–41.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Overcoming the curse of dimensionality

Abstract

Contents

1 Introduction

Theorem 1.1**.**

2 On a distributional flow property for stochastic differential equations (SDEs)

2.1 Time-discrete Gronwall inequalities

Lemma 2.1**.**

Proof of Lemma 2.1.

Corollary 2.2**.**

Proof of Corollary 2.2.

2.2 A priori moment bounds for solutions of SDEs

Lemma 2.3**.**

Proof of Lemma 2.3.

Lemma 2.4**.**

Proof of Lemma 2.4.

Lemma 2.5**.**

Proof of Lemma 2.5.

Lemma 2.6**.**

Proof of Lemma 2.6.

2.3 Temporal regularity properties for solutions of SDEs

Lemma 2.7** (Temporal regularity of solutions of time-homogeneous SDEs).**

Lemma 2.8** (Temporal regularity of solutions of time-inhomogeneous SDEs).**

Proof of Lemma 2.8.

Lemma 2.9** (A consequence of Hölders inequality).**

Proof of Lemma 2.9.

Lemma 2.10** (Explicit temporal regularity for solutions of SDEs with deterministic initial values).**

Proof of Lemma 2.10.

2.4 Strong error estimates for Euler-Maruyama approximations

Proposition 2.11** (Strong convergence of the Euler-Maruyama method).**

Proof

Corollary 2.12**.**

Proof of Corollary 2.12.

2.5 On identically distributed random variables

Lemma 2.13**.**

Proof of Lemma 2.13.

2.6 On random evaluations of random fields

Lemma 2.14**.**

Proof of Lemma 2.14.

Lemma 2.15**.**

Lemma 2.16**.**

2.7 Brownian motions and right-continuous filtrations

Lemma 2.17**.**

Proof of Lemma 2.17.

2.8 On a distributional flow property for solutions of SDEs

Lemma 2.18**.**

Proof of Lemma 2.18.

Lemma 2.19**.**

Proof of Lemma 2.19.

3 Full history recursive multilevel Picard (MLP) approximation algorithms

3.1 Stochastic fixed point equations and MLP approximations

Setting 3.1**.**

3.2 A priori bounds for solutions of stochastic fixed point equations

Lemma 3.2** (Time reversed time-continuous Gronwall inequality).**

Proof of Lemma 3.2.

Lemma 3.3**.**

Proof of Lemma 3.3.

3.3 Properties of MLP approximations

Lemma 3.4**.**

Proof of Lemma 3.4.

Lemma 3.5**.**

Proof of Lemma 3.5.

Lemma 3.6** (Properties of MLP approximations).**

Proof of Lemma 3.6.

3.4 Analysis of approximation errors of MLP approximations

3.4.1 Expectations of MLP approximations

Lemma 3.7**.**

Proof of Lemma 3.7.

Lemma 3.8** (Expectations of MLP approximations).**

Proof of Lemma 3.8.

3.4.2 Biases of MLP approximations

Lemma 3.9** (Biases of MLP approximations).**

Proof of Lemma 3.9.

3.4.3 Estimates for the variances of MLP approximations

Theorem 1.1.

Lemma 2.1.

Corollary 2.2.

Lemma 2.3.

Lemma 2.4.

Lemma 2.5.

Lemma 2.6.

Lemma 2.7 (Temporal regularity of solutions of time-homogeneous SDEs).

Lemma 2.8 (Temporal regularity of solutions of time-inhomogeneous SDEs).

Lemma 2.9 (A consequence of Hölders inequality).

Lemma 2.10 (Explicit temporal regularity for solutions of SDEs with deterministic initial values).

Proposition 2.11 (Strong convergence of the Euler-Maruyama method).

Corollary 2.12.

Lemma 2.13.

Lemma 2.14.

Lemma 2.15.

Lemma 2.16.

Lemma 2.17.

Lemma 2.18.

Lemma 2.19.

Setting 3.1.

Lemma 3.2 (Time reversed time-continuous Gronwall inequality).

Lemma 3.3.

Lemma 3.4.

Lemma 3.5.

Lemma 3.6 (Properties of MLP approximations).

Lemma 3.7.

Lemma 3.8 (Expectations of MLP approximations).

Lemma 3.9 (Biases of MLP approximations).

Lemma 3.10.

Lemma 3.11 (Estimates for the variances of MLP approximations).

Lemma 3.12.

Corollary 3.13.

Lemma 3.14.

Proposition 3.15.

Corollary 3.16.

Lemma 3.17.

Proposition 3.18.

Proposition 3.19.

Theorem 3.20.

Theorem 4.1.

Lemma 4.2.

Theorem 4.3.

Theorem 4.4.

Corollary 4.5.