Strong convergence rates for Markovian representations of fractional   processes

Philipp Harms

arXiv:1902.01471·q-fin.MF·August 6, 2020

Strong convergence rates for Markovian representations of fractional processes

Philipp Harms

PDF

TL;DR

This paper investigates the numerical discretization of Markovian representations of fractional processes, demonstrating high-order strong convergence rates and analyzing their implications for Monte Carlo methods in fractional volatility modeling.

Contribution

It establishes that discretizations of these representations can achieve arbitrarily high polynomial order convergence, clarifying their effectiveness and limitations in Monte Carlo simulations.

Findings

01

Discretizations have strong convergence rates of arbitrarily high polynomial order.

02

The representation's potential for Monte Carlo schemes is confirmed, but with noted limitations.

03

Insights into fractional volatility models like the rough Bergomi model are provided.

Abstract

Many fractional processes can be represented as an integral over a family of Ornstein-Uhlenbeck processes. This representation naturally lends itself to numerical discretizations, which are shown in this paper to have strong convergence rates of arbitrarily high polynomial order. This explains the potential, but also some limitations of such representations as the basis of Monte Carlo schemes for fractional volatility models such as the rough Bergomi model.

Figures1

Click any figure to enlarge with its caption.

Tables1

Table 1. Table 1. Complexity of several numerical methods for sampling a fractional process ( W i / k H ) i ∈ { 1 , … , k } subscript subscript superscript 𝑊 𝐻 𝑖 𝑘 𝑖 1 … 𝑘 (W^{H}_{i/k})_{i\in\{1,\dots,k\}} with Hurst index H ∈ ( 0 , 1 / 2 ) 𝐻 0 1 2 H\in(0,1/2) at k 𝑘 k equidistant time points.

Method	Structure	Error	Complexity
Cholesky	Static	0	$k^{3}$
Hosking, Dieker [28, 17]	Recursive	0	$k^{2}$
Dietrich, Newsam [18]	Static	0	$k \log k$
Bennedsen, Lunde, Pakkanen [12]	Recursive	$ϵ = k^{- H}$	$k \log k$
Carmona, Coutin, Montseny [15]	Recursive	$ϵ$	$k ϵ^{- 3 / (4 H)}$
This paper	Recursive	$ϵ$	$k ϵ^{- 1 / r}$ for $r \in (0, \infty)$

Equations126

W_{t}^{H}

W_{t}^{H}

Y : Ω \to C ([0, T], C^{\infty} ((0, \infty), R) \cap L^{1} ((0, \infty), μ)),

Y : Ω \to C ([0, T], C^{\infty} ((0, \infty), R) \cap L^{1} ((0, \infty), μ)),

\forall t \in [0, \infty), \forall x \in (0, \infty) : P [Y_{t} (x) = \frac{1}{Γ ( \frac{1}{2} - H )} \int_{0}^{t} e^{- (t - s) x} d W_{s}] = 1.

\forall t \in [0, \infty), \forall x \in (0, \infty) : P [Y_{t} (x) = \frac{1}{Γ ( \frac{1}{2} - H )} \int_{0}^{t} e^{- (t - s) x} d W_{s}] = 1.

\forall t \in [0, T] : P [\int_{0}^{\infty} Y_{t} (x) \frac{d x}{x ^{α}} = \int_{0}^{t} (t - s)^{α - 1} d W_{s}] = 1.

\forall t \in [0, T] : P [\int_{0}^{\infty} Y_{t} (x) \frac{d x}{x ^{α}} = \int_{0}^{t} (t - s)^{α - 1} d W_{s}] = 1.

t \in [0, T] sup x \in (0, \infty) sup x^{β} \partial_{x}^{m} Y_{t} (x)_{L^{p} (Ω)}

t \in [0, T] sup x \in (0, \infty) sup x^{β} \partial_{x}^{m} Y_{t} (x)_{L^{p} (Ω)}

x_{0} \in [0, 1] sup x_{0}^{- γ} t \in [0, T] sup \int_{0}^{x_{0}} Y_{t} (x) \frac{d x}{x ^{α}}_{L^{p} (Ω)}

x_{1} \in [1, \infty) sup x_{1}^{δ} t \in [0, T] sup \int_{x_{1}}^{\infty} Y_{t} (x) \frac{d x}{x ^{α}}_{L^{p} (Ω)}

Y_{t} (x) := \frac{1}{Γ ( \frac{1}{2} - H )} (W_{t} - \int_{0}^{t} W_{s} x e^{- (t - s) x} d s), t \in [0, T], x \in (0, \infty),

Y_{t} (x) := \frac{1}{Γ ( \frac{1}{2} - H )} (W_{t} - \int_{0}^{t} W_{s} x e^{- (t - s) x} d s), t \in [0, T], x \in (0, \infty),

Y : Ω \to C ([0, T], C^{\infty} ((0, \infty), R) \cap L^{1} ((0, \infty), μ)) .

Y : Ω \to C ([0, T], C^{\infty} ((0, \infty), R) \cap L^{1} ((0, \infty), μ)) .

\int_{0}^{\infty} Y_{t} (x) \frac{d x}{x ^{α}}

\int_{0}^{\infty} Y_{t} (x) \frac{d x}{x ^{α}}

= \frac{1}{Γ ( \frac{1}{2} - H )} \int_{0}^{t} \int_{0}^{\infty} e^{- (t - s) x} \frac{d x}{x ^{α}} d W_{s} = \int_{0}^{t} (t - s)^{α} d W_{s} .

\forall x \in (0, \infty) : E [t \in [0, T] sup ∣ Y_{t} (x) ∣] \leq C_{1} \frac{lo g ( 1 + T x )}{x} .

\forall x \in (0, \infty) : E [t \in [0, T] sup ∣ Y_{t} (x) ∣] \leq C_{1} \frac{lo g ( 1 + T x )}{x} .

C_{2}

C_{2}

= t \in (- \infty, 0] x \in (0, \infty) sup ∣ x^{m - 1} \partial_{t} \partial_{x}^{m} e^{t x} ∣ = t \in (- \infty, 0] x \in (0, \infty) sup ∣ x^{m - 1} \partial_{t} (t^{m} e^{t x}) ∣

= t \in (- \infty, 0] x \in (0, \infty) sup m (t x)^{m - 1} + (t x)^{m} e^{t x} = y \in (- \infty, 0] sup m y^{m - 1} + y^{m} e^{y} < \infty,

C_{3}

E [t \in [0, T] sup x \in (0, \infty) sup x^{β} \partial_{x}^{m} Y_{t} (x)]

E [t \in [0, T] sup x \in (0, \infty) sup x^{β} \partial_{x}^{m} Y_{t} (x)]

= E [t \in [0, T] sup x \in (0, \infty) sup \int_{0}^{t} W_{s} x^{β} \partial_{x}^{m} (x e^{- (t - s) x}) d s]

\leq C_{2} T E [t \in [0, T] sup ∣ W_{t} ∣] < \infty,

x_{0} \in [0, 1] sup x_{0}^{- γ} E [t \in [0, T] sup \int_{0}^{x_{0}} Y_{t} (x) \frac{d x}{x ^{α}}]

\leq C_{1} x_{0} \in [0, 1] sup x_{0}^{- γ} \int_{0}^{x_{0}} \frac{lo g ( 1 + T x )}{x} \frac{d x}{x ^{α}}

\leq C_{1} x_{0} \in [0, 1] sup x_{0}^{- γ} \int_{0}^{x_{0}} T \frac{d x}{x ^{α}} = C_{1} T γ^{- 1} < \infty,

x_{1} \in [1, \infty) sup x_{1}^{δ} E [t \in [0, T] sup \int_{x_{1}}^{\infty} Y_{t} (x) \frac{d x}{x ^{α}}]

\leq C_{1} x_{1} \in [1, \infty) sup x_{1}^{δ} \int_{x_{1}}^{\infty} \frac{lo g ( 1 + T x )}{x} \frac{d x}{x ^{α}}

\leq C_{1} C_{3} x_{1} \in [1, \infty) sup x_{1}^{δ} \int_{x_{1}}^{\infty} x^{- 1 - δ} d x = C_{1} C_{3} δ^{- 1} < \infty.

\forall k \in {0, \dots, m - 1} : \int_{a}^{b} x^{k} w (x) μ (d x) = \int_{a}^{b} x^{k} w (x) d x .

\forall k \in {0, \dots, m - 1} : \int_{a}^{b} x^{k} w (x) μ (d x) = \int_{a}^{b} x^{k} w (x) d x .

Y : Ω \to C ([0, T], C^{m} ((0, \infty)) \cap L^{1} ((0, \infty), μ))

Y : Ω \to C ([0, T], C^{m} ((0, \infty)) \cap L^{1} ((0, \infty), μ))

t \in [0, T] sup x \in (0, \infty) sup x^{β} \partial_{x}^{m} Y_{t} (x)_{L^{p} (Ω)}

t \in [0, T] sup x \in (0, \infty) sup x^{β} \partial_{x}^{m} Y_{t} (x)_{L^{p} (Ω)}

x_{0} ↓ 0 lim sup x_{0}^{- γ} t \in [0, T] sup \int_{0}^{x_{0}} Y_{t} (x) x^{- α} d x_{L^{p} (Ω)}

x_{1} ↑ \infty lim sup x_{1}^{δ} t \in [0, T] sup \int_{x_{1}}^{\infty} Y_{t} (x) x^{- α} d x_{L^{p} (Ω)}

ξ_{n, 0} = n^{- r / γ}, ξ_{n, n} = n^{r / δ}, ξ_{n, i} = ξ_{n, 0} (ξ_{n, n} / ξ_{n, 0})^{i / n},

ξ_{n, 0} = n^{- r / γ}, ξ_{n, n} = n^{r / δ}, ξ_{n, i} = ξ_{n, 0} (ξ_{n, n} / ξ_{n, 0})^{i / n},

n \in N sup n^{r} t \in [0, T] sup \int_{0}^{\infty} Y_{t} (x) x^{- α} (μ_{n} (d x) - d x)_{L^{p} (Ω)} < \infty.

n \in N sup n^{r} t \in [0, T] sup \int_{0}^{\infty} Y_{t} (x) x^{- α} (μ_{n} (d x) - d x)_{L^{p} (Ω)} < \infty.

η

η

C_{1}

C_{2}

C_{3}

C_{4}

\forall λ \in [0, \infty) : λ^{1 - α - β + m} = (1 + (λ - 1))^{1 - α - β + m} \geq 1 + (1 - α - β + m) (λ - 1),

\forall λ \in [0, \infty) : λ^{1 - α - β + m} = (1 + (λ - 1))^{1 - α - β + m} \geq 1 + (1 - α - β + m) (λ - 1),

\forall ξ \in [1, \infty), \forall n \in [lo g (ξ), \infty) : ξ^{1/ n} - 1

\forall ξ \in [1, \infty), \forall n \in [lo g (ξ), \infty) : ξ^{1/ n} - 1

\int_{ξ_{n, i}}^{ξ_{n, i + 1}} Y_{t} (x) x^{- α} (μ_{n} (d x) - d x) = \int_{ξ_{n, i}}^{ξ_{n, i + 1}} \partial_{x}^{m} Y_{t} (x) K_{n, i} (x) d x,

\int_{ξ_{n, i}}^{ξ_{n, i + 1}} Y_{t} (x) x^{- α} (μ_{n} (d x) - d x) = \int_{ξ_{n, i}}^{ξ_{n, i + 1}} \partial_{x}^{m} Y_{t} (x) K_{n, i} (x) d x,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Strong convergence rates for Markovian representations of fractional processes

Philipp Harms

Department of Stochastics

University of Freiburg

[email protected]

Abstract.

Many fractional processes can be represented as an integral over a family of Ornstein–Uhlenbeck processes. This representation naturally lends itself to numerical discretizations, which are shown in this paper to have strong convergence rates of arbitrarily high polynomial order. This explains the potential, but also some limitations of such representations as the basis of Monte Carlo schemes for fractional volatility models such as the rough Bergomi model.

2010 Mathematics Subject Classification:

60G22, 60G15, 65C05, 91G60

The author gratefully acknowledges support in the form of a Junior Fellowship of the Freiburg Institute of Advances Studies.

1. Introduction

This paper establishes strong convergence rates for certain numerical approximations of fractional processes. These approximations are inspired by Markovian representations of fractional Brownian motion [14, 15, 31, 26] and of more general Volterra processes with singular kernels [32, 4, 2, 3, 16]. The simplest such representation takes the form

[TABLE]

where $W$ is standard Brownian motion, $W^{H}$ is Volterra Brownian motion111Also known as Riemann–Liouville fractional Brownian motion or Lévy’s definition of fractional Brownian motion. with Hurst index $H\in(0,1/2)$ , and $Y(x)$ is an Ornstein–Uhlenbeck process with speed of mean reversion $x\in(0,\infty)$ . The random field $Y_{t}(x)$ , which is depicted in Figure 1, has a version which is Hölder continuous in $t$ and smooth in $x$ ; see Lemma 1 for the precise statement. Thanks to this spatial smoothness, the integral $dx$ can be approximated efficiently using high-order quadrature rules, following and extending [15, 26, 1, 3]. This leads to numerical approximations of the Volterra Brownian motion $W^{H}$ .

The main result of this article is that Volterra Brownian motion can be approximated at arbitrarily high polynomial convergence rates by weighted sums of Ornstein–Uhlenbeck processes; see Theorem 1 for the precise statement and error criterion. By arbitrarily high polynomial convergence rates we mean that $m$ -point interpolatory quadrature on $n$ suitably chosen spatial quadrature intervals leads to a discretization error of order $n^{-r}$ for all $r\in(0,2Hm/3)$ ; see Remark 3. Thus, a given rate $r>0$ can be achieved by choosing $m>3r/(2H)$ . Note that low Hurst indices $H$ require high spatial quadrature orders $m$ to achieve a given approximation rate $r$ . A visual impression of the quality of this approximation can be obtained from Figure 2. The upper bound $2Hm/3$ on the convergence rate closely matches the numerically observed rate; see Figure 3.

The motivation of this article is to develop efficient Monte Carlo methods for fractional (or rough) volatility models [24, 7, 11, 8, 27], which have been introduced on the grounds of extensive empirical evidence [24, 7, 11] and theoretical results [6, 20, 19, 9]. Under our discretization, put prices in the rough Bergomi model converge at the same rate as the underlying fractional volatility process; see Theorem 1. By put-call parity, this extends to call prices if the the asset and volatility processes are driven by negatively correlated Brownian motions, as explained at the end of Remark 2. A fully discrete Monte Carlo scheme for the rough Bergomi model can be obtained by discretizing the Ornstein–Uhlenbeck processes of Theorem 1 in time. This can be done efficiently because the covariance matrix of the Ornstein–Uhlenbeck increments has low numerical rank if the time steps are small.

To evaluate the computational complexity of our method, we consider the task of sampling a fractional process $(W^{H}_{i/k})_{i\in\{1,\dots,k\}}$ with Hurst index $H\in(0,1/2)$ at a temporal grid of $k$ equidistant time points. Our method has some additional parameters, which determine the spatial discretization of the integral representation, namely the number $n$ of spatial quadrature intervals and the order $m$ of the spatial quadrature. These are described in detail in Lemma 2. On the above-mentioned task, our method achieves accuracy $n^{-r}$ at complexity $kn$ if the order of spatial quadrature is sufficiently high, i.e., if $m>3r/(2H)$ (see Remark 3). Equivalently, accuracy $\epsilon$ can be achieved at complexity $k\epsilon^{-1/r}$ , as stated in Table 1. Typically, one is interested in temporal grids of size $k=\epsilon^{-s}$ for some $s\in(0,\infty)$ . For instance, a value of $s$ slightly above $1/H$ guarantees that the piecewise constant interpolation of an $\epsilon$ -accurate time-discrete approximation defines a continuous-time approximation of the same order of accuracy in the supremum norm. This is because the sample paths of the fractional process $W^{H}$ are nearly $H$ -Hölder continuous. Under the assumption $k=\epsilon^{-s}$ , Table 1 shows that our method outperforms the methods Hosking and Dieker [28, 17] and Carmona, Coutin, and Montseny [15] but is outperformed by the hybrid scheme of Bennedsen, Lunde, and Pakkanen [12] and by the circulant embedding method of Dietrich and Newsam [18]. This can be verified by substituting $k=\epsilon^{-s}$ in Table 1. Using exponentially converging quadrature rules such as Chebychev [22, 21], one could at best hope to reduce the complexity of our method from $k\epsilon^{-1/r}$ down to $k\log\epsilon^{-1}$ . In the important special case $k=\epsilon^{-s}$ with $s=1/H$ , this would result in exactly the same the complexity $\epsilon^{-1/H}\log\epsilon^{-1}$ as the hybrid scheme [12] and the circulant embedding method [18].

Several directions for future generalization and improvement come to mind. Theorem 1 is proved by approximation in the Laplace domain, which implies convergence in the time domain by the continuity of the Laplace transform. As Volterra processes with Lipschitz drift and volatility coefficients depend continuously on the kernel in the $L^{2}$ norm, it would be interesting to check if similar convergence results hold also in this more general setting. The rate of convergence could potentially be improved using Chebychev quadrature, taking advantage of the real analyticity of the random field $Y_{t}(x)$ in the spatial variable $x$ . Finally, following [12, 30], one could aim for more careful treatments of the singularity of the kernel near the diagonal and apply some variance reduction techniques.

2. Setting and notation

We will frequently make the following assumptions. Let $H\in(0,1/2)$ , let $\alpha=H+1/2$ , let $\mu$ be the sigma-finite measure $x^{-\alpha}dx$ on the interval $(0,\infty)$ , let $p\in[1,\infty)$ , let $T\in(0,\infty)$ , let $(\Omega,\mathcal{F},\mathbb{P},(\mathcal{F}_{t})_{t\in[0,T]})$ be a stochastic basis, and let $W,B\colon[0,T]\times\Omega\to\mathbb{R}$ be $(\mathcal{F}_{t})_{t\in[0,T]}$ -Brownian motions.

3. Integral representation

Recall from the introduction that Volterra Brownian motion $W^{H}$ can be lifted to a random field $Y_{t}(x)$ indexed by a temporal variable $t\in[0,\infty)$ and a spatial variable $x\in(0,\infty)$ [14, 15, 31, 26]. The following lemma constructs a version of this random field which is continuous in the temporal variable and smooth in the spatial variable. Moreover, it establishes bounds on the spatial derivatives and tails of the random field. These bounds are needed for the subsequent error analysis in Section 4.

The constants $m,\alpha,\beta,\gamma,\delta$ appearing in Lemma 1 are used consistently throughout the paper: $m$ stands for the number of quadrature points in Definition 1 below, $\alpha=H+1/2$ denotes the Hurst index shifted by one half, $\beta$ describes spatial integrability of $\partial_{x}^{m}Y_{t}(x)$ , $\gamma$ describes the integrability of the tail of $Y_{t}(x)$ as $x\to 0$ , and $\delta$ describes the integrability of the tail of $Y_{t}(x)$ as $x\to\infty$ . The spaces of continuous, smooth, and integrable functions appearing in Lemma 1 carry their natural topologies and Borel sigma algebras; see Appendix A.

Lemma 1.

Assume the setting of Section 2.

(a)

There exists a measurable mapping

[TABLE]

such that

[TABLE] 2. (b)

Volterra Brownian motion is a linear functional of $Y$ in the sense that

[TABLE] 3. (c)

The following integrability conditions hold: for all $m\in\mathbb{N}_{>0}$ , $\beta:=m-1$ , $\gamma:=1-\alpha$ , and $\delta\in[0,\alpha-1/2)$ ,

[TABLE]

Proof.

(a) By

s 4 and 6, the formula

[TABLE]

defines a measurable map

[TABLE]

(b) follows from the above and the stochastic Fubini theorem [34]: for each $t\in[0,T]$ , one has almost surely that

[TABLE]

(c) Let $C_{1}\in(0,\infty)$ be the constant in the maximal inequality for Ornstein–Uhlenbeck processes (see Lemma 5), i.e.,

[TABLE]

Recall that $\beta=m-1$ , and define $C_{2},C_{3}\in(0,\infty)$ as

[TABLE]

By the inequality $\log(1+Tx)\leq Tx$ , one obtains the following three estimates:

[TABLE]

This shows (c) for $p=1$ . The generalization to $p\in[1,\infty)$ is immediate because the $L^{p}$ norms of a Banach-valued Gaussian random variable are mutually equivalent thanks to the Kahane–Khintchine inequality [33, Theorem V.5.3] applied to the Karhunen–Loève expansion [33, Theorem V.5.7]. ∎

4. Discretization

In this section, the measure $\mu$ in the integral representation of Volterra Brownian motion is approximated by a weighted sum of Dirac measures. More specifically, for each $n\in\mathbb{N}$ , the positive half line is truncated to a finite interval $[\xi_{n,0},\xi_{n,n}]$ . This interval is then split into subintervals by a geometric sequence $(\xi_{n,i})_{i\in\{1,\dots,n\}}$ , and on each subinterval $[\xi_{n,i},\xi_{n,i+1}]$ the measure $\mu$ is approximated by an $m$ -point interpolatory quadrature rule $\mu_{n,i}$ such as e.g. the Gauss rule. Classical error analysis for interpolatory quadrature rules (see e.g. [13]) then yields the desired convergence result.

Definition 1.

Let $a,b\in\mathbb{R}$ satisfy $a<b$ , let $w\colon[a,b]\to[0,\infty)$ be a continuous function such that $\int_{a}^{b}w(x)dx>0$ , and let $m\in\mathbb{N}_{>0}$ . Then a measure $\mu$ on $[a,b]$ is called a non-negative $m$ -point interpolatory quadrature rule on $[a,b]$ with respect to the weight function $w$ if there are grid points $x_{1},\dots,x_{m}\in[a,b]$ and weights $w_{1},\dots,w_{m}\in[0,\infty)$ such that $\mu=\sum_{j=1}^{m}w_{j}\delta_{x_{j}}$ and

[TABLE]

The following lemma discretizes the integral representation of Volterra Brownian motion using interpolatory quadrature rules and bounds the discretization error. The assumptions of the lemma are satisfied thanks to the bounds of Lemma 1, where the same constants $\alpha,\beta,\gamma,\delta,m$ are used.

Lemma 2.

Assume the setting of Section 2, let $m\in\mathbb{N}_{>0}$ and $\alpha,\beta,\gamma,\delta\in(0,\infty)$ satisfy $1-\alpha-\beta+m>0$ , let

[TABLE]

be a measurable function which satisfies the integrability conditions

[TABLE]

let $r\in(0,\delta m/(1-\alpha-\beta+\delta+m))$ , for each $n\in\mathbb{N}$ and $i\in\{0,\dots,n-1\}$ let

[TABLE]

let $\mu_{n,i}$ be a non-negative $m$ -point interpolatory quadrature rule on $[\xi_{n,i},\xi_{n,i+1}]$ with respect to the weight function $x\mapsto x^{-\alpha}$ , and let $\mu_{n}=\sum_{i=0}^{n-1}\mu_{n,i}$ . Then

[TABLE]

Proof.

We define the constants

[TABLE]

where the upper bound on $C_{2}$ follows from Bernoulli’s inequality

[TABLE]

the finiteness of $C_{3}$ follows from the inequality

[TABLE]

and the finiteness of $C_{4}$ follows from the fact that $n^{-1}\log(n)$ tends to zero as $n\to\infty$ . Recall that the measures $\mu_{n,i}$ are by assumption non-negative $m$ -point interpolatory quadrature rules. Therefore, the corresponding quadrature error can be expressed as follows [13, Theorem 4.2.3]: for each $t\in[0,T]$ , $n\in\mathbb{N}$ , and $i\in\{0,\dots,n-1\}$ , one has

[TABLE]

where the Peano kernel $K_{n,i}\colon[\xi_{n,i},\xi_{n,i+1}]\to\mathbb{R}$ is a measurable function which satisfies [13, Theorem 5.7.1]

[TABLE]

Thus, one has for each $n\in\mathbb{N}$ that

[TABLE]

This can be expressed as a geometric series: letting $\lambda_{n}=(\xi_{n,n}/\xi_{n,0})^{1/n}$ , one has for each $n\in\mathbb{N}$ that

[TABLE]

Absorbing the denominator into one of the factors $(\lambda_{n}-1)$ and discarding the term $\xi_{n,0}$ yields for each $n\in\mathbb{N}$ that

[TABLE]

For each $n\in\mathbb{N}\cap[C_{4},\infty)$ , this can be estimated by

[TABLE]

Therefore, noting that $n^{r}=\xi_{n,0}^{-\gamma}=\xi_{n,n}^{\delta}$ , one has

[TABLE]

Remark 1.

The choice of the quadrature rule in Lemma 2 is admittedly somewhat arbitrary but produces good results. The use of the geometric grid $\xi_{n,i}$ goes back to [15] and simplifies the error analysis compared to more complex subdivisions which distribute the error more equally. It would be interesting to explore if the holomorphicity of $x\mapsto Y_{t}(x)$ permits the use of quadrature rules with exponential convergence rates such as Chebychev quadrature; see the discussion in Section 3.

5. Rough Bergomi model

The following lemma establishes that prices of put options in the rough Bergomi model converge at the same rate as the approximated Volterra processes. This holds not only for the Ornstein–Uhlenbeck approximations of Lemma 2, but more generally for any approximation of the log-volatility in the $L^{2}([0,T]\times\Omega)$ norm. Below, the space of real-valued Lipschitz functions $f\colon\mathbb{R}\to\mathbb{R}$ is denoted by $\operatorname{Lip}(\mathbb{R})$ and endowed with the norm $\|f\|_{\operatorname{Lip}(\mathbb{R})}=|f(0)|+\sup_{x\neq y}|f(y)-f(x)||y-x|^{-1}$ .

Lemma 3.

Assume the setting of Section 2, let $V,\vphantom{V}\smash{\tilde{V}},S,\vphantom{S}\smash{\tilde{S}}\colon[0,T]\times\Omega\to\mathbb{R}$ be continuous stochastic processes with $V_{0}=\vphantom{V}\smash{\tilde{V}}_{0}=0$ and

[TABLE]

and let $f\colon(0,\infty)\to\mathbb{R}$ be a measurable function such that $f\circ\exp\in\operatorname{Lip}(\mathbb{R})$ . Then

[TABLE]

Proof.

It is sufficient to control the log prices in $L^{1}$ because

[TABLE]

The basic inequality

[TABLE]

and the Burkholder–Davis–Gundy inequality [10, Theorem 1.2] imply that

[TABLE]

Remark 2.

For each $K\in(0,\infty)$ the put-option payoff

[TABLE]

satisfies the assumption of Lemma 3 that $f\circ\exp\in\operatorname{Lip}(\mathbb{R})$ because

[TABLE]

The call-option payoff does not have this property, but the prices of call options can be obtained by put-call parity if $W$ and $B$ are negatively correlated because this implies that $S$ is a martingale [23].

6. Main result

The following theorem combines the analyses of Lemmas 1–3 to show that Volterra Brownian motion can be approximated numerically at arbitrarily high polynomial convergence rates $r$ . The same convergence rate $r$ is inherited by the associated put prices in the rough Bergomi model.

Theorem 1.

Assume the setting of Section 2. For any given $r\in(0,\infty)$ , the following statements hold:

(a)

Volterra Brownian motion can be approximated at rate $n^{-r}$ by a sum of $n$ Ornstein–Uhlenbeck processes in the following sense: for each $n\in\mathbb{N}$ there are speeds of mean reversion $x_{n,i}\in(0,\infty)$ and weights $w_{n,i}\in(0,\infty)$ , $1\leq i\leq n$ , such that the continuous versions $W^{H}$ and $W^{H,n}$ of the stochastic integrals

[TABLE]

satisfy

[TABLE] 2. (b)

Under the above approximation, put prices in the rough Bergomi model converge at rate $n^{-r}$ in the following sense: the processes $S$ and $S^{n}$ defined for all $t\in[0,T]$ and $n\in\mathbb{N}$ by

[TABLE]

satisfy for all strikes $K\in[0,\infty)$ that

[TABLE]

Proof.

(a) follows from the integral representation in Lemma 1 and its discretization in Lemma 2. More precisely, the $m$ -point quadrature rule in Lemma 2 converges at any rate $r<\delta m/(1-\alpha-\beta+\delta+m)=2Hm/3$ , where the parameters $\alpha=H+1/2$ , $\beta=m-1$ , $\gamma=1/2-H$ , and $\delta=H$ are as in Lemma 1. The speeds of mean reversion $x_{n,i}$ and weights $w_{n,i}$ are determined by the relation $\mu_{n}=\sum_{i}w_{n,i}\delta_{x_{n,i}}$ , where $\mu_{n}$ is as in Lemma 2. Moreover, (b) follows from (a) and Lemma 3. ∎

Remark 3.

The proof of Theorem 1 shows that $m$ -point interpolatory quadrature on $n$ suitably chosen spatial quadrature intervals leads to a discretization error of order $n^{-r}$ for all $r\in(0,2Hm/3)$ .

Appendix A Auxiliary results

The space $C([0,T],\mathbb{R})$ of continuous real-valued functions on an interval $[0,T]$ is Banach with the supremum norm. Moreover, the space $C^{\infty}((0,\infty),\mathbb{R})$ of smooth real-valued functions on $(0,\infty)$ is locally convex with the family of seminorms $f\mapsto\sup_{x\in K}|\partial_{x}^{k}f(x)|$ , where $K$ runs through the compact subsets of $(0,\infty)$ and $k$ through the natural numbers. Similarly, the space $C([0,T],C^{\infty}((0,\infty),\mathbb{R}))$ is locally convex with the family of seminorms $f\mapsto\sup_{t\in[0,T]}\sup_{x\in K}|\partial_{x}^{k}f(t)(x)|$ for $K$ and $k$ as before.

Lemma 4.

Assume the setting of Section 2. Then the following function is continuous:

[TABLE]

Proof.

It is sufficient to show for each $k\in\mathbb{N}$ and each compact $K\subset(0,\infty)$ that the following mapping is continuous:

[TABLE]

This is obvious because this is a bounded linear map between Banach spaces. ∎

The following maximal inequality for Ornstein–Uhlenbeck processes has been shown by [25, Theorem 2.5 and Remark 2.6].

Lemma 5.

Assume the setting of Section 2. For each $x\in(0,\infty)$ , let $Y(x)\colon\Omega\to C([0,T],\mathbb{R})$ be a measurable map such that

[TABLE]

Then there exists a universal constant $C_{1}\in(0,2)$ such that the following maximal inequality holds:

[TABLE]

The following result has been shown in [26, Theorem 2.11]. We reproduce the argument here and give a simpler proof of measurability. Recall from Section 2 that $\mu=x^{-\alpha}dx$ is a sigma-finite measure on $(0,\infty)$ and that, accordingly, the space $L^{1}((0,\infty),\mu)$ of $\mu$ -integrable functions is a separable Banach space. Its intersection with the locally convex space $C((0,\infty),\mathbb{R})$ is again locally convex with the union of the corresponding families of seminorms.

Lemma 6.

Assume the setting of Section 2, and let $Y\colon\Omega\to C([0,T],C((0,\infty),\mathbb{R}))$ be a measurable map such that

[TABLE]

Then $Y$ almost surely takes values in the space $C([0,T],L^{1}((0,\infty),\mu))$ and is measurable as a map

[TABLE]

Proof.

The expression

[TABLE]

is well-defined thanks to the continuity in $t$ of $Y_{t}(x)$ , and is finite thanks to Lemma 5. Thus, the dominated convergence theorem implies that $Y$ has continuous sample paths in $L^{1}((0,\infty),\mu)$ . It remains to show that $Y\colon\Omega\to C([0,T],L^{1}((0,\infty),\mu))$ is measurable. As the Borel sigma algebra on $C([0,T],L^{1}((0,\infty),\mu))$ is generated by point evaluations at $t\in[0,T]$ [5, Lemma 4.53], it suffices to show for each $t\in[0,T]$ that $Y_{t}\colon\Omega\to L^{1}((0,\infty))$ is measurable. Moreover, by Pettis’ measurability theorem [29, Proposition 1.1.1] and the separability of $L^{1}((0,\infty),\mu)$ , it suffices to show that $Y_{t}$ is weakly measurable, i.e., that $\int_{0}^{\infty}Y_{t}(x)f(x)\mu(dx)\colon\Omega\to\mathbb{R}$ is measurable for each $f\in L^{\infty}((0,\infty),\mu)$ . This follows by approximation

[TABLE]

where for each $n\in\mathbb{N}$ , $(\mu_{n,m})_{m\in\mathbb{N}}$ is a sequence of atomic signed measures on the interval $[1/n,n]$ which converges weakly to the signed measure $f\mu$ on the same interval. ∎

Bibliography34

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Eduardo Abi Jaber “Lifting the Heston model” In Quantitative Finance 19.12 Taylor & Francis, 2019, pp. 1995–2013
2[2] Eduardo Abi Jaber and Omar El Euch “Markovian structure of the Volterra Heston model” In Statistics & Probability Letters 149 Elsevier, 2019, pp. 63–72
3[3] Eduardo Abi Jaber and Omar El Euch “Multifactor approximation of rough volatility models” In SIAM Journal on Financial Mathematics 10.2 SIAM, 2019, pp. 309–349
4[4] Eduardo Abi Jaber, Martin Larsson and Sergio Pulido “Affine volterra processes” In The Annals of Applied Probability 29.5 Institute of Mathematical Statistics, 2019, pp. 3155–3200
5[5] Charalambos D. Aliprantis and Kim C. Border “Infinite dimensional analysis. A Hitchhiker’s guide” Springer, 2006
6[6] Elisa Alòs, Jorge A León and Josep Vives “On the short-time behavior of the implied volatility for jump-diffusion models with stochastic volatility” In Finance and Stochastics 11.4 Springer, 2007, pp. 571–589
7[7] Christian Bayer, Peter Friz and Jim Gatheral “Pricing under rough volatility” In Quantitative Finance 16.6 Taylor & Francis, 2016, pp. 887–904
8[8] Christian Bayer et al. “A regularity structure for rough volatility” In Mathematical Finance 30.3 Wiley Online Library, 2020, pp. 782–832

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Strong convergence rates for Markovian representations of fractional processes

Abstract.

2010 Mathematics Subject Classification:

Contents

1. Introduction

2. Setting and notation

3. Integral representation

Lemma 1**.**

Proof.

4. Discretization

Definition 1**.**

Lemma 2**.**

Proof.

Remark 1**.**

5. Rough Bergomi model

Lemma 3**.**

Proof.

Remark 2**.**

6. Main result

Theorem 1**.**

Proof.

Remark 3**.**

Appendix A Auxiliary results

Lemma 4**.**

Proof.

Lemma 5**.**

Lemma 6**.**

Proof.

Lemma 1.

Definition 1.

Lemma 2.

Remark 1.

Lemma 3.

Remark 2.

Theorem 1.

Remark 3.

Lemma 4.

Lemma 5.

Lemma 6.