Short-time near-the-money skew in rough fractional volatility models

Christian Bayer; Peter K. Friz; Archil Gulisashvili; Blanka Horvath,; Benjamin Stemper

arXiv:1703.05132·q-fin.PR·March 12, 2018

Short-time near-the-money skew in rough fractional volatility models

Christian Bayer, Peter K. Friz, Archil Gulisashvili, Blanka Horvath,, Benjamin Stemper

PDF

TL;DR

This paper advances the understanding of short-time near-the-money skew in rough fractional volatility models by deriving higher order moderate deviation estimates, extending the applicability of skew approximation formulas in option pricing.

Contribution

It sharpens large deviation results for rough volatility models, enabling analysis in a broader moderate deviations regime around the money.

Findings

01

Extended the range of at-the-money skew approximation formulas

02

Derived higher order moderate deviation estimates for rough volatility models

03

Enhanced analytical tractability in the near-the-money regime

Abstract

We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the "rough" regime of Hurst parameter $H < 1/2$ . This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang (2017) in a way that allows us to zoom-in around the money while maintaining full analytical tractability. More precisely, this amounts to proving higher order moderate deviation estimates, only recently introduced in the option pricing context. This in turn allows us to push the applicability range of known at-the-money skew approximation formulae from CLT type log-moneyness deviations of order $t^{1/2}$ (recent works of Al\`{o}s, Le\'{o}n & Vives and Fukasawa) to the wider moderate deviations regime.

Figures3

Click any figure to enlarge with its caption.

Equations400

\frac{d S _{t}}{S _{t}} = σ (B_{t}) d (\overline{ρ} W_{t} + ρ B_{t}) .

\frac{d S _{t}}{S _{t}} = σ (B_{t}) d (\overline{ρ} W_{t} + ρ B_{t}) .

σ_{stoch} (t, ω) := σ (B_{t} (ω)) \equiv σ (B) .

σ_{stoch} (t, ω) := σ (B_{t} (ω)) \equiv σ (B) .

B_{t} = \int_{0}^{t} K (t, s) d B_{s}, t \geq 0,

B_{t} = \int_{0}^{t} K (t, s) d B_{s}, t \geq 0,

d X_{t} = - \frac{1}{2} σ^{2} (B_{t}) d t + σ (B_{t}) d (\overline{ρ} W + ρB), X_{0} = 0.

d X_{t} = - \frac{1}{2} σ^{2} (B_{t}) d t + σ (B_{t}) d (\overline{ρ} W + ρB), X_{0} = 0.

(B_{t s}, W_{t s})_{s \geq 0} = l a w ε (B_{s}, W_{s})_{s \geq 0}, where ε \equiv ε (t) \equiv t^{1/2} .

(B_{t s}, W_{t s})_{s \geq 0} = l a w ε (B_{s}, W_{s})_{s \geq 0}, where ε \equiv ε (t) \equiv t^{1/2} .

(B_{t s} : 0 \leq s \leq t_{0}) = l a w (ε B_{s} : 0 \leq s \leq t_{0}) .

(B_{t s} : 0 \leq s \leq t_{0}) = l a w (ε B_{s} : 0 \leq s \leq t_{0}) .

ε \equiv ε (t) \equiv t^{H} = ε^{2 H},

ε \equiv ε (t) \equiv t^{H} = ε^{2 H},

{\frac{1}{2} ∥ h ∥_{H_{0}^{1}}^{2} + \frac{1}{2} ∥ f ∥_{H_{0}^{1}}^{2}, + \infty, f, h \in H_{0}^{1} and f = K \dot{f}, otherwise,

{\frac{1}{2} ∥ h ∥_{H_{0}^{1}}^{2} + \frac{1}{2} ∥ f ∥_{H_{0}^{1}}^{2}, + \infty, f, h \in H_{0}^{1} and f = K \dot{f}, otherwise,

K \dot{f} (t) : = \int_{0}^{t} K (t, s) \dot{f} (s) d s

K \dot{f} (t) : = \int_{0}^{t} K (t, s) \dot{f} (s) d s

H_{0}^{1} : = {f : [0, 1] \to R continuous ∥ f ∥_{H_{0}^{1}}^{2} : = \int_{0}^{1} \dot{f} (s)^{2} d s < \infty, f (0) = 0} .

H_{0}^{1} : = {f : [0, 1] \to R continuous ∥ f ∥_{H_{0}^{1}}^{2} : = \int_{0}^{1} \dot{f} (s)^{2} d s < \infty, f (0) = 0} .

d X_{t}^{ε} = σ (ε B_{t}) ε d (\overline{ρ} W_{t} + ρ B_{t}) - \frac{1}{2} ε^{2} σ^{2} (ε B_{t}) d t, X_{0}^{ε} = 0.

d X_{t}^{ε} = σ (ε B_{t}) ε d (\overline{ρ} W_{t} + ρ B_{t}) - \frac{1}{2} ε^{2} σ^{2} (ε B_{t}) d t, X_{0}^{ε} = 0.

d X_{t}^{ε} \equiv d (\frac{ε}{ε} X_{t}^{ε}) = σ (ε B_{t}) ε d (\overline{ρ} W_{t} + ρ B_{t}) - \frac{1}{2} ε ε σ^{2} (ε B_{t}) d t, X_{0}^{ε} = 0.

d X_{t}^{ε} \equiv d (\frac{ε}{ε} X_{t}^{ε}) = σ (ε B_{t}) ε d (\overline{ρ} W_{t} + ρ B_{t}) - \frac{1}{2} ε ε σ^{2} (ε B_{t}) d t, X_{0}^{ε} = 0.

φ_{1} (h, f) : = Φ_{1} (h, f, f) = \int_{0}^{1} σ (f) d (\overline{ρ} h + ρ f),

φ_{1} (h, f) : = Φ_{1} (h, f, f) = \int_{0}^{1} σ (f) d (\overline{ρ} h + ρ f),

I (x) = h, f \in H_{0}^{1} in f {\frac{1}{2} \int_{0}^{1} \dot{h}^{2} d t + \frac{1}{2} \int_{0}^{1} \dot{f}^{2} d t : φ_{1} (h, f) = x} = f \in H_{0}^{1} in f ⎩ ⎨ ⎧ \frac{1}{2} \frac{( x - ρ ⟨ σ ( f ) , f ˙ ⟩ ) ^{2}}{ρ ^{2} ⟨ σ ^{2} ( f ) , 1 ⟩} + \frac{1}{2} \int_{0}^{1} \dot{f}^{2} d t ⎭ ⎬ ⎫,

I (x) = h, f \in H_{0}^{1} in f {\frac{1}{2} \int_{0}^{1} \dot{h}^{2} d t + \frac{1}{2} \int_{0}^{1} \dot{f}^{2} d t : φ_{1} (h, f) = x} = f \in H_{0}^{1} in f ⎩ ⎨ ⎧ \frac{1}{2} \frac{( x - ρ ⟨ σ ( f ) , f ˙ ⟩ ) ^{2}}{ρ ^{2} ⟨ σ ^{2} ( f ) , 1 ⟩} + \frac{1}{2} \int_{0}^{1} \dot{f}^{2} d t ⎭ ⎬ ⎫,

E [(e^{X_{1}^{ε}} - e^{x_{ε}})^{+}] \leq exp (- \frac{x ^{2} + o ( 1 )}{2 σ _{0}^{2} ε ^{4 H - 4 β}}) .

E [(e^{X_{1}^{ε}} - e^{x_{ε}})^{+}] \leq exp (- \frac{x ^{2} + o ( 1 )}{2 σ _{0}^{2} ε ^{4 H - 4 β}}) .

E [(e^{X_{1}^{ε}} - e^{x ε^{1 - 2 H}})^{+}] = exp (- \frac{I ( x ) + o ( 1 )}{ε ^{4 H}}) .

E [(e^{X_{1}^{ε}} - e^{x ε^{1 - 2 H}})^{+}] = exp (- \frac{I ( x ) + o ( 1 )}{ε ^{4 H}}) .

I (x ε^{2 β}) \sim \frac{1}{2} I^{''} (0) x^{2} ε^{4 β} = \frac{1}{2 σ _{0}^{2}} x^{2} ε^{4 β},

I (x ε^{2 β}) \sim \frac{1}{2} I^{''} (0) x^{2} ε^{4 β} = \frac{1}{2 σ _{0}^{2}} x^{2} ε^{4 β},

H = {K \dot{f} ∣ f \in H_{0}^{1}} .

H = {K \dot{f} ∣ f \in H_{0}^{1}} .

I (x) = \frac{1}{σ _{0}^{2}} \frac{x ^{2}}{2} - (6 ρ \frac{σ _{0}^{'}}{σ _{0}^{4}} \int_{0}^{1} \int_{0}^{t} K (t, s) d s d t) \frac{x ^{3}}{3 !} + O (x^{4}) .

I (x) = \frac{1}{σ _{0}^{2}} \frac{x ^{2}}{2} - (6 ρ \frac{σ _{0}^{'}}{σ _{0}^{4}} \int_{0}^{1} \int_{0}^{t} K (t, s) d s d t) \frac{x ^{3}}{3 !} + O (x^{4}) .

c (x, t)

c (x, t)

J (ε, x) := E [e^{- \frac{I ^{'} ( x )}{ε ^{2}} U^{ε}} (exp (\frac{ε}{ε} U^{ε}) - 1) e^{I^{'} (x) R_{2}^{ε}} 1_{U^{ε} \geq 0}]

J (ε, x) := E [e^{- \frac{I ^{'} ( x )}{ε ^{2}} U^{ε}} (exp (\frac{ε}{ε} U^{ε}) - 1) e^{I^{'} (x) R_{2}^{ε}} 1_{U^{ε} \geq 0}]

U^{ε} = ε g_{1} + ε^{2} R_{2}^{ε}

U^{ε} = ε g_{1} + ε^{2} R_{2}^{ε}

U^{ε} = ε g_{1} + ε^{2} R_{2}^{ε} \equiv ε σ W_{1} - ε^{2} σ^{2} /2

U^{ε} = ε g_{1} + ε^{2} R_{2}^{ε} \equiv ε σ W_{1} - ε^{2} σ^{2} /2

J (ε, x)

J (ε, x)

M (β) := E [e^{β W_{1}} 1_{{W_{1} \geq \frac{ε σ}{2}}}] = e^{β^{2} /2} Φ (β - \frac{ε σ}{2}) .

M (β) := E [e^{β W_{1}} 1_{{W_{1} \geq \frac{ε σ}{2}}}] = e^{β^{2} /2} Φ (β - \frac{ε σ}{2}) .

J (ε, x) \sim \frac{e ^{- x /2}}{2 π} \frac{ε ^{3} σ ^{3}}{x ^{2}} .

J (ε, x) \sim \frac{e ^{- x /2}}{2 π} \frac{ε ^{3} σ ^{3}}{x ^{2}} .

\forall x > 0, β \in [0, 1/2) : J (ε, x ε^{2 β}) \sim \frac{1}{2 π} \frac{ε ^{3 - 4 β} σ ^{3}}{x ^{2}} .

\forall x > 0, β \in [0, 1/2) : J (ε, x ε^{2 β}) \sim \frac{1}{2 π} \frac{ε ^{3 - 4 β} σ ^{3}}{x ^{2}} .

\forall x > 0, β > 1/2 : J (ε, x ε^{2 β}) \sim \frac{1}{2 π} ε σ = const \times ε

\forall x > 0, β > 1/2 : J (ε, x ε^{2 β}) \sim \frac{1}{2 π} ε σ = const \times ε

\forall x > 0 : J (ε, x ε) \sim a (x; σ) ε e^{\frac{x ^{2}}{2 σ ^{2}}} = const \times ε .

\forall x > 0 : J (ε, x ε) \sim a (x; σ) ε e^{\frac{x ^{2}}{2 σ ^{2}}} = const \times ε .

- lo g c_{B S} (k_{t}, t) = \frac{1}{t ^{1 - 2 β}} \frac{k ^{2}}{2 σ ^{2}} (1 + o (1)) as t ↓ 0.

- lo g c_{B S} (k_{t}, t) = \frac{1}{t ^{1 - 2 β}} \frac{k ^{2}}{2 σ ^{2}} (1 + o (1)) as t ↓ 0.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Short-time near-the-money skew in rough fractional volatility models

C. Bayer, P. K. Friz, A. Gulisashvili, B. Horvath, B. Stemper

WIAS Berlin, TU and WIAS Berlin, Ohio University, Imperial College London, TU and WIAS Berlin

[email protected], [email protected], [email protected], [email protected], [email protected]

Abstract.

We consider rough stochastic volatility models where the driving noise of volatility has fractional scaling, in the ”rough” regime of Hurst parameter $H<1/2$ . This regime recently attracted a lot of attention both from the statistical and option pricing point of view. With focus on the latter, we sharpen the large deviation results of Forde-Zhang (2017) in a way that allows us to zoom-in around the money while maintaining full analytical tractability. More precisely, this amounts to proving higher order moderate deviation estimates, only recently introduced in the option pricing context. This in turn allows us to push the applicability range of known at-the-money skew approximation formulae from CLT type log-moneyness deviations of order $t^{1/2}$ (recent works of Alòs, León & Vives and Fukasawa) to the wider moderate deviations regime.

Key words and phrases:

rough stochastic volatility model, European option pricing, small-time asymptotics, moderate deviations

2010 Mathematics Subject Classification:

91G20, 60H30, 60F10, 60H07, 60G22, 60G18

We gratefully acknowledge financial support through DFG research grants FR2943/2 and BA5484/1 (C. Bayer, P.K. Friz, B. Stemper), European Research Council Grant CoG-683166 (P.K. Friz), and SNF Early Postdoc Mobility Grant 165248 (B. Horvath) respectively.

1 Introduction
2 Exposition and assumptions
3 Main results
4 Simulation results
5 Proof of the energy expansion
5.1 Smoothness of the energy
5.1.1 The uncorrelated case
5.1.2 The general case
5.2 Energy expansion
5.2.1 Expansion of the minimizing configuration
5.2.2 Energy expansion in the general case
5.2.3 Energy expansion for the Riemann-Liouville kernel
6 Proof of the pricing formula
7 Proof of the moderate deviation expansions
8 Proof of the implied volatility expansion
A Auxiliary lemmas

1. Introduction

Since the groundbreaking work of Gatheral, Jaisson and Rosenbaum [GJR14a], the past two years have brought about a gradual shift in volatility modeling, leading away from classical diffusive stochastic volatility models towards so-called rough volatility models. The term was coined in [GJR14a] and [BFG16], and it essentially describes a family of (continuous-path) stochastic volatility models where the driving noise of the volatility process has Hölder regularity lower than Brownian motion, typically achieved by modeling the fundamental noise innovations of the volatility process as a fractional Brownian motion with Hurst exponent (and hence Hölder regularity) $H<1/2$ . Here, we would also like to mention pioneering work on asymptotics for rough volatility models in [ALV07] and [Fuk11]. A major appeal of such rough volatility models lies in the fact that they effectively capture several stylized facts of financial markets both from a statistical [GJR14a, BLP16] and an option-pricing point of view [BFG16]. In particular, with regards to the latter point of view, a widely observed empirical phenomenon in equity markets is the “steepness of the smile on the short end” describing the fact that as time to maturity becomes small the empirical implied volatility skew follows a power law with negative exponent, and thus becomes arbitrarily large near zero. While standard stochastic volatility models with continuous paths struggle to capture this phenomenon, predicting instead a constant at-the-money implied volatility behaviour on the short end [Gat11], models in the fractional stochastic volatility family (and more specifically so-called rough volatility models) constitute a class, well-tailored to fit empirical implied volatilities for short dated options.

Typically, the popularity of asset pricing models hinges on the availability of efficient numerical pricing methods. In the case of diffusions, these include Monte Carlo estimators, PDE discretization schemes, asymptotic expansions and transform methods. With fractional Brownian motion being the prime example of a process beyond the semimartingale framework, most currently prevalent option pricing methods – particularly the ones assuming semimartingality or Markovianity – may not easily carry over to the rough setting. In fact, the memory property (aka non-Markovianity) of fractional Brownian motion rules out PDE methods, heat kernel methods and all related methods involving a Feynman-Kac-type Ansatz. Previous work has thus focused on finding efficient Monte Carlo simulation schemes [BFG16, BLP17, BFG*+*17] or – in the special case of the Rough Heston model – on an explicit formula for the characteristic function of the log-price (see [ER16]), thus in this particular model making pricing amenable to Fourier based methods. In our work, we rely on small-maturity approximations of option prices. This is a well-studied topic. See, e.g., [ALV07, GVZ15] (the at-the-money (ATM) regime) or [DFJV14a, DFJV14b, GJR14b, GHJ16, GVZ15] (the out-of-the-money (OTM) regime, where large deviations results are used). We also refer the reader to the papers [Fuk11, Fuk17, FZ17] concerning large deviations, and to [MT16, Osa07, MS03, MS07] for related work. Based on the moderate deviations regime, Friz et al. [FGP17] have recently introduced another regime called moderately-out-of-the-money (MOTM), which, in a sense, effectively navigates between the two regimes mentioned above, by rescaling the strike with respect to the time to maturity. This approach has various advantages. On the one hand, it reflects the market reality that as time to maturity approaches zero, strikes with acceptable bid-ask spreads tend to move closer to the money (see [FGP17] for more details). On the other hand, it allows us to zoom in on the term structure of implied volatility around the money at a high resolution scale. To be more specific, our paper adds to the existing literature in two ways. First, we obtain a generalization of the Osajima energy expansion [Osa15] to a non-Markovian case, and using the new expansion, we extend the analysis of [FGP17] to the case, where the volatility is driven by a rough $(H<1/2)$ fractional Brownian motion. Indeed, Laplace approximation methods on Wiener space in the spirit of Ben Arous [BA88] and Bismut [Bis84] remain valid in the present context, and our analysis builds upon this framework in a fractional setting. Second, we use an asymptotic expansion going back to Azencott [Aze85] to bypass the need for deriving an asymptotic expansion of the density of the underlying process to obtain asymptotics for option prices. We display the potential prowess of this approach by applying it to our specific model, and derive asymptotics for call prices directly, irrespectively of corresponding density asymptotics. Finally, using a version of the ”rough Bergomi model” [BFG16], we demonstrate numerically that our implied volatility asymptotics capture very well the geometry of the term structure of implied volatility over a wide array of maturities, extending up to a year.

The paper is organized as follows: In Section 2 we set the scene, describing the class of models included in our framework ((2.1) and (2.2)) and recalling some known results ((2.4) and (2.7)), which are the starting point of our analysis. Most importantly, we argue that for small-time considerations it would suffice to restrict our attention to a class of stochastic volatility models of the form (2.3) with a volatility process driven by a Gaussian Volterra process such as in (2.2). We formulate general assumptions on the Volterra kernel (Assumptions 2.1 and 2.5) and on the function $\sigma$ in (2.3) (Assumption 2.4) under which our results are valid. In Section 3 we gather our main results, concerning a higher order expansion of the energy (Theorem 3.1), and a general expansion formula for the corresponding call prices. We derive the classical Black-Scholes expansion for the call price, using the latter result mentioned above. In addition, in Section 3 we formulate moderate deviation expansions, which allow us to derive the corresponding asymptotic formulae for implied volatilities and implied volatility skews. Finally, Section 4 displays our simulation results. Sections 5, 6 and 7 are devoted to proofs of the energy expansion, the price expansion and the moderate deviations expansion, respectively. In the appendix, we have collected some auxiliary lemmas, which are used in different sections.

2. Exposition and assumptions

We consider a rough stochastic volatility model, normalized to $r=0$ and $S_{0}=1$ , of the form suggested by Forde-Zhang [FZ17]

[TABLE]

Here $\left(W,B\right)$ are two independent standard Brownian motions, $\rho\in(-1,1)$ a correlation parameter, and $\overline{\rho}^{2}=1-\rho^{2}$ . Then $\overline{\rho}W+\rho B$ is another standard Brownian motion which has constant correlation $\rho$ with the factor $B$ , which drives the stochastic volatility

[TABLE]

Here $\sigma(.)$ is some real-valued function, typically smooth but not bounded, and we will denote by $\sigma_{0}:=\sigma(0)$ the spot volatility, with $\widehat{B}$ a Gaussian (Volterra) process of the form

[TABLE]

for some kernel $K$ , which shall be further specified in Assumptions 2.1 and 2.5 below. The log-price ${X}_{t}=\log\left(S_{t}\right)$ satisfies

[TABLE]

Recall that by Brownian scaling, for fixed $t>0$ ,

[TABLE]

As a direct consequence, classical short-time SDE problems can be analyzed as small-noise problems on a unit time horizon. For our analysis, it will also be crucial to impose such a scaling property on the Gaussian process $\widehat{B}$ (more precisely, on the kernel $K$ in (2.2)) driving the volatility process in our model:

Assumption 2.1 (Small time self-similarity).

There exists a number $t_{0}$ with $0<t_{0}\leq 1$ and a function $t\mapsto\widehat{\varepsilon}=\widehat{\varepsilon}(t)$ , $0\leq t\leq t_{0}$ , such that

[TABLE]

In fact, we will always have

[TABLE]

which covers the examples of interest, in particular standard fractional Brownian motion $\widehat{B}=B^{H}$ or Riemann-Liouville fBM with explicit kernel $K\left(t,s\right)=\sqrt{2H}\left\lvert t-s\right\rvert^{H-1/2}$ . (This is very natural, even from a general perspective of self-similar processes, see [Lam62].)

We insist that no (global) self-similarity of $\widehat{B}$ is required, as only $\widehat{B}|_{[0,t]}$ for arbitrarily small $t$ matters.

*Remark 2.2**.*

It should be possible to replace the fractional Brownian motion by a certain fractional Ornstein-Uhlenbeck process in the results obtained in this paper. Intuitively, this replacement creates a negligible perturbation (for $t\ll 1$ ) of the fBm environment. A similar situation was in fact encountered in [CF10], where fractional scaling at times near zero was important. To quantify the perturbation, the authors of [CF10] introduced an easy to verify coupling condition (see Corollary 2 in [CF10]). It should be possible to employ a version of this condition in the present paper to justify the replacement mentioned above. We will however not pursue this point further here.

*Remark 2.3**.*

Throughout this article, one can consider a classical (Markovian, diffusion) stochastic volatility setting by taking $K\equiv 1$ , or equivalently $H\equiv 1/2$ , by simply ignoring all hats ( $\widehat{\cdot}$ ) in the sequel. In particular then, $\frac{\widehat{\varepsilon}}{\varepsilon}\equiv 1$ in all subsequent formulae.

General facts on large deviations of Gaussian measures on Banach spaces [DS89] such as the path space $C([0,1],\mathbb{R}^{3})$ imply that a large deviation principle holds for the triple $\{\widehat{\varepsilon}(W,B,\widehat{B}):\widehat{\varepsilon}>0\}$ , with speed $\widehat{\varepsilon}^{2}$ and rate function

[TABLE]

where

[TABLE]

for $f\in H_{0}^{1}$ , the space of absolutely continuous paths with $L^{2}$ derivative

[TABLE]

This enables us to derive a large deviations principle for $X$ in (2.3): the (local) small-time self-similarity property of $\widehat{B}$ (Assumption 2.1) implies that $X_{t}\overset{law}{=}X_{1}^{\varepsilon}$ where

[TABLE]

For what follows, it will be convenient to consider a rescaled version of (2.3)

[TABLE]

Under a linear growth condition on the function $\sigma$ , Forde–Zhang [FZ17] use the extended contraction principle to establish a large deviations principle for ( $\widehat{X}^{\varepsilon}_{1}$ ) with speed $\widehat{\varepsilon}^{2}$ . More precisely, with

[TABLE]

the rate function is given by

[TABLE]

where $\left\langle\cdot\,,\cdot\right\rangle$ denotes the inner product on $L^{2}\left([0,1],dt\right)$ . Several other proofs (under varying assumptions on $\sigma$ ) have appeared since [JPS17, BFG*+*17, Gul17].

As a matter of fact, this paper relies on moderate - rather than large - deviations, as emphasized in (iiic) below. To this end, let us make

Assumption 2.4.

(i)

(Positive spot vol) Assume $\sigma:\mathbb{R}\rightarrow\mathbb{R}$ is smooth with $\sigma_{0}:=\sigma(0)>0$ . 2. (ii)

(Roughness) The Hurst parameter $H$ satisfies $H\in(0,1/2]$ . 3. (iiia)

(Martingality) The price process $S=\exp X$ is a martingale. 4. (iiib)

(Short-time moments) $\forall m<\infty\ \exists t>0$ : $\ E(S_{t}^{m})<\infty$ .

While condition (iiia) hardly needs justification, we emphasize that conditions (iiia-b) are only used to the extent that they imply condition (iiic) given below (which thus may replace (iiia-b) as an alternative, if more technical, assumption). The reason we point this out explicitly is that all the conditions (iiia-c) are implicit (growth) conditions on the function $\sigma(.)$ . For instance, (iiia-b) was seen to hold under a linear growth assumption [FZ17], whereas the log-normal volatility case (think of $\sigma(x)=e^{x}$ ) is complicated. Martingality, for instance, requires $\rho\leq 0$ and there is a critical moment $m^{*}=m^{*}(\rho)$ , even when $\rho<0$ . See [Sin98, Jou04, LM07] for the case $H=1/2$ and the forthcoming work [FG18] for the general rough case $H\in(0,1]$ . We view (iiic) simply as a more flexible condition that can hold in situations where (iiib) fails.

(iiic)

(Call price upper moderate deviation bound) For every $\beta\in(0,H)$ , and every fixed $x>0$ , and $\widehat{x}_{\varepsilon}:=x\varepsilon^{1-2H+2\beta}$ ,

[TABLE]

This condition is reminiscent of the “upper part” of the large deviation estimate obtained in [FZ17]

[TABLE]

If fact, if one formally applies this with $x$ replaced by $x\varepsilon^{2\beta}$ , followed by Taylor expanding the rate function,

[TABLE]

one readily arrives at the estimate (iiic). Unfortunately, $o(1)=o_{x}(1)$ in (2.8), which is a serious obstacle in making this argument rigorous. Instead, we will give a direct argument (Lemma 7.1) to see how (iiia-b) implies (iiic).

In the sequel, we will use another mild assumption on the kernel.

Assumption 2.5.

The kernel $K$ has the following properties

(i)

$\widehat{B}_{t}=\int_{0}^{t}K(t,s)dB_{s}$ has a continuous (in $t$ ) version on $[0,1]$ .

(ii)

$\forall t\in[0,1]:\ \int_{0}^{t}K(t,s)^{2}ds<\infty$ .

Note that the Riemann-Liouville kernel $K(t,s)=\sqrt{2H}(t-s)^{\gamma}$ , $\gamma=H-1/2$ satisfies Assumption 2.5.

*Remark 2.6**.*

Assumption 2.5 implies that the Cameron-Martin space $\mathcal{H}$ of $\widehat{B}$ is given by the image of $H^{1}_{0}$ under $K$ , i.e.,

[TABLE]

See Lemma 5.3 and Remark 5.4 for more details. A reference and also a sufficient condition for Assumption 2.5 (i) can be found e.g. in [Dec05, Section 3].

3. Main results

The following result can be seen as a non-Markovian extension of work by Osajima [Osa15]. The statement here is a combination of Theorem 5.10 and Proposition (5.14) below. Recall that $\sigma_{0}=\sigma\left(0\right)$ represents spot-volatility. We also set $\sigma_{0}^{\prime}\equiv\sigma^{\prime}\left(0\right)$ .

Theorem 3.1 (Energy expansion).

The rate function (or energy) $I$ in (2.7) is smooth in a neighbourhood of $x=0$ (at-the-money) and it is of the form

[TABLE]

The next result is an exact representation of call prices, valid in a non-Markovian generality, and amenable to moderate- and large-deviation analysis (Theorem 3.4 below) as well as to full asymptotic expansions, which will be explored in forthcoming work.

Theorem 3.2 (Pricing formula).

For a fixed log-strike $x\geq 0$ and time to maturity $t>0$ , set $\widehat{x}:=\frac{\varepsilon}{\widehat{\varepsilon}}x$ , where $\varepsilon=t^{1/2}$ and $\widehat{\varepsilon}=t^{H}=\varepsilon^{2H}$ , as before. Then we have

[TABLE]

where

[TABLE]

and $\widehat{U}^{\varepsilon}$ is a random variable of the form

[TABLE]

with $g_{1}$ a centred Gaussian random variable, explicitly given in equation (6.3) below, and $R^{\varepsilon}_{2}$ is a (random) remainder term, in the sense of a stochastic Taylor expansion in $\widehat{\varepsilon}$ , see Lemma 6.2 for more details.

Example 3.3 (Black-Scholes model).

We fix volatility $\sigma\left(\cdot\right)\equiv\sigma>0$ , and $H=1/2$ so that $\widehat{\varepsilon}=\varepsilon$ and all $\widehat{\cdot}$ can be omitted. Energy is given by $I\left(x\right)=\frac{x^{2}}{2\sigma^{2}}$ and

[TABLE]

with $R^{\varepsilon}_{2}=R_{2}\equiv-\sigma^{2}/2$ independent of $\varepsilon$ . Moreover,

[TABLE]

with $\alpha:=\frac{I^{\prime}\left(x\right)\sigma}{\varepsilon}=\frac{1}{\sigma}(x/\varepsilon)$ , and, in terms of the standard Gaussian cdf $\Phi$ ,

[TABLE]

Using the expansion $\Phi(-y)=\frac{1}{y\sqrt{2\pi}}e^{-y^{2}/2}(1-y^{-2}+...)$ , as $y\to\infty$ one deduces, for fixed $x>0$ , the asymptotic relation, as $\varepsilon\to 0$ ,

[TABLE]

We will be interested (cf. Theorem 3.4) in replacing $x$ by $\widetilde{x}=x\varepsilon^{2\beta}\to 0$ for $\beta>0$ . This gives $\widetilde{\alpha}=\frac{1}{\sigma}(x/\varepsilon^{1-2\beta})$ and the above analysis, now based on $\widetilde{\alpha}\to\infty$ , remains valid111More terms in the expansion of $\Phi$ are needed. for $\beta$ in the “moderate” regime $\beta\in[0,1/2)$ and we obtain

[TABLE]

Let us point out, for the sake of completeness, that a similar expansion is not valid for $\beta>1/2$ . To see this, first note that (3.1) implies that $J(\varepsilon,x)|_{x=0}$ is precisely the ATM call price with time $t=\varepsilon^{2}$ from expiration. Well-known ATM asymptotics then imply that $J(\varepsilon,x)|_{x=0}\sim\tfrac{1}{\sqrt{2\pi}}\varepsilon\sigma$ as $\varepsilon\to 0$ . These asymptotics are unchanged in case of $o(t^{1/2})=o(\varepsilon)$ out-of-moneyness (“almost-at-the-money” in the terminology of [FGP17]), which readily implies

[TABLE]

At last, we have the borderline case $\beta=1/2$ , or $\widetilde{x}=x\varepsilon$ . From e.g. [MKN11, Thm 3.1], we see that $c(x\varepsilon,\varepsilon^{2})\sim a(x;\sigma)\varepsilon$ with positive constant $a(x;\sigma)$ . A look at (3.1) then reveals

[TABLE]

For the call price expansion in the large / moderate deviations regime, $\beta\in[0,1/2)$ , the polynomial in $\varepsilon$ -behaviour of (3.5) implies that the $J$ -term in the pricing formula will be negligible on the moderate / large deviation scale, in the sense for any $\theta>0$ , we have $\varepsilon^{\theta}\log J(\varepsilon,x\varepsilon^{2\beta})\to 0$ as $\varepsilon\to 0$ . Consequently, with $k_{t}=kt^{\beta}$ , for $t=\varepsilon^{2}$ , $k>0$ , $\beta\in[0,1/2)$ , we get the “moderate” Black-Scholes call price expansion,

[TABLE]

While the above can be confirmed by elementary analysis of the Black–Scholes formula, the following theorem exhibits it as an instance of a general principle. See [FGP17] for a general diffusion statement.

Theorem 3.4 (Moderate Deviations).

In the rough volatility regime $H\in(0,1/2]$ , consider log-strikes of the form

[TABLE]

(i) For $\beta\in(0,H)$ , and every $\theta>0$ , we have

[TABLE]

(ii) For $\beta\in(0,\frac{2}{3}H)$ , and every $\theta>0$ , we have

[TABLE]

Moreover,

[TABLE]

where $\left\langle\cdot\,,\cdot\right\rangle$ is the inner product in $L^{2}\left([0,1]\right)$ .

*Remark 3.5**.*

In principle, further terms (of order $t^{i\beta-2H}$ ) can be added to this expansion of log call prices, given that the energy has sufficient regularity, see Theorem 3.6. We also note that, for small enough $\beta$ , the error term $O(t^{-\theta})$ can be omitted. In any case, one can replace the additive error bounds by (cruder) ones, where the right-most term in the expansion is multiplied with $(1+o(1))$ , as was done in [FGP17].

Proof of Theorem 3.4.

We apply Theorem 3.2 with $\widehat{x}=k_{t}=kt^{1/2-H+\beta}$ , i.e., with $x=kt^{\beta}=k\varepsilon^{2\beta}$ . In particular, we so get, with $\widehat{\varepsilon}=t^{H}$ and $\varepsilon=t^{1/2}$ ,

[TABLE]

The technical Proposition 7.3 asserts that, for fixed $k>0$ , the factor $J$ is negligible in the sense that, for every $\theta>0$ ,

[TABLE]

The theorem now follows immediately from the Taylor expansion of $I(x)$ around $x=0$ (see Theorem 3.1), plugging in $x=kt^{\beta}$ . Indeed, replacing $I(x)$ by the Taylor-jet seen in (i),(ii), leads exactly to an error term $O(t^{3\beta-2H})$ , resp. $O(t^{4\beta-2H})$ . ∎

Fix real numbers $k>0$ , $0<H<\frac{1}{2}$ , $0<\beta<H$ , and an integer $n\geq 2$ . For every $t>0$ , set

[TABLE]

and denote

[TABLE]

Here, $\theta>0$ can be arbitrarily small. It is clear that for all small $t$ and $\theta$ small enough,

[TABLE]

while

[TABLE]

The following statement provides an asymptotic formula for the implied variance.

Theorem 3.6.

Suppose $0<\beta<\frac{2H}{n}$ and $\theta>0$ small enough. Then as $t\rightarrow 0$ ,

[TABLE]

The $\mathcal{O}$ -estimate in (3.6) depends on $n$ , $H$ , $\beta$ , $\theta$ , and $k$ . It is uniform on compact subsets of $[0,\infty)$ with respect to the variable $k$ .

*Remark 3.7**.*

Using the multinomial formula, we can represent the expression on the left-hand side of (3.6) in terms of certain powers of $t$ . However, the coefficients become rather complicated.

*Remark 3.8**.*

Let an integer $n\geq 2$ be fixed, and suppose we would like to use only the derivatives $I^{(i)}(0)$ for $2\leq i\leq n$ in formula (3.6) to approximate $\sigma_{\rm impl}(k_{t},t)^{2}$ . Then, the optimal range for $\beta$ is the following: $\frac{2H}{n+1}\leq\beta<\frac{2H}{n}$ . On the other hand, if $\beta$ is outside of the interval $[\frac{2H}{n+1},\frac{2H}{n})$ , more derivatives of the energy function at zero may be needed to get a good approximation of the implied variance in formula (3.6).

We will next derive from Theorem 3.6 several asymptotic formulas for the implied volatility. In the next corollary, we take $n=2$ .

Corollary 3.9.

As $t\rightarrow 0$ ,

[TABLE]

Corollary 3.9 follows from Theorem 3.6 with $n=2$ , the equality

[TABLE]

given in Theorem 3.4, and the Taylor expansion $\sqrt{1+h}=1+\mathcal{O}(h)$ as $h\rightarrow 0$ .

In the next corollary, we consider the case where $n=3$ .

Corollary 3.10.

Suppose $\beta<\frac{2H}{3}$ . Then, as $t\rightarrow 0$ ,

[TABLE]

Corollary 3.10 follows from Theorem 3.6 with $n=3$ , formula (3.8), the equality

[TABLE]

(see Theorem 3.4), and the expansion $\sqrt{1+h}=1+\frac{1}{2}h+\mathcal{O}(h^{2})$ as $h\rightarrow 0$ .

Using Corollary 3.10, we establish the following implied volatility skew formula in the moderate deviation regime.

Corollary 3.11.

Let $0<H<\frac{1}{2}$ , $0<\beta<\frac{2}{3}H$ , and fix $y,z>0$ with $y\neq z$ . Then as $t\rightarrow 0$ ,

[TABLE]

*Remark 3.12**.*

Corollary 3.11 complements earlier works of Alòs et al. [ALV07] and Fukasawa [Fuk11, Fuk17]. For instance, the following formula can be found in [Fuk17, p. 6], see also [Fuk11, p. 14]:

[TABLE]

In formula (3.12), we employ the notation used in the present paper. Our analysis shows that the applicability range of skew approximation formulas is by no means restricted to the Central Limit Theorem type log-moneyness deviations of order $t^{1/2}$ . It also includes the moderate deviations regime of order $t^{1/2-H+\beta}$ . The previous rate is clearly $\gg t^{1/2}$ as $t\to 0$ .

*Remark 3.13** (Symmetry).*

Write $\Phi_{1}(W,B,\widehat{B};\rho;\sigma)$ for the “Itô-type map”

[TABLE]

It equals, in law, $\Phi_{1}(W,-B,-\widehat{B};-\rho;\sigma(-\cdot))$ , and indeed all our formulae are invariant under this transformation. In particular, the skew remains unchanged when the pair $(\rho,\sigma_{0}^{\prime})$ is replaced by $(-\rho,-\sigma_{0}^{\prime})$ .

4. Simulation results

We verify our theoretical results numerically with a variant of the rough Bergomi model [BFG16] which fits nicely into the general rough volatility framework considered in this paper. As before, the model has been normalized such that $S_{0}=1$ and $r=0$ . We let $(W,B)$ be two independent Brownian motions and $\rho\in(-1,1)$ with $\overline{\rho}^{2}=1-\rho^{2}$ such that $Z=\overline{\rho}W+\rho B$ is another Brownian motion having constant correlation $\rho$ with $B$ . For some spot volatility $\sigma_{0}$ and volatility of volatility parameter $\eta$ , we then assume the following dynamics for some asset $S$ :

[TABLE]

where $\widehat{B}$ is a Riemann-Liouville fBM given by

[TABLE]

The approach taken for the Monte Carlo simulations of the quantities we are interested in is the one initially explored in the original rough Bergomi pricing paper [BFG16]. That is, exploiting their joint Gaussianity, where we use the well-known Cholesky method to simulate the joint paths of $(Z,\widehat{B})$ on some discretization grid $\mathcal{D}$ . With (4.2) being an explicit function in terms of the rough driver, an Euler discretisation of the Ito SDE (4.1) on $\mathcal{D}$ then yields estimates for the price paths.

The Cholesky algorithm critically hinges on the availability and explicit computability of the joint covariance matrix of $(Z,\widehat{B})$ whose terms we readily compute below.222 Note that expressions for the exact same scenario have have been computed before in the original pricing paper [BFG16], yet in that version the expression for the autocorrelation of the fBM $\widehat{B}$ was incorrect. We compute and state here all the relevant terms for the sake of completeness.

Lemma 4.1.

For convenience, define constants $\gamma=\frac{1}{2}-H\in[0,\frac{1}{2})$ and $D_{H}=\frac{\sqrt{2H}}{H+\frac{1}{2}}$ and define an auxiliary function $G:[1,\infty)\rightarrow\mathbb{R}$ by

[TABLE]

*where ${}_{2}F_{1}$ denotes the Gaussian hypergeometric function [Olv10]. Then the joint process $(Z,\widehat{B})$ has zero mean and covariance structure governed by

$\begin{cases}\operatorname{Var}[\widehat{B}_{t}^{2}]=t^{2H},&\text{for$ t\geq 0 $,}\\ \operatorname{Cov}[\widehat{B}_{s}\widehat{B}_{t}]=t^{2H}G\left(s/t\right),&\text{for$ s>t\geq 0 $,}\\ \operatorname{Cov}[\widehat{B}_{s}Z_{t}]=\rho D_{H}\left(s^{H+\frac{1}{2}}-\left(s-\min(t,s)\right)^{H+\frac{1}{2}}\right),&\text{for$ t,s\geq 0 $,}\\ \operatorname{Cov}[Z_{t}Z_{s}]=\min(t,s),&\text{for$ t,s\geq 0 $.}\end{cases}$ **

Numerical simulations333 The Python 3 code used to run the simulations can be found at github.com/RoughStochVol. confirm the theoretical results obtained in the last section. In particular - as can be seen in Figure LABEL:fig:pub_roughimpvol – the asymptotic formula for the implied volatility (3.9) captures very well the geometry of the term structure of implied volatility, with particularly good results for higher $H$ and worsening results as $H\downarrow 0$ . Quite surprisingly, despite being an asymptotic formula, it seems to be fairly accurate over a wide array of maturities extending up to a single year.

5. Proof of the energy expansion

Consider

[TABLE]

where $\widehat{B}_{t}=\int_{0}^{t}K\left(t,s\right)dB_{s}$ for a fixed Volterra kernel (recall (2.3) in the previous section). We study the small noise problem $\left(X^{\varepsilon},Y^{\varepsilon}\right)$ where $\left(W,B,\widehat{B}\right)$ is replaced by $\left(\varepsilon W,\varepsilon B,\widehat{\varepsilon}\widehat{B}\right)$ . The following proposition roughly says that

[TABLE]

Proposition 5.1 (Forde-Zhang [FZ17]).

Under suitable assumptions (cf. Section 2), the rescaled process $\left(\frac{\widehat{\varepsilon}}{\varepsilon}X_{1}^{\varepsilon}:\varepsilon\geq 0\right)$ satisfies an LDP (with speed $\widehat{\varepsilon}^{2}$ ) and rate function

[TABLE]

where

[TABLE]

The rest of this section is devoted to analysis of the function $I$ as defined in (5.1). First, we derive the first order optimality condition for the above minimization problem.

Proposition 5.2 (First order optimality condition).

For any $x\in\mathbb{R}$ we have at any local minimizer $f=f^{x}$ of the functional $\mathcal{I}_{x}$ in (5.1) that

[TABLE]

for all $t\in\left[0,1\right]$ .

Proof.

We denote $a\approx b$ whenever $a=b+o\left(\delta\right)$ for a small parameter $\delta$ . We expand

[TABLE]

If $f=f^{x}$ is a minimizer then $\delta\mapsto\mathcal{I}_{x}\left(f+\delta g\right)$ has a minimum at $\delta=0$ for all $g$ . We expand

[TABLE]

As a consequence, we must have, for $f=f^{x}$ and every $\dot{g}\in L^{2}\left[0,1\right]$

[TABLE]

Recall $f_{0}^{x}=0$ , any $x$ . We now test with $\dot{g}=1_{\left[0,t\right]}$ for a fixed $t\in\left[0,1\right]$ and obtain

[TABLE]

5.1. Smoothness of the energy

Having formally identified the first order condition for minimality in (5.1), we will now show that the energy $x\mapsto I(x)$ is a smooth function. More precisely, we will use the implicit function theorem to show that the minimizing configuration $f^{x}$ is a smooth function in $x$ (locally at $x=0$ ). As $\mathcal{I}_{x}$ is a smooth function, too, this will imply smoothness of $x\mapsto\mathcal{I}_{x}(f^{x})=I(x)$ , at least in a neighborhood of [math].

As the Cameron-Martin space $\mathcal{H}$ of the process $\widehat{B}$ continuously embeds into $C\left([0,1]\right)$ , $K$ maps $H^{1}_{0}$ continuously into $C\left([0,1]\right)$ , i.e., there is a constant $C>0$ such that for any $f\in H^{1}_{0}$ we have

[TABLE]

This result will follow from

Lemma 5.3.

Let $\left(V_{t}:0\leq t\leq 1\right)$ be a continuous, centred Gaussian process and $\mathcal{H}$ its Cameron-Martin space. Then we have the continuous embedding $\mathcal{H}\hookrightarrow C\left[0,1\right]$ . That is, for some constant $C$ ,

[TABLE]

Proof.

By a fundamental result of Fernique, applied to the law of $V$ as Gaussian measure on the Banach space $(C\left[0,1\right],\left\|\cdot\right\|_{\infty})$ , the random variable $\left\|V\right\|_{\infty}$ has Gaussian integrability. In particular,

[TABLE]

On the other hand, a generic element $h\in\mathcal{H}$ can be written as $h_{t}=E\left[V_{t}Z\right]$ where $Z$ is a centred Gaussian random variable with variance $\left\|h\right\|_{\mathcal{H}}^{2}$ , see, e.g., [FH14, page 150]. By Cauchy–Schwarz,

[TABLE]

and conclude by taking the $\sup$ over on the l.h.s. over $t\in\left[0,1\right]$ . ∎

*Remark 5.4**.*

Assume $V$ is of Volterra form, i.e. $V_{t}=\int_{0}^{t}K\left(t,s\right)dB_{s}$ . Then it can be shown (see [Dec05, Section 3]) that $\mathcal{H}$ is the image of $L^{2}$ under the map

[TABLE]

and $\left\|K\dot{f}\right\|_{\mathcal{H}}=\left\|\dot{f}\right\|_{L^{2}}.$ In particular then, applying the above with $h=K\dot{f}\in\mathcal{H}$ , gives

[TABLE]

5.1.1. The uncorrelated case

We start with the case $\rho=0$ as the formulas are much simpler in this case.

By Proposition 5.2, any local optimizer $f=f^{x}$ of the functional $\mathcal{I}_{x}:H_{0}^{1}\to\mathbb{R}$ in the uncorrelated case $\rho=0$ satisfies for any $t\in[0,1]$

[TABLE]

We define a map $H:H_{0}^{1}\times\mathbb{R}\to H_{0}^{1}$ by

[TABLE]

Hence, for given $x\in\mathbb{R}$ , any local optimizer $f$ must solve $H(f,x)=0$ . As one particular solution is given by the pair $(0,0)$ , we are in the realm of the implicit function theorem. We need to prove that

•

$(f,x)\mapsto H(f,x)$ is locally smooth (in the sense of Fréchet);

•

$DH(f,x)\coloneqq\frac{\partial}{\partial f}H(f,x)$ is invertible in $(0,0)$ .

Note that invertibility should hold for $x$ small enough, as $DH(f,x)=\operatorname{id}_{H_{0}^{1}}-x^{2}R$ for some $R$ , which is invertible as long as $R$ has a bounded norm for sufficiently small $x$ .

*Remark 5.5**.*

The method of proof in this section is purely local in $H^{1}_{0}$ . Hence, we only really need smoothness of $\sigma$ locally around [math]. Note, however, that stochastic Taylor expansions used in Section 6 will actually require global smoothness of $\sigma$ .

Lemma 5.6.

The functions $F:H_{0}^{1}\to\mathbb{R}$ and $R_{1}:H_{0}^{1}\to C\left([0,1]\right)$ defined by

[TABLE]

are smooth in the sense of Fréchet.

Proof.

For $N\geq 1$ we note that the Gateaux derivative of $F$ satisfies

[TABLE]

By Lemma 5.3, we can bound

[TABLE]

for $\mathrm{const}=\left\lVert\frac{d^{n}}{dx^{n}}\sigma^{2}\right\rVert_{\infty}$ .444More precisely, since neither $\sigma$ nor its derivatives need to be bounded, we need to actually work with a local version of the above estimate, for instance by replacing the max with a sup over a compact set containing $\{(K\dot{f})(t):0\leq t\leq 1\}$ . Thus, $D^{N}F(f)$ is a multi-linear form on $H^{1}_{0}$ with operator norm $\left\lVert D^{N}F(f)\right\rVert\leq\left\lVert\frac{d^{n}}{dx^{n}}\sigma^{2}\right\rVert_{\infty}C^{N}$ independent of $f$ . As $f\mapsto D^{N}F(f)$ is continuous, we conclude that $D^{N}F(f)$ as given above is, in fact, a Fréchet derivative.

Let us next consider the functional $R_{1}$ . Note that

[TABLE]

for $\mathfrak{s}_{N}(x)\coloneqq\frac{d^{N}}{dx^{N}}\sigma(x)\sigma^{\prime}(x)$ . Hence, Assumption 2.5 implies that

[TABLE]

We see that the multi-linear map $D^{N}R_{1}(f)$ has operator norm bounded by

[TABLE]

independent of $f$ . From continuity of $f\mapsto D^{N}R_{1}(f)$ , it follows that $D^{N}R_{1}(f)$ is the $N$ ’th Fréchet derivative. ∎

Theorem 5.7 (Zero correlation).

Assuming $\rho=0$ , the energy $I(x)$ (as defined in (5.1)) is smooth in a neighborhood of $x=0$ .

Proof.

By construction, we have

[TABLE]

for $A:H_{0}^{1}\to\mathcal{L}(H_{0}^{1},H_{0}^{1})$ defined by

[TABLE]

Here,

[TABLE]

As verified above, $H$ is smooth in the sense of Fréchet. Trivially, $DH(0,0)=\operatorname{id}_{H^{1}_{0}}$ is invertible and $H(0,0)=0$ . Therefore, the implicit function theorem implies that there are open neighborhoods $U$ and $V$ of $0\in H^{1}_{0}$ and $0\in\mathbb{R}$ , respectively, and a smooth map $x\mapsto f^{x}$ from $V$ to $U$ such that $H(f^{x},x)\equiv 0$ and $f^{x}$ is unique in $U$ with this property.

For the energy, we prove that $I(x)=\mathcal{I}_{x}(f^{x})$ in a neighborhood of $x=0$ . First of all, we show that a minimizer exists. If not, there is a function $g\in H^{1}_{0}$ with $\mathcal{I}_{x}(g)<\mathcal{I}_{x}(f^{x})$ . For small enough $x$ such a $g$ must be inside a ball with radius $\epsilon$ around $0\in H^{1}_{0}$ , as $\mathcal{I}_{x}(g)\geq\frac{1}{2}\left\lVert g\right\rVert^{2}_{H_{0}^{1}}$ and $\lim_{x\to 0}\mathcal{I}_{x}(f^{x})=0$ . Then note that for any $g\in H^{1}_{0}$

[TABLE]

where $D^{2}\mathcal{I}_{x}(f)$ denotes the second derivative of $f\mapsto\mathcal{I}_{x}(f)$ . By continuity, $D^{2}\mathcal{I}_{x}(f)$ stays positive definite for $(x,f)$ in a neighborhood of $(0,0)$ . As noted, for $x$ small enough, both $g$ and $f^{x}$ (and the line connecting them) lie in this neighborhood. For $h\coloneqq g-f^{x}$ , this implies

[TABLE]

since $D\mathcal{I}_{x}(f_{x})\cdot h=0$ and $D^{2}\mathcal{I}_{x}(f^{x}+tsh)\cdot(h,h)>0$ . This contradicts the assumption that $\mathcal{I}_{x}(g)<\mathcal{I}_{x}(f^{x})$ , and we conclude that $f^{x}$ is, indeed, a minimizer of $\mathcal{I}_{x}$ , implying that $I(x)=\mathcal{I}_{x}(f^{x})$ locally.

Finally, as $x\mapsto f^{x}$ is smooth and $(f,x)\mapsto\mathcal{I}_{x}(f)=\frac{x^{2}}{2F(f)}+\frac{1}{2}\left\lVert f\right\rVert_{H^{1}_{0}}^{2}$ is smooth, we see that $x\mapsto I(x)=\mathcal{I}_{x}(f^{x})$ is smooth in a neighborhood of [math]. (Note that this arguments relies on $\sigma(0)\neq 0$ , implying that $F(f)\neq 0$ for $f$ in a neighborhood to [math].) ∎

*Remark 5.8**.*

Classical counter-examples in the context of the direct method of calculus of variations show that the step of verifying the existence of a minimizer should not be taken too lightly. For instance, the functional

[TABLE]

does not have a minimizer in $H^{1}_{0}$ , but $J$ can be made arbitrarily close to [math] by choosing piecewise-linear functions $u$ with slope $\left\lvert u^{\prime}\right\rvert=1$ oscillating around [math]. We refer to any text book on calculus of variations. In the situation above, local “convexity” in the sense of a positive definite second derivative prevents this phenomenon. An alternative method of proof for the existence of a minimizer is to show that $J$ is (lower semi-) continuous in the weak sense.

5.1.2. The general case

In the general case (cf. Proposition 5.2), we define the function $H:H_{0}^{1}\times\mathbb{R}\to H^{1}_{0}$ by

[TABLE]

where $R_{2},R_{3}:H_{0}^{1}\to H^{1}_{0}$ are defined by

[TABLE]

$t\in[0,1]$ .

One easily checks that $G$ , $R_{2}$ , $R_{3}$ are smooth in the Fréchet sense.

Lemma 5.9.

The functions $G:H^{1}_{0}\to\mathbb{R}$ , $R_{2}:H^{1}_{0}\to H^{1}_{0}$ and $R_{3}:H^{1}_{0}\to H^{1}_{0}$ are smooth in Fréchet sense.

Proof.

The proof of smoothness is clear. We report the actual derivatives. For $G$ we get

[TABLE]

For $R_{2}$ and, respectively, $R_{3}$ , we obtain

[TABLE]

and

[TABLE]

Theorem 5.10.

Let $\sigma$ be smooth with $\sigma(0)\neq 0$ . Then the energy $I(x)$ as defined in (5.1) is smooth in a neighborhood of $x=0$ .

Proof.

The proof is similar to the proof of Theorem 5.7. In fact, the only difference is in establishing invertibility of $DH(0,0)$ and the existence of a minimizer.

Note that (5.5) contains three terms. The derivative of the first term ( $f\mapsto f$ ) is always equal to $\operatorname{id}_{H_{0}^{1}}$ . For the second term, we note that

[TABLE]

Hence, the only non-vanishing contribution to the derivative of the second term evaluated in direction $g\in H_{0}^{1}$ at $x=0$ , $f=0$ and $t\in[0,1]$ is

[TABLE]

For the same reason, the derivative of the third term at $(f,x)=(0,0)$ vanishes entirely. Hence,

[TABLE]

It is easy to see that $g\mapsto DH(0,0)\cdot g$ is invertible. Indeed, let us construct the pre-image $g=DH(0,0)^{-1}\cdot h$ of some $h\in H^{1}_{0}$ . At $t=1$ we have

[TABLE]

implying $g(1)=\overline{\rho}^{2}h(1)$ . For $0\leq t<1$ , we then get

[TABLE]

or $g(t)=h(t)-\rho^{2}h(1)t$ .

For existence of the minimizer, note that

[TABLE]

which is again positive definite. ∎

*Remark 5.11**.*

Note that we do not really need infinite smoothness of $\sigma$ if we only want partial smoothness of $I$ . Indeed, it is easy to show that $\sigma\in C^{k}$ implies that $I\in C^{k-1}$ (locally at [math]).

5.2. Energy expansion

Having established smoothness of the energy $I$ as well as of the minimizing configuration $x\mapsto f^{x}$ locally around $x=0$ , we can proceed with computing the Taylor expansion of $f^{x}$ around $x=0$ . We will once more rely on the first order optimality condition given in Proposition 5.2. Plugging the Taylor expansion of $f^{x}$ into $\mathcal{I}_{x}$ will then give us the local Taylor expansion of $I(x)$ .

5.2.1. Expansion of the minimizing configuration

Theorem 5.12.

We have

[TABLE]

*Remark 5.13** (Non-Markovian transversality).*

In the RL-fBM case, $K\left(t,s\right)=\sqrt{2H}\left|t-s\right|^{\gamma}$ with $\gamma=H-1/2$ one computes

[TABLE]

Interestingly, the transversality condition known from the Markovian setting ( $q_{1}=0$ , which readily translates to $\dot{f}_{1}^{x}=0$ there) remains valid here (for $\rho=0$ ), at least to order $x^{2}$ , in the sense that

[TABLE]

Proof of Theorem 5.12.

**First order expansion:

**Up to the order needed in order to get the first order term, we have

[TABLE]

Therefore,

[TABLE]

This yields for the first order term in (5.2)

[TABLE]

Setting $t=1$ , we get

[TABLE]

which is solved by $\alpha_{1}=\frac{\rho}{\sigma_{0}}$ . Inserting this term back into the equation for $\alpha_{t}$ , we get

[TABLE]

Second order expansion:

Using (5.8) and the ansatz $f^{x}_{t}=\alpha_{t}x+\frac{1}{2}\beta_{t}x^{2}+\mathcal{O}(x^{3})$ , we re-compute the relevant terms appearing in the (5.2). We have

[TABLE]

and analogously for $\sigma$ replaced by $\sigma^{\prime}$ , $\sigma\sigma^{\prime}$ . This implies

[TABLE]

Using the notation introduced earlier, we have

[TABLE]

This directly implies

[TABLE]

We next compute some auxiliary terms appearing in (5.2).

[TABLE]

The corresponding denominator is $\overline{\rho}^{2}F(f^{x})$ . Using the formula

[TABLE]

we obtain

[TABLE]

For the second term in (5.2), let

[TABLE]

The corresponding denominator is $\overline{\rho}^{2}F(f^{x})^{2}=\overline{\rho}^{2}\sigma_{0}^{4}+\mathcal{O}(x)$ . Hence,

[TABLE]

Combining (5.9) and (5.10), we get

[TABLE]

We shall next compute $\beta_{1}$ . Taking the second order terms on both sides and letting $t=1$ , we obtain

[TABLE]

Moving $\beta_{1}$ to the other side with $1+\frac{\rho^{2}}{\overline{\rho}^{2}}=\frac{1}{\overline{\rho}^{2}}$ and collecting terms on the right hand side, we arrive at

[TABLE]

We conclude that

[TABLE]

Hence, we obtain

[TABLE]

5.2.2. Energy expansion in the general case

Now we compute the Taylor expansion of $I(x)$ as defined in Proposition 5.1. We start with the second term. Plugging in the optimal path $f_{t}^{x}=\alpha_{t}x+\frac{1}{2}\beta_{t}x^{2}+\mathcal{O}(x^{3})$ (and using $\left\langle\dot{\beta}\,,1\right\rangle=\beta_{1}$ as $\beta_{0}=0$ ) we obtain

[TABLE]

Inserting $\beta_{1}=2(1-2\rho^{2})\frac{\sigma_{0}^{\prime}}{\sigma_{0}^{3}}\left\langle K1\,,1\right\rangle$ into the above formula for $\left(x-\rho G(f^{x})\right)^{2}$ , we get

[TABLE]

Recall the denominator

[TABLE]

Using the expansion of a fraction

[TABLE]

we obtain from

[TABLE]

We note that

[TABLE]

Adding both terms, we arrive at the

Proposition 5.14.

The energy expansion to third order gives

[TABLE]

5.2.3. Energy expansion for the Riemann-Liouville kernel

Let us specialize the energy expansion given in Proposition 5.14 for the Riemann-Liouville fBm. Choose $\gamma=H-\frac{1}{2}$ and recall that the kernel $K$ takes the form $K(t,s)=(t-s)^{\gamma}$ . We get

[TABLE]

The key term $\left\langle K1\,,1\right\rangle$ appearing in the energy expansion now gives

[TABLE]

Plugging formula (5.2.3) into the energy expansion, we obtain the energy expansion for the Riemann-Liouville fractional Browian motion

[TABLE]

For completeness, let us also fully describe the time-dependence of the second order term $\beta_{t}$ in the expansion of the optimal trajectory $f^{x}_{t}$ . Unlike the first order time, here we do not have a linear movement any more. Indeed

[TABLE]

6. Proof of the pricing formula

Fix $x\geq 0$ and $\widehat{x}=\frac{\varepsilon}{\widehat{\varepsilon}}x$ where $\varepsilon=t^{1/2}$ and $\widehat{\varepsilon}=t^{H}=\varepsilon^{2H}$ . We have

[TABLE]

where we recall

[TABLE]

Consider a Cameron-Martin perturbation of $\widehat{X}_{1}^{\varepsilon}$ . That is, for a Cameron-Martin path $\mathrm{h}=(h,f)\in H_{0}^{1}\times H_{0}^{1}$ consider a measure change corresponding to a transformation $\widehat{\varepsilon}\left(W,B\right)\rightsquigarrow\widehat{\varepsilon}\left(W,B\right)+\left(h,f\right)$ (transforming the Brownian motions to Brownian motions with drift), we obtain the Girsanov density

[TABLE]

Under the new measure, $\widehat{X}_{1}^{\varepsilon}$ becomes $\widehat{Z}_{1}^{\varepsilon}$ , where

[TABLE]

Definition 6.1.

For fixed $x\geq 0$ , write $\left(h,f\right)\in\mathcal{K}^{x}$ if $\Phi_{1}\left(h,f,\widehat{f}\right)=x$ . Call such $\left(h,f\right)$ admissible for arrival at log-strike $x$ . Call $\left(h^{x},f^{x}\right)$ the cheapest admissible control, which attains

[TABLE]

where we recall that $\widehat{f}=K\dot{f}$ and

[TABLE]

For any Cameron-Martin path $(h,f)$ , the perturbed random variable $\widehat{Z}_{1}^{\varepsilon}$ admits a stochastic Taylor expansion with respect to $\widehat{\varepsilon}$ .

Lemma 6.2.

Fix $\left(h,f\right)\in\mathcal{K}^{x}$ and define $\widehat{Z}_{1}^{\varepsilon}$ accordingly. Then

[TABLE]

where $g_{1}$ is a Gaussian random variable, given explicitly by

[TABLE]

and

[TABLE]

Proof.

By a stochastic Taylor expansion for the controlled process $\widehat{Z}_{t}^{\varepsilon}$ with control $(h,f)\in\mathcal{K}^{x}$ as in Definition 6.1 and thanks to $\sigma\in C^{2}$ , we have at $t=1$

[TABLE]

Collecting terms in powers of $\widehat{\varepsilon}$ and with the random variable $g_{1}$ as in (6.3) (recalling that $\widehat{\varepsilon}\varepsilon\in\mathcal{O}(\widehat{\varepsilon}^{2})$ ), we have

[TABLE]

furthermore, since $(h,f)\in\mathcal{K}^{x}$ , by the definition of $\Phi_{1}$ , it holds that

[TABLE]

This proves the statement (6.2) and the statement that $g_{1}$ is Gaussian is immediate from the form (6.3). ∎

Finally, we determine an explicit form of the Girsanov density $G_{\varepsilon}$ for the choice where $(h^{x},f^{x})$ in (6.1) are chosen the cheapest admissible control (cf. Definition 6.1. Similarly to classical works of Azencott, Ben Arous and others, see, for instance, [BA88], we show that the stochastic integrals in the exponent of $G_{\varepsilon}$ are proportional to the first order term $g_{1}$ (with factor $I^{\prime}(x)$ ) when evaluated at the minimizing configuration $(h^{x},f^{x})$ .

Lemma 6.3.

We have

[TABLE]

Proof.

See Lemma A.2. ∎

With these preparations in place, we are now ready to prove the pricing formula from Section 3.

Proof of Theorem 3.2.

With a Girsanov factor (all integrals on $\left[0,1\right]$ )

[TABLE]

and (evaluated at the minimizer)

[TABLE]

we have, setting $\widehat{U}^{\varepsilon}\coloneqq\widehat{Z}_{1}^{\varepsilon}-x=\widehat{\varepsilon}g_{1}+\widehat{\varepsilon}^{2}R_{2}^{\varepsilon}$

[TABLE]

7. Proof of the moderate deviation expansions

In Section 2, we pointed out that (iiic) is exactly what one get from (call price) large deviations (2.8), if heuristically applied to $x\varepsilon^{2\beta}$ . We now sketch a proper derivation based on moderate deviations.

Lemma 7.1.

Assume (iiia-b) from Assumption 2.4. Then an upper moderate deviation estimate holds both for calls and digital calls. That is, we have

(iiic)

For every $\beta\in(0,H)$ , and every fixed $x>0$ , and $\widehat{x}_{\varepsilon}:=x\varepsilon^{1-2H+2\beta}$ ,

[TABLE]

and also

[TABLE]

Proof.

(Sketch) Recall $\sigma(.)$ smooth but unbounded and recall $\widehat{x}_{\varepsilon}:=x\varepsilon^{1-2H+2\beta}$ . In case of $\beta=0$ and $H=1/2$ a large deviation principle (LDP) for $(X^{\varepsilon}_{1}\widehat{\varepsilon}/\varepsilon)$ is readily reduced, via exponential equivalence, to a LDP for the family of stochastic Itô integrals given by $\int\sigma(\widehat{\varepsilon}\widehat{B})\widehat{\varepsilon}dZ$ for some Brownian $Z$ , $\rho$ -correlated with $B$ . There are then many ways to establish a LDP for this family. A particularly convenient one, that requires no growth restriction on $\sigma$ , uses continuity of stochastic integration with respect to the rough path $(B,Z,\int BdZ)=(B,Z,\int\widehat{B}dZ)$ in suitable metrics, for which a LDP is known [FH14, Ch 9.3]. It was pointed out in [BFG*+*17] that a similar reasoning is possible when $H<1/2$ , the rough path is then replaced by a “richer enhancement” of $(B,Z)$ , the precise size of which depends on $H$ , for which again one has a LDP. A moderate deviation priniple (MDP) for $(X^{\varepsilon}_{1}\widehat{\varepsilon}/\varepsilon)$ is a LDP for $(\varepsilon^{-2\beta}X^{\varepsilon}_{1}\widehat{\varepsilon}/\varepsilon)$ for $\beta\in(0,H)$ . This can be reduced to a LDP, with $\overline{\varepsilon}:=\varepsilon^{-2\beta}\widehat{\varepsilon}=\varepsilon^{2H-2\beta}$ , for

[TABLE]

with speed $\overline{\varepsilon}^{2}$ . Also, $\sigma_{\varepsilon}(\cdot)\equiv\sigma(\varepsilon^{2\beta}\cdot)$ convergens (with all derivatives) locally uniformly to the constant function $\sigma_{0}$ , and one checks that $\varepsilon^{-2\beta}\int_{0}^{1}\sigma(\widehat{\varepsilon}\widehat{B})$ is exponentially equivalent to the (Gaussian) family given by $\sigma_{0}\overline{\varepsilon}Z_{1}$ , with law $\mathcal{N}(0,\sigma_{0}^{2}\overline{\varepsilon}^{2})=\mathcal{N}(0,\sigma_{0}^{2}\varepsilon^{4H-4\beta})$ which gives (7.1), even with equality. (Showing this exponential equivalence can again be done for $\sigma$ without growth restrictions.)

We have not yet used either assumption (iiia-b). These become important in order to extend estimate (7.1) to the case of genuine call payoffs. We can follow here a well-known argument (Forde-Jacquier, Pham, …) with the “moderate” caveat to carry along a factor $\varepsilon^{2\beta}$ . In fact, this is close in spirit to what already happens with rough volatility where one has to carry along a factor $\widehat{\varepsilon}/\varepsilon=\varepsilon^{2H-1}$ . The remaining details then follow essentially “Appendix C. Proof of Corollary 4.13., part (ii) upper bound” of [FZ17], noting perhaps that the authors use their assumptions to show validity of what we simply assumed as condition (iiib), and also that one works with the quadratic rate function $I^{\prime\prime}(0)x^{2}=\frac{x^{2}}{2\sigma_{0}^{2}}$ throughout. ∎

*Remark 7.2**.*

By an easy argument similar to “Appendix C. Proof of Corollary 4.13., part (i) lower bound” of [FZ17] one sees that validity of the call price upper bound (iiic) implies the corresponding digital call price upper bound (7.1.) For this reason, we only emphasized (iiic) but not (7.1) in Section 2.

In a classical work Azencott [Aze82] (see also [Aze85], [BA88, Théorème 2]) obtained asymptotic expansions of functionals of Laplace type on Wiener space, of the type “ $E[\exp(-F(X^{\varepsilon})/\varepsilon^{2})]$ ”, for small noise diffusions $X^{\varepsilon}$ . This refines the large deviation (equivalently: Laplace) principle of Freidlin–Wentzell for small noise diffusions. In a nutshell, for fixed $X_{0}=x$ , Azencott gets expansions of the form $e^{-c/\varepsilon^{2}}(\alpha_{0}+\alpha_{1}\varepsilon...)$ . His ideas (used by virtually all subsequent works in this direction) are a Girsanov transform, to make the minimizing path “typical”, followed by localization around the minimizer (justified by a good large deviation principle), and finally a local (stochastic Taylor) type analysis near the minimizer. None of these ingredients rely on the Markovian structure (or, relatedly, PDE arguments). As a consequence (and motivation for this work) such expansions were also obtained in the (non-Markovian) context of rough differential equations driven by fractional Brownian motion [Ina13, BO15] with $H<1/2$ .

And yet, our situation is different in the sense that call price Wiener functionals do not fit the form studied by Azencott and others, nor can we in fact expect a similar expansion: Example 3.3 gives a Black-Scholes call price expansion of the form constant times $e^{-c\varepsilon^{2}}(\varepsilon^{3}+...)$ . Azencott’s ideas are nonetheless very relevant to us: we already used the Girsanov formula in Theorem 3.2 in order to have a tractable expression for $J$ . It thus “only” remains to carry out the localization and do some local analysis. We again content ourselves with a sketch and leave full technical details as well as some extensions to a forthcoming technical note.

Proposition 7.3.

In the context of Theorem 3.4, let $x>0$ . Then the factor $J$ is negligible in the sense that, for every $\theta>0$ ,

[TABLE]

Proof.

(Sketch). Step 1. Localization One shows that

[TABLE]

can be replaced, in the sense that the error $|J\left(\varepsilon,x\varepsilon^{2\beta}\right)-J_{\delta}\left(\varepsilon,x\varepsilon^{2\beta}\right)|$ is exponentially small (cf. [BA88, Lemme 1.32]), with

[TABLE]

Unlike the works of [Aze82, BA88], however, this is not a simple consequence of large (or here: moderate) deviation upper estimates alone, but requires the corresponding call price estimate (iiic), as provided by Lemma 7.1.

Step 2. Local analysis. Recall that $\widehat{U}^{\varepsilon}$ decomposes into a Gaussian random variable $g_{1}$ and remainder $R^{\varepsilon}_{2}$ . In order to control this remainder without imposing boundedness assumption on $\sigma(.)$ , we can show that it is well concentrated on the relevant parts of the probability space in the sense of a “localized remainder tail estimate” (cf. [Aze82, Prop. 4.3.]), of the form

[TABLE]

(It is in this step that we exploit $C^{2}$ -regularity of $\sigma$ , which allows to write the remainder in terms of local martingales, stopped after leaving a $\kappa$ -neighbourhood of zero, whose quadratic variation then can be estimated and leads to the claimed tail estimate.) One then estimates $J_{\delta}$ from above, separately on $\{\widehat{\varepsilon}|\widehat{B}|_{\infty;\left[0,1\right]}<\kappa\}$ (using the above estimate) and its complement, using Fernique estimates. For the lower bound, use again localized remainder tail estimate, plus some elementary calculus estimates of the form

[TABLE]

for some positive constant, and $u/\widehat{\varepsilon}^{2}$ small enough. ∎

8. Proof of the implied volatility expansion

With Theorem 3.2 in place, we now turn to the proof of the implied volatility expansion, formulated in Theorem 3.6.

Proof of Theorem 3.6.

We will use an asymptotic formula for the dimensionless implied variance

[TABLE]

obtained in [GL14]. It follows from the first formula in Remark 7.3 in [GL14] that

[TABLE]

where $L_{t}=-\log c(k_{t},t)$ , $t>0$ .

We will need the following formula that was established in the proof of Theorem 3.4:

[TABLE]

as $t\rightarrow 0$ , for all $x\geq 0$ and $\beta\in[0,H)$ and any $\theta>0$ . Let us first assume $\frac{2H}{n+1}\leq\beta<\frac{2H}{n}$ . Using the energy expansion, we obtain from (8.2) that

[TABLE]

as $t\rightarrow 0$ . The second term in the brackets on the right-hand side of (8.3) disappears if $n=2$ .

*Remark 8.1**.*

Suppose $n\geq 2$ and $\frac{2H}{n+1}\leq\beta<\frac{2H}{n}$ . Then formula (8.3) is optimal. Next, suppose $n\geq 2$ and $0<\beta<\frac{2H}{n+1}$ . In this case, there exists $m\geq n+1$ such that $\frac{2H}{m+1}\leq\beta<\frac{2H}{m}$ , and hence (8.3) holds with $m$ instead of $n$ . However, we can replace $m$ by $n$ , by making the error term worse. It is not hard to see that the following formula holds for all $n\geq 2$ and $0<\beta<\frac{2H}{n+1}$ :

[TABLE]

as $t\rightarrow 0$ provided we choose $\theta$ small enough.

Let us continue the proof of Theorem 3.6. Since $k_{t}\approx t^{\frac{1}{2}-H+\beta}$ and $L_{t}\approx t^{2\beta-2H}$ as $t\rightarrow 0$ , (8.1) implies that

[TABLE]

Next, using the Taylor formula for the function $u\mapsto\frac{1}{1+u}$ , and setting

[TABLE]

we obtain from (8.3) that

[TABLE]

as $t\rightarrow 0$ . It follows from $\frac{2H}{n+1}\leq\beta<\frac{2H}{n}$ that $(n-1)\beta\geq 2H-2\beta$ , and hence

[TABLE]

as $t\rightarrow 0$ . Now, (8.5) gives

[TABLE]

as $t\rightarrow 0$ . Finally, by cancelling a factor of $t$ in the previous formula, we obtain formula (3.6) for $\frac{2H}{n+1}\leq\beta<\frac{2H}{n}$ . The proof in the case where $\beta\leq\frac{2H}{n+1}$ is similar. Here we take into account Remark 8.1. This completes the proof of Theorem 3.6. ∎

Appendix A Auxiliary lemmas

In this section we provide and prove some auxiliary lemmas, which are used in the preparations to the proof of Theorem 3.2. We start with a technical Lemma, that justifies the derivation.

Lemma A.1.

Assume $\sigma\left(.\right)>0$ and $\left|\rho\right|<1$ . Then $\mathcal{K}^{x}$ is a Hilbert manifold near any $\mathfrak{h}\coloneqq(h,f)\in\mathcal{K}^{x}\subset\mathfrak{H}\coloneqq H^{1}_{0}\times H^{1}_{0}$ .

Proof.

Similar to Bismut [Bis84, p. 25] we need to show that $D\varphi_{1}\left(\mathfrak{h}\right)$ is surjective where $\varphi_{1}\left(\mathfrak{h}\right):$ $\mathfrak{H}\rightarrow\mathbb{R}$ with

[TABLE]

From

[TABLE]

the functional derivative $D\varphi_{1}\left(\mathfrak{h}\right)$ can be computed explicitly. In fact, even the computation

[TABLE]

is sufficient to guarantee surjectivity of $D\varphi_{1}\left(\mathfrak{h}\right)$ . ∎

We now deliver the proof of Lemma 6.2, which determines the form of the Girsanov measure change (6.1) for the minimizing configuration (Definition 6.1), denoted by $(h^{x}.f^{x})\in\mathcal{K}^{x}$ .

Lemma A.2.

(i) Any optimal control $\mathfrak{h}^{0}=\left(h^{x},f^{x}\right)\in\mathcal{K}^{x}$ is a critical point of

[TABLE]

(ii) it holds that

[TABLE]

Proof.

(Step 1) Write $\mathfrak{h}=\left(h,f\right)$ and

[TABLE]

Let $\mathfrak{h}^{0}=\left(h^{x},f^{x}\right)\in\mathcal{K}^{x}$ an optimal control. Then

[TABLE]

(This requires $\mathcal{K}^{x}$ to be a Hilbert manifold near $\mathfrak{h}^{0}$ , as was seen in the last lemma.)(Step 2) For fixed $\mathfrak{h}\in\mathfrak{H}$ , define

[TABLE]

with equality at $t=0$ (since $x=\varphi_{1}^{\mathfrak{h}^{0}}$ and $I\left(x\right)=\frac{1}{2}\left\|\mathfrak{h}^{0}\right\|_{\mathfrak{H}}^{2}$ ) and non-negativity for all $t$ because $\mathfrak{h}^{0}+t\mathfrak{h}$ is an admissible control for reaching $\widetilde{x}=\varphi_{1}^{\mathfrak{h}^{0}+t\mathfrak{h}}$ (so that $I\left(\widetilde{x}\right)=\inf\left\{...\right\}\leq\frac{1}{2}\left\|\mathfrak{h}^{0}+t\mathfrak{h}\right\|_{\mathfrak{H}}^{2}$ .)

(Step 3) We note that $\dot{u}\left(0\right)=0$ is a consequence of $u\in C^{1}$ near [math], $u\left(0\right)=0$ and $u\geq 0$ . In other words, $\mathfrak{h}^{0}$ is a critical point for

[TABLE]

(Step 4) The functional derivative of this map at $\mathfrak{h}^{0}$ must hence be zero. In particular, for all $\mathfrak{h}\in\mathfrak{H}$ ,

[TABLE]

(Step 5) With $\mathfrak{h}^{0}=\left(h^{x},f^{x}\right)$ and $\mathfrak{h}=\left(h,f\right)$

[TABLE]

By continuous extension, replace $\mathfrak{h}=\left(h,f\right)$ by $\left(W,B\right)$ above and note that

[TABLE]

since indeed $g_{1}=\int_{0}^{1}\sigma(\widehat{f}_{t})d\left(\overline{\rho}W_{t}+\rho B_{t}\right)+\sigma^{\prime}(\widehat{f}_{t})\widehat{B}_{t}d\left(\overline{\rho}h_{t}+\rho f_{t}\right)$ . Hence

[TABLE]

Bibliography42

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[ALV 07] Elisa Alòs, Jorge A León, and Josep Vives. On the short-time behavior of the implied volatility for jump-diffusion models with stochastic volatility. Finance and Stochastics , 11(4):571–589, 2007.
2[Aze 82] Robert Azencott. Formule de Taylor stochastique et développement asymptotique d’intégrales de Feynman. In Seminar on Probability, XVI, Supplement , volume 921 of Lecture Notes in Math. , pages 237–285. Springer, Berlin-New York, 1982.
3[Aze 85] Robert Azencott. Petites perturbations aléatoires des systemes dynamiques: développements asymptotiques. Bulletin des sciences mathématiques , 109(3):253–308, 1985.
4[BA 88] Gérard Ben Arous. Methods de Laplace et de la phase stationnaire sur l’espace de Wiener. Stochastics , 25(3):125–153, 1988.
5[BFG 16] Christian Bayer, Peter K. Friz, and Jim Gatheral. Pricing under rough volatility. Quantitative Finance , 16(6):887–904, 2016.
6[BFG + 17] Christian Bayer, Peter K. Friz, Paul Gassiat, Jörg Martin, and Benjamin Stemper. A regularity structure for rough volatility. Preprint, 2017. ar Xiv:1710.07481.
7[Bis 84] Jean-Michel Bismut. Large deviations and the Malliavin calculus , volume 45 of Progress in Mathematics . Birkhäuser Boston, Inc., Boston, MA, 1984.
8[BLP 16] Mikkel Bennedsen, Asger Lunde, and Mikko S Pakkanen. Decoupling the short- and long-term behavior of stochastic volatility. Preprint, 2016. ar Xiv:1610.00332.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Short-time near-the-money skew in rough fractional volatility models

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

Contents

1. Introduction

2. Exposition and assumptions

Assumption 2.1** (Small time self-similarity).**

Remark 2.2*.*

Remark 2.3*.*

Assumption 2.4**.**

Assumption 2.5**.**

Remark 2.6*.*

3. Main results

Theorem 3.1** (Energy expansion).**

Theorem 3.2** (Pricing formula).**

Example 3.3** (Black-Scholes model).**

Theorem 3.4** (Moderate Deviations).**

Remark 3.5*.*

Proof of Theorem 3.4.

Theorem 3.6**.**

Remark 3.7*.*

Remark 3.8*.*

Corollary 3.9**.**

Corollary 3.10**.**

Corollary 3.11**.**

Remark 3.12*.*

Remark 3.13* (Symmetry).*

4. Simulation results

Lemma 4.1**.**

5. Proof of the energy expansion

Proposition 5.1** (Forde-Zhang [FZ17]).**

Proposition 5.2** (First order optimality condition).**

Proof.

5.1. Smoothness of the energy

Lemma 5.3**.**

Proof.

Remark 5.4*.*

5.1.1. The uncorrelated case

Remark 5.5*.*

Lemma 5.6**.**

Proof.

Theorem 5.7** (Zero correlation).**

Proof.

Remark 5.8*.*

5.1.2. The general case

Lemma 5.9**.**

Proof.

Theorem 5.10**.**

Proof.

Remark 5.11*.*

5.2. Energy expansion

5.2.1. Expansion of the minimizing configuration

Theorem 5.12**.**

Remark 5.13* (Non-Markovian transversality).*

Proof of Theorem 5.12.

5.2.2. Energy expansion in the general case

Proposition 5.14**.**

5.2.3. Energy expansion for the Riemann-Liouville kernel

6. Proof of the pricing formula

Definition 6.1**.**

Lemma 6.2**.**

Proof.

Lemma 6.3**.**

Proof.

Proof of Theorem 3.2.

7. Proof of the moderate deviation expansions

Lemma 7.1**.**

Proof.

Remark 7.2*.*

Proposition 7.3**.**

Proof.

8. Proof of the implied volatility expansion

Proof of Theorem 3.6.

Assumption 2.1 (Small time self-similarity).

*Remark 2.2**.*

*Remark 2.3**.*

Assumption 2.4.

Assumption 2.5.

*Remark 2.6**.*

Theorem 3.1 (Energy expansion).

Theorem 3.2 (Pricing formula).

Example 3.3 (Black-Scholes model).

Theorem 3.4 (Moderate Deviations).

*Remark 3.5**.*

Theorem 3.6.

*Remark 3.7**.*

*Remark 3.8**.*

Corollary 3.9.

Corollary 3.10.

Corollary 3.11.

*Remark 3.12**.*

*Remark 3.13** (Symmetry).*

Lemma 4.1.

Proposition 5.1 (Forde-Zhang [FZ17]).

Proposition 5.2 (First order optimality condition).

Lemma 5.3.

*Remark 5.4**.*

*Remark 5.5**.*

Lemma 5.6.

Theorem 5.7 (Zero correlation).

*Remark 5.8**.*

Lemma 5.9.

Theorem 5.10.

*Remark 5.11**.*

Theorem 5.12.

*Remark 5.13** (Non-Markovian transversality).*

Proposition 5.14.

Definition 6.1.

Lemma 6.2.

Lemma 6.3.

Lemma 7.1.

*Remark 7.2**.*

Proposition 7.3.

*Remark 8.1**.*

Lemma A.1.

Lemma A.2.