Consistent Time-Homogeneous Modeling of SPX and VIX Derivatives

Andrew Papanicolaou

arXiv:1812.05859·q-fin.PR·March 16, 2022

Consistent Time-Homogeneous Modeling of SPX and VIX Derivatives

Andrew Papanicolaou

PDF

Open Access

TL;DR

This paper develops a method to recover a consistent stochastic volatility model for SPX and VIX derivatives from market data, ensuring the models are aligned and non-negative, with analysis of solution uniqueness.

Contribution

It introduces a novel inverse problem approach to derive a stochastic volatility function from market models of VIX futures, ensuring consistency with SPX derivatives.

Findings

01

Method for recovering stochastic volatility functions from market models.

02

Conditions for uniqueness of the inverse solution.

03

Illustrations of potential negativity and inconsistency issues.

Abstract

This paper shows how to recover a stochastic volatility model (SVM) from a market model of the VIX futures term structure. Market models have more flexibility for fitting of curves than do SVMs, and therefore are better suited for pricing VIX futures and VIX derivatives. But the VIX itself is a derivative of the S&P500 (SPX) and it is common practice to price SPX derivatives using an SVM. Therefore, consistent modeling for both SPX and VIX should involve an SVM that can be obtained by inverting the market model. This paper's main result is a method for the recovery of a stochastic volatility function by solving an inverse problem where the input is the VIX function given by a market model. Analysis will show conditions necessary for there to be a unique solution to this inverse problem. The models are consistent if the recovered volatility function is non-negative. Examples are…

Equations332

\frac{d S _{t}}{S _{t}}

\frac{d S _{t}}{S _{t}}

d X_{t}

d B_{t} d W_{t}^{i} = ρ_{i} d t for 1 \leq i \leq d .

d B_{t} d W_{t}^{i} = ρ_{i} d t for 1 \leq i \leq d .

\mbox{VIX}_{t}=\sqrt{\mathbb{E}\left[\frac{1}{\tau}\int_{t}^{t+\tau}v^{2}(X_{u})du\Big{|}\mathcal{F}_{t}\right]}\ ,

\mbox{VIX}_{t}=\sqrt{\mathbb{E}\left[\frac{1}{\tau}\int_{t}^{t+\tau}v^{2}(X_{u})du\Big{|}\mathcal{F}_{t}\right]}\ ,

h_{T - t} (X_{t}) = E [\mbox V I X_{T} ∣ F_{t}] for t \leq T .

h_{T - t} (X_{t}) = E [\mbox V I X_{T} ∣ F_{t}] for t \leq T .

\frac{d F _{t, T}}{F _{t, T}} = ν (t, T) d W_{t},

\frac{d F _{t, T}}{F _{t, T}} = ν (t, T) d W_{t},

h_{0}^{2}(x)=\mathbb{E}\left[\frac{1}{\tau}\int_{t}^{t+\tau}v^{2}(X_{u})du\Big{|}X_{t}=x\right]\ .

h_{0}^{2}(x)=\mathbb{E}\left[\frac{1}{\tau}\int_{t}^{t+\tau}v^{2}(X_{u})du\Big{|}X_{t}=x\right]\ .

h_{θ} (X_{t}) = E [\mbox V I X_{t + θ} ∣ F_{t}],

h_{θ} (X_{t}) = E [\mbox V I X_{t + θ} ∣ F_{t}],

F_{t, t + θ} = E [F_{t + θ, t + θ} ∣ F_{t}] .

F_{t, t + θ} = E [F_{t + θ, t + θ} ∣ F_{t}] .

\frac{d F _{t, t + θ}}{F _{t, t + θ}} = Y_{t}^{θ} d t + ν (t, t + θ) d W_{t},

\frac{d F _{t, t + θ}}{F _{t, t + θ}} = Y_{t}^{θ} d t + ν (t, t + θ) d W_{t},

Y_{t}^{θ}

Y_{t}^{θ}

F_{t, t + θ} = h_{θ} (X_{t}) a.s. \forall t \geq 0 and \forall θ \geq 0,

F_{t, t + θ} = h_{θ} (X_{t}) a.s. \forall t \geq 0 and \forall θ \geq 0,

F_{t, t} = h_{0} (X_{t}) a.s. \forall t \geq 0 .

F_{t, t} = h_{0} (X_{t}) a.s. \forall t \geq 0 .

\frac{d F _{t, t + θ}}{F _{t, t + θ}} = f_{θ} (X_{t}) d t + ν_{θ} (X_{t}) d W_{t},

\frac{d F _{t, t + θ}}{F _{t, t + θ}} = f_{θ} (X_{t}) d t + ν_{θ} (X_{t}) d W_{t},

E exp (\frac{1}{2} \int_{0}^{T} ∥ ν_{T - t} (X_{t}) ∥^{2} d t) < \infty for all 0 \leq T < \infty .

E exp (\frac{1}{2} \int_{0}^{T} ∥ ν_{T - t} (X_{t}) ∥^{2} d t) < \infty for all 0 \leq T < \infty .

\frac{dF_{t,T}}{F_{t,T}}=\frac{dF_{t,t+\theta}}{F_{t,t+\theta}}\Bigg{|}_{\theta=T-t}-f_{T-t}(X_{t})dt=\nu_{T-t}(X_{t})dW_{t}\ ,

\frac{dF_{t,T}}{F_{t,T}}=\frac{dF_{t,t+\theta}}{F_{t,t+\theta}}\Bigg{|}_{\theta=T-t}-f_{T-t}(X_{t})dt=\nu_{T-t}(X_{t})dW_{t}\ ,

L = \frac{1}{2} trace [σ σ^{*} (x) \nabla \nabla^{*}] + μ^{*} (x) \nabla,

L = \frac{1}{2} trace [σ σ^{*} (x) \nabla \nabla^{*}] + μ^{*} (x) \nabla,

L h_{θ} (X_{t})

L h_{θ} (X_{t})

σ^{*} (X_{t}) \nabla h_{θ} (X_{t})

F_{t, t + θ} = F_{0, θ} exp (\int_{0}^{t} (f_{θ} (X_{u}) - \frac{1}{2} ∥ ν_{θ} (X_{u}) ∥^{2}) d u + \int_{0}^{t} ν_{θ} (X_{u}) d W_{u}),

F_{t, t + θ} = F_{0, θ} exp (\int_{0}^{t} (f_{θ} (X_{u}) - \frac{1}{2} ∥ ν_{θ} (X_{u}) ∥^{2}) d u + \int_{0}^{t} ν_{θ} (X_{u}) d W_{u}),

h^{2}(x)=\mathbb{E}\left[\frac{1}{\tau}\int_{t}^{t+\tau}v^{2}(X_{u})du\Big{|}X_{t}=x\right]\ ,

h^{2}(x)=\mathbb{E}\left[\frac{1}{\tau}\int_{t}^{t+\tau}v^{2}(X_{u})du\Big{|}X_{t}=x\right]\ ,

⟨ g ⟩ := \int g (x) d ω (x),

⟨ g ⟩ := \int g (x) d ω (x),

⟨ (e^{L t} g)^{2} ⟩ \leq e^{- λ t} ⟨ g^{2} ⟩,

⟨ (e^{L t} g)^{2} ⟩ \leq e^{- λ t} ⟨ g^{2} ⟩,

e^{\mathcal{L}t}g(x)=\mathbb{E}\left[g(X_{t})\Big{|}X_{0}=x\right]

e^{\mathcal{L}t}g(x)=\mathbb{E}\left[g(X_{t})\Big{|}X_{0}=x\right]

h^{2} (x) - ⟨ h^{2} ⟩ = Φ ξ (x),

h^{2} (x) - ⟨ h^{2} ⟩ = Φ ξ (x),

Φ = \frac{1}{τ} \int_{0}^{τ} e^{L u} d u .

Φ = \frac{1}{τ} \int_{0}^{τ} e^{L u} d u .

⟨ ξ ⟩ = 0,

⟨ ξ ⟩ = 0,

Φ ξ (x) = h^{2} (x) - ⟨ h^{2} ⟩ .

Φ ξ (x) = h^{2} (x) - ⟨ h^{2} ⟩ .

L Φ ξ (x) = L h^{2} (x) .

L Φ ξ (x) = L h^{2} (x) .

L Φ = \frac{1}{τ} \int_{0}^{τ} L e^{L u} d u = \frac{1}{τ} (e^{L τ} - I),

L Φ = \frac{1}{τ} \int_{0}^{τ} L e^{L u} d u = \frac{1}{τ} (e^{L τ} - I),

- τ L h^{2} (x) = - τ L Φ ξ (x) = (I - (I + τ L Φ)) ξ = (I - e^{L τ}) ξ (x),

- τ L h^{2} (x) = - τ L Φ ξ (x) = (I - (I + τ L Φ)) ξ = (I - e^{L τ}) ξ (x),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Complex Systems and Time Series Analysis · Financial Risk and Volatility Modeling

Full text

Consistent Time-Homogeneous Modeling of SPX and VIX Derivatives111Data Sharing and Data Accessibility: Data sharing is not applicable to this article as no new data were created or analyzed in this study. 222This work was partially supported by NSF grant DMS-1907518.

A. Papanicolaou Department of Mathematics, North Carolina State University, 2311 Stinson Drive, Raleigh, NC 27695. [email protected]

Abstract

This paper shows how to recover a stochastic volatility model (SVM) from a market model of the VIX futures term structure. Market models have more flexibility for fitting of curves than do SVMs, and therefore are better suited for pricing VIX futures and VIX derivatives. But the VIX itself is a derivative of the S&P500 (SPX) and it is common practice to price SPX derivatives using an SVM. Therefore, consistent modeling for both SPX and VIX should involve an SVM that can be obtained by inverting the market model. This paper’s main result is a method for the recovery of a stochastic volatility function by solving an inverse problem where the input is the VIX function given by a market model. Analysis will show conditions necessary for there to be a unique solution to this inverse problem. The models are consistent if the recovered volatility function is non-negative. Examples are presented to illustrate the theory, to highlight the issue of negativity in solutions, and to show the potential for inconsistency in non-Markov settings.

Keywords: stochastic volatility, market models, VIX futures, consistent pricing.

AMS Subject Codes: 91B24, 91B70, 45Q05

1 Motivation and Formulation
1.1 Problem Formulation
1.2 Background Literature
1.3 Results and Organization of this Paper
2 Definitions and Main Result
2.1 Consistency
2.2 Main Result: Markovian Inverse Problem for $v^{2}(x)$
2.2.1 General Solvability
2.2.2 Solution via Eigenseries Expansion
3 Application to Tractable Models
3.1 The Scalar Bergomi Model
3.2 The Multi-Factor Bergomi Model
3.3 The $3/2$ Model
3.4 The Double Nelson Model
3.5 Non-Negative Solutions for Brownian Motion Factor
4 Non-Markovian Market Models
4.1 Scalar Consistency with Constant $\nu_{\theta}(t)$
4.2 An Inconsistent Example
5 Summary and Conclusion

1 Motivation and Formulation

Volatility trading has increased in the 21st century with the introduction of derivatives on the VIX index. Two such derivatives are VIX futures, which began trading on the CBOE in 2004, and VIX (European) options, which began trading on the CBOE in 2006. There are also exchange traded notes (ETNs) written on these futures, and options written on these ETNs. Pricing of VIX derivatives uses so-called market models, which are stochastic models for the futures term structure. One thing to keep in mind when using a market model is that the published VIX index is computed from European S&P500 (SPX) options, which means that the VIX is really an SPX derivative. Therefore, VIX prices may be partially determinable if there are established prices for SPX options. Moreover, pricing of SPX derivatives uses stochastic volatility models (SVMs) rather than market models. Hence, there is potential for conflicting prices if market models and SVMs are being used simultaneously to price VIX and SPX derivatives, respectively.

To understand why such a conflict would be a problem, consider a situation where a single financial institution has two separate trading floors: one for SPX derivatives and another for VIX derivatives. Each floor has its own traders who quote prices from their own respective models. If these models have substantially different assessments on the outcome of 30-day variance, then there could be inter-desk arbitrage, i.e., mispricings that allow for a third party (external to the institution) to take SPX derivative prices offered by the SPX desk and arbitrage them against VIX derivative prices offered by the VIX desk. A solution to this problem should provide a criterion for consistency of the models, and in a practical setting should provide a method for specification of one model in terms of the other. This paper presents such a solution.

1.1 Problem Formulation

Let $S_{t}$ denote the scalar price process for the SPX, and let $X_{t}$ denote a $d$ -dimensional factor process with $d$ a positive integer. Consider a model where SPX returns are given by a risk-neutral SVM,

[TABLE]

where $r\geq 0$ is the risk-free rate, $v(x)$ is a scalar-valued volatility function, $\mu(x)$ is a $d$ -dimensional drift, $\sigma(x)$ is a $d\times d$ diffusion matrix, $W_{t}$ is a $d$ -dimensional risk-neutral vector-valued Standard Brownian motion, and $B_{t}$ is a risk-neutral scalar Brownian motion, with correlations between $W_{t}$ and $B_{t}$ denoted with $\rho$ ,

[TABLE]

Denote by $(\mathcal{F}_{t})_{t\geq 0}$ the filtration generated by $W_{t}$ and $B_{t}$ . The VIX is the square root of the risk-neutral expected realized 30-day variance, which in the continuous diffusion model of (1) and (2) is

[TABLE]

where $\tau=$ 30 days, and with a VIX future being given by

[TABLE]

For this SVM it is clear that the asset price $S_{t}$ , the VIX, and all VIX futures are $\mathcal{F}_{t}$ -adapted Markov processes.

Separate from SVMs are market models that are designed to describe directly the VIX and VIX futures. Let $F_{t,T}$ be a market model’s price for a VIX future with maturity $T$ at time $t\leq T$ . These prices come from the following system of SDEs,

[TABLE]

where $W_{t}$ is the same Brownian motion from equation (2), and where the volatility $\nu(t,T)$ is an $\mathcal{F}_{t}$ -adapted $d$ -dimensional row vector function that is specific for a given $T$ . The model is applied simultaneously for multiple or a continuum of $T$ ’s, thereby forming an entire curve of VIX futures. It is important to keep in mind that equation (4) is generally a time inhomogeneous and non-Markovian model, but results in this paper apply to time-homogenous Markovian market models that are also driven by the factor process $X_{t}$ of equation (2). Section 2.1 will set forth assumptions for time homogeneity and Markovianity along with some explanation, but further discussion will come in Section 4 where it will be shown how there is essentially a contradiction when trying to specify a consistent Markovian SVM with a non-Markovian market model. Market models considered in this paper include: a Bergomi-type market model333This paper throughout will refer to the “Bergomi model” when perhaps it should say “a Bergomi-type model for VIX futures”, but this would be unwieldy so instead the name is used in a general sense. The original Bergomi model is for the curve of future instantaneous variances, not VIX directly. where $\nu(t,T)=\gamma^{*}e^{-\mathbf{k}(T-t)}\sigma$ with $\mathbf{k}$ and $\sigma$ being $d\times d$ positive-definite matrices and $\gamma$ a $d\times 1$ volatility vector; a 3/2 market model where all futures are an expectation of a VIX given by $F_{t,t}=1/X_{t}$ with $X_{t}$ being a Cox-Ingersol-Ross (CIR) process; a double Nelson (or double mean reverting model) similar to the model in [6]; a non-stationary model where $X_{t}$ is Brownian motion.

In practice, it is a good idea to use market models because VIX futures are very liquid with a richness of information for understanding the state of volatility. Therefore, it is sensible to first define a market model, and second to build an SVM with the structure of the market model taken into consideration. If the market model is Markovian with the same factors as the SVM, then the instantaneous variance $v^{2}(x)$ is the solution to an inverse problem. The formulation of this inverse problem is as follows: if the coefficients of the factor process $\mu(x)$ and $\sigma(x)$ are known for all $x$ , and the market model has provided the function $h_{0}(x)$ , then the inverse problem for $v^{2}(x)$ is expressed as

[TABLE]

In Section 2 it is shown that if the process $X_{t}$ is ergodic and has an invariant law relative to which its infinitesimal generator is symmetric, making $X_{t}$ reversible, and has also a spectral gap [4], then under some integrability conditions for $h_{0}^{2}$ there exists a unique $v^{2}(x)$ that is the solution to equation (5). If in addition the solution is non-negative, then the market model has provided a valid volatility function for equation (1), leaving the correlation coefficients $\rho_{1},\ldots,\rho_{d}$ as the only remaining parameters to be determined for the SVM. However, non-negativity of the solution is difficult to prove; this issue is explored further in the examples of Section 3. Similar non-negativity issues arise elsewhere in the literature, for example in quantum inverse scattering theory [13, 12] and in Fourier analysis [39]. For the most part, results in the literature proving positivity are less general than theory for existence of solutions.

1.2 Background Literature

Background for SVMs, including the Heston model, can be found in various books and papers, including [18, 23], and more general models involving jumps in [16]. The Bergomi model for future variance is introduced in [7, 8, 9], and a consistency condition for the drift in futures curves for variance is given in [10]. Term structure and the associated Heath-Jarrow-Morton (HJM) framework are discussed in [11, 22, 36, 37]. In [25] a market model with stochastic volatility is used to derive formulae for VIX futures.

In the past decade there has been a lot of research on joint models for SPX and VIX options, which includes some re-evaluation of widely-used SVMs and a search for new models to fit both markets. One such example is the so-called 3/2 model, which is analyzed in [5, 15] and is a popular choice because it is able to reproduce the increasing right-hand implied-volatility skew in VIX options. The search for a model to simultaneously calibrate an SVM to both SPX and VIX options is done in [6, 19] using a two-factor diffusion model; in [14] a numerically efficient model for joint calibration is proposed using affine jump diffusions; in [28] there is exploration of the Heston model with jumps revealing evidence in VIX options that suggests there are jumps in the volatility process; in [26] there is further exploration of the Heston model with jumps and the role played by the Feller condition; in [30] a displacement or volatility push-up is proposed to improve the joint fit of affine models; in [33] a model for joint calibration is proposed using a regime-switching extension of the Heston model [23]; in [17] it is proposed to use a Heston vol-of-vol model. A different approach is taken in [21] wherein a joint distribution from a non-parametric family is fit satisfying both the marginal distributions from VIX and SPX options, and it is shown to be arbitrage free. Non-model-specific analysis of the joint SPX and VIX markets includes [32] and the data analysis of future variance-swap rates in [31]. The problem of consistency between SVMs and market models is also studied in the PhD thesis of Alex Badran [2].

1.3 Results and Organization of this Paper

Section 2.1 introduces Definition 2.1, stating formally the meaning of consistency between an SVM and a market model. Section 2.2 has the main result of this paper, which is a theorem for the solvability of equation (5). Section 3 has examples of tractable models: Sections 3.1 and 3.2 explore the scalar and multivariate Bergomi models, respectively, for which the inverse problem has an explicit eigenfunction expansion; Section 3.3 looks at the market model where $\mbox{VIX}_{t}^{2}$ is a 3/2 process, which also has an explicit eigenfunction expansion; Section 3.4 looks at the double Nelson market model that is tractable with nice statistical features to fit the data but with an inverse problem that does not have a positive solution for all $x$ ; Section 3.5 looks at a non-stationary model with a Brownian motion factor process for which an application of Bochner’s theorem ensures non-negativity. Section 4 has further discussion on the issue of non-Markovian market models.

2 Definitions and Main Result

The value of a future contract with a fixed horizon is referred to as a constant maturity future (CMF), that is, for a constant $\theta\geq 0$ the CMF price with horizon $\theta$ given by the SVM is,

[TABLE]

and the CMF given by the market model is,

[TABLE]

Unlike regular futures, CMFs are not risk-neutral martingales. Instead, their differential has a non-zero drift,

[TABLE]

where $Y_{t}^{\theta}=\frac{\partial}{\partial T}\log(F_{t,T})\Big{|}_{T=t+\theta}$ . In the stationary case there is the following representation of $Y_{t}^{\theta}$ ,

[TABLE]

from which it is seen that stationarity requires some integrability of $\nu(t,t+\theta)$ and $\frac{\partial}{\partial T}\nu(t,T)\Big{|}_{T=t+\theta}$ . For example, $\nu(t,T)=\gamma e^{-k(T-t)}$ leads to stationary CMFs. The quantity $Y_{t}^{\theta}$ has a financial significance because it is the roll-yield of a trading strategy to track the CMF’s returns (see [1]).

2.1 Consistency

It is first necessary to define this paper’s meaning for consistency:

Definition 2.1 (Consistency).

Assume there exists a unique strong solution for the SDE appearing in equation (2) of the SVM, and there also exists unique strong solutions for the SDEs for CMFS given by (8). The SVM and the market model have consistent prices if the CMFs agree, that is, if

[TABLE]

where $h_{\theta}(x)$ is the SVM’s CMF as defined in (6).

Remark 1.

The essential step in confirming consistency between an SVM and a market model is to prove the statement expressed by equation (10). However, the SVM is such that $h_{T-t}(X_{t})=\mathbb{E}[h_{0}(X_{T})|\mathcal{F}_{t}]$ for all $t\geq 0$ and the market model is such that $F_{t,T}=\mathbb{E}[F_{T,T}|\mathcal{F}_{t}]$ for all $T\geq t$ , and so it is sufficient to show

[TABLE]

That is, the SVM and the market model share the same filtration $(\mathcal{F}_{t})_{t\geq 0}$ , and both $h_{T-t}(X_{t})$ and $F_{t,T}$ are martingales by construction, and so all that needs to be checked is that the models have agreement between their respective VIX processes.

If Definition 2.1 does not hold then there is potential for arbitrage. For example in [32] it is shown how prices should respect certain structural bounds, otherwise there are options portfolios that produce arbitrage. From Definition 2.1, the first thing to notice is that solutions to equation (8) are functions of $X_{t}$ and do not depend separately on $t$ . Thus, Markovian SVM future prices need to be equal to those of the market model, and it stands to reason that the market model should also be a Markov process. Therefore, the simplest approach is to assume that both the SVM and the market model are Markovian and both driven by the factor process $X_{t}$ ; discussion and a counterexample related to this issue will come in Section 4.

The second thing to notice from Definition 2.1 is that time homogeneity in the SVM implies time homogeneity in the market model. The reason being, that the differential of $h_{\theta}(X_{t})$ obtained from Itô’s lemma and the SDE in (2) have time-independent coefficients, and therefore the differential of $F_{t,t+\theta}$ must also have time-independent coefficients.

Assumption 2.1 (Time-Homogeneous Markovian Market Model Driven by $X_{t}$ ).

The market model with CMFs given by equation (8) is a Time-Homogeneous Markov model driven by the same factor process as the SVM. In particular, the CMF $F_{t,t+\theta}$ has roll-yield functions $f_{\theta}(x)$ and volatility row vector functions $\nu_{\theta}(x)$ for each horizon $\theta\geq 0$ , such that the CMF dynamics are,

[TABLE]

where $X_{t}$ is the factor process given by equation (2). Moreover, there is an initial curve $F_{0,\theta}$ such that $F_{0,\theta}=\mathbb{E}[F_{\theta,\theta}|\mathcal{F}_{0}]$ for all $\theta\geq 0$ , and there is an initial value $X_{0}$ such that $X_{0}\in h_{0}^{-1}(F_{0,0})$ a.s.

At this point it is appropriate to formally state sufficient conditions on the SDE coefficients.

Assumption 2.2.

Coefficients $\mu(x)$ and $\sigma(x)$ are globally Lipschitz continuous so that equation (2) has a strong solution. Coefficients $f_{\theta}(x)$ and $\nu_{\theta}(x)$ are bounded so that (11) and (12) have unique strong solutions and the futures of (12) are true martingales.

From Assumptions 2.1 and 2.2 it is assured that Definition 2.1 is meaningful. Assumption 2.1 is necessary for the theory in this paper, but Assumption 2.2 is not always needed. Indeed, there are several important non-Lipschitz examples, such as the Heston SVM, or various other models where $X_{t}$ is a CIR process (see the 3/2 model of Section 3.3). Assumption 2.2 asserts boundedness of $f_{\theta}(x)$ and $\nu_{\theta}(x)$ , but a less restrictive criterion is for $f_{\theta}(x)$ to allow a well-defined Riemann integral $\int_{0}^{t}f_{\theta}(X_{u})du$ and for $\nu_{\theta}(x)$ to satisfy the Novikov condition,

[TABLE]

Assumption 2.1 says that the roll-yields from equation (8) are now functions of $X_{t}$ , namely, $Y_{t}^{\theta}=f_{\theta}(X_{t})$ for all $\theta\geq 0$ . Given Assumption 2.1, the market model’s future dynamics can be rewritten as

[TABLE]

Under Assumptions 2.1 and 2.2, Itô’s lemma can be applied to check whether or not a model satisfies the consistency of Definition 2.1 and equation (10). Indeed, denote the operator $\mathcal{L}$ ,

[TABLE]

which is the infinitesimal generator of the factor process $X_{t}$ . If $h_{\theta}(x)$ has sufficient differentiability, then $dh_{\theta}(X_{t})$ is set equal to the right-hand side of (11) to obtain the following pair of consistency equations,

[TABLE]

with the initial condition satisfying and $X_{0}\in h_{0}^{-1}(F_{0,0})$ .

Remark 2 (Buehler’s Condition).

Equation (14) is Buehler’s condition, which was identified for expected variance in [10].

Remark 3 (Initializing Curve Models with Market Data).

Practical use of market models often involves the insertion of VIX curve data as an initial condition. The advantage to this approach is that the model is able to directly assimilate futures prices observed in the market. If the initial curve $F_{0,\theta}$ is given, then the solution to equation (11) is

[TABLE]

which may not be a time-homogeneous market model driven by $X_{t}$ if the initial conditions stated in Assumption 2.1 are not satisfied, that is, time inhomogeneity could arise if the initial curve cannot be written as a function of $X_{0}$ . Within the framework of Assumption 2.1, a time-homogeneous market model can be initialized with curve data $F_{0,\theta}$ if $F_{0,\theta}=\mathbb{E}[F_{\theta,\theta}|\mathcal{F}_{0}]$ and if there exists $X_{0}$ with $X_{0}\in h_{0}^{-1}(F_{0,0})$ . In practice, the dimension of $X_{t}$ should be sufficiently high so that $h_{0}^{-1}(F_{0,0})$ contains $X_{0}$ , i.e., so that there exists $x$ in the domain of $X_{0}$ with $x\in h_{0}^{-1}(F_{0,0})$ .

2.2 Main Result: Markovian Inverse Problem for $v^{2}(x)$

Let $h(x)=h_{0}(x)$ denote the VIX. Suppose that $h(x)$ and the market model satisfy Definition 2.1 and Assumption 2.1. A function $v^{2}(x)$ should be found for consistent specification of the SVM. If $h(x)$ is already known, then finding $v^{2}(x)$ amounts to solving an inverse problem,

[TABLE]

where a solution is a function $v^{2}:\mathbb{R}^{d}\rightarrow\mathbb{R}$ that satisfies equation (16). This solution admits a valid SVM if it is non-negative.

2.2.1 General Solvability

The inverse problem can be solved for a general class of factor processes. Let the factor process $X_{t}$ be a stationary ergodic process with infinitesimal generator $\mathcal{L}$ given by (13). Let $\omega$ denote $X_{t}$ ’s invariant measure. Here and in the sequel, expectation with respect to the invariant measure for any (integrable) test function $g$ is denoted by

[TABLE]

and all calculations to come will follow the analytical framework of semigroups for diffusions defined in [4]. It will be necessary to assume existence of a unique invariant measure and that the operator $\mathcal{L}$ has a spectral gap:

Assumption 2.3 (Unique Invariant Measure).

There is a unique invariant measure $\omega$ such that $\left<\mathcal{L}g\right>=0$ for any test function $g(x)$ .444Conditions for existence of a unique invariant measure are given in [34]. They include boundedness and uniform ellipticity of matrices $\sigma\sigma^{*}(x)$ , and also that $\lim\sup_{\|x\|\rightarrow\infty}x^{*}\mu(x)\leq-c\|x\|^{1+\alpha}$ for some $c>0$ and $\alpha\geq-1$ .

Assumption 2.4 (Spectral Gap).

The operator $\mathcal{L}$ is symmetric, that is, $\left<g_{1}\mathcal{L}g_{2}\right>=\left<g_{2}\mathcal{L}g_{1}\right>$ for any test functions $g_{1}(x)$ and $g_{2}(x)$ , with a spectrum that is non-positive with a gap at zero. In other words, there is a constant $\lambda>0$ such that,

[TABLE]

for all $t\geq 0$ and for any $g(x)$ such that $\left<g\right>=0$ and $\left<g^{2}\right><\infty$ . Here $e^{\mathcal{L}t}g$ denotes the contraction semigroup generated by $\mathcal{L}$ , and given by

[TABLE]

for bounded $g(x)$ as well as for square integrable ones.

Clearly $|e^{\mathcal{L}t}g(x)|\leq\sup_{y}|g(y)|$ and also $\left<(e^{\mathcal{L}t}g)^{2}\right>\leq\left<g^{2}\right>$ for all suitable $g(x)$ , $t\geq 0$ . Conditions on the symmetric diffusion generator $\mathcal{L}$ to have a spectral gap are given in [3, 4], with the Ornstein-Uhlenbeck generator being the canonical case that motivates the more general theory555The theory of Pardoux and Veretennikov [34] can also be used for Theorem 2.1.. The examples of Section 3 explore further the scope of the theory.

Theorem 2.1 (General Solvability of Inverse Problem).

Assume $h^{2}(x)$ is such that $\left<h^{4}\right><\infty$ and $\left<(\mathcal{L}h^{2})^{2}\right><\infty$ , where $\mathcal{L}$ is the operator from equation (13). Given Assumptions 2.3 and 2.4, a square-integrable solution to equation (16) exists.

Proof of Theorem 2.1.

By writing the solution as $v^{2}(x)=\left<h^{2}\right>+\xi(x)$ , the inverse problem of equation (16) can be rewritten as

[TABLE]

where the operator $\Phi$ is defined by

[TABLE]

Using the invariant measure it is clear the solution $\xi$ is now centered,

[TABLE]

because $0=\left<h^{2}-\left<h^{2}\right>\right>=\left<\Phi\xi\right>=\int(\Phi\xi)d\omega=\int\xi d\omega=\left<\xi\right>$ , and the inverse problem is posed as

[TABLE]

The operator $\Phi$ is an averaging operator, and so it stands to reason that $h^{2}(x)$ is more regular than $\xi(x)$ . The operator $\mathcal{L}$ is applied to both sides of equation (19), and because by assumption the quantity $\mathcal{L}h^{2}(x)$ is well defined, it follows that

[TABLE]

Using the algebraic properties of the semigroup operator (see [4, 38]),

[TABLE]

which can be rearranged to obtain,

[TABLE]

and due to the spectral gap of Assumption 2.4 the solution can be written with a (convergent) geometric series,

[TABLE]

Note the solvability assumption: given the spectral gap there is a solution if and only if $\left<\mathcal{L}h^{2}\right>=0$ , which is the same as equation (22) after applying consistency equations (14) and (15). In addition, it is needed to use the fact that,

[TABLE]

Uniqueness of a square integrable solution with $\left<\xi\right>=0$ also follows from equation (20): for any two solutions $\xi(x)$ and $\xi^{\prime}(x)$ having $\left<\xi^{2}\right>+\left<\xi^{\prime 2}\right><\infty$ it must be that $\mathcal{L}\Phi\xi(x)=\mathcal{L}\Phi\xi^{\prime}(x)$ , or $(I-e^{\mathcal{L}\tau})\xi(x)=(I-e^{\mathcal{L}\tau})\xi^{\prime}(x)$ . By inverting the operator $I-e^{\mathcal{L}\tau}$ it is clear that $\xi(x)=\xi^{\prime}(x)$ for a.e. $x$ .

Multiplying both sides of equation (20) by $\xi(x)$ and taking brackets yields,

[TABLE]

From symmetry of $\mathcal{L}$ and the spectral gap in equation (17) there is the following estimate,

[TABLE]

which is inserted into the previous equation to obtain,

[TABLE]

Rearranging and applying Cauchy-Schwartz yields the estimate,

[TABLE]

which for $\lambda>0$ is rearranged to obtain an estimate on the norm of the solution,

[TABLE]

The bound (21) shows that the solution is square integrable against the invariant density, given our assumptions about $h^{2}(x)$ and the spectral gap. ∎

Remark 4.

The implication of Theorem 2.1 is that, if $h(x)$ is given by a market model and a unique solution to equation (16) is non-negative for a.e. $x$ , and if equation (2) has a strong solution, then there is an SVM that is consistent in the sense of Definition 2.1.

Remark 5.

It may be the case that equation (16) is solvable but does not have a solution that is non-negative for a.e. $x$ , even though it is denoted by $v^{2}(x)$ because that is how the problem is posed. In this case, for the proposed market model there does not exists an SVM that is consistent in the sense of Definition 2.1.

Remark 6.

If a square-integrable solution $v^{2}(x)$ to equation (16) exists, and if the SVM and market model are consistent (in the sense of Definition 2.1), then from the consistency equations of (14) and (15) there is the following solvability condition for the inverse problem,

[TABLE]

where $f(x)$ is the roll yield and $\nu(x)$ the volatility in (11) with $\theta=0$ .

Remark 7.

The solvability condition in equation (22) is analogous to the Fredholm alternative in finite Euclidean space (see [4, 38]). It is an integral condition that involves the roll yield $f(x)$ and volatility of the market model $\nu(x)$ , the VIX $h(x)$ , and the invariant measure of the factor process $\omega$ .

Remark 8 (Symmetric Operators).

For $X_{t}$ given by equation (2), if there is an invariant density $\omega(x)$ , then the operator $\mathcal{L}$ of equation (13) is symmetric if there are matrices $A(x)$ such that $\mathcal{L}$ can be written in self-adjoint form,

[TABLE]

for any test function $g(x)$ . In other words, $\sigma(x)$ and $\mu(x)$ need to satisfy

[TABLE]

This shows us that the symmetry of Assumption 2.4 is somewhat restrictive. However, symmetry is not always required to solve the inverse problem, as will be shown in the examples of Section 3.

2.2.2 Solution via Eigenseries Expansion

If the operator $\mathcal{L}$ has a complete basis of orthogonal eigenfunctions, then so does $\Phi$ given in (18), and then the solution to the inverse problem (16) can be found by computing eigencoefficients in a series expansion of $v^{2}(x)$ . For many such cases there are transition densities for the factor process $X_{t}$ given $X_{0}$ , and so equation (16) can be written using a kernel,

[TABLE]

where the kernel is,

[TABLE]

Suppose there are eigenfunctions $\psi_{n}:\mathbb{R}^{d}\rightarrow\mathbb{R}$ such that for an index value $n\in\{0,1,2,\dots\}$ ,

[TABLE]

where $\lambda_{n}\neq 0$ , and suppose there is an invariant density $\omega(x)>0$ such that any pair is orthogonal,

[TABLE]

Suppose additionally that these eigenfunctions form a complete basis in $L^{2}(\mathbb{R}^{d};\omega)$ , i.e,. if $\left<v^{4}\right><\infty$ then there are coefficients $a_{0},a_{1},a_{2}\dots$ such that,

[TABLE]

If $v^{2}$ is the solution to the inverse problem then there is eigenseries expansion,

[TABLE]

and via orthogonality the $a_{n}$ ’s are solved for,

[TABLE]

This provides a (unique) solution to equation (16).

Remark 9.

The eigenfunction expansion presented in this section can be reformulated without the assumption of a transition density; see [29] for spectral theory of general semi-group operators.

3 Application to Tractable Models

This section presents some examples of models that are applicable in practice, i.e., simulation, numerics, data calibration, etc., can be done within a reasonable amount of time. All models considered are Markov with strong SDE solutions, as per Assumptions 2.1 and 2.2. It will be assumed that $h_{\theta}(x)=F_{t,t+\theta}$ for all $\theta\geq 0$ , and then the emphasis will be placed on calculations for finding the solution to the inverse problem of equation (16).

3.1 The Scalar Bergomi Model

For $d=1$ the Bergomi market model has the volatility function,

[TABLE]

where $\gamma$ is a scalar constant, and $\sigma>0$ and $\kappa>0$ . Define the factor process to be the Ornstein-Uhlenbeck (OU) process $X_{t}$ given by

[TABLE]

which has invariant density

[TABLE]

for this model the drift and diffusion are $\mu(x)=-\kappa x$ , $\sigma(x)=\sigma$ . Given $X_{t}$ , the market model’s futures price is

[TABLE]

where $F^{\infty}=\lim_{T\rightarrow\infty}F_{t,T}$ and is also a model parameter; it is straightforward to check that this expression for $F_{t,T}$ satisfies the market-model equation $dF_{t,T}=F_{t,T}\nu(t,T)dW_{t}$ . For this model, the roll-yields of equation (11) are

[TABLE]

and the volatilities take the form $\nu_{\theta}(X_{t})=\gamma\sigma e^{-\kappa\theta}$ . The consistency equation (15) can be solved to obtain $h_{\theta}(x)=h_{\theta}(0)\exp\left(\gamma e^{-\kappa\theta}x\right)$ , and it is easily verified that the solvability condition (22) holds.

The inverse problem in (16) is solved by an eigenfunction expansion. The OU process has a complete orthogonal basis of eigenfunctions given by the Hermite polynomials. Hence, the inverse problem is solved with an eigenseries expansion like that of Section 2.2.2.

Consider the process

[TABLE]

where $dW_{t}dW_{t}=dt$ . The generator of this process is

[TABLE]

and the eigenfunctions of $\mathcal{L}$ satisfy equations,

[TABLE]

where each $\psi_{n}$ is a Hermite polynomial,

[TABLE]

i.e.,

[TABLE]

Theses polynomials are orthogonal with respect to $Z_{t}$ ’s invariant measure,

[TABLE]

where

[TABLE]

These eigenfunctions form a complete orthogonal basis in $L^{2}(\mathbb{R};\omega)$ , and are convenient because,

[TABLE]

The transition density for the $Z_{t}$ ’s is the following kernel,

[TABLE]

and when applied to the Hermite polynomials,

[TABLE]

which yields the eigenvalues

[TABLE]

For the scalar Bergomi model driven by the OU process $X_{t}$ with mean-reversion rate $\kappa$ and diffusion parameter $\sigma$ , there is the following weak equivalence with $Z_{t}$ ,

[TABLE]

Define the scaled domain variance function,

[TABLE]

and then notice

[TABLE]

If the SVM and market model are consistent, then $\mbox{VIX}_{t}^{2}=h^{2}(X_{t})$ is given explicitly by the market model,

[TABLE]

Then, in terms of $z$ and the scaled eigenfunction $\tilde{v}^{2}(z)$ , the solution to the inverse problem has the expansion,

[TABLE]

and the inverse problem (16) can be written in terms of the scaled variable and variance function,

[TABLE]

for all $z\in\mathbb{R}$ . Then using orthogonality the coefficients are,

[TABLE]

This is clearly an expansion convergent in $L^{2}(\mathbb{R};\omega)$ and uniformly on compact sets. Finally, in terms of $x$ the solution is,

[TABLE]

This expansion is also convergent in $L^{2}$ and uniformly on compact sets. Numerical calculations indicate that the solution $v^{2}(x)$ is positive and therefore there is an acceptable volatility function. It is interesting to note that the market model for the VIX is an exponential function, leading to an exponential OU VIX futures process. However, the consistent SVM in this case does not have an exponential OU volatility function. Numerical calculations show that the instantaneous variance $v^{2}(x)$ has exponential-like behavior but is not an exact exponential. Figure 1 shows a numerical example of the simulated VIX and the recovered volatility funciton in this scalar OU example.

3.2 The Multi-Factor Bergomi Model

VIX futures from the multidimensional Bergomi model are given by

[TABLE]

where $\mathbf{k}$ is a $d\times d$ matrix with positive eigenvalues, $\sigma$ is a $d\times d$ constant matrix, $W_{t}$ is $d$ -dimensional uncorrelated Brownian motion, and $\gamma$ is a $d\times 1$ vector. Let $X_{t}$ be the multidimensional OU process given by

[TABLE]

To ensure stationarity of $X_{t}$ , it is enough to assume that the eigenvalues of $\mathbf{k}$ have positive real parts and that $(-\mathbf{k},\sigma)$ is a controllable pair, i.e.,

[TABLE]

Under these assumptions the distribution of the OU process $X_{t}$ will converge to a stationary state. The invariant density of $X_{t}$ is a $d$ -dimensional Gaussian density with mean zero and $d\times d$ covariance matrix $\Sigma$ is

[TABLE]

which is finite and non-singular if the pair is controllable. From the integral formula of (23) it is seen that $\Sigma$ satisfies the stationary Lyapunov equation,

[TABLE]

Thus, the solution to the market model’s SDE for $F_{t,T}$ is

[TABLE]

where $F^{\infty}=\lim_{T\rightarrow\infty}F_{t,T}$ . For this multidimensional model, the log-future’s derivative with respect to $T$ is

[TABLE]

The volatility function of equation (11) is

[TABLE]

The formula for the VIX is explicit and obtained from (15) (up to the initial value),

[TABLE]

As in the scalar case of Section 3.1, it is easily verified that solvability condition (22) holds here.

When $\mathbf{k}$ is diagonalizable with linearly independent eigenvectors then the generator $\mathcal{L}$ has a discrete set of eigenvalues and a complete bi-orthogonal (in general) basis of eigenfunctions given by multivariate Hermite polynomials (see [24, 27, 40]), and therefore the method of Section 2.2.2 applies even though the generator is in general not symmetric in the sense of Assumption 2.4.

As an example, consider the 2-dimensional model from [1, 7], where the factors are

[TABLE]

with $\kappa_{i}>0$ for $i=1$ and $2$ , $dW_{t}^{1}dW_{t}^{2}=\rho dt$ , the VIX being

[TABLE]

where it is assumed for simplicity that $\gamma=\frac{1}{2}\mathbf{1}$ with $\mathbf{1}=(1,1)^{*}$ . The generator of $X_{t}$ is

[TABLE]

and the invariant density is

[TABLE]

where

[TABLE]

so that (24) holds. Following [40], the eigenfunctions $\phi_{n}$ for the adjoint operator $\mathcal{L}^{*}$ are

[TABLE]

where $n_{1}$ and $n_{2}$ are non-negative integers; notice that $\mathcal{L}^{*}\omega=0$ . These $\phi_{n}$ ’s are the solutions to the equations

[TABLE]

where $\alpha_{n}=n_{1}\kappa_{1}+n_{2}\kappa_{2}$ . Then, the eigenfunctions $\psi_{n}$ for the operator $\mathcal{L}$ are multivariate Hermite polynomials, which are

[TABLE]

and satisfy the equation

[TABLE]

each of these $\psi_{n}$ ’s is a polynomial of degree equal to $n_{1}+n_{2}$ . In this case the transition-density kernel is

[TABLE]

where $X_{t}\leq y$ denotes element-wise inequality, and when applied to the multivariate Hermite polynomials, similar to the scalar OU example of Section 3.1, there are eigenvalues

[TABLE]

The set of $\psi_{n}$ ’s forms a complete basis in $L^{2}(\mathbb{R}^{2};\omega)$ , which satisfy a bi-orthogonality relation relative to a second basis. Define this second set of basis functions to be

[TABLE]

which are bi-orthogonal in the sense that

[TABLE]

Denoting $\mathbf{1}=(1,1)^{*}$ , the inverse problem is

[TABLE]

for all $x\in\mathbb{R}^{2}$ , which via the bi-orthogonality relation has the solution

[TABLE]

As with the scalar Bergomi, it is not needed to check for solvability, existence or uniqueness because the solution has eigencoefficients that are explicit. Figures 2 and 3 show the simulation of this 2-factor Bergomi model along with the recovered $v(x)$ , which is appears to be positive, and Figure 4 looks at the difference $Q(x)=v(x)-h(x)$ to gain a sense of the differing factor sensitivities in $v(x)$ and VIX function $h(x)$ . The approximated $v(x)$ uses all multivariate Hermite polynomials up to and including powers of 6, $v(x)\approx\sqrt{\sum_{\mathcal{N}_{6}}a_{n}\psi_{n}(x)}$ where $\mathcal{N}_{6}=\{n:n_{1}+n_{2}\leq 6\}$ . Using only 6-degree polynomials is sufficiently accurate, as the average error in approximating is of order $10^{-6}$ , i.e., $\sqrt{\frac{1}{|\mbox{x}|}\sum_{i,j}\left(\sqrt{\sum_{\mathcal{N}_{6}}a_{n}\psi_{n}(\mbox{x}_{ij})}-h(\mbox{x}_{ij})\right)^{2}}=\mathcal{O}(10^{-6})$ where $\mbox{x}_{ij}$ denotes a discrete evaluation point in $\mathbb{R}^{2}$ and $|\mbox{x}|$ denotes the total number of discrete points evaluated. Notice that $\int v^{2}(x)\omega(x)dx=\int h^{2}(x)\omega(x)dx$ (to see why multiply both sides of (16) by $\omega(x)$ and integrate). From this surface plot it can be seen that rises in the persistent factor $x_{1}$ have more effect on VIX than on $v(x)$ when the fast-mean-reverting factor is low (i.e., when $x_{2}<0$ ); this is seen in the corner of the surface plot where $Q(x_{1},x_{2})$ is most negative. This is an interesting caveat of the solution to the inverse problem, as it says that the VIX can be more persistent than instantaneous volatility, but this should not be too much of a surprise because VIX is the square-root of the expectation of a moving average of square instantaneous volatility.

3.3 The $3/2$ Model

Consider a market model constructed upon the squared VIX being a 3/2 process,

[TABLE]

where $X_{t}$ is a CIR process,

[TABLE]

with $\frac{2\kappa\bar{x}}{\sigma^{2}}>2$ .666The model proposed in this section is similar to that used in [20], wherein $\mbox{VIX}_{t}=1/X_{t}$ , which could be done here as well but will require $\frac{2\bar{x}\kappa}{\sigma^{2}}>4$ to have $L^{2}$ integrability of the series expansion. Applying Itô’s lemma yields

[TABLE]

from which the 3/2 power in the diffusion is seen, thus giving the process $V_{t}$ its name. Note that this 3/2 model is based on Assumption 2.2 because $X_{t}$ ’s SDE has non-Lipschitz coefficients. However, $X_{t}$ does have strong solutions, and the futures are $F_{t,T}=\mathbb{E}[\sqrt{V_{T}}|\mathcal{F}_{t}]$ for all $t\leq T$ , which are martingales by construction. Therefore, Assumption 2.2 is not needed and Definition 2.1 for consistency applies to this model.

Consider first the normalized CIR process, which has a complete orthogonal basis of eigenfunctions for its generator, given by the generalized Laguerre polynomials. Hence, the inverse problem is again solved with an eigenseries expansion and the method of Section 2.2.2 applies. Consider the normalized CIR process,

[TABLE]

where $\alpha>0$ . The generator of this process is

[TABLE]

and the eigenfunctions of $\mathcal{L}$ satisfy equations

[TABLE]

where each $\psi_{n}$ is a generalized Laguerre polynomial,

[TABLE]

that is,

[TABLE]

These polynomials are orthogonal with respect to $Z$ ’s invariant measure,

[TABLE]

where

[TABLE]

and

[TABLE]

with $\Gamma(\alpha)$ the Gamma function evaluated at $\alpha>1$ . These eigenfunctions form a complete orthogonal basis in $L^{2}(\mathbb{R}^{+};\omega)$ , and are convenient because

[TABLE]

For the CIR process $X_{t}$ defined above, there is the following weak equivalence with a scaled $Z_{t}$ ,

[TABLE]

with $\alpha=\frac{2\bar{x}\kappa}{\sigma^{2}}-1$ . Define also the scaled domain variance or volatility function,

[TABLE]

and then notice

[TABLE]

Therefore it is useful to define the kernel for the $Z_{t}$ ’s, $\Phi_{z}(y,z)=\frac{1}{\tau}\int_{0}^{\tau}\frac{\partial}{\partial y}\mathbb{P}(Z_{\kappa t}\leq y|Z_{0}=z)dt$ , and when applied to the Laguerre polynomials, similar to the scalar OU example,

[TABLE]

there are the eigenvalues,

[TABLE]

Hence, if the SVM and market model are consistent, then $\mbox{VIX}_{t}^{2}=h^{2}(X_{t})$ is given explicitly by the market model,

[TABLE]

which is in $L^{2}(\mathbb{R}^{+};\omega)$ if $\alpha>1$ . Then, in terms of $z$ and the scaled function $\tilde{v}^{2}(z)$ , the solution to the inverse problem has the expansion,

[TABLE]

and therefore

[TABLE]

for all $z>0$ . Using orthogonality, the coefficients are

[TABLE]

For $n$ large there is the behavior $a_{n}\approx n^{-\alpha+1}$ , which requires $\alpha>2$ for square integrability of the expansion of $\tilde{v}^{2}(z)$ . Finally, in terms of $x$ , the solution is

[TABLE]

Figure 5 shows a simulation of the 3/2 process and the two approximations of the recovered function $v(x)$ using 25 and 30 Laguerre polynomials. In the figure, the simulation is run for 10 years with time step $\Delta t=1/365$ , and produces empirical statistics $\frac{1}{N}\sum_{i=1}^{N}\mbox{VIX}_{t_{i}}=19.89\%$ and $\hbox{mode}_{i\leq N}(\mbox{VIX}_{t_{i}})=16.0\%$ (in the summation $N=10\times 365=3,650$ ). The figure shows a recovered $v^{2}(x)$ from which it is clear that, compared to the VIX, instantaneous volatility is more affected by low values of $X_{t}$ ; i.e., stochastic volatility is more sensitive to the left-hand tail distribution of $X_{t}$ . Note also from the figure that the numerical solution is positive.

3.4 The Double Nelson Model

Consider the 2-dimensional mean reverting process $X_{t}=(X_{t}^{1},X_{t}^{2})$ with dynamics,

[TABLE]

where $\bar{x}>0$ , $\kappa_{1}>0$ , $\kappa_{2}>0$ , and $dW_{t}^{1}dW_{t}^{2}=\rho dt$ . This is the double Nelson model, which is the continuous-time limit of a double GARCH model. Defining the VIX to be

[TABLE]

the futures curve $F_{t,T}=\mathbb{E}[h(X_{T})|X_{t}]$ is

[TABLE]

This is a market model for which the inverse problem will look to find $v^{2}(x)$ from an SVM driven by the same factors $X_{t}^{1}$ and $X_{t}^{2}$ .

This model’s infinitesimal generator is not symmetric and so the general theory of Theorem 2.1 does not apply directly. However, the factor process satisfies a linear system of stochastic differential equations for which there are closed equations for moments of all orders, and so the solvability condition given by (22) from Section 2.2.1 can be applied.

The zero-maturity roll yield is

[TABLE]

the volatility is $\nu(t,t)=\sigma_{1}$ , and so the solvability condition of equation (22) is

[TABLE]

Invariant moments can be calculated using Itô’s lemma and then taking expectations,

[TABLE]

Hence, provided that $2\kappa_{1}-\sigma_{1}^{2}>0$ and $2\kappa_{2}-\sigma_{2}^{2}>0$ to ensure that $X_{t}$ has finite (invariant) second moments, and that $\left<x_{1}x_{2}\right>$ is finite,

[TABLE]

it follows that equation (26) holds.

The inverse problem is

[TABLE]

with $h^{2}(x)=x_{1}^{2}$ , and is solved explicitly by looking for the solution in the form,

[TABLE]

The coefficients $a_{ij}$ and $b_{i}$ for $i,j=1,2$ are obtained by solving explicitly for the moments $u_{11}(t)=\mathbb{E}[(X_{t}^{1})^{2}|X_{0}=x],~{}u_{12}(t)=\mathbb{E}[X_{t}^{1}X_{t}^{2}|X_{0}=x]\ ,\ldots\ ,$ which satisfy a linear system of ordinary differential equations obtained by Ito’s formula from the stochastic differential equations of the factor process (25),

[TABLE]

Note that the invariant moments obtained above are simply the limit of these moments as $t\to\infty$ , and this requires that the relations between $\kappa_{1},\kappa_{2},\sigma_{1},\sigma_{2},\rho$ introduced above hold here too. Hence,

[TABLE]

and by adjusting the coefficients $a_{11},a_{12},\ldots$ the solution is found to be $v^{2}(x)=v^{2}(x_{1},x_{2})$ for $x_{1}\geq 0$ and $x_{2}\geq 0$ , which is a quadratic polynomial in $(x_{1},x_{2})$ . However, this solution will not be non-negative and therefore it is not acceptable for an SVM.

To see how the solution $v^{2}(x_{1},x_{2})$ can go negative, consider the simplified inverse problem,

[TABLE]

which requires only a linear expression for its solution

[TABLE]

Thus

[TABLE]

with

[TABLE]

Inserting these expressions and doing the time averaging it is seen that in order to solve the inverse problem it must be that

[TABLE]

so that the coefficient of $x_{1}$ on the right is one. Then taking $b_{2}$ to make the coefficient of $x_{2}$ equal to zero, this leads to

[TABLE]

After solving for $b_{1}$ and $b_{2}$ , the constant $c$ equals to the remaining terms. Finally it is seen that $b_{2}$ is negative for any $\kappa_{1},\kappa_{2}$ , and this makes the solution $v^{2}(x)=b_{1}x_{1}+b_{2}x_{2}+c,~{}x_{1}\geq 0,x_{2}\geq 0,$ take negative values for $x_{1}$ near [math] and $x_{2}$ large. This indicates that there cannot be consistency in the sense of Definition 2.1.

3.5 Non-Negative Solutions for Brownian Motion Factor

Consider another example that does not have the stationarity of Assumptions 2.3 and 2.4, but instead is a market model driven by Brownian motion $Z_{t}$ . This is an example that has a general condition on $h^{2}$ to ensure non-negativity of the recovered volatility function.

The inverse problem is

[TABLE]

where $Z_{t}$ is standard Brownian motion. The Fourier transform is used to solve this problem. The space to consider is $L^{2}(\mathbb{R},dz)$ , and the Fourier elements are

[TABLE]

which have generalized orthogonality with the delta function $\delta(k-k^{\prime})=\frac{1}{2\pi}\int e^{ik^{\prime}z}e^{-ikz}dz$ . The market model’s VIX function and the SVM function have Fourier transforms

[TABLE]

The Fourier basis is used to transform the inverse problem,

[TABLE]

Hence, the solution to the problem is

[TABLE]

If $k^{2}\widehat{h^{2}}(k)$ is in $L^{2}(\mathbb{R},dk)$ , then Parseval’s identity says the solution $v^{2}(z)$ is in $L^{2}(\mathbb{R},dz)$ ,

[TABLE]

If $\widehat{v^{2}}(k)$ is continuous and positive definite, that is, if for any $k_{\ell}\in\mathbb{R}$ and $c_{\ell}\in\mathbb{C}$ for $\ell=1,2,3,\dots,M$ for $M$ any positive integer,

[TABLE]

then Bochner’s theorem applies [35] and $v^{2}(z)=\frac{1}{\sqrt{2\pi}}\int e^{ikz}\widehat{v^{2}}(k)dk$ is non negative. This is a general criterion for the solution to the inverse problem to be non-negative, but this application of Bochner’s theorem is special to the case of Fourier eigenfunctions.

To further illustrate, consider the specific example

[TABLE]

and with $Z_{t}$ a standard Brownian motion. For inverse problem

[TABLE]

it is easy to check,

[TABLE]

Hence, there is non-negative solution if $c\geq\frac{\tau\gamma}{2}$ .

4 Non-Markovian Market Models

Let $h_{\theta}(x)$ denote the CMFs derived from the Markovian SVM. Suppose that Assumption 2.1 does not hold so that it’s possible to have non-Markovian dynamics. Then to check for the consistency of Definition 2.1, there are the following pair of equations that are the generalization of (14) and (15),

[TABLE]

where $Y_{t}^{\theta}$ is the roll yield as shown in equation (8). From equations (27) and (28) it should be clear that a Markovian representation of the market model must be imposed. Namely, $Y_{t}^{\theta}=f_{\theta}(X_{t})$ where $f_{\theta}(X_{t})$ equals the left-hand side of equation (27), and $\nu_{\theta}(t)=\nu_{\theta}(X_{t})$ where $\nu_{\theta}(X_{t})$ equals the transpose of the left-hand side of equation (28).

4.1 Scalar Consistency with Constant $\nu_{\theta}(t)$

Consider the case where $X_{t}$ and $W_{t}$ in equation (2) are scalar processes. Suppose that $\nu_{\theta}$ is a scalar, constant deterministic function,

[TABLE]

Then solving equations (27) and (28) leads to the following VIX futures and roll yields,

[TABLE]

It is assumed in this equation that $\sigma(x)$ is strictly positive and its inverse is integrable.

4.2 An Inconsistent Example

There are non-trivial cases where there is a violation of the scalar consistency formula of equation (29). For example, suppose there is algebraic decay in the market model’s volatility function,

[TABLE]

Then the SDE for the CMF can be computed via Itô’s lemma, which yields the following roll yield,

[TABLE]

There is no function of the Markov process $X_{t}$ that can equal this process almost surely, as $Y_{t}^{\theta}$ itself is not a Markov process. Hence, formula (29) cannot hold.

5 Summary and Conclusion

The achievement of this paper is the derivation of a consistent SVM for the SPX given a market model for the VIX. The main result is Theorem 2.1, which gives conditions for the unique determination of the volatility function of the SVM from a VIX function given by the market model, provided both models are driven by the same underlying stationary ergodic factor process. The theorem’s conditions involve moments of the VIX function, the uniqueness of the invariant measure of the factor process, and require that the operator semi-group have a spectral gap. At the time of this article there are no known structural conditions that will make the resulting volatility function non-negative, and therefore no theoretical guarantees for consistency can be made. There are special cases where positivity can be guaranteed, such as models where $X_{t}$ is a Brownian motion. Detailed analysis and numerical calculations for several market models indicate that for the commonly used Bergomi market models (Sections 3.1 and 3.2) the volatility function appears to be positive. For another market model where square VIX is the reciprocal of a CIR process (Section 3.3), the volatility is again shown numerically to be positive. The double Nelson model in Section 3.4 is a counter example, wherein the market model’s factor process is a linear SDE that is stationary ergodic, but the inverse problem leads to a (unique) volatility function that cannot be everywhere non-negative. Positivity can be guaranteed for the example in Section 3.5 because the factor process is Brownian motion, and therfore Bochner’s theorem gives general conditions for non-negativity.

Future problems to consider include general results for non-negativity of recovered volatility functions under the OU and the CIR processes, and also to generalize this inverse problem formulation to jump-diffusion models like that of [16]. From a computational standpoint, it would be worth solving the inverse problem not as an exact equality, but instead as a minimization subject to the constraint that $v^{2}\geq 0$ . Then, under a loosening of conditions for consistency, it could be possible that this constrained minimization will produce useful SVMs from a broader class of market models.

Bibliography40

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Avellaneda and A. Papanicolaou. Statistics of VIX futures and applications to trading volatility exchange-traded products. International Journal of Theoretical and Applied Finance , 22(01):1850061, 2019.
2[2] A. Badran. Arbitrage-free models for VIX and equity derivatives . Ph D thesis, University of Sydney, 2014. https://ses.library.usyd.edu.au/handle/2123/13082 .
3[3] D. Bakry, P. Cattiaux, and A. Guillinde. Rate of convergence for ergodic continuous Markov processes: Lyapunov versus Poincaré. Journal of Functional Analysis , 254(3):727–759, 2008.
4[4] D. Bakry, I. Gentil, and M. Ledoux. Analysis and geometry of Markov diffusion operators , volume 348. Springer Science & Business Media, 2013.
5[5] J. Baldeaux and A. Badran. Consistent modelling of vix and equity derivatives using a 3/2 plus jumps model. Applied Mathematical Finance , 21(4):299–312, 2014.
6[6] C. Bayer, J. Gatheral, and M. Karlsmark. Fast Ninomiya-Victoir calibration of the double-mean-reverting model. Quantitative Finance , 13(11):1813–1829, 2013.
7[7] L. Bergomi. Smile dynamics ii. Risk , pages 117–123, October 2005.
8[8] L. Bergomi. Smile dynamics iii. Risk , pages 90–96, October 2008.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Consistent Time-Homogeneous Modeling of SPX and VIX Derivatives111Data Sharing and Data Accessibility: Data sharing is not applicable to this article as no new data were created or analyzed in this study. 222This work was partially supported by NSF grant DMS-1907518.

Abstract

Contents

1 Motivation and Formulation

1.1 Problem Formulation

1.2 Background Literature

1.3 Results and Organization of this Paper

2 Definitions and Main Result

2.1 Consistency

Definition 2.1** (Consistency).**

** Remark 1****.**

Assumption 2.1** (Time-Homogeneous Markovian Market Model Driven by XtX_{t}Xt​).**

Assumption 2.2**.**

** Remark 2**** (Buehler’s Condition).**

** Remark 3**** (Initializing Curve Models with Market Data).**

2.2 Main Result: Markovian Inverse Problem for v2(x)v^{2}(x)v2(x)

2.2.1 General Solvability

Assumption 2.3** (Unique Invariant Measure).**

Assumption 2.4** (Spectral Gap).**

** Theorem 2.1**** (General Solvability of Inverse Problem).**

Proof of Theorem 2.1.

** Remark 4****.**

** Remark 5****.**

** Remark 6****.**

** Remark 7****.**

** Remark 8**** (Symmetric Operators).**

2.2.2 Solution via Eigenseries Expansion

** Remark 9****.**

3 Application to Tractable Models

3.1 The Scalar Bergomi Model

3.2 The Multi-Factor Bergomi Model

3.3 The 3/23/23/2 Model

3.4 The Double Nelson Model

3.5 Non-Negative Solutions for Brownian Motion Factor

4 Non-Markovian Market Models

4.1 Scalar Consistency with Constant νθ(t)\nu_{\theta}(t)νθ​(t)

4.2 An Inconsistent Example

5 Summary and Conclusion

Definition 2.1 (Consistency).

Remark 1.

Assumption 2.1 (Time-Homogeneous Markovian Market Model Driven by $X_{t}$ ).

Assumption 2.2.

Remark 2 (Buehler’s Condition).

Remark 3 (Initializing Curve Models with Market Data).

2.2 Main Result: Markovian Inverse Problem for $v^{2}(x)$

Assumption 2.3 (Unique Invariant Measure).

Assumption 2.4 (Spectral Gap).

Theorem 2.1 (General Solvability of Inverse Problem).

Remark 4.

Remark 5.

Remark 6.

Remark 7.

Remark 8 (Symmetric Operators).

Remark 9.

3.3 The $3/2$ Model

4.1 Scalar Consistency with Constant $\nu_{\theta}(t)$