Consistent Time-Homogeneous Modeling of SPX and VIX Derivatives
Andrew Papanicolaou

TL;DR
This paper develops a method to recover a consistent stochastic volatility model for SPX and VIX derivatives from market data, ensuring the models are aligned and non-negative, with analysis of solution uniqueness.
Contribution
It introduces a novel inverse problem approach to derive a stochastic volatility function from market models of VIX futures, ensuring consistency with SPX derivatives.
Findings
Method for recovering stochastic volatility functions from market models.
Conditions for uniqueness of the inverse solution.
Illustrations of potential negativity and inconsistency issues.
Abstract
This paper shows how to recover a stochastic volatility model (SVM) from a market model of the VIX futures term structure. Market models have more flexibility for fitting of curves than do SVMs, and therefore are better suited for pricing VIX futures and VIX derivatives. But the VIX itself is a derivative of the S&P500 (SPX) and it is common practice to price SPX derivatives using an SVM. Therefore, consistent modeling for both SPX and VIX should involve an SVM that can be obtained by inverting the market model. This paper's main result is a method for the recovery of a stochastic volatility function by solving an inverse problem where the input is the VIX function given by a market model. Analysis will show conditions necessary for there to be a unique solution to this inverse problem. The models are consistent if the recovered volatility function is non-negative. Examples are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Complex Systems and Time Series Analysis · Financial Risk and Volatility Modeling
Consistent Time-Homogeneous Modeling of SPX and VIX Derivatives111Data Sharing and Data Accessibility: Data sharing is not applicable to this article as no new data were created or analyzed in this study. 222This work was partially supported by NSF grant DMS-1907518.
A. Papanicolaou Department of Mathematics, North Carolina State University, 2311 Stinson Drive, Raleigh, NC 27695. [email protected]
Abstract
This paper shows how to recover a stochastic volatility model (SVM) from a market model of the VIX futures term structure. Market models have more flexibility for fitting of curves than do SVMs, and therefore are better suited for pricing VIX futures and VIX derivatives. But the VIX itself is a derivative of the S&P500 (SPX) and it is common practice to price SPX derivatives using an SVM. Therefore, consistent modeling for both SPX and VIX should involve an SVM that can be obtained by inverting the market model. This paper’s main result is a method for the recovery of a stochastic volatility function by solving an inverse problem where the input is the VIX function given by a market model. Analysis will show conditions necessary for there to be a unique solution to this inverse problem. The models are consistent if the recovered volatility function is non-negative. Examples are presented to illustrate the theory, to highlight the issue of negativity in solutions, and to show the potential for inconsistency in non-Markov settings.
Keywords: stochastic volatility, market models, VIX futures, consistent pricing.
AMS Subject Codes: 91B24, 91B70, 45Q05
Contents
1 Motivation and Formulation
Volatility trading has increased in the 21st century with the introduction of derivatives on the VIX index. Two such derivatives are VIX futures, which began trading on the CBOE in 2004, and VIX (European) options, which began trading on the CBOE in 2006. There are also exchange traded notes (ETNs) written on these futures, and options written on these ETNs. Pricing of VIX derivatives uses so-called market models, which are stochastic models for the futures term structure. One thing to keep in mind when using a market model is that the published VIX index is computed from European S&P500 (SPX) options, which means that the VIX is really an SPX derivative. Therefore, VIX prices may be partially determinable if there are established prices for SPX options. Moreover, pricing of SPX derivatives uses stochastic volatility models (SVMs) rather than market models. Hence, there is potential for conflicting prices if market models and SVMs are being used simultaneously to price VIX and SPX derivatives, respectively.
To understand why such a conflict would be a problem, consider a situation where a single financial institution has two separate trading floors: one for SPX derivatives and another for VIX derivatives. Each floor has its own traders who quote prices from their own respective models. If these models have substantially different assessments on the outcome of 30-day variance, then there could be inter-desk arbitrage, i.e., mispricings that allow for a third party (external to the institution) to take SPX derivative prices offered by the SPX desk and arbitrage them against VIX derivative prices offered by the VIX desk. A solution to this problem should provide a criterion for consistency of the models, and in a practical setting should provide a method for specification of one model in terms of the other. This paper presents such a solution.
1.1 Problem Formulation
Let denote the scalar price process for the SPX, and let denote a -dimensional factor process with a positive integer. Consider a model where SPX returns are given by a risk-neutral SVM,
[TABLE]
where is the risk-free rate, is a scalar-valued volatility function, is a -dimensional drift, is a diffusion matrix, is a -dimensional risk-neutral vector-valued Standard Brownian motion, and is a risk-neutral scalar Brownian motion, with correlations between and denoted with ,
[TABLE]
Denote by the filtration generated by and . The VIX is the square root of the risk-neutral expected realized 30-day variance, which in the continuous diffusion model of (1) and (2) is
[TABLE]
where 30 days, and with a VIX future being given by
[TABLE]
For this SVM it is clear that the asset price , the VIX, and all VIX futures are -adapted Markov processes.
Separate from SVMs are market models that are designed to describe directly the VIX and VIX futures. Let be a market model’s price for a VIX future with maturity at time . These prices come from the following system of SDEs,
[TABLE]
where is the same Brownian motion from equation (2), and where the volatility is an -adapted -dimensional row vector function that is specific for a given . The model is applied simultaneously for multiple or a continuum of ’s, thereby forming an entire curve of VIX futures. It is important to keep in mind that equation (4) is generally a time inhomogeneous and non-Markovian model, but results in this paper apply to time-homogenous Markovian market models that are also driven by the factor process of equation (2). Section 2.1 will set forth assumptions for time homogeneity and Markovianity along with some explanation, but further discussion will come in Section 4 where it will be shown how there is essentially a contradiction when trying to specify a consistent Markovian SVM with a non-Markovian market model. Market models considered in this paper include: a Bergomi-type market model333This paper throughout will refer to the “Bergomi model” when perhaps it should say “a Bergomi-type model for VIX futures”, but this would be unwieldy so instead the name is used in a general sense. The original Bergomi model is for the curve of future instantaneous variances, not VIX directly. where with and being positive-definite matrices and a volatility vector; a 3/2 market model where all futures are an expectation of a VIX given by with being a Cox-Ingersol-Ross (CIR) process; a double Nelson (or double mean reverting model) similar to the model in [6]; a non-stationary model where is Brownian motion.
In practice, it is a good idea to use market models because VIX futures are very liquid with a richness of information for understanding the state of volatility. Therefore, it is sensible to first define a market model, and second to build an SVM with the structure of the market model taken into consideration. If the market model is Markovian with the same factors as the SVM, then the instantaneous variance is the solution to an inverse problem. The formulation of this inverse problem is as follows: if the coefficients of the factor process and are known for all , and the market model has provided the function , then the inverse problem for is expressed as
[TABLE]
In Section 2 it is shown that if the process is ergodic and has an invariant law relative to which its infinitesimal generator is symmetric, making reversible, and has also a spectral gap [4], then under some integrability conditions for there exists a unique that is the solution to equation (5). If in addition the solution is non-negative, then the market model has provided a valid volatility function for equation (1), leaving the correlation coefficients as the only remaining parameters to be determined for the SVM. However, non-negativity of the solution is difficult to prove; this issue is explored further in the examples of Section 3. Similar non-negativity issues arise elsewhere in the literature, for example in quantum inverse scattering theory [13, 12] and in Fourier analysis [39]. For the most part, results in the literature proving positivity are less general than theory for existence of solutions.
1.2 Background Literature
Background for SVMs, including the Heston model, can be found in various books and papers, including [18, 23], and more general models involving jumps in [16]. The Bergomi model for future variance is introduced in [7, 8, 9], and a consistency condition for the drift in futures curves for variance is given in [10]. Term structure and the associated Heath-Jarrow-Morton (HJM) framework are discussed in [11, 22, 36, 37]. In [25] a market model with stochastic volatility is used to derive formulae for VIX futures.
In the past decade there has been a lot of research on joint models for SPX and VIX options, which includes some re-evaluation of widely-used SVMs and a search for new models to fit both markets. One such example is the so-called 3/2 model, which is analyzed in [5, 15] and is a popular choice because it is able to reproduce the increasing right-hand implied-volatility skew in VIX options. The search for a model to simultaneously calibrate an SVM to both SPX and VIX options is done in [6, 19] using a two-factor diffusion model; in [14] a numerically efficient model for joint calibration is proposed using affine jump diffusions; in [28] there is exploration of the Heston model with jumps revealing evidence in VIX options that suggests there are jumps in the volatility process; in [26] there is further exploration of the Heston model with jumps and the role played by the Feller condition; in [30] a displacement or volatility push-up is proposed to improve the joint fit of affine models; in [33] a model for joint calibration is proposed using a regime-switching extension of the Heston model [23]; in [17] it is proposed to use a Heston vol-of-vol model. A different approach is taken in [21] wherein a joint distribution from a non-parametric family is fit satisfying both the marginal distributions from VIX and SPX options, and it is shown to be arbitrage free. Non-model-specific analysis of the joint SPX and VIX markets includes [32] and the data analysis of future variance-swap rates in [31]. The problem of consistency between SVMs and market models is also studied in the PhD thesis of Alex Badran [2].
1.3 Results and Organization of this Paper
Section 2.1 introduces Definition 2.1, stating formally the meaning of consistency between an SVM and a market model. Section 2.2 has the main result of this paper, which is a theorem for the solvability of equation (5). Section 3 has examples of tractable models: Sections 3.1 and 3.2 explore the scalar and multivariate Bergomi models, respectively, for which the inverse problem has an explicit eigenfunction expansion; Section 3.3 looks at the market model where is a 3/2 process, which also has an explicit eigenfunction expansion; Section 3.4 looks at the double Nelson market model that is tractable with nice statistical features to fit the data but with an inverse problem that does not have a positive solution for all ; Section 3.5 looks at a non-stationary model with a Brownian motion factor process for which an application of Bochner’s theorem ensures non-negativity. Section 4 has further discussion on the issue of non-Markovian market models.
2 Definitions and Main Result
The value of a future contract with a fixed horizon is referred to as a constant maturity future (CMF), that is, for a constant the CMF price with horizon given by the SVM is,
[TABLE]
and the CMF given by the market model is,
[TABLE]
Unlike regular futures, CMFs are not risk-neutral martingales. Instead, their differential has a non-zero drift,
[TABLE]
where Y_{t}^{\theta}=\frac{\partial}{\partial T}\log(F_{t,T})\Big{|}_{T=t+\theta}. In the stationary case there is the following representation of ,
[TABLE]
from which it is seen that stationarity requires some integrability of and \frac{\partial}{\partial T}\nu(t,T)\Big{|}_{T=t+\theta}. For example, leads to stationary CMFs. The quantity has a financial significance because it is the roll-yield of a trading strategy to track the CMF’s returns (see [1]).
2.1 Consistency
It is first necessary to define this paper’s meaning for consistency:
Definition 2.1** (Consistency).**
Assume there exists a unique strong solution for the SDE appearing in equation (2) of the SVM, and there also exists unique strong solutions for the SDEs for CMFS given by (8). The SVM and the market model have consistent prices if the CMFs agree, that is, if
[TABLE]
where is the SVM’s CMF as defined in (6).
** Remark 1****.**
The essential step in confirming consistency between an SVM and a market model is to prove the statement expressed by equation (10). However, the SVM is such that for all and the market model is such that for all , and so it is sufficient to show
[TABLE]
That is, the SVM and the market model share the same filtration , and both and are martingales by construction, and so all that needs to be checked is that the models have agreement between their respective VIX processes.
If Definition 2.1 does not hold then there is potential for arbitrage. For example in [32] it is shown how prices should respect certain structural bounds, otherwise there are options portfolios that produce arbitrage. From Definition 2.1, the first thing to notice is that solutions to equation (8) are functions of and do not depend separately on . Thus, Markovian SVM future prices need to be equal to those of the market model, and it stands to reason that the market model should also be a Markov process. Therefore, the simplest approach is to assume that both the SVM and the market model are Markovian and both driven by the factor process ; discussion and a counterexample related to this issue will come in Section 4.
The second thing to notice from Definition 2.1 is that time homogeneity in the SVM implies time homogeneity in the market model. The reason being, that the differential of obtained from Itô’s lemma and the SDE in (2) have time-independent coefficients, and therefore the differential of must also have time-independent coefficients.
Assumption 2.1** (Time-Homogeneous Markovian Market Model Driven by ).**
The market model with CMFs given by equation (8) is a Time-Homogeneous Markov model driven by the same factor process as the SVM. In particular, the CMF has roll-yield functions and volatility row vector functions for each horizon , such that the CMF dynamics are,
[TABLE]
where is the factor process given by equation (2). Moreover, there is an initial curve such that for all , and there is an initial value such that a.s.
At this point it is appropriate to formally state sufficient conditions on the SDE coefficients.
Assumption 2.2**.**
Coefficients and are globally Lipschitz continuous so that equation (2) has a strong solution. Coefficients and are bounded so that (11) and (12) have unique strong solutions and the futures of (12) are true martingales.
From Assumptions 2.1 and 2.2 it is assured that Definition 2.1 is meaningful. Assumption 2.1 is necessary for the theory in this paper, but Assumption 2.2 is not always needed. Indeed, there are several important non-Lipschitz examples, such as the Heston SVM, or various other models where is a CIR process (see the 3/2 model of Section 3.3). Assumption 2.2 asserts boundedness of and , but a less restrictive criterion is for to allow a well-defined Riemann integral and for to satisfy the Novikov condition,
[TABLE]
Assumption 2.1 says that the roll-yields from equation (8) are now functions of , namely, for all . Given Assumption 2.1, the market model’s future dynamics can be rewritten as
[TABLE]
Under Assumptions 2.1 and 2.2, Itô’s lemma can be applied to check whether or not a model satisfies the consistency of Definition 2.1 and equation (10). Indeed, denote the operator ,
[TABLE]
which is the infinitesimal generator of the factor process . If has sufficient differentiability, then is set equal to the right-hand side of (11) to obtain the following pair of consistency equations,
[TABLE]
with the initial condition satisfying and .
** Remark 2**** (Buehler’s Condition).**
Equation (14) is Buehler’s condition, which was identified for expected variance in [10].
** Remark 3**** (Initializing Curve Models with Market Data).**
Practical use of market models often involves the insertion of VIX curve data as an initial condition. The advantage to this approach is that the model is able to directly assimilate futures prices observed in the market. If the initial curve is given, then the solution to equation (11) is
[TABLE]
which may not be a time-homogeneous market model driven by if the initial conditions stated in Assumption 2.1 are not satisfied, that is, time inhomogeneity could arise if the initial curve cannot be written as a function of . Within the framework of Assumption 2.1, a time-homogeneous market model can be initialized with curve data if and if there exists with . In practice, the dimension of should be sufficiently high so that contains , i.e., so that there exists in the domain of with .
2.2 Main Result: Markovian Inverse Problem for
Let denote the VIX. Suppose that and the market model satisfy Definition 2.1 and Assumption 2.1. A function should be found for consistent specification of the SVM. If is already known, then finding amounts to solving an inverse problem,
[TABLE]
where a solution is a function that satisfies equation (16). This solution admits a valid SVM if it is non-negative.
2.2.1 General Solvability
The inverse problem can be solved for a general class of factor processes. Let the factor process be a stationary ergodic process with infinitesimal generator given by (13). Let denote ’s invariant measure. Here and in the sequel, expectation with respect to the invariant measure for any (integrable) test function is denoted by
[TABLE]
and all calculations to come will follow the analytical framework of semigroups for diffusions defined in [4]. It will be necessary to assume existence of a unique invariant measure and that the operator has a spectral gap:
Assumption 2.3** (Unique Invariant Measure).**
There is a unique invariant measure such that for any test function .444Conditions for existence of a unique invariant measure are given in [34]. They include boundedness and uniform ellipticity of matrices , and also that for some and .
Assumption 2.4** (Spectral Gap).**
The operator is symmetric, that is, for any test functions and , with a spectrum that is non-positive with a gap at zero. In other words, there is a constant such that,
[TABLE]
for all and for any such that and . Here denotes the contraction semigroup generated by , and given by
[TABLE]
for bounded as well as for square integrable ones.
Clearly and also for all suitable , . Conditions on the symmetric diffusion generator to have a spectral gap are given in [3, 4], with the Ornstein-Uhlenbeck generator being the canonical case that motivates the more general theory555The theory of Pardoux and Veretennikov [34] can also be used for Theorem 2.1.. The examples of Section 3 explore further the scope of the theory.
** Theorem 2.1**** (General Solvability of Inverse Problem).**
Assume is such that and , where is the operator from equation (13). Given Assumptions 2.3 and 2.4, a square-integrable solution to equation (16) exists.
Proof of Theorem 2.1.
By writing the solution as , the inverse problem of equation (16) can be rewritten as
[TABLE]
where the operator is defined by
[TABLE]
Using the invariant measure it is clear the solution is now centered,
[TABLE]
because , and the inverse problem is posed as
[TABLE]
The operator is an averaging operator, and so it stands to reason that is more regular than . The operator is applied to both sides of equation (19), and because by assumption the quantity is well defined, it follows that
[TABLE]
Using the algebraic properties of the semigroup operator (see [4, 38]),
[TABLE]
which can be rearranged to obtain,
[TABLE]
and due to the spectral gap of Assumption 2.4 the solution can be written with a (convergent) geometric series,
[TABLE]
Note the solvability assumption: given the spectral gap there is a solution if and only if , which is the same as equation (22) after applying consistency equations (14) and (15). In addition, it is needed to use the fact that,
[TABLE]
Uniqueness of a square integrable solution with also follows from equation (20): for any two solutions and having it must be that , or . By inverting the operator it is clear that for a.e. .
Multiplying both sides of equation (20) by and taking brackets yields,
[TABLE]
From symmetry of and the spectral gap in equation (17) there is the following estimate,
[TABLE]
which is inserted into the previous equation to obtain,
[TABLE]
Rearranging and applying Cauchy-Schwartz yields the estimate,
[TABLE]
which for is rearranged to obtain an estimate on the norm of the solution,
[TABLE]
The bound (21) shows that the solution is square integrable against the invariant density, given our assumptions about and the spectral gap. ∎
** Remark 4****.**
The implication of Theorem 2.1 is that, if is given by a market model and a unique solution to equation (16) is non-negative for a.e. , and if equation (2) has a strong solution, then there is an SVM that is consistent in the sense of Definition 2.1.
** Remark 5****.**
It may be the case that equation (16) is solvable but does not have a solution that is non-negative for a.e. , even though it is denoted by because that is how the problem is posed. In this case, for the proposed market model there does not exists an SVM that is consistent in the sense of Definition 2.1.
** Remark 6****.**
If a square-integrable solution to equation (16) exists, and if the SVM and market model are consistent (in the sense of Definition 2.1), then from the consistency equations of (14) and (15) there is the following solvability condition for the inverse problem,
[TABLE]
where is the roll yield and the volatility in (11) with .
** Remark 7****.**
The solvability condition in equation (22) is analogous to the Fredholm alternative in finite Euclidean space (see [4, 38]). It is an integral condition that involves the roll yield and volatility of the market model , the VIX , and the invariant measure of the factor process .
** Remark 8**** (Symmetric Operators).**
For given by equation (2), if there is an invariant density , then the operator of equation (13) is symmetric if there are matrices such that can be written in self-adjoint form,
[TABLE]
for any test function . In other words, and need to satisfy
[TABLE]
This shows us that the symmetry of Assumption 2.4 is somewhat restrictive. However, symmetry is not always required to solve the inverse problem, as will be shown in the examples of Section 3.
2.2.2 Solution via Eigenseries Expansion
If the operator has a complete basis of orthogonal eigenfunctions, then so does given in (18), and then the solution to the inverse problem (16) can be found by computing eigencoefficients in a series expansion of . For many such cases there are transition densities for the factor process given , and so equation (16) can be written using a kernel,
[TABLE]
where the kernel is,
[TABLE]
Suppose there are eigenfunctions such that for an index value ,
[TABLE]
where , and suppose there is an invariant density such that any pair is orthogonal,
[TABLE]
Suppose additionally that these eigenfunctions form a complete basis in , i.e,. if then there are coefficients such that,
[TABLE]
If is the solution to the inverse problem then there is eigenseries expansion,
[TABLE]
and via orthogonality the ’s are solved for,
[TABLE]
This provides a (unique) solution to equation (16).
** Remark 9****.**
The eigenfunction expansion presented in this section can be reformulated without the assumption of a transition density; see [29] for spectral theory of general semi-group operators.
3 Application to Tractable Models
This section presents some examples of models that are applicable in practice, i.e., simulation, numerics, data calibration, etc., can be done within a reasonable amount of time. All models considered are Markov with strong SDE solutions, as per Assumptions 2.1 and 2.2. It will be assumed that for all , and then the emphasis will be placed on calculations for finding the solution to the inverse problem of equation (16).
3.1 The Scalar Bergomi Model
For the Bergomi market model has the volatility function,
[TABLE]
where is a scalar constant, and and . Define the factor process to be the Ornstein-Uhlenbeck (OU) process given by
[TABLE]
which has invariant density
[TABLE]
for this model the drift and diffusion are , . Given , the market model’s futures price is
[TABLE]
where and is also a model parameter; it is straightforward to check that this expression for satisfies the market-model equation . For this model, the roll-yields of equation (11) are
[TABLE]
and the volatilities take the form . The consistency equation (15) can be solved to obtain , and it is easily verified that the solvability condition (22) holds.
The inverse problem in (16) is solved by an eigenfunction expansion. The OU process has a complete orthogonal basis of eigenfunctions given by the Hermite polynomials. Hence, the inverse problem is solved with an eigenseries expansion like that of Section 2.2.2.
Consider the process
[TABLE]
where . The generator of this process is
[TABLE]
and the eigenfunctions of satisfy equations,
[TABLE]
where each is a Hermite polynomial,
[TABLE]
i.e.,
[TABLE]
Theses polynomials are orthogonal with respect to ’s invariant measure,
[TABLE]
where
[TABLE]
These eigenfunctions form a complete orthogonal basis in , and are convenient because,
[TABLE]
The transition density for the ’s is the following kernel,
[TABLE]
and when applied to the Hermite polynomials,
[TABLE]
which yields the eigenvalues
[TABLE]
For the scalar Bergomi model driven by the OU process with mean-reversion rate and diffusion parameter , there is the following weak equivalence with ,
[TABLE]
Define the scaled domain variance function,
[TABLE]
and then notice
[TABLE]
If the SVM and market model are consistent, then is given explicitly by the market model,
[TABLE]
Then, in terms of and the scaled eigenfunction , the solution to the inverse problem has the expansion,
[TABLE]
and the inverse problem (16) can be written in terms of the scaled variable and variance function,
[TABLE]
for all . Then using orthogonality the coefficients are,
[TABLE]
This is clearly an expansion convergent in and uniformly on compact sets. Finally, in terms of the solution is,
[TABLE]
This expansion is also convergent in and uniformly on compact sets. Numerical calculations indicate that the solution is positive and therefore there is an acceptable volatility function. It is interesting to note that the market model for the VIX is an exponential function, leading to an exponential OU VIX futures process. However, the consistent SVM in this case does not have an exponential OU volatility function. Numerical calculations show that the instantaneous variance has exponential-like behavior but is not an exact exponential. Figure 1 shows a numerical example of the simulated VIX and the recovered volatility funciton in this scalar OU example.
3.2 The Multi-Factor Bergomi Model
VIX futures from the multidimensional Bergomi model are given by
[TABLE]
where is a matrix with positive eigenvalues, is a constant matrix, is -dimensional uncorrelated Brownian motion, and is a vector. Let be the multidimensional OU process given by
[TABLE]
To ensure stationarity of , it is enough to assume that the eigenvalues of have positive real parts and that is a controllable pair, i.e.,
[TABLE]
Under these assumptions the distribution of the OU process will converge to a stationary state. The invariant density of is a -dimensional Gaussian density with mean zero and covariance matrix is
[TABLE]
which is finite and non-singular if the pair is controllable. From the integral formula of (23) it is seen that satisfies the stationary Lyapunov equation,
[TABLE]
Thus, the solution to the market model’s SDE for is
[TABLE]
where . For this multidimensional model, the log-future’s derivative with respect to is
[TABLE]
The volatility function of equation (11) is
[TABLE]
The formula for the VIX is explicit and obtained from (15) (up to the initial value),
[TABLE]
As in the scalar case of Section 3.1, it is easily verified that solvability condition (22) holds here.
When is diagonalizable with linearly independent eigenvectors then the generator has a discrete set of eigenvalues and a complete bi-orthogonal (in general) basis of eigenfunctions given by multivariate Hermite polynomials (see [24, 27, 40]), and therefore the method of Section 2.2.2 applies even though the generator is in general not symmetric in the sense of Assumption 2.4.
As an example, consider the 2-dimensional model from [1, 7], where the factors are
[TABLE]
with for and , , the VIX being
[TABLE]
where it is assumed for simplicity that with . The generator of is
[TABLE]
and the invariant density is
[TABLE]
where
[TABLE]
so that (24) holds. Following [40], the eigenfunctions for the adjoint operator are
[TABLE]
where and are non-negative integers; notice that . These ’s are the solutions to the equations
[TABLE]
where . Then, the eigenfunctions for the operator are multivariate Hermite polynomials, which are
[TABLE]
and satisfy the equation
[TABLE]
each of these ’s is a polynomial of degree equal to . In this case the transition-density kernel is
[TABLE]
where denotes element-wise inequality, and when applied to the multivariate Hermite polynomials, similar to the scalar OU example of Section 3.1, there are eigenvalues
[TABLE]
The set of ’s forms a complete basis in , which satisfy a bi-orthogonality relation relative to a second basis. Define this second set of basis functions to be
[TABLE]
which are bi-orthogonal in the sense that
[TABLE]
Denoting , the inverse problem is
[TABLE]
for all , which via the bi-orthogonality relation has the solution
[TABLE]
As with the scalar Bergomi, it is not needed to check for solvability, existence or uniqueness because the solution has eigencoefficients that are explicit. Figures 2 and 3 show the simulation of this 2-factor Bergomi model along with the recovered , which is appears to be positive, and Figure 4 looks at the difference to gain a sense of the differing factor sensitivities in and VIX function . The approximated uses all multivariate Hermite polynomials up to and including powers of 6, where . Using only 6-degree polynomials is sufficiently accurate, as the average error in approximating is of order , i.e., where denotes a discrete evaluation point in and denotes the total number of discrete points evaluated. Notice that (to see why multiply both sides of (16) by and integrate). From this surface plot it can be seen that rises in the persistent factor have more effect on VIX than on when the fast-mean-reverting factor is low (i.e., when ); this is seen in the corner of the surface plot where is most negative. This is an interesting caveat of the solution to the inverse problem, as it says that the VIX can be more persistent than instantaneous volatility, but this should not be too much of a surprise because VIX is the square-root of the expectation of a moving average of square instantaneous volatility.
3.3 The Model
Consider a market model constructed upon the squared VIX being a 3/2 process,
[TABLE]
where is a CIR process,
[TABLE]
with .666The model proposed in this section is similar to that used in [20], wherein , which could be done here as well but will require to have integrability of the series expansion. Applying Itô’s lemma yields
[TABLE]
from which the 3/2 power in the diffusion is seen, thus giving the process its name. Note that this 3/2 model is based on Assumption 2.2 because ’s SDE has non-Lipschitz coefficients. However, does have strong solutions, and the futures are for all , which are martingales by construction. Therefore, Assumption 2.2 is not needed and Definition 2.1 for consistency applies to this model.
Consider first the normalized CIR process, which has a complete orthogonal basis of eigenfunctions for its generator, given by the generalized Laguerre polynomials. Hence, the inverse problem is again solved with an eigenseries expansion and the method of Section 2.2.2 applies. Consider the normalized CIR process,
[TABLE]
where . The generator of this process is
[TABLE]
and the eigenfunctions of satisfy equations
[TABLE]
where each is a generalized Laguerre polynomial,
[TABLE]
that is,
[TABLE]
These polynomials are orthogonal with respect to ’s invariant measure,
[TABLE]
where
[TABLE]
and
[TABLE]
with the Gamma function evaluated at . These eigenfunctions form a complete orthogonal basis in , and are convenient because
[TABLE]
For the CIR process defined above, there is the following weak equivalence with a scaled ,
[TABLE]
with . Define also the scaled domain variance or volatility function,
[TABLE]
and then notice
[TABLE]
Therefore it is useful to define the kernel for the ’s, , and when applied to the Laguerre polynomials, similar to the scalar OU example,
[TABLE]
there are the eigenvalues,
[TABLE]
Hence, if the SVM and market model are consistent, then is given explicitly by the market model,
[TABLE]
which is in if . Then, in terms of and the scaled function , the solution to the inverse problem has the expansion,
[TABLE]
and therefore
[TABLE]
for all . Using orthogonality, the coefficients are
[TABLE]
For large there is the behavior , which requires for square integrability of the expansion of . Finally, in terms of , the solution is
[TABLE]
Figure 5 shows a simulation of the 3/2 process and the two approximations of the recovered function using 25 and 30 Laguerre polynomials. In the figure, the simulation is run for 10 years with time step , and produces empirical statistics and (in the summation ). The figure shows a recovered from which it is clear that, compared to the VIX, instantaneous volatility is more affected by low values of ; i.e., stochastic volatility is more sensitive to the left-hand tail distribution of . Note also from the figure that the numerical solution is positive.
3.4 The Double Nelson Model
Consider the 2-dimensional mean reverting process with dynamics,
[TABLE]
where , , , and . This is the double Nelson model, which is the continuous-time limit of a double GARCH model. Defining the VIX to be
[TABLE]
the futures curve is
[TABLE]
This is a market model for which the inverse problem will look to find from an SVM driven by the same factors and .
This model’s infinitesimal generator is not symmetric and so the general theory of Theorem 2.1 does not apply directly. However, the factor process satisfies a linear system of stochastic differential equations for which there are closed equations for moments of all orders, and so the solvability condition given by (22) from Section 2.2.1 can be applied.
The zero-maturity roll yield is
[TABLE]
the volatility is , and so the solvability condition of equation (22) is
[TABLE]
Invariant moments can be calculated using Itô’s lemma and then taking expectations,
[TABLE]
Hence, provided that and to ensure that has finite (invariant) second moments, and that is finite,
[TABLE]
it follows that equation (26) holds.
The inverse problem is
[TABLE]
with , and is solved explicitly by looking for the solution in the form,
[TABLE]
The coefficients and for are obtained by solving explicitly for the moments which satisfy a linear system of ordinary differential equations obtained by Ito’s formula from the stochastic differential equations of the factor process (25),
[TABLE]
Note that the invariant moments obtained above are simply the limit of these moments as , and this requires that the relations between introduced above hold here too. Hence,
[TABLE]
and by adjusting the coefficients the solution is found to be for and , which is a quadratic polynomial in . However, this solution will not be non-negative and therefore it is not acceptable for an SVM.
To see how the solution can go negative, consider the simplified inverse problem,
[TABLE]
which requires only a linear expression for its solution
[TABLE]
Thus
[TABLE]
with
[TABLE]
Inserting these expressions and doing the time averaging it is seen that in order to solve the inverse problem it must be that
[TABLE]
so that the coefficient of on the right is one. Then taking to make the coefficient of equal to zero, this leads to
[TABLE]
After solving for and , the constant equals to the remaining terms. Finally it is seen that is negative for any , and this makes the solution take negative values for near [math] and large. This indicates that there cannot be consistency in the sense of Definition 2.1.
3.5 Non-Negative Solutions for Brownian Motion Factor
Consider another example that does not have the stationarity of Assumptions 2.3 and 2.4, but instead is a market model driven by Brownian motion . This is an example that has a general condition on to ensure non-negativity of the recovered volatility function.
The inverse problem is
[TABLE]
where is standard Brownian motion. The Fourier transform is used to solve this problem. The space to consider is , and the Fourier elements are
[TABLE]
which have generalized orthogonality with the delta function . The market model’s VIX function and the SVM function have Fourier transforms
[TABLE]
The Fourier basis is used to transform the inverse problem,
[TABLE]
Hence, the solution to the problem is
[TABLE]
If is in , then Parseval’s identity says the solution is in ,
[TABLE]
If is continuous and positive definite, that is, if for any and for for any positive integer,
[TABLE]
then Bochner’s theorem applies [35] and is non negative. This is a general criterion for the solution to the inverse problem to be non-negative, but this application of Bochner’s theorem is special to the case of Fourier eigenfunctions.
To further illustrate, consider the specific example
[TABLE]
and with a standard Brownian motion. For inverse problem
[TABLE]
it is easy to check,
[TABLE]
Hence, there is non-negative solution if .
4 Non-Markovian Market Models
Let denote the CMFs derived from the Markovian SVM. Suppose that Assumption 2.1 does not hold so that it’s possible to have non-Markovian dynamics. Then to check for the consistency of Definition 2.1, there are the following pair of equations that are the generalization of (14) and (15),
[TABLE]
where is the roll yield as shown in equation (8). From equations (27) and (28) it should be clear that a Markovian representation of the market model must be imposed. Namely, where equals the left-hand side of equation (27), and where equals the transpose of the left-hand side of equation (28).
4.1 Scalar Consistency with Constant
Consider the case where and in equation (2) are scalar processes. Suppose that is a scalar, constant deterministic function,
[TABLE]
Then solving equations (27) and (28) leads to the following VIX futures and roll yields,
[TABLE]
It is assumed in this equation that is strictly positive and its inverse is integrable.
4.2 An Inconsistent Example
There are non-trivial cases where there is a violation of the scalar consistency formula of equation (29). For example, suppose there is algebraic decay in the market model’s volatility function,
[TABLE]
Then the SDE for the CMF can be computed via Itô’s lemma, which yields the following roll yield,
[TABLE]
There is no function of the Markov process that can equal this process almost surely, as itself is not a Markov process. Hence, formula (29) cannot hold.
5 Summary and Conclusion
The achievement of this paper is the derivation of a consistent SVM for the SPX given a market model for the VIX. The main result is Theorem 2.1, which gives conditions for the unique determination of the volatility function of the SVM from a VIX function given by the market model, provided both models are driven by the same underlying stationary ergodic factor process. The theorem’s conditions involve moments of the VIX function, the uniqueness of the invariant measure of the factor process, and require that the operator semi-group have a spectral gap. At the time of this article there are no known structural conditions that will make the resulting volatility function non-negative, and therefore no theoretical guarantees for consistency can be made. There are special cases where positivity can be guaranteed, such as models where is a Brownian motion. Detailed analysis and numerical calculations for several market models indicate that for the commonly used Bergomi market models (Sections 3.1 and 3.2) the volatility function appears to be positive. For another market model where square VIX is the reciprocal of a CIR process (Section 3.3), the volatility is again shown numerically to be positive. The double Nelson model in Section 3.4 is a counter example, wherein the market model’s factor process is a linear SDE that is stationary ergodic, but the inverse problem leads to a (unique) volatility function that cannot be everywhere non-negative. Positivity can be guaranteed for the example in Section 3.5 because the factor process is Brownian motion, and therfore Bochner’s theorem gives general conditions for non-negativity.
Future problems to consider include general results for non-negativity of recovered volatility functions under the OU and the CIR processes, and also to generalize this inverse problem formulation to jump-diffusion models like that of [16]. From a computational standpoint, it would be worth solving the inverse problem not as an exact equality, but instead as a minimization subject to the constraint that . Then, under a loosening of conditions for consistency, it could be possible that this constrained minimization will produce useful SVMs from a broader class of market models.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Avellaneda and A. Papanicolaou. Statistics of VIX futures and applications to trading volatility exchange-traded products. International Journal of Theoretical and Applied Finance , 22(01):1850061, 2019.
- 2[2] A. Badran. Arbitrage-free models for VIX and equity derivatives . Ph D thesis, University of Sydney, 2014. https://ses.library.usyd.edu.au/handle/2123/13082 .
- 3[3] D. Bakry, P. Cattiaux, and A. Guillinde. Rate of convergence for ergodic continuous Markov processes: Lyapunov versus Poincaré. Journal of Functional Analysis , 254(3):727–759, 2008.
- 4[4] D. Bakry, I. Gentil, and M. Ledoux. Analysis and geometry of Markov diffusion operators , volume 348. Springer Science & Business Media, 2013.
- 5[5] J. Baldeaux and A. Badran. Consistent modelling of vix and equity derivatives using a 3/2 plus jumps model. Applied Mathematical Finance , 21(4):299–312, 2014.
- 6[6] C. Bayer, J. Gatheral, and M. Karlsmark. Fast Ninomiya-Victoir calibration of the double-mean-reverting model. Quantitative Finance , 13(11):1813–1829, 2013.
- 7[7] L. Bergomi. Smile dynamics ii. Risk , pages 117–123, October 2005.
- 8[8] L. Bergomi. Smile dynamics iii. Risk , pages 90–96, October 2008.
