The Heston stochastic volatility model in Hilbert space
Fred Espen Benth, Iben Cathrine Simonsen

TL;DR
This paper generalizes the Heston stochastic volatility model to a Hilbert space setting, enabling advanced modeling of energy market forward curves with tensor processes and operator analysis.
Contribution
It introduces a tensor Heston model in Hilbert space, extending classical models with new tensor and operator techniques for energy market applications.
Findings
Defined a Hilbert-valued Ornstein-Uhlenbeck process with stochastic volatility.
Derived characteristic functional and covariance operator for the process.
Analyzed the model's dynamics and its projection to real line for comparison.
Abstract
We extend the Heston stochastic volatility model to a Hilbert space framework. The tensor Heston stochastic variance process is defined as a tensor product of a Hilbert-valued Ornstein-Uhlenbeck process with itself. The volatility process is then defined by a Cholesky decomposition of the variance process. We define a Hilbert-valued Ornstein-Uhlenbeck process with Wiener noise perturbed by this stochastic volatility, and compute the characteristic functional and covariance operator of this process. This process is then applied to the modelling of forward curves in energy markets. Finally, we compute the dynamics of the tensor Heston volatility model when the generator is bounded, and study its projection down to the real line for comparison with the classical Heston dynamics.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Heston stochastic volatility model in Hilbert space
Fred Espen Benth and Iben Cathrine Simonsen
Fred Espen Benth and Iben Cathrine Simonsen
Department of Mathematics
University of Oslo
P.O. Box 1053, Blindern
N–0316 Oslo, Norway
Abstract.
We extend the Heston stochastic volatility model to a Hilbert space framework. The tensor Heston stochastic variance process is defined as a tensor product of a Hilbert-valued Ornstein-Uhlenbeck process with itself. The volatility process is then defined by a Cholesky decomposition of the variance process. We define a Hilbert-valued Ornstein-Uhlenbeck process with Wiener noise perturbed by this stochastic volatility, and compute the characteristic functional and covariance operator of this process. This process is then applied to the modelling of forward curves in energy markets. Finally, we compute the dynamics of the tensor Heston volatility model when the generator is bounded, and study its projection down to the real line for comparison with the classical Heston dynamics.
Key words and phrases:
Heston stochastic volatility, Infinite dimensional Ornstein-Uhlenbeck processes, forward prices, commodity markets
F. E. Benth acknowledges financial support from the project FINEWSTOCH, funded by the Norwegian Research Council.
1. Introduction
Ornstein-Uhlenbeck processes in Hilbert space has received some attention in the literature in recent years (see Applebaum [1]), one reason being that it is a basic process for the dynamics of commodity forward prices (see Benth and Krühner [3]). In the modelling of financial prices, the stochastic volatility dynamics plays an important role, and in this paper we propose an infinite dimensional version of the classical Heston model (see Heston [12]).
On a separable Hilbert space , an Ornstein-Uhlenbeck process takes the form
[TABLE]
where is some densely defined linear operator and is an -valued Wiener process. Usually, is some non-random bounded linear operator on , being a scaling of the noise which is referred to as the volatility. We propose to model as a time-dependent stochastic process with values in the space of bounded linear operators. More specifically, we consider a stochastic variance process being defined as the tensor product of another Ornstein-Uhlenbeck process with itself, which will become a positive definite stochastic process in the space of Hilbert-Schmidt operators on . We use its square root process as a volatility process in the dynamics of . Our construction is an extension of the classical Heston stochastic volatility model.
If is some suitable space of real-valued functions on , the non-negative real numbers, and , one can view as the risk-neutral forward price at time for some contract delivering a given commodity at time . Such forward price models (with generalisations) have been extensively analysed in Benth and Krühner [3], being stochastic models in the so-called Heath-Jarrow-Morton framework (see Heath, Jarrow and Morton [11]) with the Musiela parametrisation. The analysis relates closely to a long stream of literature on forward rate modelling in fixed-income markets (see Filipovic [9] and references therein). However, stochastic volatility models from the infinite dimensional perspective have not, to the best of our knowledge, been studied to any significant extent. An exception is the paper by Benth, Rüdiger and Süss [4], who propose and analyse an infinite dimensional generalisation of the Barndorff-Nielsen and Shephard stochastic volatility model (see Barndorff-Nielsen and Shephard [2]).
As indicated, we define , where is an -valued Gaussian Ornstein-Uhlenbeck process. We prove several properties of the tensor Heston variance process , and show that the square-root process is explicitly available. Moreover, we present a family of Cholesky-type decompositions of , which will be our choice as stochastic volatility in the dynamics of . We study probabilistic properties of both and , and specialize to the situation of a commodity forward market where we provide expressions for the implied covariance structure between forward prices with different times to maturity. In the situation when the Ornstein-Uhlenbeck process is governed by a bounded generator, we can present a stochastic dynamics of which can be related to the Heston model in the finite dimensional case. In particular, our model is an alternative to the Wishart process of Bru [5].
1.1. Notation
We let be a filtered probability space and be a separable Hilbert space with inner product and associated norm . Furthermore, we let denote the space of bounded linear operators from into itself, which is a Banach space with the operator norm denoted . The adjoint of an operator is denoted . Furthermore, denotes the space of Hilbert-Schmidt operators in . is also a separable Hilbert space, and we denote its inner product by and the associated norm by .
2. The tensor Heston stochastic variance process
Let be an -Wiener process in with covariance operator , where is a symmetric and positive definite trace class operator. Define the Ornstein-Uhlenbeck process in by
[TABLE]
where is a densely defined operator on generating a -semigroup , and . From Peszat and Zabczyk [14, Sect. 9.4], the unique mild solution of (1) is given by
[TABLE]
for . The next lemma gives the characteristic functional of .
Lemma 1**.**
For we have
[TABLE]
where the integral on the right-hand side is the Bochner integral on .
Proof.
From the mild solution of in (2), we find
[TABLE]
Hence, from the Gaussianity and independent increment property of the Wiener process,
[TABLE]
As is a -semigroup, its operator norm satisfies an exponential growth bound in time by the Hille-Yoshida Theorem (see Engel and Nagel [7, Prop. I.5.5]). Hence, the Bochner integral is well-defined, and the result follows. ∎
From the lemma above we conclude that is an -valued Gaussian process with mean and covariance operator
[TABLE]
Following Applebaum [1], admits an invariant Gaussian distribution with zero mean if the -semigroup is exponentially stable. The covariance operator for the invariant mean zero Gaussian distribution of then becomes
[TABLE]
We define the tensor Heston stochastic variance process by
[TABLE]
where we recall the tensor product to be for . By the Cauchy-Schwartz inequality, it follows straightforwardly that . Hence, the tensor Heston stochastic variance process defines an -adapted stochastic process in . The next proposition shows that defines a family of symmetric, positive definite Hilbert-Schmidt operators.
Proposition 2**.**
It holds that for all . Furthermore, is a symmetric and positive definite operator.
Proof.
Let be an orthonormal basis (ONB) of . By Parseval’s identity applied twice,
[TABLE]
Since for every , the first conclusion of the proposition follows.
We find for that
[TABLE]
Moreover, with ,
[TABLE]
This proves the second part. ∎
The proposition shows that for all . The Gaussianity of the process implies that the real-valued stochastic process has finite exponential moments up to a certain order:
Lemma 3**.**
It holds that
[TABLE]
for and .
Proof.
From Prop. 2, , and then by the triangle inequality
[TABLE]
From the mild solution of in (2),
[TABLE]
which is a centered Gaussian random variable. Hence, Fernique’s Theorem (see Fernique [8] or Thm. 3.31 in Peszat and Zabczyk [14]) implies that and
[TABLE]
for . ∎
From this lemma we can conclude that all moments of the real-valued random variable are finite, as for arbitrary small .
If , then we see that
[TABLE]
Recalling Lemma 1, is normally distributed with mean and variance . Moreover,
[TABLE]
A straightforward (but tedious) calculation reveals that the characteristic functional of evaluated at becomes,
[TABLE]
where we have assumed for simplicity. This characteristic functional is related to a noncentral -distribution with one degree of freedom. We recall that the variance process in the classical Heston model has a noncentral -distribution (see Heston [12]).
Since is symmetric and positive definite, we can define its square root , which turns out to have an explicit expression.
Proposition 4**.**
The square root process of is given by
[TABLE]
Proof.
If , it follows that , and thus also . Assume . Let . Define , which is symmetric and positive definite by Prop. 2. Then,
[TABLE]
But,
[TABLE]
Hence, , and the result follows. ∎
Consider for a moment the operator defined as for and .
Lemma 5**.**
The operator is locally Lipschitz continuous.
Proof.
It holds for that
[TABLE]
Hence, if in , then in , so is continuous in zero. Next, suppose that are both non-zero. Then, by a simple application of the triangle inequality and its reverse,
[TABLE]
Again, by triangle inequality and the elementary inequality , we find for an ONB in ,
[TABLE]
Therefore,
[TABLE]
which shows locally Lipschitz continuity of . ∎
By a result of Kotelenez [13] (see Peszat and Zabczyk [14, Thm. 9.20]), there exists a continuous version of in (2) if the -semigroup is quasi-contractive, that is, if for some constant , for all . Thus, from Lemma 5, we can conclude that there exists a version of (namely defined by the version of with continuous paths) which has continuous paths in , when is quasi-contractive. We remark that This holds true since by Parseval’s identity
[TABLE]
for an ONB of . By Lemma 1, is a Gaussian random variable for all , and . If , then we must have for all . But this happens with probability zero, and it follows that .
We move our attention to a Cholesky-type of decomposition of the tensor Heston stochastic variance process . To this end, introduce an -adapted -valued stochastic process for which , i.e., a process living on the unit ball of . Define the operator for by
[TABLE]
The following lemma collects the elementary properties of this operator-valued process.
Lemma 6**.**
It holds that is an -adapted stochastic process with values in .
Proof.
By definition becomes a linear operator, where boundedness follows readily from the Cauchy-Schwartz inequality. For an ONB in , we have from Parseval’s identity
[TABLE]
The -measurability follows by assumption on and definition of . The proof is complete. ∎
We notice that with the convention , we can define and recover for all such that . We show that for general unitary processes, defines a Cholesky decomposition of the tensor Heston stochastic variance process:
Proposition 7**.**
The tensor Heston stochastic variance process can be decomposed as
[TABLE]
for all , where .
Proof.
Since, for any and ,
[TABLE]
we have that . It follows that for any ,
[TABLE]
The result follows. ∎
A simple choice of an -valued stochastic process is , where with .
3. Ornstein-Uhlenbeck process with stochastic volatility
Define the -valued Ornstein-Uhlenbeck process by
[TABLE]
where is a densely defined operator on generating a -semigroup , and is a Wiener process in with covariance operator (i.e., is a symmetric and positive definite trace class operator). We assume that is independent of and recall from (4).
The next lemma validates the existence of the stochastic integral in (5):
Lemma 8**.**
It holds that
[TABLE]
Proof.
Let be an ONB in . By Parseval’s identity, we have
[TABLE]
As is a symmetric, positive definite trace class operator, we can find an ONB of eigenvectors in with corresponding positive eigenvalues of , such that for all and therefore . We have by Parseval’s identity and Cauchy-Schwartz’ inequality,
[TABLE]
since by assumption . Next we show that . From the expression in (2), it follows from an elementary inequality that
[TABLE]
where the last equality is a consequence of the Itô isometry. The Hille-Yoshida Theorem (see Engel and Nagel [7, Prop. I.5.5]) implies that for constants and . Thus,
[TABLE]
Finally, we observe that , and hence the lemma follows. ∎
The integral is well-defined, and therefore according to Peszat and Zabczyk [14, Sect. 9.4] (5) has a unique mild solution given by
[TABLE]
We remark in passing that the stochastic integral is well-defined in (6) since with an operator norm which is growing at most exponentially by the Hille-Yoshida Theorem (see Engel and Nagel [7, Prop. I.5.5]).
We analyse the characteristic functional of . To this end, denote by the filtration generated by .
Proposition 9**.**
Assume that the process in defined in (4) is -adapted. It holds for any
[TABLE]
*where the -integral on the right-hand side is a Bochner integral in . *
Proof.
With we get from (6),
[TABLE]
Recall that and are independent. Since is assumed -adapted, we will have that and , and therefore , are independent of . By the tower property of conditional expectation and the Gaussianity of conditional on ,
[TABLE]
Recalling from Prop. 7, we have . Hence,
[TABLE]
and
[TABLE]
The proof is complete. ∎
From the proposition we see that is a Gaussian process conditional on , with mean . The covariance operator of , defined by the relationship
[TABLE]
for , can be computed as follows: since , and for a fixed , the process is an -martingale, it follows from Peszat and Zabczyk [14, Thm. 8.7 (iv)],
[TABLE]
for . Now, let , and we find
[TABLE]
Note that
[TABLE]
for any . Thus we recover the covariance functional that we can read off from Prop. 9;
[TABLE]
By Peszat and Zabczyk [14, Thm. 8.7 (iv)] and the zero expectation of the stochastic integral with respect to ,
[TABLE]
for . Hence, in the particular case of , with , we find that the covariance becomes
[TABLE]
We next apply our Ornstein-Uhlenbeck process with tensor Heston stochastic volatility to the modelling of forward prices of commodity markets. For this purpose, we let be the Filipovic space , which was introduced by Filipovic in [9]. For a measurable and increasing function with and , the Filipovic space is defined as the space of absolutely continuous functions such that
[TABLE]
Here, denotes the weak derivative of . The space is a separable Hilbert space with inner product and associated norm .
We let be defined as in (5), with being the derivative operator . The derivative operator is densely defined on (see Filipovic [9]), with the left-shift operator being its -semigroup. For , the left-shift semigroup acts as . Furthermore, we let be the evaluation functional, i.e. for and , . We have that , that is, the evaluation map is a linear functional on . Letting be given by
[TABLE]
we have that (see Benth and Krühner [3]).
The arbitrage-free price at time of a contract delivering a commodity at a future time , is modelled by (see Benth and Krühner [3]). We adopt the Musiela notation and express the price in terms of time to delivery rather than time of delivery , letting . The next corollary gives the covariance between two contracts with different times to delivery.
Corollary 10**.**
For all , we have
[TABLE]
*where for . *
Proof.
Since , we find
[TABLE]
with given in (9). Since , it follows,
[TABLE]
The claim follows. ∎
The above corollary yields that the entire covariance structure between contracts with different times of maturity is determined by only. We notice that the choice of in the definition of only scales the covariance. Consider the special case of . Using similar arguments to those in the derivation of yield,
[TABLE]
Thus, when , we find the covariance to be
[TABLE]
since . The covariance of and is determined by the parameters of the -process, more specifically, its volatility , the operator (yielding a semigroup ), the initial field and the covariance operator of the Wiener noise driving its dynamics. We also observe that the time integrals sample the parameters of over the intervals and to build up the covariance of the field , not only taking the values at and into account.
4. The case when is bounded
In this section, we analyse the tensor Heston stochastic variance process when in (1) is a bounded operator. Moreover, we make comparison with the classical Heston model on the real line (see Heston [12]) and discuss its extensions.
When is bounded, (1) has a strong solution and we can compute the dynamics of by an infinite dimensional version of Itô’s formula.
Proposition 11**.**
Assume is bounded. Then we have the following representation of ,
[TABLE]
where is the -valued process
[TABLE]
and is the -valued process
[TABLE]
Proof.
When is bounded, the unique strong solution of (1) is given by
[TABLE]
Define the function by and observe that . To use the infinite dimensional Itô formula by Curtain and Falb [6], we need to find the first and second order Frechét derivatives of . Define the functions and by
[TABLE]
and
[TABLE]
A direct calculation reveals that,
[TABLE]
Thus, we find,
[TABLE]
which converges to 0 when . This shows that is the Frechét derivative of , which we denote by . Next, for any ,
[TABLE]
which shows that is the Frechét derivative of , and hence the second order Frechét derivative of , which we denote by . It follows from the infinite dimensional Itô formula in Curtain and Falb [6] that
[TABLE]
where is an ONB of of eigenvectors of with corresponding eigenvalues . Inserting and for respectively and , gives us
[TABLE]
For , we find,
[TABLE]
The proof is complete. ∎
We can formulate the dynamics of as an operator-valued stochastic differential equation.
Proposition 12**.**
Assume that is bounded. Then
[TABLE]
where Id is the identity operator on and is the Cholesky decomposition of defined in (4).
Proof.
For , we see from a direct computation that
[TABLE]
or,
[TABLE]
Next, for any bounded operator , we have from linearity of that
[TABLE]
Thus,
[TABLE]
With and recalling the definition of in Prop. 11, this shows the drift of .
For , it holds that
[TABLE]
whenever , with defined in Prop. 11. The result follows. ∎
Recall from Lemma 6 that . Hence, for any ,
[TABLE]
Informally, we can say that the diffusion term of is given as the sum of the ”square root” of and its adjoint.
Let us consider our tensor Heston stochastic variance process in the particular case of finite dimensions, that is, for . We assume is a -dimensional standard Brownian motion, and the -dimensional stochastic process is defined by the dynamics (1) with . It is straightforward to see that for any , , where means the transpose of . Hence, . Moreover, if ,
[TABLE]
and
[TABLE]
Hence, since , the identity matrix, we find from Prop. 11 that
[TABLE]
This is a different dynamics than the Wishart processes on defined by Bru [5], and proposed as a multifactor extension of the Heston stochastic volatility model in Fonseca, Grasselli and Tebaldi [10]. The drift term in the Wishart process is analogous to the one in (10), while the diffusion term in the Wishart process takes the form
[TABLE]
where is a matrix-valued Brownian motion and is a matrix. Our tensor model in infinite dimensions yields a simplified diffusion in finite dimensions compared to the Wishart process of Bru [5], where one is using a Cholesky-type of representation of the square root of , involving also the ”volatility” of the Ornstein-Uhlenbeck dynamics of . To ensure a positive definite process, Bru [5] introduces strong conditions on and , while our Heston model is positive definite by construction.
Let us now slightly turn the perspective, going back to the general infinite dimensional situation, and study the projection of the -valued process to the real line in the sense of studying the process expanded along a given element .
To this end, for introduce the linear functions by
[TABLE]
We note that for ,
[TABLE]
and, in particular, . We define the real-valued stochastic process as
[TABLE]
for . It is immediate from the definition that is an -adapted process taking values on , the positive real line (including zero).
Proposition 13**.**
Assume that is bounded. Then the dynamics of defined in (12) is
[TABLE]
where is a real-valued Wiener process.
Proof.
From Props. 11 and 12, we have
[TABLE]
First,
[TABLE]
Next,
[TABLE]
This proves the drift term of .
We finally consider the projection of the stochastic integral. From Thm. 2.1 in Benth and Krühner [3],
[TABLE]
where is a real-valued Wiener process and with being the -valued stochastic process defined by . Since
[TABLE]
we have . Observe in passing, recalling Lemma 8, that is an -adapted stochastic process such that for any , and thus -integrable. The integrand is therefore given by
[TABLE]
Thus, and the proof is complete. ∎
We see that the process shares some similarities with a classical real-valued Heston volatility model (see Heston [12]). has a square-root diffusion term, and a linear drift term. However, there are also some additional drift terms which are not expressible in .
If is an eigenvector of with an eigenvalue , we find that and , and hence by Prop. 13,
[TABLE]
which corresponds to a classical Heston stochastic variance process.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Applebaum (2015). Infinite dimensional Ornstein-Uhlenbeck processes driven by Lévy processes. Probab. Surveys , 12 , pp. 33–54.
- 2[2] O. E. Barndorff-Nielsen and N. Shephard (2001). Non-Gaussian Ornstein-Uhlenbeck-based models and some of their uses in financial economics. J. Royal Stat. Soc.: Series B (Stat. Method.) , 63 , pp. 167–241.
- 3[3] F. E. Benth and P. Krühner (2014). Representation of infinite-dimensional forward price models in commodity markets. Comm. Math. Statist. , 2 (1), pp. 47–106.
- 4[4] F. E. Benth, B. Rüdiger and A. Süss (2015). Ornstein-Uhlenbeck processes in Hilbert space with non-Gaussian stochastic volatility. To appear in Stoch. Proc. Applic. Available on ar Xiv:1506.07245
- 5[5] M. F. Bru (1991). Wishart Processes. J. Theor. Probab. , 4 , pp. 725–743.
- 6[6] R. F. Curtain and P. L. Falb (1970). Itô’s Lemma in infinite dimensions. J. Math. Analysis Appl. , 31 , pp. 434–448.
- 7[7] K.-J. Engel and R. Nagel (2000). One-Parameter Semigroups for Linear Evolution Equations . Springer Verlag, New York.
- 8[8] X. Fernique (1975). Regularite des trajectoires des functions aleatoires gausiennes. In Ecole d’Ete de Probabilites de Saint-Flour, IV-1974. , Vol. 480, Lecture Notes in Mathematics, Springer-Verlag, Berlin Heidelberg, pp. 1–96.
