Stochastic equation and exponential ergodicity in Wasserstein distances for affine processes
Martin Friesen, Peng Jin, Barbara R\"udiger

TL;DR
This paper investigates the long-term behavior of affine processes, proving exponential ergodicity in Wasserstein distances under certain moment conditions, and establishes their representation as solutions to stochastic equations.
Contribution
It demonstrates that affine processes can be uniquely represented as solutions to stochastic equations and proves their exponential ergodicity in Wasserstein distances.
Findings
Affine processes are solutions to stochastic equations driven by Brownian motions and Poisson measures.
Subcritical affine processes exhibit exponential ergodicity in Wasserstein distances.
Moment conditions are crucial for establishing ergodicity.
Abstract
This work is devoted to the study of conservative affine processes on the canonical state space R_+^m \times \R^nm + n > 0$. We show that each affine process can be obtained as the pathwise unique strong solution to a stochastic equation driven by Brownian motions and Poisson random measures. Then we study the long-time behavior of affine processes, i.e., we show that under first moment condition on the state-dependent and log-moment conditions on the state-independent jump measures, respectively, each subcritical affine process is exponentially ergodic in a suitably chosen Wasserstein distance. Moments of affine processes are studied as well.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Stochastic equation and exponential ergodicity in Wasserstein distances for affine processes
Martin Friesen111Fakultät für Mathematik und Naturwissenschaften, Bergische Universität Wuppertal, Gaußstraße 20, 42119 Wuppertal, Germany, [email protected]
Peng Jin222Department of Mathematics, Shantou University, Shantou, Guangdong 515063, China, [email protected]
Barbara Rüdiger333Fakultät für Mathematik und Naturwissenschaften, Bergische Universität Wuppertal, Gaußstraße 20, 42119 Wuppertal, Germany, [email protected]
Abstract: This work is devoted to the study of conservative affine processes on the canonical state space , where . We show that each affine process can be obtained as the pathwise unique strong solution to a stochastic equation driven by Brownian motions and Poisson random measures. Then we study the long-time behavior of affine processes, i.e., we show that under first moment condition on the state-dependent and -moment conditions on the state-independent jump measures, respectively, each subcritical affine process is exponentially ergodic in a suitably chosen Wasserstein distance. Moments of affine processes are studied as well.
AMS Subject Classification: 37A25; 60H10; 60J25
Keywords: affine process; ergodicity; Wasserstein distance; coupling; stochastic differential equation
1 Introduction and statement of the result
1.1 General introduction
An affine process is a time-homogeneous Markov processes whose characteristic function satisfies
[TABLE]
where is the time and the starting point of the process. The general theory of affine processes, including a full characterization on the canonical state space where and , was discussed in [14]. In particular, it is shown that the functions and should satisfy certain generalized Riccati equations. Common applications of affine processes in mathematical finance are interest rate models (e.g., the Cox-Ingersoll-Ross, Vašiček or general affine term structure short rate models), option pricing (e.g., the Heston model) and credit risk models, see also [1] and the references therein. After [14], the mathematical theory of affine processes was developed in various directions. Regularity of affine processes was studied in [37] and [38]. Based on a Hörmander-type condition, existence and smoothness of transition densities were obtained in [21]. Exponential moments for affine processes were studied in [30] and [35]. The theory of affine diffusions, i.e., processes without jumps, was developed in [20], while its application to large deviations for affine diffusions was studied in [32]. The possibility to obtain affine processes as multi-parameter time changes of Lévy processes was recently discussed in [12]. It is worthwhile to mention that the above list is, by far, not complete. For further references and additional details on the general theory of affine processes we refer to the book [1].
Below we describe two important sub-classes of affine processes. Continuous-state branching processes with immigration (shorted as CBI processes) are affine processes with state space . Such processes have been first introduced in 1958 by Jiřina [26] and then studied in [52, 40, 48], where it was also shown that these processes arise as scaling limits of Galton-Watson processes. Various properties of one-dimensional CBI processes were studied in [25, 17, 11, 34, 22, 13] and [10]. For results applicable in arbitrary dimension we refer to [5], [7] and [19]. Let us mention that CBI processes are also measure-valued Markov processes as studied in [41]. Another important class of affine processes corresponds to the state space and is consisted of processes of Ornstein-Uhlenbeck (OU) type. These processes include also Lévy processes as a particular case.
1.2 Affine processes
Let us describe affine processes in more detail. For let , and suppose that . In this work we study affine processes on the canonical state space . Let
[TABLE]
If , then let and . Denote by the space of -matrices. For we write
[TABLE]
where , , , and . Denote by the space of symmetric and positive semidefinite -matrices. Finally, let , , stand for the Kronecker-Delta.
Definition 1.1**.**
We call a tuple admissible parameters, if they satisfy the following conditions:
- (i)
* with , and .* 2. (ii)
* with and if or .* 3. (iii)
. 4. (iv)
* is such that for all and , and .* 5. (v)
* is a Borel measure on such that and*
[TABLE] 6. (vi)
* where are Borel measures on such that*
[TABLE]
In contrast to [14], we do not consider killing for affine processes and, moreover, we suppose that integrate , i.e., the first moment for big jumps is finite. It is well-known that without killing and under first moment condition for the big jumps of , the corresponding affine process (introduced below) is conservative (see [14, Lemma 9.2]). In this paper we work with Definition 1.1 and thus restrict our study to conservative affine processes. In order to simplify the notation, we have also set and , for . Hence all integrals with respect to the measures can be taken over instead of .
Denote by the Banach space of bounded measurable functions over . This space is equipped with the supremum norm . Define
[TABLE]
Note that is bounded for any . Here denotes the Euclidean scalar product on . By abuse of notation, we later also use for the scalar product on or The following is due to [14].
Theorem 1.2**.**
Let be admissible parameters. Then there exists a unique conservative Feller semigroup on with generator such that and, for and ,
[TABLE]
where . Moreover, is a core for the generator. Let be the transition probabilities. Then
[TABLE]
where and are uniquely determined by the generalized Riccati differential equations: for ,
[TABLE]
and , are of Lévy-Khintchine form
[TABLE]
Consequently, there exists a unique Feller process with generator . This process is called affine process with admissible parameters .
Remark 1.3**.**
Let be admissible parameters. According to [14, Lemma 10.1 and Lemma 10.2], the martingale problem with generator and domain is well-posed in the Skorokhod space over equipped with the usual Skorokhod topology. Hence, we can characterise an affine process with admissible parameters as the unique solution to the martingale problem with generator and domain . In any case, it can be constructed as a Markov process on the Skorokhod space over .
Affine processes are thus constructed on the canonical state space. In order to prove the main result of this work, we provide in Section 4 a pathwise construction of affine processes. The latter one extends previous cases from the literature such as [15, 20, 43] and [5].
1.3 Ergodicity in Wasserstein distance for affine processes
Let be the space of all Borel probability measures over . By abuse of notation, we extend the transition semigroup (given by Theorem 1.2) onto via
[TABLE]
Then describes the distribution of the affine process at time such that it has at initial time law . Note that , and is a semigroup on in the sense that , for any and . Such semigroup property is simply a compact notation for the Chapman-Kolmogorov equations satisfied by . Since the martingale problem with generator and domain is well-posed, and is a core (see Theorem 1.2 and Remark 1.3), it follows from [16, Proposition 9.2] that, for some given , the following properties are equivalent:
- (i)
, for all . 2. (ii)
, for all . 3. (iii)
, for all and all .
A distribution which satisfies one of these properties (i) – (iii) is called invariant distribution for the semigroup . In this work we will prove that, under some appropriate assumptions, has a unique invariant distribution , this distribution has some finite -moment and, moreover, with exponential rate. For this purpose we use the Wasserstein distance on introduced below. Given , a coupling of is a Borel probability measure on which has marginals and , respectively, i.e., for it holds that
[TABLE]
Denote by the collection of all such couplings. Let us now introduce two different metrics on as follows:
- (a)
Define, for , , , and let
[TABLE] 2. (b)
Introduce , , and let
[TABLE]
Let . The Wasserstein distance on is defined by
[TABLE]
The appearance of the additional factor is purely technical, it is a consequence of the estimates proved in Section 6. By general theory of Wasserstein distances we see that is a complete seperable metric space, see, e.g., [50, Theorem 6.18]. Convergence with respect to this distances is explained in the following remark, see also [50, Theorem 6.9].
Remark 1.4**.**
Let , and . The following are equivalent
- (i)
* as .* 2. (ii)
For each continuous function with , it holds that
[TABLE] 3. (iii)
* weakly as , and*
[TABLE] 4. (iv)
* weakly as , and*
[TABLE]
For simplicity of notation, we let , , , and . Then it is easy to see that and , for some constant , i.e., is stronger then . The following is our main result.
Theorem 1.5**.**
Let be admissible parameters. Suppose that has only eigenvalues with negative real parts, and
[TABLE]
Then has a unique invariant distribution and the following assertions hold:
- (a)
* and there exist constants such that, for all ,*
[TABLE] 2. (b)
If there exists satisfying
[TABLE]
then and there exists constants such that, for all ,
[TABLE]
It is worthwhile to mention that to our knowledge a convergence rate solely under a -moment condition on the state-independent jump measure was not even obtained for one-dimensional CBI processes. In order that and are well-defined, we need to show that belongs to or , respectively. This will be shown in Section 5, where general moment estimates for affine processes are studied. Using combined with Remark 1.4 we conclude the following.
Remark 1.6**.**
Under the conditions of Theorem 1.5, there exist constants such that
[TABLE]
where . Let be the Wasserstein distance given by (1.4) with replaced by . Then similarly to Remark 1.4, convergence with respect to is equivalent to weak convergence of probability measures on . Since , we conclude from (1.9) that weakly as with exponential rate.
Let be an affine process. For the parameter estimation of affine models, see, e.g., [3], [42] and [2], it is often necessary to prove a Birkhoff ergodic theorem, i.e.,
[TABLE]
holds almost surely for sufficiently many test functions . Using classical theory, see, e.g., [45, Theorem 17.1.7] and [47], such convergence is implied by the ergodicity in the total variation distance, i.e., by
[TABLE]
where denotes the total variation distance. Unfortunately, it is typically a very difficult mathematical task to prove (1.11) even for particular models. An extension of (1.10) applicable in the case where holds in the Wasserstein distance generated by the metric was recently studied in [47]. Applying the main result of [47] to the case of affine processes and using the fact that each affine process can be obtained as a pathwise unique strong solution to some stochastic equation with jumps (see Section 4), yields the following corollary.
Corollary 1.7**.**
Let be admissible parameters. Suppose that has only eigenvalues with negative real parts, and (1.5) is satisfied. Let be the corresponding affine process constructed as the pathwise unique strong solution on a complete probability space in Section 4. Let for some , then (1.10) holds in .
Although we have formulated (1.10) in continuous time, the discrete-time analog can be obtained in the same manner.
1.4 Comparison with related works
Consider an Ornstein-Uhlenbeck process on , i.e., an affine process on state space with admissible parameters . If has only eigenvalues with negative real parts and (1.5) is satisfied, then [49] is applicable and hence the corresponding Ornstein-Uhlenbeck process satisfies, for all , weakly as . Under additional technical conditions on the measure , it follows that the corresponding process also satisfies (1.11) with exponential rate, see [51]. Since in view of Theorem 1.5 the convergence (in the Wasserstein distance) has already exponential rate, we conclude that the additional restriction on imposed in [51] is only used to guarantee that convergence takes place in the stronger total variation distance, i.e., it is not necessary for the speed of convergence.
Consider a subcritical multi-type CBI process on , i.e., an affine process on state space for which the parameter has only eigenvalues with negative real parts. In dimension , Pinsky [46] announced (without proof) the existence of a limiting distribution under condition (1.5). A proof of this fact was then given in [36, Theorem 3.16], while in [41, Theorem 3.20 and Corollary 3.21] it was shown that is equivalent to (1.5). Some properties of the invariant distribution have been studied in [34]. In [42] exponential ergodicity in total variation distance, see (1.11), was established for one-dimensional subcritical CBI processes with , while some other related results for stochastic equations on have been recently considered in [18]. An extension of the techniques from [42] to arbitrary dimension is still an interesting open problem. Recently, in [44] another approach for the exponential ergodicity in the total variation distance for affine processes on cones, including multi-type CBI processes, was provided. Their techniques were closely related to stochastic stability of Markov processes in the sense of Meyn and Tweedie [45], see also the references therein. More precisely, it was shown that each subcritical CBI process which is -irreducible, aperiodic and has finite second moments, where is a reference measure with its support having non-empty interior, is exponentially ergodic in the total variation distance. As such a result is formulated in a very general way, it becomes a delicate mathematical task to show that such conditions are satisfied for CBI processes with jumps of infinite activity or with degenerate diffusion components. Moreover, assuming that has at least finite second moments rules out some natural examples as studied in [42] for and in Section 2 of this work. In contrast, our results can be applied in arbitrary dimension without the need to prove irreducibility or aperiodicity, paying the price that we use the Wasserstein distance instead. Let us mention that recently also asymptotic results for supercritical CBI processes have been obtained in [33, 9, 8].
Consider now the general case of an affine process on the canonical state space . Based on the stability theory of Markov chains in the sense of Meyn and Tweedie the long-time behavior of some particular two-dimensional models on state space was studied in [4, 28].These results have been further developed in [53] for arbitrary dimensions, where also functional limit theorems were obtained. In order to prove irreducibility and aperiodicity, the authors supposed that the diffusion compnent is non-degenerate and that and are probability measures, i.e., the corresponding affine process has only jumps of finite variation. Independently in [29] the following result was obtained.
Theorem 1.8**.**
[29]** Let be admissible parameters. Suppose that has only eigenvalues with negative real parts and (1.5) is satisfied. Then there exists a unique invariant distribution for . Moreover, has Laplace transform
[TABLE]
and one has, for all , weakly as .
The proof of Theorem 1.8 is based on a fine stability analysis of the Riccati equations (1.2). Comparing with our main result Theorem 1.5, the authors have, in addition, established a formula for the Laplace transform of , but have not studied any convergence rate. We emphasize that the main aim of our Theorem 1.5 is to establish the exponential convergence speed (1.6) and (1.8) with respect to the corresponding Wasserstein metrics. However, in the process of proving (1.6) we also obtain the existence and uniqueness of an invariant distribution as a natural by-product. Moreover, in Theorem 1.5 and Theorem 1.8 existence and uniqueness of an invariant distribution is shown by essentially different techniques.
1.5 Main idea of proof and structure of the work
The proof of Theorem 1.5 is divided in 4 steps as explained below.
Step 1. Provide a stochastic description of conservative affine processes. More precisely, in Section 3 we discuss a stochastic equation for multi-type CBI processes and recall a comparison principle due to [5]. In Section 4 we prove that each affine process can be obtained as the pathwise unique strong solution to a certain stochastic equation, where denotes the initial condition. The particular structure of this equation shows that the process takes the form , where is a CBI process with initial condition and is an OU-type process with initial condition whose coefficients depend on the process .
Step 2. Let be an affine process. Based on the stochastic equation from the first step, we study in Section 5 finiteness of the moments and . Since the proofs in this section are rather standard, we only outline the main steps, while technical details are postponed to the appendix.
Step 3. Let and be the affine processes with initial states , respectively, obtained as the unique strong solutions to the stochastic equation discussed in Section 4. Suppose that (1.7) is satisfied for . The following key estimate is proved in Section 6:
[TABLE]
where are some constants. Indeed, write and , respectively. Using the comparison principle for the CBI component we prove that
[TABLE]
where is some constant. From this and the particular structure of the stochastic equation solved by and we then easily deduce (1.13). In the literature the proof of similar inequalities to (1.13) and (1.14) is often based on the construction of a successfull coupling which is typically a difficult task. In the framework of affine processes a surprisingly simple proof of such estimates is given in Section 6 by using monotone couplings as explained above.
Step 4. The results obtained in Steps 1 – 3 provide us all necessary tools to give a full proof of Theorem 1.5 in Section 7. For the sake of simplicity, we explain below how (1.8) is shown. Estimate (1.6) can be obtained in the same way. Using classical arguments, we may deduce assertion (1.8) from the contraction estimate
[TABLE]
Next observe that, by the convexity of the Wasserstein distance (see Lemma 8.4) combined with (1.3), property (1.15) is implied by
[TABLE]
Let be the transition semigroup for the affine process with admissible parameters . In view of (1.1) one has , where denotes the usual convolution of measures. A similar decomposition for affine processes was also used in [29]. Applying now Lemma 8.3 and the Jensen inequality gives
[TABLE]
where the last inequality follows from Step 3 applied to .
2 Examples
2.1 Anisotropic -root process
Let be independent one-dimensional Lévy processes with symbols
[TABLE]
where . Let be another -dimensional Lévy process with symbol
[TABLE]
where is a measure on with and
[TABLE]
Suppose that and are independent. Applying the results of [5] to this particular case shows that, for each , there exists a pathwise unique strong solution to
[TABLE]
This process is an affine process on with admissible parameters
[TABLE]
and corresponding Lévy measures ,
[TABLE]
Applying our main result to this particular case gives the following.
Corollary 2.1**.**
If has only eigenvalues with negative real parts and satisfies
[TABLE]
then the assertions of Theorem 1.5 are true.
Convergence in total variation distance for a similar one-dimensional model was studied in [42]. Similar two-dimensional processes were also studied in [4] and [27]. In view of our main result Theorem 1.5, it is straightforward to extend this model to arbitrary dimension , with possibly non-vanishing diffusion part and more general driving noise of Lévy type.
2.2 Stochastic volatility model
Let , i.e., . Let be the unique strong solution to
[TABLE]
where , , , is the correlation coefficient, is a two-dimensional Brownian motion, is a one-dimensional Lévy subordinator with Lévy measure , and a one-dimensional Lévy process with Lévy measure . Suppose that and are mutually independent. It is not difficult to see that is an affine process with admissible parameters
[TABLE]
and measures
[TABLE]
Then we obtain the following.
Corollary 2.2**.**
If and
[TABLE]
then the assertions of Theorem 1.5 are true.
It is straightforward to extend this model to higher dimensions and more general driving noises.
3 Stochastic equation for multi-type CBI processes
In this section we recall some results for the particular case of multi-type CBI processes, i.e. affine processes on state space (that is, ). For further references and additional explanations we refer to [5] and [8]. Let be a complete probability space rich enough to support the following objects:
- (B1)
A -dimensional Brownian motion . 2. (B2)
A Poisson random measure on with compensator , where is a Borel measure supported on satisfying
[TABLE] 3. (B3)
Poisson random measures on with compensators , , where are Borel measures supported on satisfying
[TABLE]
The objects are supposed to be mutually independent. Let and be the corresponding compensated Poisson random measures. Here and below we consider the natural augmented filtration generated by . Finally let
- (a)
. 2. (b)
such that , for and . 3. (c)
A matrix , where .
For , consider the stochastic equation
[TABLE]
where . Pathwise uniqueness for a slightly more complicated equation was recently obtained in [5], while (3.1) in this form appeared first in [8]. The following is essentially due to [5].
Proposition 3.1**.**
Let be as in (a) – (c), and consider objects that are given in (B1) – (B3). Then the following assertions hold:
- (a)
For each , there exists a pathwise unique strong solution to (3.1). 2. (b)
Let be any solution to (3.1). Then is a multi-type CBI process starting from , and the generator of is of the following form: for
[TABLE]
Conversely, given any multi-type CBI process with generator and starting point , we can find a solution to (3.1) such that and have the same law.
The proof of the pathwise uniqueness is based on a comparison principle for multi-type CBI processes, see [5, Lemma 4.2]. This comparison principle is stated below.
Lemma 3.2**.**
[5, Lemma 4.2]** Let be a weak solution to (3.1) with parameters , let be another weak solution to (3.1) with parameters , where and satisfy (a) – (c). Both solutions are supposed to be defined on the same probability space and with respect to the same noises that satisfy (B1) – (B3). Suppose that, for all , and . Then
[TABLE]
4 Stochastic equation for affine processes
Below we show that any affine process can also be obtained as the pathwise unique strong solution to a certain stochastic equation. In the two-dimensinoal case such a result was first obtained in [15]. Indepedently, the case of affine diffusions on the canoncical state space (i.e., processes without jumps) was studied in [20]. The main obstacle there is related with the diffusion component which is degenerate at the boundary but also has a nontrival structure in higher dimensions. In order to take this into account we use, compared to [20], another representation of the diffusion matrix (see (A0) and (A1) below). Such a representation is used to decompose the corresponding affine process into a CBI and an OU component which are then treated seperately. Consequently, based on the avaliable results for CBI processes, the proofs in this section become relatively simple.
Let be admissible parameters. For the parameters and consider the following objects:
- (A0)
An -matrix such that . 2. (A1)
Matrices such that, for all , and
[TABLE]
Let us remark the following.
Remark 4.1**.**
- (i)
The first condition is simple to check. Indeed, by definition, one has , thus is symmetric and positive semidefinite. Hence denotes the non-negative square root of . 2. (ii)
Concerning the second condition, recall that and hence is positive semidefinite. Moreover, by definition of admissible parameters, is everywhere zero except at the entry . Hence is well-defined. Existence of satisfying (4.1) follows from the characterization of positive semidefiniteness for symmetric block matrices, see, e.g., **[23, Theorem 16.1]**. The latter result is based on pseudo-inverses and properties of the Schur-complement for block matrices.
Below we describe the noises appearing in the stochastic equation for affine processes. Let be a complete probability space rich enough to support the following objects:
- (A2)
A -dimensional Brownian motion . 2. (A3)
For each , a -dimensional Brownian motion . 3. (A4)
A Poisson random measure with compensator on . 4. (A5)
For each , a Poisson random measure with compensator on .
We suppose that all objects are mutually independent. Denote by and , , the corresponding compensated Poisson random measures. Here and below we consider the natural augmented filtration generated by these noise terms. For , consider the stochastic equation
[TABLE]
where and are, for , given by
[TABLE]
Note that we have changed the drift coefficients to and in order to change the compensators in the stochastic integrals. Such change is, under the given moment conditions on , always possible and does not affect our results. Concerning existence and uniqueness for (4.2), we obtain the following.
Theorem 4.2**.**
Let be admissible parameters. Then, for each , there exists a pathwise unique -valued strong solution to (4.2).
This result will be proved later in this Section. Let us first relate (4.2) to affine processes.
Proposition 4.3**.**
Let be admissible parameters. Then each solution to (4.2) is an affine process with admissible parameters and starting point .
Proof.
Let be a solution to (4.2) and . Applying the Itô formula shows that
[TABLE]
is a local martingale. Note that is bounded. Hence
[TABLE]
and we conclude that is a true martingale. It follows from Remark 1.3 that is an affine process with admissible parameters . ∎
The rest of this section is devoted to the proof of Theorem 4.2. As often in the theory of stochastic equations, existence of weak solutions is the easy part.
Lemma 4.4**.**
Let be admissible parameters. Then, for each , there exists a weak solution to (4.2).
Proof.
Since existence of a solution to the martingale problem with sample paths in the Skorokhod space over is known, the assertion is a consequence of [39], namely, the equivalence between weak solutions to stochastic equations and martingale problems. Alternatively, following [14, p.993] we can show that each solution to the martingale problem with generator and domain is a semimartingale and compute its semimartingale characteristics (see [14, Theorem 2.12]). The assertion is then a consequence of the equivalence between weak solutions to stochastic equations and semimartingales (see [31, Chapter III, Theorem 2.26]). ∎
In view of the Yamada-Watanabe Theorem (see [6]), Theorem 4.2 is proved, provided we can show pathwise uniqueness for (4.2). For this purpose we rewrite (4.2) into its components , where and . Introduce the notation , where and . Moreover, let and write for the initial condition . Finally, let denote the canonical basis vectors in . Then (4.2) is equivalent to the system of equations
[TABLE]
Observe that the first equation for does not involve . We will show that (4.4) is precisely (3.1), i.e., is a multi-type CBI process and pathwise uniqueness holds for . The second equation for describes an OU-type process with random coefficients depending on . If we regard as conditionally fixed, then pathwise uniqueness for (4.5) is obvious. These ideas are summarized in the next lemma.
Lemma 4.5**.**
Let be admissible parameters. Then pathwise uniqueness holds for (4.4) and (4.5), and hence for (4.2).
Proof.
Let and be two solutions to (4.2) with the same initial condition both defined on the same probability space. Then and both satisfy (4.4). Let us show that (4.4) is precisely (3.1), from which we deduce . Set , , and define
- •
A -dimensional Brownian motion .
- •
A Poisson random measure on by
[TABLE]
where and is a Borel set.
- •
Poisson random measures on by
[TABLE]
where , and is a Borel set.
Note that the random objects are mutually independent. Moreover, it is not difficult to see that and have compensators
[TABLE]
where and . Finally let , , and
[TABLE]
Then (4.4) is precisely (3.1), and it follows from Proposition 3.1.(a) that .
It remains to prove pathwise uniqueness for (4.5). Define, for , a stopping time . Since and both satisfy (4.5) for the same , we obtain
[TABLE]
and hence, for some constant ,
[TABLE]
The Grownwall lemma gives , for all and . Note that and have no explosion. Taking proves the assertion. ∎
5 Moments for affine processes
The stochastic equation introduced in Section 4 can be used to provide a simple proof for the finiteness of moments for affine processes. The following is our main result for this section.
Proposition 5.1**.**
Let be admissible parameters. For , let be the unique solution to (4.2).
- (a)
Suppose that there exists such that
[TABLE]
Then there exists a constant (independent of and ) such that
[TABLE] 2. (b)
Suppose that (1.5) is satisfied. Then there exists a constant (independent of and ) such that
[TABLE]
Proof.
Define and , where . Applying the Itô formula for , , gives
[TABLE]
where and are given by
[TABLE]
where was defined in (4.3). Define, for , a stopping time . Then it is not difficult to see that is a martingale, for any . Moreover, we will prove in the appendix that there exists a constant such that
[TABLE]
Hence taking expectations in (5.1) gives
[TABLE]
Applying the Gronwall lemma gives , for all and some constant . Since has cádlág paths and is independent of , we may take the limit and conclude the assertion by the lemma of Fatou. ∎
We close this section with a formula for the first moment of general affine processes. The particular case of multi-type CBI processes was treated in [5, Lemma 3.4], while recursion formulas for higher-order moments of multi-type CBI processes were provided in [7].
Lemma 5.2**.**
Let be admissible parameters and suppose that
[TABLE]
Let be an affine process obtained from (4.2) with . Then
[TABLE]
where . and , then
[TABLE]
Proof.
First observe that, by definition of admissible parameters and (5.3), we may apply Proposition 5.1 (a) and deduce that has finite first moment. Taking expectations in (4.2) gives
[TABLE]
Solving this equation gives the desired formula for . Taking expectations in (3.1) (or (4.4)) gives
[TABLE]
which implies the desired formula for . Finally, taking expectations in (4.5) gives
[TABLE]
Solving this equation and using previous formula for , we obtain the assertion. ∎
6 Contraction estimate for trajectories of affine processes
The following is our main estimate for this section.
Proposition 6.1**.**
Let be admissible parameters, suppose that (5.3) is satisfied, and assume that has only eigenvalues with negative real parts. Let , and let and be the unique strong solutions to (4.2) with initial condition and , respectively. Then there exist constants independent of and such that, for all ,
[TABLE]
Proof.
Let us first prove (6.1). Note that and are multi-type CBI processes with the same parameters. If for all , then we obtain from Lemma 3.2 and Lemma 5.2
[TABLE]
where we have used that has only eigenvalues with negative real parts (since has this property and ). For general , let be such that
[TABLE]
where denote the canonical basis vectors in . Then, for each , the elements are comparable in the sense that if , and either or . In any case, we obtain from the previous consideration
[TABLE]
where we have used . This completes the proof of (6.1).
If , then (6.2) is trivial. Suppose that . Applying the Itô formula to and , and then taking the difference, gives
[TABLE]
Here and below we denote by a generic constant which may vary from line to line. Moreover, we find and such that
[TABLE]
The stochastic integral against the Brownian motion is estimated by the BDG-inequality as follows
[TABLE]
where we have used (6.1) and (6.3). For the stochastic integral against we consider the cases and separately. For we apply first the BDG-inequality and then the Jensen inequality to obtain, for each ,
[TABLE]
For , we apply first the BDG-inequality and then use the sub-additivity of to obtain
[TABLE]
where we have used . Collecting all estimates proves the assertion. ∎
7 Proof of Theorem 1.5
7.1 The -Wasserstein estimate
Based on the results of Section 6, we first deduce the following estimate with respect to the -Wasserstein distance.
Proposition 7.1**.**
Let be the transition semigroup with admissible parameters , suppose that has only eigenvalues with negative real parts, and (1.5) is satisfied. Then there exist constants such that, for any , one has
[TABLE]
Proof.
Let be the transition semigroup with admissible parameters given by Theorem 1.2. Take and let and , respectively, be the corresponding affine processes obtained from (4.2) with admissible parameters . Since has law and has law , there exist by Proposition 6.1 constants such that
[TABLE]
Next observe that, for , one has
[TABLE]
Combining this with (1.1) proves , where denotes the convolution of measures on . Let us now prove the desired -estimate. Using Lemma 8.3 from the appendix and then the Jensen inequality for the concave function , gives for some generic constant
[TABLE]
where we have used, for , the elementary inequality
[TABLE]
which is proved in the appendix. Applying now Lemma 8.4 from the appendix gives for any
[TABLE]
Choosing as the optimal coupling of , i.e.,
[TABLE]
proves the assertion. ∎
Based on previous proposition, the proof of Theorem 1.5 is easy. It is given below.
Lemma 7.2**.**
Let be the transition semigroup with admissible parameters . Suppose that has only eigenvalues with negative real parts, and (1.5) is satisfied. Then has a unique invariant distribution . Moreover, this distribution belongs to and, for any , one has (1.6).
Proof.
Let us first prove existence of an invariant distribution . Observe that, by Proposition 5.1, we easily deduce that , for any . Fix any and let with . Then
[TABLE]
Since the right-hand side tends to zero as , we conclude that is a Cauchy sequence in . In particular, there exists a limit , i.e., as . Let us show that is an invariant distribution for . Indeed, take , then
[TABLE]
Since as , we conclude that all terms tend to zero. Hence , i.e., , for all . Next we prove that is the unique invariant distribution. Let be any two invariant distributions and define as in (1.4) with replaced by . Then we obtain, for any and all , by the proof of Proposition 7.1 (see (7.1))
[TABLE]
Fix any , then using the invariance of together with the convexity of the Wasserstein distance gives
[TABLE]
By dominated convergence we deduce that the right-hand side tends to zero as and hence . The last assertion can now be deduced from
[TABLE]
where we have first used the invariance of and then Proposition 7.1. ∎
7.2 The -Wasserstein estimate
As before, we start with an estimate with respect to the Wasserstein distance .
Proposition 7.3**.**
Let be the transition semigroup with admissible parameters . Suppose that has only eigenvalues with negative real parts, and (1.7) is satisfied for some . Then there exist constants such that, for any , one has
[TABLE]
Proof.
Let be the transition semigroup with admissible parameters given by Theorem 1.2. Arguing as in the proof of Proposition 7.1, we obtain
[TABLE]
and . Then we obtain from Lemma 8.3 from the appendix
[TABLE]
where the second inequality follows from the Jensen inequality and the third is a consequence of (7.2). Using now Lemma 8.4 from the appendix, we conclude that
[TABLE]
This proves the assertion. ∎
Based on previous proposition, the proof of the -estimate in Theorem 1.5 can be deduced by exactly the same arguments as in Lemma 7.2. So Theorem 1.5 is proved.
8 Appendix
8.1 Moment estimates for and
In this section we prove (5.2).
Lemma 8.1**.**
Suppose that the same conditions as in Proposition 5.1 (a) are satisfied. Then there exists a constant such that
[TABLE]
Proof.
Observe that . Using gives , and hence we obtain for some generic constant
[TABLE]
For the second order term we first observe that, for ,
[TABLE]
where denotes the Kronecker-Delta symbol. Using gives . This implies that
[TABLE]
Let us now estimate the integrals against and . Consider first the case . The mean value theorem gives
[TABLE]
where we have used in the last inequality. If , then
[TABLE]
If , then . In any case, we obtain, for ,
[TABLE]
Using and
[TABLE]
for the integral against , gives
[TABLE]
where we have used , . It remains to estimate the corresponding integrals for . Applying twice the mean value theorem gives
[TABLE]
where we have used . Using, for and ,
[TABLE]
we conclude that
[TABLE]
Collecting all estimates proves the desired estimate for . ∎
Let us now prove the desired estimate for .
Lemma 8.2**.**
Suppose that the same conditions as in Proposition 5.1 (b) are satisfied. Then there exists a constant such that
[TABLE]
Proof.
Observe that . Hence we obtain for some generic constant
[TABLE]
Observe that, for ,
[TABLE]
Using gives . This implies that
[TABLE]
Let us estimate the integrals against and . Consider first the case . Then
[TABLE]
and hence we obtain
[TABLE]
From the mean value theorem we obtain
[TABLE]
In view of for , we obtain . Using gives
[TABLE]
It remains to estimate the corresponding integrals for . As in (8.1), we get
[TABLE]
This implies
[TABLE]
For , by , we get and hence
[TABLE]
Collecting all estimates proves the desired estimate for . ∎
8.2 Some estimate on the Wasserstein distance
Here and below we let . Below we provide two simple and known estimates for Wasserstein distances.
Lemma 8.3**.**
Let . Then
[TABLE]
Proof.
Using the Kantorovich duality (see [50, Theorem 5.10, Case 5.16], we obtain
[TABLE]
where . Using now the definition of the convolution on the right-hand side gives
[TABLE]
where . Since , we conclude that
[TABLE]
where we have used again the Kantorovich duality. This completes the proof. ∎
The next estimate shows that the Wasserstein distance is convex. For additional details we refer to [50, Theorem 4.8].
Lemma 8.4**.**
Let be a Markov transition function on . Then, for any and any coupling of , it holds that
[TABLE]
8.3 Proof of the elementary inequality with respect to
Below we prove the following inequality.
Lemma 8.5**.**
For any one has
[TABLE]
Proof.
Using the elementary inequality , see [24], we easily obtain
[TABLE]
from which we readily deduce
[TABLE]
Fix any . If , then we obtain
[TABLE]
The case can be treated in the same way. Finally, if , then we obtain
[TABLE]
Collecting both estimates gives, for all , the estimate
[TABLE]
where . A simple extreme value analysis shows that attains its maximum at which gives . ∎
Acknowledgments
The authors would like to thank Jonas Kremer for several discussions on affine processes and pointing out some interesting references on this topic. Peng Jin is supported by the STU Scientific Research Foundation for Talents (No. NTF18023).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alf [15] Aurélien Alfonsi. Affine diffusions and related processes: simulation, theory and applications , volume 6 of Bocconi & Springer Series . Springer, Cham; Bocconi University Press, Milan, 2015.
- 2BBAKP [18] Mátyás Barczy, Mohamed Ben Alaya, Ahmed Kebaier, and Gyula Pap. Asymptotic properties of maximum likelihood estimator for the growth rate for a jump-type CIR process based on continuous time observations. Stochastic Process. Appl. , 128(4):1135–1164, 2018.
- 3BDLP [13] Mátyás Barczy, Leif Döring, Zenghu Li, and Gyula Pap. On parameter estimation for critical affine processes. Electron. J. Stat. , 7:647–696, 2013.
- 4BDLP [14] Mátyás Barczy, Leif Döring, Zenghu Li, and Gyula Pap. Stationarity and ergodicity for an affine two-factor model. Adv. in Appl. Probab. , 46(3):878–898, 2014.
- 5[5] Mátyás Barczy, Zenghu Li, and Gyula Pap. Stochastic differential equation with jumps for multi-type continuous state and continuous time branching processes with immigration. ALEA Lat. Am. J. Probab. Math. Stat. , 12(1):129–169, 2015.
- 6[6] Mátyás Barczy, Zenghu Li, and Gyula Pap. Yamada-Watanabe results for stochastic differential equations with jumps. Int. J. Stoch. Anal. , pages Art. ID 460472, 23, 2015.
- 7BLP [16] Mátyás Barczy, Zenghu Li, and Gyula Pap. Moment formulas for multitype continuous state and continuous time branching process with immigration. J. Theoret. Probab. , 29(3):958–995, 2016.
- 8[8] Mátyás Barczy, Sandra Palau, and Gyula Pap. Almost sure, L 1 subscript 𝐿 1 L_{1} - and L 2 subscript 𝐿 2 L_{2} -growth behavior of supercritical multi-type continuous state and continuous time branching processes with immigration. ar Xiv:1803.10176 [math.PR] , 2018.
